Generating robust parsers using island grammars

Source model extraction, the automated extraction of information from system artifacts, is a common phase in reverse engineering tools. One of the major challenges of this phase is creating extractors that can deal with irregularities in the artifacts that are typical for the reverse engineering dom...

Full description

Saved in:
Bibliographic Details
Published inProceedings Eighth Working Conference on Reverse Engineering pp. 13 - 22
Main Author Moonen, L.
Format Conference Proceeding
LanguageEnglish
Published IEEE 2001
Subjects
Online AccessGet full text
ISBN0769513034
9780769513034
ISSN1095-1350
DOI10.1109/WCRE.2001.957806

Cover

More Information
Summary:Source model extraction, the automated extraction of information from system artifacts, is a common phase in reverse engineering tools. One of the major challenges of this phase is creating extractors that can deal with irregularities in the artifacts that are typical for the reverse engineering domain (for example, syntactic errors, incomplete source code, language dialects and embedded languages). The paper proposes a solution in the form of island grammars, a special kind of grammar that combines the detailed specification possibilities of grammars with the liberal behavior of lexical approaches. We show how island grammars can be used to generate robust parsers that combine the accuracy of syntactical analysis with the speed, flexibility and tolerance usually only found in lexical analysis. We conclude with a discussion of the development of MANGROVE, a generator for source model extractors based on island grammars and describe its application to a number of case studies.
ISBN:0769513034
9780769513034
ISSN:1095-1350
DOI:10.1109/WCRE.2001.957806