On the look-ahead problem in lexical analysis (Q1899099): Difference between revisions

Modern programming languages use regular expressions to define valid tokens. Traditional lexical analyzers based on minimum deterministic finite automata for regular expressions cannot handle the look-ahead problem. The scanner writer needs to explicitly identify the look-ahead states and codes the buffering and re-scanning operations by hand. We identify the class of finite look-ahead finite automata, which is general enough to include all finite automata of practical lexical analyzers. Finite look-ahead finite automata are then transformed into suffix finite automata. A new lexical analyzer makes use of the suffix finite automata to identify tokens. The new lexical analyzer solves the look-ahead problem in a table-driven approach and it can detect lexical errors at an earlier time than traditional lexical analyzers. The extra cost of the new lexical analyzers is the larger state transition table and three additional one-dimensional tables. Incremental lexical analysis is also discussed.

0 references

zbMATH Keywords

deterministic finite automata

0 references

describes a project that uses

Modula

0 references

Flex

0 references

Identifiers

zbMATH Open document ID

0829.68018

0 references

DOI

10.1007/BF01213079

0 references

Mathematics Subject Classification ID

0 references

0 references

0 references

Sitelinks

Mathematics(1 entry)

mardi Publication:1899099

Revision as of 23:01, 28 February 2024 SwMATHimport240215 (talk \| contribs) Bots 507,965 edits ‎Changed an Item ← Older edit	Revision as of 03:36, 29 February 2024 SwMATHimport240215 (talk \| contribs) Bots 507,965 edits ‎Changed an Item Newer edit →
	Property / describes a project that uses
		Flex
	Property / describes a project that uses: Flex / rank
		Normal rank