z-logo
open-access-imgOpen Access
RE2C: A lexer generator based on lookahead-TDFA
Author(s) -
Ulya Trofimovich
Publication year - 2020
Publication title -
software impacts
Language(s) - English
Resource type - Journals
ISSN - 2665-9638
DOI - 10.1016/j.simpa.2020.100027
Subject(s) - computer science , compiler , generator (circuit theory) , state (computer science) , algorithm , programming language , physics , power (physics) , quantum mechanics
RE2C is a regular expression compiler: it transforms regular expressions into finite state machines and encodes them as programs in the target language. At the core of RE2C is the lookahead-TDFA algorithm that allows it to perform fast and lightweight submatch extraction. This article describes the algorithm used in RE2C and gives an example of TDFA construction.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom