Data Language Specification via Terminal Attribution

Alexander Sakharov,Timothy Sakharov
DOI: https://doi.org/10.48550/arXiv.1511.00909
2015-11-03
Abstract:Unstructured data have to be parsed in order to become usable. The complexity of grammar notations and the difficulty of grammar debugging limit the use of parsers for data preprocessing. We introduce a notation in which grammars are defined by simply dividing terminals into predefined classes and then splitting elements of some classes into multiple layered sub-groups. These LL(1) grammars are designed for data languages. They simplify the task of developing data parsers.
Formal Languages and Automata Theory
What problem does this paper attempt to address?