A Customised Automata Algorithm and Toolkit for Language Learning and Application

Ruoyu Wang,Guoqiang Li,Jianwen Xiang,Hongming Cai
DOI: https://doi.org/10.1504/ijbdi.2018.10008173
2018-01-01
International Journal of Big Data Intelligence
Abstract:Automata are abstract computing machines. They play a basic role in computability theory and programming language theory. More recently in data analytics, data automata have become a formal way to represent pipelines and workflows. However, in researches involved with automata, there are still situations where redundant work and ununified standards occur. In order to solve that problem, we propose a new toolkit: CAT, which provides a simple and unified framework for automaton construction and customisation. We adopted both structural and behavioural analysis in order to design the body structure. Several calculus algorithms are implemented according to the theoretical accomplishments and designed as overloaded operators. To test the correctness and performance of this toolkit, several bare automata were constructed and compared with 'GREP' in Ubuntu Linux. The result showed that CAT has realised most of the design purposes and presents a more illustrative way for writing codes of automata construction and calculation.
What problem does this paper attempt to address?