Abstract:The literature on word-representable graphs is quite rich, and a number of variations of the original definition have been proposed over the years. We are initiating a systematic study of such variations based on formal languages. In our framework, we can associate a graph class to each language over the binary alphabet \{0,1\}. All graph classes that are language-representable in this sense are hereditary and enjoy further common properties. Besides word-representable graphs and, more generally, 1^k- or k-11-representable graphs, we can identify many more graph classes in our framework, like (co)bipartite graphs, (co)comparability graphs, to name a few. It was already known that any graph is 111- or 2-11-representable. When such representations are considered for storing graphs, 111- or 2-11-representability bears the disadvantage of being significantly inferior to standard adjacency matrices or lists. We prove that quite famous languages like the palindromes, the copy language or the Lyndon words can match the efficiency of standard graph representations. The perspective of language theory allows us to prove general results that hold for all graph classes that can be defined in this way. This includes certain closure properties (e.g., all language-definable graph classes are hereditary) as well as certain limitations (e.g., all language-representable graph classes contain graphs of arbitrarily large treewidth and of arbitrarily large degeneracy, except a trivial case). As each language describes a graph class, we can also ask decidability questions concerning graph classes, given a concrete presentation of a formal language. We also present a systematic study of graph classes that can be represented by languages in which each letter occurs at most twice. Here, we find graph classes like interval, permutation, circle, bipartite chain, convex, and threshold graphs.
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to explore and systematize Generalized Word - Representable Graphs based on formal language theory. Specifically, the authors associate formal languages over a binary alphabet with graph classes by introducing a new framework, in order to study the properties and characteristics of graph classes defined by different languages.
### Decomposition of the Main Problem
1. **Define New Types of Graph Classes**:
- Through formal language theory, a new graph class \( G_L \) is defined, where \( L \) is a binary language. This way of definition allows many known graph classes (such as interval graphs, permutation graphs, etc.) to be incorporated into a unified framework for study.
2. **Study the Common Properties of Graph Classes**:
- It is proved that all graph classes defined by languages are hereditary, that is, these graph classes are closed under induced subgraphs.
- Some closure properties and other constraints of these graph classes are explored, for example, they contain arbitrarily large tree - width graphs and degenerate graphs.
3. **Evaluate the Effectiveness of Different Language Representations**:
- The efficiency of different languages (such as palindrome languages, copy languages, Lyndon words, etc.) in representing graphs is compared, and it is shown that these languages can match the efficiency of standard graph representation methods (such as adjacency matrices or adjacency lists) in general.
4. **Explore the Characteristics of Graph Classes Defined by Specific Languages**:
- The graph classes defined by certain specific languages (such as binary languages in which each letter appears at most twice) are studied, and some well - known graph classes (such as interval graphs, permutation graphs, circular graphs, etc.) are discovered.
5. **Propose New Research Directions and Open Problems**:
- Some new research directions and open problems are proposed, for example, given a language described by a context - free grammar, how to determine whether the graph class it defines contains only graphs with a tree - width of no more than 2.
### Formula Summary
There are many formal language and graph theory concepts involved in the paper. The following is a summary of some key formulas and symbols:
- **Binary Language Representation**: \( L\subseteq \{0, 1\}^* \)
- **Graph Class Definition**: \( G_L=\{G(L, w)\mid w\in V^*\} \), where \( G(L, w) \) represents the graph defined by the language \( L \) and the word \( w \).
- **Projection Homomorphism**: \( h_{u, v}(w) \), which maps the letters in the word \( w \) other than \( u \) and \( v \) to the empty word.
- **Frequency Set**: \( \text{freq}(L)=\{n\in\mathbb{N}_{\geq1}\mid\exists w\in L:|w|_0 = n\} \)
Through these formulas and definitions, the authors construct a powerful framework to study and understand graph classes defined by different languages and provide a rich theoretical basis for future research.