Abstract:This document describes a sizable grammar of English written in the TAG formalism and implemented for use with the XTAG system. This report and the grammar described herein supersedes the TAG grammar described in an earlier 1995 XTAG technical report. The English grammar described in this report is based on the TAG formalism which has been extended to include lexicalization, and unification-based feature structures. The range of syntactic phenomena that can be handled is large and includes auxiliaries (including inversion), copula, raising and small clause constructions, topicalization, relative clauses, infinitives, gerunds, passives, adjuncts, it-clefts, wh-clefts, PRO constructions, noun-noun modifications, extraposition, determiner sequences, genitives, negation, noun-verb contractions, sentential adjuncts and imperatives. This technical report corresponds to the XTAG Release 8/31/98. The XTAG grammar is continuously updated with the addition of new analyses and modification of old ones, and an online version of this report can be found at the XTAG web page at <a class="link-external link-http" href="http://www.cis.upenn.edu/~xtag/" rel="external noopener nofollow">this http URL</a>
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to construct a Lexicalized Tree Adjoining Grammar (LTAG) system for English in order to better handle syntactic analysis problems in natural language processing. Specifically, the goals of the paper include:
1. **Formal Definition and Implementation**: Through the introduction of features and unification mechanisms, describe in detail the formal definition of LTAG and its specific implementation methods in English.
2. **Development of the Grammar System**: Develop a comprehensive, feature - based Lexicalized Tree Adjoining Grammar system (XTAG) for parsing English sentence structures.
3. **Handling Complex Syntactic Phenomena**: For complex syntactic phenomena in English (such as verb phrases, passive voices, interrogative sentences, relative clauses, etc.), provide detailed analysis and solutions.
4. **Improving Parsing Efficiency and Accuracy**: By improving the grammar and algorithms, enhance the parsing efficiency and accuracy of natural language processing systems for English sentences.
### Key Point Summary
- **TAG Formalization**: The paper introduces the basic concepts of Tree Adjoining Grammar (TAG), including elementary trees, substitution, and adjunction operations.
- **Lexicalization**: Closely combine grammar rules with lexical items, so that each grammar structure is associated with specific lexical items, thereby enhancing the expressive power and flexibility of the grammar.
- **Unification Mechanism**: Introduce features and unification mechanisms to handle the constraint conditions in grammar structures and ensure that the generated sentences conform to grammar rules.
- **System Architecture**: Describe the overall architecture of the XTAG system, including modules such as tree selection, tree database, tree grafting, and grammar development environment.
Through these efforts, the paper aims to provide a powerful and flexible syntactic analysis tool for natural language processing of English, thereby promoting the development of natural language processing technology.