COMPAS (COMpiler for PArsing Schemata)

In this page, you can download the source code and binary files for the COMPAS parsing schema compiler.

About the COMPAS system

COMPAS (COMpiler for PArsing Schemata) is a system that can be used to automatically compile formal specifications of parsing algorithms (in the form of parsing schemata) to efficient Java implementations of the corresponding parsers.

The system was registered with the Xunta de Galicia Intellectual Property Office, with registration number C-40-2009.

The system's design and implementation is described in this journal article:

Carlos Gómez-Rodríguez, Jesús Vilares and Miguel A. Alonso,
A compiler for parsing schemata,
Software: Practice and Experience, 39(5):441-470, 2009. ISSN 0038-0644. DOI 10.1002/spe.904
[GomVilAlo2009a.pdf]

Some design aspects are described in less detail in the following publications:

Carlos Gómez-Rodríguez, Jesús Vilares and Miguel A. Alonso,
Compiling Declarative Specifications of Parsing Algorithms,
in R. Wagner, R. Newell and G. Pernul (eds.), Database and Expert Systems Applications, volume 4653 of Lecture Notes in Computer Science, pp. 529-538, Springer-Verlag, Berlin-Heidelberg-New York, 2007. ISSN 0302-9743.
[GomVilAlo2007a.pdf]

Carlos Gómez-Rodríguez, Miguel A. Alonso and Manuel Vilares,
Generation of indexes for compiling efficient parsers from formal specifications,
in Roberto Moreno-Díaz, Franz Pichler, and Alexis Quesada-Arencibia (eds.), Computer Aided Systems Theory, volume of Lecture Notes in Computer Science, Springer-Verlag, Berlin-Heidelberg-New York, 2007. ISSN 0302-9743.
[GomAloVil2007b.pdf]

Carlos Gómez-Rodríguez, Jesús Vilares and Miguel A. Alonso,
Prototyping Efficient Natural Language Parsers,
in Proc. of International Conference RANLP 2007, Recent Advances in Natural Language Processing, Borovets, Bulgaria, 2007.
[GomVilAlo2007b.pdf]

Carlos Gómez-Rodríguez, Jesús Vilares and Miguel A. Alonso,
Automatic Generation of Natural Language Parsers from Declarative Specifications,
in Loris Penserini, Pavlos Peppas and Anna Perini (eds.), STAIRS 2006 - Proceedings of the Third Starting AI Researchers' Symposium, Riva del Garda, Italy, August 28-29, 2006, volume 142 of Frontiers in Artificial Intelligence and Applications, pp. 259-260, IOS Press, Amsterdam/Berlin/Oxford/Tokyo/Washington DC, 2006. ISSN 0922-6389 / ISBN 1-58603-645-9.
[GomVilAlo2006a.pdf]

Additionally, the system has been used to perform the empirical tests reported in the following publications:

Carlos Gómez-Rodríguez, Miguel A. Alonso and Manuel Vilares,
On Theoretical and Practical Complexity of TAG Parsers,
in Paola Monachesi, Gerald Penn, Giorgio Satta and Shuly Wintner (eds.), FG 2006: The 11th conference on Formal Grammar. Malaga, Spain, July 29-30, 2006, chapter 5, pp. 61-75, Center for the Study of Language and Information, Stanford, 2006.
[GomAloVil2006b.pdf]

Carlos Gómez-Rodríguez, Miguel A. Alonso and Manuel Vilares,
Generating XTAG Parsers from Algebraic Specifications,
in Proceedings of the 8th International Workshop on Tree Adjoining Grammar and Related Formalisms. Sydney, July 2006, pp. 103-108, Association for Computational Linguistics, East Stroudsburg, PA, 2006. ISBN: 1-932432-85-X.
[GomAloVil2006a.pdf]

Both a description of the system and empirical tests conducted with it are reported in the following Ph. D. thesis:

Carlos Gómez-Rodríguez,
Parsing schemata for practical text analysis,
Ph.D. Thesis, Departamento de Computación, Universidade da Coruńa, Spain, 2009 (xviii + 290 pp).

How to use

The COMPAS system is written in Java, and therefore it is runnable in any system for which a Java Virtual Machine (JVM) is available, including Windows and Linux.

You need to have the following software installed in order to use the system::
Once you have this software, download the system and refer to "readme.txt" for detailed usage instructions. If you understand Spanish, you can also download an user manual in this language.

Download (source code + binaries)

The COMPAS system can be downloaded here. The archive contains both the binaries and complete source code.

A 20-page user manual in Spanish is also available, download it here.

Licensing terms

COMPAS is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License (version 3) as published by the Free Software Foundation.

COMPAS is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the GNU General Public License for more details.

The text of the GNU General Public License can be found inside the distribution or at http://www.gnu.org/licenses/

Contact

The system is maintained by Carlos Gómez Rodríguez. Please direct any questions or comments by email to <cgomezr@udc.es>.

Last update: 2009-06-14.