ALPAGE's tools and resources

This document lists the collection of tools and resources developed by INRIA Project-Team ALPAGE for linguistic processing.

Table of content

package dependencies

DyALog

DyALog is an environment to compile and run tabular Logic Programs and Parsers, for Definite Clause Grammars [DCG], Tree Adjoining Grammars [TAG], and Range Concatenation Grammars [RCG].

Tabulation means that traces of computation are tabulated in order to share common sub-computations and detect (most kinds of) loops. By keeping traces of computations, it is also possible, after a computation, to extract, as a shared forest, the set of all successful proofs or parse trees. These characteristics are especially useful to handle highly recursive and ambiguous grammars for Natural Language.

Version 1.13 (beta) - released 2010-02-17
Version 1.12 (beta) - released 2008-08-21
Version 1.11.4 (beta) - released 2008-04-01
Version 1.11.3 (beta) - released 2007-05-31
Version 1.11.2 (beta) - released 2006-12-08
Version 1.11.0 (beta) - released 2006-03-06
Version 1.10.7 (beta) - released 2005-11-03
Version 1.10.6 (beta) - released 2005-09-19
Version 1.10.5 (beta) - released 2004-12-03
Version 1.10.4 (stable) - released 2004-03-26

[Top]

dyalog-xml

dyalog-xml is a DyALog module providing predicates over LibXML API. This module, while very preliminary, may be used to read, process and print XML documents with DyALog.

Practical informations
Version 1.1.0 (beta) - released 2010-02-17
Version 1.0.4 (beta) - released 2008-08-22
Version 1.0.3 (beta) - released 2007-06-01
Version 1.0.2 (beta) - released 2006-12-11
Version 1.0.1 (beta) - released 2006-03-22
Version 1.0.0 (beta) - released 2006-03-22

[Top]

dyalog-sqlite

dyalog-sqlite is a DyALog module providing predicates over SQLITE3 API. This module, while very preliminary, may be used to create and access sqlite3 databases, for instance lexicons or lexical preference databases.

Practical informations
Version 1.1.0 (beta) - released 2008-02-17
Version 1.0.1 (beta) - released 2008-08-22
Version 1.0.0 (beta) - released 2006-03-22

[Top]

SYNTAX

SYNTAX is an environment for generating efficient parsers from a grammatical description. Formalisms dealt with by SYNTAX include, among others, CFGs (deterministic or ambiguous), TAGs, LFGs, RCGs.

Practical informations
Version 6.0b7 (beta) - released 2010-02-17
Version 6.0b5 (beta) - released 2008-08-20
Version 6.0b4 (beta) - released 2008-04-02
Version 6.0b3 (beta) - released 2008-03-14
Version 6.0b2 (beta) - released 2007-06-27
Version 6.0b (beta) - released 2007-04-25

[Top]

SxLFG

SxLFG is an LFG parser generator based on SYNTAX which benefits from extra modules like a chunker module and a bootstrappable n-best module. Includes a meta-LFG grammar for French (which relies on the Lefff) and the compiler from meta-LFG to LFG.

Version 2.0 (beta) - released tomorrow

[Top]

alpi

alpi is a Perl script to help users install locally specific Alpage software or ALPAGE French linguistic processing chain based on DyALog, frmg, Lefff, lefff-frmg, sxpipe.

Practical informations
Version 3.0.1 - released 2010-03-11
Version 2.0.0 - released 2008-09-04
Version 0.1 - released 2008-02-28

[Top]

forest_utils

forest_utils is a set of Perl scripts to convert between various formats for shared derivation forest produced by parsers for Tree Adjoining Grammars [TAG].

Practical informations
Version 0.2 - released 2010-02-17
Version 0.1 - released 2008-08-28
Version 0.09 (beta) - released 2007-07-11
Version 0.08 (beta) - released 2006-12-08
Version 0.07 (beta) - released 2006-03-31
Version 0.06 (beta) - released 2006-03-22
Version 0.05 (beta) - released 2005-11-03
Version 0.04 (beta) - released 2004/04/08

[Top]

parserd

parserd is a set of Perl scripts to setup a server of parsers. Web CGI clients are available, providing graphical display of shared forests.

Practical informations
Version 2.3.0 - released 2010-02-17
Version 2.2.1 - released 2008-08-28
Version 2.1.5 (beta) - released 2006-03-24
Version 2.1.4 (beta) - released 2006-03-24
Version 2.1.3 (beta) - released 2005-11-28
Version 2.0.2 (beta) - released 2004-11-26

[Top]

lexed

lexed may be used to build and consult lexicons.

Practical informations
  • Maintainer: Lionel Clement [Lionel.Clement@inria.fr]
  • Home page
  • SVN (use the second command if you have an account on INRIA GForge and replace "login" by you login on Gforge)
    # svn co svn://scm.gforge.inria.fr/svn/lingwb/lexed/trunk
    # svn co svn+ssh://login@scm.gforge.inria.fr/svn/lingwb/lexed/trunk
          
    
  • SVN Web
Version 4.7 - released 2006-12-15
Version 4.6 - released 2004/11/23
Version 4.3.2

[Top]

sxpipe

sxpipe is a pre-parsing processing chain that handles segmentation and tokenization, spelling error correction, and named entities regognition. It is designed to transform in a robust way raw corpus to DAG of lexical entries.

Practical informations
Version 2.1 - released 2010-02-17
Version 2.0b3 - released 2008-08-22
Version 2.0b2 - released 2008-04-02
Version 2.0b - released 2008-03-14
Version 1.2.1 - released 2007-06-27
Version 1.2 - released 2007-05-11
Version 1.0.0 - released 2005-03

[Top]

mafd
A small demonstrator for MAF
Practical informations
  • Maintainer: Eric de la Clergerie [Eric.De_La_Clergerie@inria.fr]
  • SVN (use the second command if you have an account on INRIA GForge and replace "login" by you login on Gforge)
    # svn co svn://scm.gforge.inria.fr/svn/lingwb/mafd/trunk
    # svn co svn+ssh://login@scm.gforge.inria.fr/svn/lingwb/mafd/trunk
          
    
  • SVN Web

[Top]

Lingua::Features
Lingua::Features implements natural languages features in Perl. It brings the following advantages:
  • type verification
  • features and values normalization
  • smart comparisons between structures
Practical informations
  • Maintainer: Guillaume Rousse [grousse@cpan.org]
  • Home page
  • CVS: use something like
    # cvs -d:pserver:anoncvs@atollcvs.inria.fr:/atoll login
    # cvs -d:pserver:anoncvs@atollcvs.inria.fr:/atoll co Lingua-Features
    
  • CVS Web
Version 0.3.1 - released 2008-08-22
Version 0.3 - released 2004-06-10

[Top]

Lingua::TagSet
Lingua::TagSet allows to convert values between different tagsets used in natural language processing, using Lingua::Features as a pivot format.
Practical informations
  • Maintainer: Guillaume Rousse [grousse@cpan.org]
  • Home page
  • CVS: use something like
    # cvs -d:pserver:anoncvs@atollcvs.inria.fr:/atoll login
    # cvs -d:pserver:anoncvs@atollcvs.inria.fr:/atoll co Lingua-TagSet
    
  • CVS Web
Version 0.3.2 - released 2006-09-01
Version 0.3.1 - released 2006-08-22
Version 0.3 - released 2004-06-10

[Top]

Lingua::MAF
Lingua::MAF allows to easily produces MAF-format documents. MAF (Morpho-Syntactic Annotation Framework) is a standard proposal developed by ISO TC37SC4 commitee, see http://www.tc37sc4.org for further details.
Practical informations
  • Maintainer: Guillaume Rousse [grousse@cpan.org]
  • Home page
  • CVS: use something like
    # cvs -d:pserver:anoncvs@atollcvs.inria.fr:/atoll login
    # cvs -d:pserver:anoncvs@atollcvs.inria.fr:/atoll co Lingua-MAF
    
  • CVS Web
Version 0.1.1 - released 2006-09-01
Version 0.1 - released 2005-05-25

[Top]

Lingua::Matcher
Lingua::Matcher allows to assemble matchers based on regular expressions and instanciated from data.
Practical informations
  • Maintainer: Guillaume Rousse [grousse@cpan.org]
  • SVN (use the second command if you have an account on INRIA GForge and replace "login" by you login on Gforge)
    # svn co svn://scm.gforge.inria.fr/svn/lingwb/Lingua-Matcher/trunk
    # svn co svn+ssh://login@scm.gforge.inria.fr/svn/lingwb/Lingua-Matcher/trunk
          
    
  • SVN Web
Version 0.1 - released 2007-02-14

[Top]

Dict::Lexed
Dict::Lexed is a perl wrapper around lexed.
Practical informations
  • Maintainer: Guillaume Rousse [grousse@cpan.org]
  • Home page
  • SVN (use the second command if you have an account on INRIA GForge and replace "login" by you login on Gforge)
    # svn co svn://scm.gforge.inria.fr/svn/lingwb/Dict-Lexed/trunk
    # svn co svn+ssh://login@scm.gforge.inria.fr/svn/lingwb/Dict-Lexed/trunk
          
    
  • SVN Web
Version 0.2.2 - released 2006-08-22
Version 0.2.1 - released 2005-05-25

[Top]

Dict::FSA
Dict::FSA is a perl wrapper around fsa, a set of tools based on finite state automata (See this page).
Practical informations
  • Maintainer: Guillaume Rousse [grousse@cpan.org]
  • Home page
  • SVN (use the second command if you have an account on INRIA GForge and replace "login" by you login on Gforge)
    # svn co svn://scm.gforge.inria.fr/svn/lingwb/Dict-FSA/trunk
    # svn co svn+ssh://login@scm.gforge.inria.fr/svn/lingwb/Dict-FSA/trunk
          
    
  • SVN Web
Version 0.1.2 - released 2006-08-22
Version 0.1.1 - released 2005-05-25

[Top]

mgcomp

mgcomp is a DyALog compiler for Linguistic Meta-Grammars, with some additional files for a full processing chain.

Practical informations
Version 1.5.0 (beta) - released 2010-02-17
Version 1.4.3 (beta) - released 2008-04-01
Version 1.4.2 (beta) - released 2007-04-05
Version 1.4.1 (beta) - released 2006-03-22
Version 1.4.0 (beta) - released 2004-12-03
Version 1.3.0 (beta) - released 2004-04-23
Version 1.2.1 (stable) - released 2004-04-13

[Top]

tag_utils

tag_utils is a set of Perl scripts to convert between various formats for Tree Adjoining Grammars, in particular the XML TAGML format

Practical informations
Version 1.13 (beta) - released 2010-02-17
Version 1.12 (beta) - released 2008-08-28
Version 1.11 (beta) - released 2006-12-11
Version 1.10 (beta) - released 2006-05-02

[Top]

mgtools
mgtools collects several useful tools and scripts to edit and visualize Meta Grammars
  • smg2xml to convert MG from a simple format (smg) to XML format
  • mg.el an Emacs mode to edit MG (in both XML and SMG formats)
  • mg2smg.xsl an XSLT stylesheet to convert from XML to SMG
  • mgviewer.pl a graphical viewer/editor for MG
  • mg*2html.xsl several XSLT stylesheets to get HTML views on MG
  • mg2*.xsl several XSLT stylesheets to convert to other formats
Practical informations
  • Maintainer: Eric de la Clergerie [Eric.De_La_Clergerie@inria.fr]
  • Other author: François Thomasset []
  • SVN (use the second command if you have an account on INRIA GForge and replace "login" by you login on Gforge)
    # svn co svn://scm.gforge.inria.fr/svn/mgkit/mgtools/trunk
    # svn co svn+ssh://login@scm.gforge.inria.fr/svn/mgkit/mgtools/trunk
          
    
  • SVN Web
Version 2.2.2 (beta) - released 2010-02-17
Version 2.2.1 (beta) - released 2008-08-28
Version 2.2.0 (beta) - released 2007-06-19
Version 2.1.0 (beta) - released 2007-04-05
Version 2.0.0 (beta) - released 2006-12-11
Version 1.0.1 (beta) - released 2004-12-07
Version 0.0.1 (beta) - released 2004-12-02

[Top]

frmg

frmg is a a French grammar generated from a MetaGrammar and compiled with DyALog.

It is based on a previous MetaGrammar (frenchmg), itself based on an previous version developped by Lionel Clément.

A new simplified notation has been used to simplify the writing of the MetaGrammar, with still conversion to an XML exchange format.

The MetaGrammar exploits several new functionalities allowed by mgcomp (MG compiler) and DyALog to be more compact. We can cite MG classes with namespace (to use several times a same class but with different namespaces in a terminal class), free node ordering when possible (interleaving), conditional nodes, ...

Practical informations
  • Maintainer: Eric de la Clergerie [Eric.De_La_Clergerie@inria.fr]
  • SVN (use the second command if you have an account on INRIA GForge and replace "login" by you login on Gforge)
    # svn co svn://scm.gforge.inria.fr/svn/mgkit/frmg/trunk
    # svn co svn+ssh://login@scm.gforge.inria.fr/svn/mgkit/frmg/trunk
          
    
  • SVN Web
Version 2.0.1 - released 2010-02-17
Version 1.2.0 - released 2008-08-28
Version 1.1.0 (alpha) - released 2008-04-01
Version 1.0.2 (alpha) - released 2006-12-11
Version 1.0.0 (alpha) - released 2006-03-24
Version 0.0.4 (alpha) - released 2004-12-08
Version 0.0.1 (alpha) - released 2004-11-25

[Top]

biomg

biomg is a French grammar generated from a MetaGrammar and compiled with DyALog, to be used to parse botanical descriptions.

It is based on frmg, a general MetaGrammar for French.

Practical informations
  • Maintainer: Eric de la Clergerie [Eric.De_La_Clergerie@inria.fr]
  • SVN (use the second command if you have an account on INRIA GForge and replace "login" by you login on Gforge)
    # svn co svn://scm.gforge.inria.fr/svn/mgkit/biomg/trunk
    # svn co svn+ssh://login@scm.gforge.inria.fr/svn/mgkit/biomg/trunk
          
    
  • SVN Web
Version (alpha) - released tomorrow

[Top]

french-toy-tag

french-toy-tag is a toy Tree Adjoining Grammar for French that may be compiled with DyALog. Two flavors of parsers (TAG and hybrid TAG/TIG) may be built and registered in a server of parsers. The parser may run in robust mode (for partial parsing).

Practical informations
Version 1.0.4 (beta) - released 2006-12-21
Version 1.0.3 (beta) - released 2004-04-14
Version 1.0.2 (beta) - released 2004-04-14
Version 1.0.1 (beta) - released 2004/04/11

[Top]

Lefff
A French Morphological and Syntactic Lexicon
Version 3.0.2 - released 2010-02-17
Version 2.5 - released 2007-04-19
Version 2.2.1 - released 2006-09-19
Version 2 - released 2004-01
Version 1 - released 2004-01

[Top]

lefff-frmg
An adaptation of Lefff for frmg
Practical informations
Version 0.4 - released 2010-02-17
Version 0.3 - released 2008-08-28
Version 0.2 - released 2007-07-31

[Top]

english-tag

english-tag packages a small English Tree Adjoining Grammar (developed in group XTAG at UPenn). The grammar may be compiled with DyALog to get tabular parsers. Two flavors of parsers (TAG and hybrid TAG/TIG) may be built and registered in a server of parsers. The parsers may run in robust mode (for partial parsing).

english-tag also illustrates the use of package tag_utils to build the parsers from XML sources

Practical informations
Version 1.0.3 (beta) - released 2006-12-21
Version 1.0.2 (beta) - released 2004-04-14
Version 1.0.1 (beta) - released 2004/04/11

[Top]

frenchmg

frenchmg may be used to build tabular parsers with DyALog starting from a Meta Grammar. Two flavors of parsers (TAG and hybrid TAG/TIG) may be build and registered in a server of parsers. The parsers may run in robust mode (for partial parsing).

frenchmg also illustrates the use of package mgcomp to compile the Meta Grammar into a Tree Adjoining Grammar. The grammar is converted from XML TAGML format to DyALog input format using tag_utils before being compiled by DyALog.

Practical informations
Version 1.1.1 (alpha) - released 2004-03-24
Version 1.0.2 (alpha) - released 2004/04/11

[Top]

Event::XML::Sync

Event::XML::Sync is a small Perl module to run synchronized XML streams.

This module may be used to develop application wrappers to be installed in XML pipelines.

The wrapper expects some XML input. It sends some bits of information to the wrapped application and information the returned information (when arriving) in the input XML stream (by adding, modifying or deleting XML elements). Output XML is produced for the next wrapper in the pipeline.

The key point is that wrappers are non-blocking, i.e. a wrapper do not stop waiting for information to be returned by the wrapped application. To achieve that, the wrapper needs some kind of synchronization to correlate the returned information with the input XML stream.

Practical informations
Version 1.0 (beta) - released 2004/04/08

[Top]

tokenizer
A tokenizer for French
Practical informations
Version 5.4.1 - released 2004/11/25
Version 5.3.2

[Top]

XLFG
A LFG parser for French
Practical informations
Version 3.8.0
Version 3.6.1

[Top]

yamge
Yet Another Meta Grammar Editor
Practical informations
Version 2.0.2 (beta)

[Top]

WWW::Google::Images
WWW::Google::Images may be used search images on Google. Its interface is heavily inspired from WWW::Google::Groups.
Practical informations
  • Maintainer: Guillaume Rousse [grousse@cpan.org]
  • Home page
  • CVS: use something like
    # cvs -d:pserver:anoncvs@atollcvs.inria.fr:/atoll login
    # cvs -d:pserver:anoncvs@atollcvs.inria.fr:/atoll co WWW-Google-Images
    
  • CVS Web
Version 0.6.2 - released 2005-05-23
Version 0.4 - released 2004-04-08

[Top]

WWW::Orphea
WWW::Orphea may be used search images on an Orphea web server, a Digital Asset Management (DAM) software suite published by Algoba Systems. Its interface is adapted from WWW::Google::Images, itself inspired from WWW::Google::Groups.
Practical informations
  • Maintainer: Guillaume Rousse [grousse@cpan.org]
  • Home page
  • CVS: use something like
    # cvs -d:pserver:anoncvs@atollcvs.inria.fr:/atoll login
    # cvs -d:pserver:anoncvs@atollcvs.inria.fr:/atoll co WWW-Orphea
    
  • CVS Web
Version 0.3.2 - released 2005-05-23
Version 0.3 - released 2004-04-08

[Top]

Last update: 2010-07-15+02:00 - Version: 1.0 - Maintainer: Eric de la Clergerie