WOLF (Wordnet Libre du Français, Free French Wordnet) is a free
semantic lexical resource (wordnet) for French.
The WOLF has been built from the Princeton WordNet (PWN) and various multilingual resources (Sagot and Fišer 2008a, Sagot and Fišer 2008b, Fišer and Sagot 2008). Polysemous literals have been dealt with by an approach based on word-aligning a parallel corpora in 5 languages. The extracted multilingual lexical has been semantically disambiguated thanks to wordnets for the languages involved. Moreover, a bilingual approach was sufficient for building new entries for monosemous words. To achieve this, we extracted bilingual lexicons from Wikipedia and thesauri. The resulting wordnet has been evaluated against the French wordnet developed during the EuroWordNet project.
In 2009, a specific work has been done on adverbial synsets (Sagot, Fort et Venant 2009a, Sagot, Fort et Venant 2009b).
The WOLF contains all PWN synsets, including those for which no French literal is known.
The WOLF is in the XML format used in the BalkaNet project. For now, SENSE elements are filled with information on the sources thanks to which the lexeme was found, and not with sense numbers.
The WOLF is a free resource, distributed under the Cecill-C license (LGPL compatible).
For now, the WOLF and the Lefff are not mapped. In the following months, Lefff entries should receive WOLF (i.e. PWN) synset ids.
DOWNLOADLast distributed version (0.1.5)