Multiflex
Multiflex is a formalism and its implementation for an automatic inflection of (contiguous) Multi-Word Expressions. It has been developed in C and is integrated into 3 linguistic platforms:- Unitex (and its industry-oriented version GramLab), a multiligual corpus processor available under the GNU LGPL v3 license (see also the Unitex/GramLab forum)
- Toposław, a Polish lexicographic framework distributed under the GNU GLP v3 license,
- LeXimir, a Serbian tool for lexical resource management and query expansion (available from the author)
XMLCorrector
XMLCorrector is an implementation of an algorithm allowing to correct an XML document with respect to schema constraints expressed as a DTD. Namely, given a well-formed XML document t seen as a tree, a schema S and a non negative threshold th the algorithm finds every tree t' valid with respect to S such that the edit distance between t and t' is no bigger than th. The algorithm is based on a recursive exploration of the finite-state automata representing structural constraints imposed by the schema, as well as on the construction of an edit distance matrix storing edit sequences leading to correction candidate trees.
The implementation is released under the GNU LGPL v3 license.