January the 31st 2011
The TUTtoPENN converter has been developed within the
Turin University Treebank
(TUT) project for implementing the conversion of the TUT native format into the format of the
Penn Treebank adapted to Italian (called TUT-Penn). Moreover, it allows for the conversion
in a set of other constituency based formats, which correspond to the different steps of the conversion
These formats are described in the
TUT web site and in the publications listed below.
The TUTtoPENN converter is licenced under a GNU GENERAL PUBLIC LICENSE (Version 3) and its
download and use should be done according to this licence.
Download the TUTtoPENN converter
converter user MANUAL: this
txt document briefly describes the organization and implementation of the converter and explains
how to use the tool to obtain the various formats.
- C. Bosco. Multiple-step treebank conversion: from dependency to Penn format. In
Proceedings of Linguistic Annotation Workshop (LAW) at ACL'07, Prague, Czeck Republic, 2007,
- C. Bosco. Linguistic knowledge extraction from corpus parallel annotations.
In Proceedings of XL Congresso della Società di Linguistica Italiana, Vercelli,
Italy, 2006, pdf-zip
- C. Bosco, V. Lombardo.
Comparing linguistic information in treebank annotations. In Proceedings
of LREC'06, Genova, Italy, 2006, ps-zip
[back to the TUT home]