Access  |
Bwananet |
|
|
The Corpus Tècnic query interface. The Corpus
Tècnic contains written texts from the fields of
Law, Economy, Genomics, Medicine and Environment as well
as a contrastive corpus from the press. The languages
of the texts in the corpus are Catalan, Spanish, English,
French and German. |
 |
 |
Access  |
Tools
for Catalan and Spanish corpus processing
New! |
|
|
A package of tools for Catalan and Spanish corpus processing.
It includes a text handling module and a probabilistic
POS tagger. It also allows consulting POS tagger dictionary
data. |
 |
 |
 |
Treebank-IULA |
|
|
A new tool for the processing of the Corpus Tècnic
in Catalan, Spanish and English. |
 |
 |
Access  |
PALIC |
|
|
A package of tools for the processing of the Corpus
Tècnic in Catalan and Spanish. It includes a preprocessor,
a PosTagger and a linguistic disambiguator. |
 |
 |
 |
Desambigua |
|
|
A bank of disambiguation linguistic rules for Catalan
and Spanish. |
 |
 |
Access  |
Jaguar
New! |
|
|
A tool for statistical corpus exploitation. It offers
concordances, counts ngrams, extracts collocations and
gives association, distribution and similarity measures. |
 |
 |
 |
COLDIC |
|
|
A management tool from the collection of dictionaries
used for the linguistic processing of the Corpus Tècnic
and other projects of IULA. The dictionaries in electronic
format have either been lent by institutions collaborating
with IULA or developed by the different research groups
of IULA. |
 |
 |
 |
Syntactic
analyser for Spanish |
|
|
An open-source HPSG grammar for Spanish implemented
within the LKB system. |
 |
 |
Access  |
Alinea |
|
|
A tool for parallelizing translated texts, which has
been specially designed for specialized corpora and also
as a translation validator. |
 |
 |
 |
Wüska |
|
|
A package of tools for the automatic retrieval and classification
of specialized texts. |
 |
 |
Access  |
Poppins |
|
|
An experimental design of a document classifier with
supervised learning. |