University of Szeged Natural Language Processing Group Hungarian Academy of Sciences

Downloads

Software

magyarlanc: a toolkit for the basic linguistic processing of Hungarian

A Named Entity Recognition tool for English and Hungarian

A HTML annotation tool (Firefox extension)

TextAnnotator: a tool for the linguistic annotation of natural language texts

A CRF-based tool for identifying light verb constructions in English texts

Language Resources

The Szeged Treebank

The Szeged Dependency Treebank

The Hungarian WordNet

Corpora for uncertainty detection

Hungarian Named Entity corpora

Named Entity lemmatization database

Corpora of multiword expressions

Hungarian word sense disambiguated corpus

The affiliation HTML corpus

The SzegedParalell English-Hungarian corpus

The HunOr Russian-Hungarian parallel corpus

Hungarian forum corpus for Opinion Mining

A dataset for opinionated keyphrase extraction

HunLearner, a learners' corpus of Hungarian