The Natural Language Processing group has been involved in human language technology research (primarily information extraction (IE)) since 1998, and by now, it has become one of the leading workshops of Hungarian computational linguistics. The Group is engaged in processing Hungarian and English texts. Its general objective is to develop language-independent or easily adaptable technologies.
IE aims at obtaining useful information, correlations automatically from running texts. It proves most expedient when lengthy texts are to be scanned in order to find the necessary information. In the case of a suitably sized training database, the amount of human labor to be expended can be radically reduced by means of machine learning algorithms. Applications comprise the following important fields: information extraction from business news, biological publications, medical reports and from the Internet (forums, blogs, etc.). With the development of the manually annotated Szeged Corpus and TreeBank, as well as the Hungarian WordNet, SzegedNE and other corpora it has become possible to apply machine learning based methods for the syntactic and semantic analysis of Hungarian texts.
Senior and junior researchers, Ph.D. students, programmers and linguistic experts enrich the team with their knowledge. The group has more than 100 relevant international publications, it participated in about 20 text processing related national RTD programs and delivered several industrial R&D solutions.
The Group has organised
The head of the group: János CSIRIK
Tel: +36-62-544126, +36-62-546396
Fax: +36 62 546737
Address: Szeged, 6720, Árpád tér 2.
The members of the Human Language Technology group are:
CSIRIK János, full professor, the head of the group
FARKAS Richárd, senior researcher
VINCZE Veronika, senior researcher
BEREND Gábor, junior research fellow
HANGYA Viktor, PhD student
SZÁNTÓ Zsolt, PhD student
SIMKÓ Katalin Ilona, PhD student
ZSIBRITA János, software developer