The Croatian Valency Lexicon
of Verbs, Version 2.0008 (CROVALLEX 2.0008) is an attempt of formal
description of valency frames of Croatian verbs.
CROVALLEX 2.0008 was developed as the part of the PhD thesis titled Approaches
to the Development of the Machine Lexicon for Croatian Language written by Nives Mikelic Preradovic
and supervised by prof.dr.sc. Damir Boras at the Department
of Information Sciences, Faculty of Humanistics
and Social Sciences,
The Functional Generative Description (FGD), being developed by Czech linguists Petr Sgall and his collaborators since the 1960s, is used as the background theory in CROVALLEX 2.0008. for the description of valency frames of selected verbs.
CROVALLEX 2.0008 contains roughly 1740 verbs. They were selected from the Croatian frequency dictionary, according to their number of occurrences.
The logical structure of the CROVALLEX data is described here. The structure is based on the Valency Lexicon of Czech Verbs (VALLEX 1.0) developed by Zdeněk Žabokrtský and Markéta Lopatková at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University in Prague 2004.
If you are facing problems viewing any of the accents of the verbs, please download the appropriate font here.
The lexicon is available in the following formats:
The logical structure of CROVALLEX gives a short description of the lexicon. The more detailed description is available as the part of the PhD thesis titled Approaches to the Development of the Machine Lexicon for Croatian Language written by Nives Mikelic Preradovic.
Although the large efforts were spent on removing annotation errors, some of them still remain in the lexicon. If you find any, you are kindly asked to report them to firstname.lastname@example.org. Any other comments are welcome as well.