State-of-art in HLT for Croatian
language resources
- corpora (Institute of linguistics, FF Zg)
- Croatian National Corpus (www.hnk.ffzg.hr) (MZT 130718)
- test version ᡃ Mw
- ᡖ Mw of contemporary Croatian by end of 2000
- towards 100 Mw in 2001/2002
- Croatian-English Parallel Corpus (www.hnk.ffzg.hr/hr-en)
- 3.5 Mw of translations
- aligned on sentence level
- Croatian-Slovene Parallel corpus (www.hnk.ffzg.hr/hr-si)
- 1 Mw of translations (MZT 130821)
- in statu nascendi
- dictionaries
- Croatian Morphological Lexicon
- ca 30.000 headwords with generated wordforms
- by end of 2000