Book ; Online: Hybrid lemmatization in HuSpaCy
2023
Abstract: Lemmatization is still not a trivial task for morphologically rich languages. Previous studies showed that hybrid architectures usually work better for these languages and can yield great results. This paper presents a hybrid lemmatizer utilizing both a ... ...
Abstract | Lemmatization is still not a trivial task for morphologically rich languages. Previous studies showed that hybrid architectures usually work better for these languages and can yield great results. This paper presents a hybrid lemmatizer utilizing both a neural model, dictionaries and hand-crafted rules. We introduce a hybrid architecture along with empirical results on a widely used Hungarian dataset. The presented methods are published as three HuSpaCy models. Comment: published at the conference XIX. Magyar Sz\'am\'it\'og\'epes Nyelv\'eszeti Konferencia (XIX. Hungarian Computational Linguistics Conference) |
---|---|
Keywords | Computer Science - Computation and Language ; 68T50 ; I.2.7 |
Publishing date | 2023-06-13 |
Publishing country | us |
Document type | Book ; Online |
Database | BASE - Bielefeld Academic Search Engine (life sciences selection) |
Full text online
More links
Kategorien
Inter-library loan at ZB MED
Your chosen title can be delivered directly to ZB MED Cologne location if you are registered as a user at ZB MED Cologne.