You can’t suggest that?! Comparisons and improvements of speller error models

Pirinen, Flammie; Moshagen, Sjur Nørstebø; Kaalep, Heiki-Jaan

dc.contributor.author	Pirinen, Flammie
dc.contributor.author	Moshagen, Sjur Nørstebø
dc.contributor.author	Kaalep, Heiki-Jaan
dc.date.accessioned	2023-01-26T10:15:22Z
dc.date.available	2023-01-26T10:15:22Z
dc.date.issued	2022-08-30
dc.description.abstract	In this article, we study correction of spelling errors, specifically on how the spelling errors are made and how can we model them computationally in order to fix them. The article describes two different approaches to generating spelling correction suggestions for three Uralic languages: Estonian, North Sámi and South Sámi. The first approach of modelling spelling errors is rule-based, where experts write rules that describe the kind of errors are made, and these are compiled into finite-state automaton that models the errors. The second is data-based, where we show a machine learning algorithm a corpus of errors that humans have made, and it creates a neural network that can model the errors. Both approaches require collection of error corpora and understanding its contents; therefore we also describe the actual errors we have seen in detail. We find that while both approaches create error correction systems, with current resources the expert-build systems are still more reliable.	en_US
dc.identifier.citation	Pirinen, Moshagen, Kaalep. You can’t suggest that?! Comparisons and improvements of speller error models . Nordlyd. 2022
dc.identifier.cristinID	FRIDAID 2114174
dc.identifier.doi	10.7557/12.6349
dc.identifier.issn	0332-7531
dc.identifier.issn	1503-8599
dc.identifier.uri	https://hdl.handle.net/10037/28381
dc.language.iso	eng	en_US
dc.publisher	Septentrio Academic Publishing	en_US
dc.relation.journal	Nordlyd
dc.rights.holder	Copyright 2022 The Author(s)	en_US
dc.rights.uri	https://creativecommons.org/licenses/by-nc/4.0	en_US
dc.rights	Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)	en_US
dc.title	You can’t suggest that?! Comparisons and improvements of speller error models	en_US
dc.type.version	publishedVersion	en_US
dc.type	Journal article	en_US
dc.type	Tidsskriftartikkel	en_US
dc.type	Peer reviewed	en_US

File(s) in this item

Name:: article.pdf
Size:: 211.7Kb
Format:: PDF

View/Open

This item appears in the following collection(s)

Artikler, rapporter og annet (språk og kultur) [1477]

Show simple item record

Except where otherwise noted, this item's license is described as Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)