• The GiellaLT infrastructure: A multilingual infrastructure for rule-based NLP 

      Nørstebø Moshagen, Sjur; Pirinen, Flammie; Antonsen, Lene; Gaup, Børre; Mikkelsen, Inga Lill Sigga; Trosterud, Trond; Wiechetek, Linda; Hiovain-Asikainen, Katri (Journal article; Tidsskriftartikkel; Peer reviewed, 2023)
      This article gives an overview of the GiellaLT infrastructure, the main parts of it, and how it has been and can be used to support a large number of indigenous and minority languages, from keyboards to speech technology and advanced proofing tools. A special focus is given to languages with few or non-existing digital resources, and it is shown that many tools useful to the daily digital life of ...
    • Mii *eai leat gal vuollánan – Vi *ha neimen ikke gitt opp 

      Wiechetek, Linda; Pirinen, Flammie; Gaup, Børre; Argese, Chiara; Omma, Thomas (Journal article; Tidsskriftartikkel; Peer reviewed, 2022-08-30)
      Machine learning is the dominating paradigm in natural language processing nowadays. It requires vast amounts of manually annotated or synthetically generated text data. In the GiellaLT infrastructure, on the other hand, we have worked with rule-based methods, where the linguistis have full control over the development the tools. In this article we uncover the myth of machine learning being cheaper ...
    • Recent advances in Apertium, a free/open-source rule-based machine translation platform for low-resource languages 

      Khanna, Tanmai; Washington, Jonathan North; Tyers, Francis Morton; Bayatlı, Sevilay; Swanson, Daniel; Pirinen, Flammie; Tang, Irene; Alos i Font, Héctor (Journal article; Tidsskriftartikkel; Peer reviewed, 2021-10-08)
      This paper presents an overview of Apertium, a free and open-source rule-based machine translation platform. Translation in Apertium happens through a pipeline of modular tools, and the platform continues to be improved as more language pairs are added. Several advances have been implemented since the last publication, including some new optional modules: a module that allows rules to process recursive ...
    • You can’t suggest that?! Comparisons and improvements of speller error models 

      Pirinen, Flammie; Moshagen, Sjur Nørstebø; Kaalep, Heiki-Jaan (Journal article; Tidsskriftartikkel; Peer reviewed, 2022-08-30)
      In this article, we study correction of spelling errors, specifically on how the spelling errors are made and how can we model them computationally in order to fix them. The article describes two different approaches to generating spelling correction suggestions for three Uralic languages: Estonian, North Sámi and South Sámi. The first approach of modelling spelling errors is rule-based, where ...