ub.xmlui.mirage2.page-structure.muninLogoub.xmlui.mirage2.page-structure.openResearchArchiveLogo
    • EnglishEnglish
    • norsknorsk
  • Velg spraaknorsk 
    • EnglishEnglish
    • norsknorsk
  • Administrasjon/UB
Vis innførsel 
  •   Hjem
  • Fakultet for humaniora, samfunnsvitenskap og lærerutdanning
  • Institutt for språk og kultur
  • Artikler, rapporter og annet (språk og kultur)
  • Vis innførsel
  •   Hjem
  • Fakultet for humaniora, samfunnsvitenskap og lærerutdanning
  • Institutt for språk og kultur
  • Artikler, rapporter og annet (språk og kultur)
  • Vis innførsel
JavaScript is disabled for your browser. Some features of this site may not work without it.

Recent advances in Apertium, a free/open-source rule-based machine translation platform for low-resource languages

Permanent lenke
https://hdl.handle.net/10037/22990
DOI
https://doi.org/10.1007/s10590-021-09260-6
Thumbnail
Åpne
article.pdf (1.256Mb)
Publisert versjon (PDF)
Dato
2021-10-08
Type
Journal article
Tidsskriftartikkel
Peer reviewed

Forfatter
Khanna, Tanmai; Washington, Jonathan North; Tyers, Francis Morton; Bayatlı, Sevilay; Swanson, Daniel; Pirinen, Flammie; Tang, Irene; Alos i Font, Héctor
Sammendrag
This paper presents an overview of Apertium, a free and open-source rule-based machine translation platform. Translation in Apertium happens through a pipeline of modular tools, and the platform continues to be improved as more language pairs are added. Several advances have been implemented since the last publication, including some new optional modules: a module that allows rules to process recursive structures at the structural transfer stage, a module that deals with contiguous and discontiguous multi-word expressions, and a module that resolves anaphora to aid translation. Also highlighted is the hybridisation of Apertium through statistical modules that augment the pipeline, and statistical methods that augment existing modules. This includes morphological disambiguation, weighted structural transfer, and lexical selection modules that learn from limited data. The paper also discusses how a platform like Apertium can be a critical part of access to language technology for so-called low-resource languages, which might be ignored or deemed unapproachable by popular corpus-based translation technologies. Finally, the paper presents some of the released and unreleased language pairs, concluding with a brief look at some supplementary Apertium tools that prove valuable to users as well as language developers. All Apertium-related code, including language data, is free/open-source and available at https://github.com/apertium.
Forlag
Springer
Sitering
Khanna, Washington, Tyers, Bayatlı, Swanson, Pirinen, Tang, Alos i Font. Recent advances in Apertium, a free/open-source rule-based machine translation platform for low-resource languages. Machine Translation. 2021
Metadata
Vis full innførsel
Samlinger
  • Artikler, rapporter og annet (språk og kultur) [1472]
Copyright 2021 The Author(s)

Bla

Bla i hele MuninEnheter og samlingerForfatterlisteTittelDatoBla i denne samlingenForfatterlisteTittelDato
Logg inn

Statistikk

Antall visninger
UiT

Munin bygger på DSpace

UiT Norges Arktiske Universitet
Universitetsbiblioteket
uit.no/ub - munin@ub.uit.no

Tilgjengelighetserklæring