Show simple item record

dc.contributor.advisorTrosterud, Trond
dc.contributor.authorAntonsen, Lene
dc.date.accessioned2018-06-15T12:43:22Z
dc.date.available2018-06-15T12:43:22Z
dc.date.issued2018-06-26
dc.description.abstract<p>Nákkosgirjjis guorahalan leago grammatihkalaš giellamodealla vástádus sámegielaid ja eará davvimáilmmi eamiálbmotgielaid giellateknologalaš dárbbuide? Grámmatihkalaš giellamodealla lea huksejuvvon Finite state transduserin (FST:n). Suokkardalan hástalusaid mat čuožžilit go galgá hukset ja heivehit sámegielaid grammatihkalaš giellamodeallaid duohta giellamáilbmái.<p> <p>FST:in sáhttá modelleret morfofonologalaš rievddademiid ja analyseret teakstakorpusiid vaikko teavsttain lea ollu lingvisttalaš variašuvdna. FST:in sáhttá maiddái genereret sátnehámiid mat eai gávdno korpusis, muhto gávdnojit gielas. Syntávssalaš analysáhtor mii lea huksejuvvon Constraint Grammariin, ii gáibit stuorra teakstačoakkáldaga, ja lea gierdilis vuohki disambigueret morfologalaš analysaid gaskkas.<p> <p>Čájehan mo sáhttit ávkkástallat sámegielaid grámmatihkalaš giellamodeallain geavaheddjiidprográmmain. Grammatihkalaš gilkorat dahket vejolažžan addit metalingvisttalaš dieđuid ja máhcahaga geavaheddjiide. FST ferte heivehit nu ahte dat buorebut speadjalastá duohta giela. FST lahkonanvuogi čuolbma lea badjelmearálaš sátnehámiid genereren, ja dan dihte lea dehálaš gáržžidit giellamodealla. Go ráhkada giellateknologalaš reaidduid giellaoahpahallamii, de lea ávkkálaš dasa siskkildit giellaoahpahalli gaskagiela. Suokkardalan muhtun vugiid mo sáhttá gáržžidit ja viiddidit giellamodealla vai buorebut sulastahttá duohta giellamáilmmi. <p> <p>Čájehan ahte FST huksen vástida máŋgga davvimáilmmi eamiálbmotgiela dárbbuide oažžut giellateknologalaš reaidduid. FST lahkonanvuohki oktan buriin vuođđostruktuvrrain mainna ávkkástallá vuođđobarggus mii lea dahkkon sámegielaide, dahká alladásat geavaheddjiidprográmmaid olámuddui vehádatgielaide main lea rikkes morfologiija, vátna teakstakorpus ja unnán hállit.<p> <p><b>ENGLISH</b>:<p> In this thesis I investigate whether grammatical language modeling is an appropriate response to the need for language technology for the Saami languages ​​and other circumpolar indigenous languages. The grammatical language models under investigation are built as Finite state transducers (FST). I examine the challenges of building such grammatical models for Saami languages and adapting them to real-world linguistic issues.<p> <p>A finite state transducer (FST) makes it possible to model morphophonological alternations and analyse text in a corpus, even when there is considerable linguistic variation in the text. One can also generate word forms that are not found in the corpus, but exist in the language. A syntactic analyser based on Constraint Grammar does not require a large text corpus, and does robust disambiguation of multiple morphological analyses.<p> <p>I show how Saami grammatical language models can be implemented in various user programs. Grammatical tags make it possible to provide both metalinguistic information and immediate feedback to the user's input. It is necessary to adapt the FSTs to real language usage. The FST approach causes overgeneration, which is why it is important to limit the language model. Including the learner's interlanguage is also useful for language learning tools based on language technology. I have examined a number of ways both to limit and to expand the language models.<p> <p>I show that the construction of an FST is the key answer to the need for language technology tools for circumpolar indigenous languages. With the appropriate infrastructure available, which also makes it possible to port results achieved for the Saami language to other languages as well, the FST approach places advanced user applications with the reach of minority languages ​​with complex morphology, meagre corpus resources, and few speakers.<p>en_US
dc.description.doctoraltypeph.d.en_US
dc.description.popularabstractSamiske språk og andre sirkumpolare urfolksspråk har ikke de tekstressursene som kreves for å bruke statistiske metoder, som er grunnlaget for majoritetsspråkenes språkteknologi. Doktorgradsavhandlingen handler om hvordan man ved å bygge en språkmodell kan lage språkteknologiske verktøy for disse språkene. Avhandlingen behandler utfordringer i byggingen av slike modeller og ser på måter å tilpasse dem til den virkelige språkverdenen: Modellen skal dekke variasjonen i språket, men samtidig skal man unngå at den lager ord som ikke finnes i språket. Avhandlingen viser at språkmodellen kan brukes i programmer som ordretteprogram, ordbøker med grammatikk og språklæringsprogrammer. Denne typen språkmodeller (kalt FST), kombinert med en god og lett tilgjengelig infrastruktur, gjør det mulig å skape moderne språkteknologiske verktøy for minoritetsspråk med mye ordbøying og få talere.en_US
dc.identifier.urihttps://hdl.handle.net/10037/12884
dc.language.isosmien_US
dc.publisherUiT Norges arktiske universiteten_US
dc.publisherUiT The Arctic University of Norwayen_US
dc.rights.accessRightsopenAccessen_US
dc.rights.holderCopyright 2018 The Author(s)
dc.rights.urihttps://creativecommons.org/licenses/by-nc-sa/3.0en_US
dc.rightsAttribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)en_US
dc.subjectVDP::Humaniora: 000::Språkvitenskapelige fag: 010en_US
dc.subjectVDP::Humanities: 000::Linguistics: 010en_US
dc.subjectVDP::Humaniora: 000::Språkvitenskapelige fag: 010en_US
dc.subjectVDP::Humanities: 000::Linguistics: 010en_US
dc.titleSámegielaid modelleren – huksen ja heiveheapmi duohta giellamáilbmáien_US
dc.typeDoctoral thesisen_US
dc.typeDoktorgradsavhandlingen_US


File(s) in this item

Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail
Thumbnail

This item appears in the following collection(s)

Show simple item record

Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)
Except where otherwise noted, this item's license is described as Attribution-NonCommercial-ShareAlike 3.0 Unported (CC BY-NC-SA 3.0)