Normative language work in the age of machine learning
Author
Trosterud, TrondAbstract
Neural nets have, during the last few years, given us both an improved Google Translate, better search algorithms, better speech technology and doubtless many other things. The approach dominates current language technology to the extent that no other approach is visible. Being data driven, the hidden assumption behind this approach when used in proofing tools is that the language is used correctly in the text material, in other words, usage equals the norm. Although this approach is able to provide useful help for the largest languages, it leads to some serious problems. For indigenous and often also for other minority languages, the assumption does not hold. The written norm is weakly established and cannot be reliably found in usage. For normative bodies responsible for defining the written norm of a given language, usage-based proofing tools will not be able to implement the explicit norm they have defined. The present article discusses the current trend within proofing tools and looks at some alternatives.
Description
Source at https://efnil.org/documents/publications/.
Publisher
EFNILCitation
Trosterud T: Normative language work in the age of machine learning. In: Željko, Kirchmeier. The Role of National Language Institutions in the Digital Age, 2021. Hungarian Research Centre for Linguistics p. 61-70Metadata
Show full item recordCollections
Copyright 2021 The Author(s)