dc.contributor.advisor | Willassen, Nils Peder | |
dc.contributor.author | Klemetsen, Terje | |
dc.date.accessioned | 2021-12-01T10:24:56Z | |
dc.date.available | 2021-12-01T10:24:56Z | |
dc.date.issued | 2021-12-10 | |
dc.description.abstract | This thesis introduces the MAR databases as marine-specific resources in the genomic landscape. Paper 1 describes the curation effort and development leading to the MAR databases being created. It results in the highly valued reference database MarRef, the broader MarDB, and the marine gene catalog MarCat. Definition of a marine environment, the curation process, and the Marine Metagenomics Portal as a public web-service are described. It facilitates scientists to find marine sequence data for prokaryotes and to explore rich contextual information, secondary metabolites, updated taxonomy, and helps in evaluating genome quality. Many of these database advancements are covered in Paper 2. This includes new entries and development of specific databases on marine fungi (MarFun) and salmon related prokaryotes (SalDB). With the implementation of metagenome assembled and single amplified genomes it leads up to the database quality evaluation discussed in Paper 3. The lack of quality control in primary databases is here discussed based on estimated completeness and contamination in the genomes of the MAR databases.
Paper 4 explores the microbiota of skin and gut mucosa of Atlantic salmon. By using a database dependent amplicon analysis, the full-length 16 rRNA gene proved accurate, but not a game-changer in taxonomic classification for this environmental niche. The proportion of dataset sequences lacking clear taxonomic classification suggests lack of diversity in current-day databases and inadequate phylogenetic resolution. Advancing phylogenetic resolution was the subject of Paper 5. Here the highly similar species of genus Aliivibrio became delineated using six genes in a multilocus sequence analysis. Five potentially novel species could in this way be delineated, which coincided with recent genome-wide taxonomy listings. Thus, Paper 4 and 5 parallel those of the MAR databases by providing insight into the inter-relational framework of bioinformatic analysis and marine database sources. | en_US |
dc.description.doctoraltype | ph.d. | en_US |
dc.description.popularabstract | The process of sequencing and digitally storing DNA from sampled microbiota is happening on a progressively larger scale. With a multitude of institutions submitting their data to public archives, primary databases struggle to facilitate the use of descriptive contextual data – preventing the selection of desired sequence data. The development and maintenance of the MAR databases presented in this work explain how biocuration and best practices in data hosting has been used to create databases for marine sequence data. Enriched with contextual information and analyzed by state-of-the-art tools, scientists can access high-value marine resources that can aid in the discovery of novel molecules and help provide data for meta-analysis. In parallel, the MAR databases are discussed in context with amplicon analysis of Atlantic salmon and the phylogenetic revision of genus Aliivibrio. These studies reflect on the need to update databases and make use of the latest knowledge and technology. | en_US |
dc.description.sponsorship | ELIXIR-EXCELERATE, ELIXIR-CONVERGE | en_US |
dc.identifier.isbn | 978-82-8236-461-4 (pdf) | |
dc.identifier.uri | https://hdl.handle.net/10037/23232 | |
dc.language.iso | eng | en_US |
dc.publisher | UiT Norges arktiske universitet | en_US |
dc.publisher | UiT The Arctic University of Norway | en_US |
dc.relation.haspart | <p>Paper 1: Klemetsen, T., Raknes, I.A., Fu, J., Agafonov, A., Balasundaram, S.V., Tartari, G., ... Willassen, N.P. (2018). The MAR databases: development and implementation of databases specific for marine metagenomics. <i>Nucleic Acids Research, 46</i>(D1), D692-D699. Also available in Munin at <a href=https://hdl.handle.net/10037/13567>https://hdl.handle.net/10037/13567</a>.
<p>Paper 2: Klemetsen, T., Fu, J., Agafonov, A., Balasundaram, S.V., Robertsen, E. & Willassen, N.P. The MAR databases: A manually curated resource for marine microbial genomics and metagenomics. (Submitted manuscript).
<p>Paper 3: Klemetsen, T., Robertsen, M.E. & Willassen, N.P. A substantial quality assessment of prokaryotic genomes in the MAR databases reveals an urgent need for submission quality control. (Submitted manuscript).
<p>Paper 4: Klemetsen, T., Willassen, N.P. & Karlsen, C.R. (2019). Full-length 16S rRNA gene classification of Atlantic salmon bacteria and effects of using different 16S variable regions on community structure analysis. <i>MicrobiologyOpen, 8</i>(10), e898. Also available in Munin at <a href=https://hdl.handle.net/10037/15948>https://hdl.handle.net/10037/15948</a>.
<p>Paper 5: Klemetsen, T., Karlsen, C.R. & Willassen, N.P. (2021). Phylogenetic Revision of the Genus Aliivibrio: Intra- and Inter-Species Variance Among Clusters Suggest a Wider Diversity of Species. <i>Frontiers in Microbiology, 12</i>, 626759. Also available at <a href=https://doi.org/10.3389/fmicb.2021.626759>https://doi.org/10.3389/fmicb.2021.626759</a>. | en_US |
dc.relation.isbasedon | Sequence data used in this research is described in detail by the included articles and are hosted publicly by the International Nucleotide Sequence Database Collaboration. | en_US |
dc.rights.accessRights | openAccess | en_US |
dc.rights.holder | Copyright 2021 The Author(s) | |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-sa/4.0 | en_US |
dc.rights | Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) | en_US |
dc.subject | VDP::Mathematics and natural science: 400::Basic biosciences: 470::Bioinformatics: 475 | en_US |
dc.subject | VDP::Mathematics and natural science: 400::Basic biosciences: 470::Genetics and genomics: 474 | en_US |
dc.subject | VDP::Mathematics and natural science: 400::Information and communication science: 420::Databases and multimedia systems: 428 | en_US |
dc.title | Developing the MAR databases – Augmenting Genomic Versatility of Sequenced Marine Microbiota | en_US |
dc.type | Doctoral thesis | en_US |
dc.type | Doktorgradsavhandling | en_US |