Show simple item record

dc.contributor.authorGlusman, Gustavo
dc.contributor.authorEl-Gewely, M. Raafat
dc.contributor.authorQin, Shizhen
dc.contributor.authorSiegel, Andrew F.
dc.contributor.authorRoach, Jared C.
dc.contributor.authorHood, Leroy
dc.contributor.authorSmit, Arian F.A.
dc.date.accessioned2007-05-16T10:35:08Z
dc.date.available2007-05-16T10:35:08Z
dc.date.issued2006-03
dc.description.abstractThe identification and characterization of the complete ensemble of genes is a main goal of deciphering the digital information stored in the human genome. Many algorithms for computational gene prediction have been described, ultimately derived from two basic concepts: (1) modeling gene structure and (2) recognizing sequence similarity. Successful hybrid methods combining these two concepts have also been developed. We present a third orthogonal approach to gene prediction, based on detecting the genomic signatures of transcription, accumulated over evolutionary time. We discuss four algorithms based on this third concept: Greens and CHOWDER, which quantify mutational strand biases caused by transcription-coupled DNA repair, and ROAST and PASTA, which are based on strand-specific selection against polyadenylation signals. We combined these algorithms into an integrated method called FEAST, which we used to predict the location and orientation of thousands of putative transcription units not overlapping known genes. Many of the newly predicted transcriptional units do not appear to code for proteins. The new algorithms are particularly apt at detecting genes with long introns and lacking sequence conservation. They therefore complement existing gene prediction methods and will help identify functional transcripts within many apparent ‘‘genomic deserts.’’en
dc.format.extent2648061 bytes
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/10037/964
dc.identifier.urnURN:NBN:no-uit_munin_792
dc.language.isoengen
dc.publisherPublic Library of Scienceen
dc.relation.ispartofseriesPloS computational biology, 2(2006)nr 3, pp 160-173en
dc.rights.accessRightsopenAccess
dc.subjectVDP::Medisinske fag: 700::Basale medisinske, odontologiske og veterinærmedisinske fag: 710::Medisinsk genetikk: 714en
dc.titleA third approach to gene prediction suggests thousands of additional human transcribed regionsen
dc.typeJournal articleen
dc.typeTidsskriftartikkelen
dc.typePeer reviewed


File(s) in this item

Thumbnail
Thumbnail

This item appears in the following collection(s)

Show simple item record