Vis enkel innførsel

dc.contributor.authorMeng, Wei
dc.contributor.authorRay, Mithlesh Kumar
dc.contributor.authorFenton, Christopher Graham
dc.contributor.authorAnderssen, Endre
dc.contributor.authorPaulssen, Ruth H
dc.date.accessioned2023-10-06T06:13:20Z
dc.date.available2023-10-06T06:13:20Z
dc.date.issued2022
dc.description.abstract<p><i>Background -</i> Analysis of methylation data is dependent on two inputs. The total number of reads at any given cytosine site (coverage), and how many of those cytosines are methylated (methylation). Both the large number of sites and low coverage can cause problems in the analysis of methylation data. Likewise finding DMR is computationally and statistically challenging. Here we present a new method (MethyBlock) that highlights candidate regions by balancing co-variation and chromosomal distance. <p><i>Methods -</i> Raw data were processed by Bismark. Sample sites under a user-defined coverage were set to zero. Furthermore, sites with too many coverage zeros across all samples were removed. Data is divided into chromosomes, each chromosome divided into segments based on the distance to the neighboring CpG site. Relative methylation level data is calculated by dividing methylation counts by coverage counts. Missing values were imputed using K-nearest neighbor (KNN) per segment. Imputed segments were divided into blocks by balancing covariability and distance. Blocks were further filtered for outliers. The blocks and the imputed relative methylation matrix are kept for further analysis. <p><i>Results -</i> With imputation on a certain percentage of the samples, MethyBlock returned more base pairs but grouped into smaller regions than the DMRs generated from DMRseq. This may allow the finding of more specific blocks within large CpG regions. MethyBlock also returned a similar percentage of CpG islands, regulatory regions, and functional annotations. Comparison with DMRseq shows a similar percentage in UCSC hg38 overlapped regulatory regions. <p><i>Conclusions -</i> MethyBlock simplifies the downstream computational and statistical analysis by reducing the data to the region level. This method improves the power of statistical tests by reducing the impact of multiple testing.en_US
dc.descriptionPoster presented at the Norwegian Bioinformatics Days 2022, Sandvollen, 28-30 September 2022.en_US
dc.identifier.cristinIDFRIDAID 2082662
dc.identifier.urihttps://hdl.handle.net/10037/31476
dc.language.isoengen_US
dc.rights.accessRightsopenAccessen_US
dc.rights.holderCopyright 2022 The Author(s)en_US
dc.titleAnalysis of methylation data with imputation with blocks (MethylBlocks)en_US
dc.typeConference objecten_US
dc.typeKonferansebidragen_US


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel