Normalization of gene expression data revisited: the three viewpoints of the transcriptome in human skeletal muscle undergoing load-induced hypertrophy and why they matter

Khan, Yusuf; Hammarström, Daniel; Ellefsen, Stian; Ahmad, Rafi

dc.contributor.author	Khan, Yusuf
dc.contributor.author	Hammarström, Daniel
dc.contributor.author	Ellefsen, Stian
dc.contributor.author	Ahmad, Rafi
dc.date.accessioned	2022-11-22T15:01:07Z
dc.date.available	2022-11-22T15:01:07Z
dc.date.issued	2022-06-18
dc.description.abstract	Background - The biological relevance and accuracy of gene expression data depend on the adequacy of data normalization. This is both due to its role in resolving and accounting for technical variation and errors, and its defining role in shaping the viewpoint of biological interpretations. Still, the choice of the normalization method is often not explicitly motivated although this choice may be particularly decisive for conclusions in studies involving pronounced cellular plasticity. In this study, we highlight the consequences of using three fundamentally different modes of normalization for interpreting RNA-seq data from human skeletal muscle undergoing exercise-training-induced growth. Briefly, 25 participants conducted 12 weeks of high-load resistance training. Muscle biopsy specimens were sampled from m. vastus lateralis before, after two weeks of training (week 2) and after the intervention (week 12), and were subsequently analyzed using RNA-seq. Transcript counts were modeled as (1) per-library-size, (2) per-total-RNA, and (3) per-sample-size (per-mg-tissue).<p> <p>Result - Initially, the three modes of transcript modeling led to the identification of three unique sets of stable genes, which displayed differential expression profiles. Specifically, genes showing stable expression across samples in the per-library-size dataset displayed training-associated increases in per-total-RNA and per-sample-size datasets. These gene sets were then used for normalization of the entire dataset, providing transcript abundance estimates corresponding to each of the three biological viewpoints (i.e., per-library-size, per-total-RNA, and per-sample-size). The different normalization modes led to different conclusions, measured as training-associated changes in transcript expression. Briefly, for 27% and 20% of the transcripts, training was associated with changes in expression in per-total-RNA and per-sample-size scenarios, but not in the per-library-size scenario. At week 2, this led to opposite conclusions for 4% of the transcripts between per-library-size and per-sample-size datasets (↑ vs. ↓, respectively).<p> <p>Conclusion - Scientists should be explicit with their choice of normalization strategies and should interpret the results of gene expression analyses with caution. This is particularly important for data sets involving a limited number of genes or involving growing or differentiating cellular models, where the risk of biased conclusions is pronounced.	en_US
dc.identifier.citation	Khan, Hammarström, Ellefsen, Ahmad. Normalization of gene expression data revisited: the three viewpoints of the transcriptome in human skeletal muscle undergoing load-induced hypertrophy and why they matter. BMC Bioinformatics. 2022;23(1)	en_US
dc.identifier.cristinID	FRIDAID 2050588
dc.identifier.doi	10.1186/s12859-022-04791-y
dc.identifier.issn	1471-2105
dc.identifier.uri	https://hdl.handle.net/10037/27482
dc.language.iso	eng	en_US
dc.publisher	BMC	en_US
dc.relation.journal	BMC Bioinformatics
dc.rights.accessRights	openAccess	en_US
dc.rights.holder	Copyright 2022 The Author(s)	en_US
dc.rights.uri	https://creativecommons.org/licenses/by/4.0	en_US
dc.rights	Attribution 4.0 International (CC BY 4.0)	en_US
dc.title	Normalization of gene expression data revisited: the three viewpoints of the transcriptome in human skeletal muscle undergoing load-induced hypertrophy and why they matter	en_US
dc.type.version	publishedVersion	en_US
dc.type	Journal article	en_US
dc.type	Tidsskriftartikkel	en_US
dc.type	Peer reviewed	en_US

File(s) in this item

Name:: article.pdf
Size:: 715.3Kb
Format:: PDF

View/Open

This item appears in the following collection(s)

Artikler, rapporter og annet (klinisk medisin) [1975]

Show simple item record

Except where otherwise noted, this item's license is described as Attribution 4.0 International (CC BY 4.0)