Gene dispersion is the key determinant of the read count bias in differential expression analysis of RNA-seq data.

Citation data:

BMC genomics, ISSN: 1471-2164, Vol: 18, Issue: 1, Page: 408

Publication Year:
2017
Usage 48
Abstract Views 36
Full Text Views 10
Link-outs 2
Captures 32
Readers 30
Exports-Saves 2
Mentions 1
Blog Mentions 1
Social Media 6
Tweets 6
Repository URL:
http://scholarworks.unist.ac.kr/handle/201301/22265
PMID:
28545404
DOI:
10.1186/s12864-017-3809-0
Author(s):
Yoon, Sora; Nam, Dougu
Publisher(s):
Springer Nature; BIOMED CENTRAL LTD
Tags:
Biochemistry, Genetics and Molecular Biology; RNA-seq; Differential expression analysis; Read count bias; Gene length bias; Dispersion
Most Recent Tweet View All Tweets
Most Recent Blog Mention
article description
In differential expression analysis of RNA-sequencing (RNA-seq) read count data for two sample groups, it is known that highly expressed genes (or longer genes) are more likely to be differentially expressed which is called read count bias (or gene length bias). This bias had great effect on the downstream Gene Ontology over-representation analysis. However, such a bias has not been systematically analyzed for different replicate types of RNA-seq data.