Controlling the false-discovery rate by procedures adapted to the length bias of RNA-Seq

Cited 2 time in scopus
Metadata Downloads

Full metadata record

DC FieldValueLanguage
dc.contributor.authorT Y Yang-
dc.contributor.authorSeongmun Jeong-
dc.date.accessioned2018-04-19T05:18:57Z-
dc.date.available2018-04-19T05:18:57Z-
dc.date.issued2018-
dc.identifier.issn1226-3192-
dc.identifier.uri10.1016/j.jkss.2017.08.001ko
dc.identifier.urihttps://oak.kribb.re.kr/handle/201005/17725-
dc.description.abstractIn RNA-Seq experiments, the number of mapped reads for a given gene is proportional to its expression level and length. Because longer genes contribute more sequencible fragments than do shorter ones, it is expected that even if two genes have the same expression level, the longer gene will have a greater number of total reads. This characteristic creates a length bias such that the proportion of significant genes increases with the gene length. However, genes with a long length are not more biologically meaningful than genes with a short length. Therefore, the length bias should be properly corrected to determine the accurate list of significant genes in RNA-Seq. For this purpose, we proposed two multiple-testing procedures based on a weighted-FDR and a separate-FDR approach. These two methods use prior information on differential gene length while keeping the false-discovery rate (FDR) controlled at α. In the weighted-FDR controlling procedure, we incorporated prior weights for the length of each gene. These weights increase the power when the gene's length is short and decrease the power when its length is long. In the separate-FDR controlling procedure, we sequentially ordered all genes according to their lengths and then split these genes into two subgroups of short and long genes. The adaptive Benjamini?Hochberg procedure was then performed separately for each subgroup. The proposed procedures were compared with existing methods and evaluated in two numerical examples and one simulation study. We concluded that the weighted p-value procedure properly reduced the length bias of RNA-Seq-
dc.publisherSpringer-
dc.titleControlling the false-discovery rate by procedures adapted to the length bias of RNA-Seq-
dc.title.alternativeControlling the false-discovery rate by procedures adapted to the length bias of RNA-Seq-
dc.typeArticle-
dc.citation.titleJournal of Korean Statistical Society-
dc.citation.number1-
dc.citation.endPage23-
dc.citation.startPage13-
dc.citation.volume47-
dc.contributor.affiliatedAuthorSeongmun Jeong-
dc.contributor.alternativeName양태영-
dc.contributor.alternativeName정성문-
dc.identifier.bibliographicCitationJournal of Korean Statistical Society, vol. 47, no. 1, pp. 13-23-
dc.identifier.doi10.1016/j.jkss.2017.08.001-
dc.subject.keywordCommon weight-
dc.subject.keywordIndividual weight-
dc.subject.keywordLength bias-
dc.subject.keywordRNA-Seq-
dc.subject.keywordSeparate procedure-
dc.subject.keywordWeighted procedure-
dc.subject.localCommon weight-
dc.subject.localIndividual weight-
dc.subject.localLength bias-
dc.subject.localRNA-seq-
dc.subject.localRNA-Seq-
dc.subject.localSeparate procedure-
dc.subject.localWeighted procedure-
dc.description.journalClassY-
Appears in Collections:
1. Journal Articles > Journal Articles
Files in This Item:
  • There are no files associated with this item.


Items in OpenAccess@KRIBB are protected by copyright, with all rights reserved, unless otherwise indicated.