KRIBB Repository

KRIBB Home KRIBB Library

검색

Open Access@KRIBB1. Journal Articles Journal Articles

Comparison and evaluation of pathway-level aggregation methods of gene expression data = 패스웨이 레벨에서 유전자 발현 데이터를 합치는 방법론의 비교 및 평가

Cited 17 time in scopus

Metadata Downloads

Title: Comparison and evaluation of pathway-level aggregation methods of gene expression data = 패스웨이 레벨에서 유전자 발현 데이터를 합치는 방법론의 비교 및 평가

Author(s): Seungwoo Hwang

Bibliographic Citation: BMC Genomics, vol. 13, no. Sup 8, pp. S26-S26

Publication Year: 2012

Abstract: Microarray experiments produce expression measurements in genomic scale. A way to derive functional understanding of the data is to focus on functional sets of genes, such as pathways, instead of individual genes. While a common practice for the pathway-level analysis has been functional enrichment analysis such as over-representation analysis and gene set enrichment analysis, an alternative approach has also been explored. In this approach, gene expression data are first aggregated at pathway level to transform the original data into a compact representation in which each row corresponds to a pathway instead of a gene. Thereafter the pathway expression data can be used for differential expression and classification analyses in pathway space, leveraging existing algorithms usually applied to gene expression data. While several studies have proposed the pathway-level aggregation methods, it remains unclear how they compare with one another, since the evaluations were done to a limited extent. Thus this study presents a comprehensive evaluation of six most prominent aggregation methods. The compared methods include five existing methods--mean of all member genes (Mean all), mean of condition-responsive genes (Mean CORGs), analysis of sample set enrichment scores (ASSESS), principal component analysis (PCA), and partial least squares (PLS)--and a variant of an existing method (Mean top 50%, averaging top half of member genes). Comprehensive and stringent benchmarking was performed by collecting seven pairs of related but independent datasets encompassing various phenotypes. Aggregation was done in the space of KEGG pathways. Performance of the methods was assessed by classification accuracy validated both internally and externally, and by examining the correlative extent of pathway signatures between the dataset pairs. The assessment revealed that (i) the best accuracy and correlation were obtained from ASSESS and Mean top 50%, (ii) Mean all showed the lowest accuracy, and (iii) Mean CORGs and PLS gave rise to the largest extent of discordance in the pathway signature correlation. The two best performing method (ASSESS and Mean top 50%) are suggested to be preferred. The benchmarking analysis also suggests that there is both room and necessity for developing a novel method for pathway-level aggregation.

ISSN: 1471-2164

Publisher: Springer-BMC

Full Text Link: http://dx.doi.org/10.1186/1471-2164-13-S7-S26

Type: Article

Appears in Collections:: 1. Journal Articles > Journal Articles

Files in This Item:

10998.pdf 806.51 kB / Adobe PDF
Download

Show full item record

qrcode

트윗하기

Open Access@KRIBB was built as a OAK to the National Central Library.