Avoiding the pitfalls of gene set enrichment analysis with SetRank

Background The purpose of gene set enrichment analysis (GSEA) is to find general trends in the huge lists of genes or proteins generated by many functional genomics techniques and bioinformatics analyses. Results Here we present SetRank, an advanced GSEA algorithm which is able to eliminate many fal...

Full description

Saved in:
Bibliographic Details
Published inBMC bioinformatics Vol. 18; no. 1; p. 151
Main Authors Simillion, Cedric, Liechti, Robin, Lischer, Heidi E.L., Ioannidis, Vassilios, Bruggmann, Rémy
Format Journal Article
LanguageEnglish
Published London BioMed Central 04.03.2017
Springer Nature B.V
Subjects
Online AccessGet full text
ISSN1471-2105
1471-2105
DOI10.1186/s12859-017-1571-6

Cover

More Information
Summary:Background The purpose of gene set enrichment analysis (GSEA) is to find general trends in the huge lists of genes or proteins generated by many functional genomics techniques and bioinformatics analyses. Results Here we present SetRank, an advanced GSEA algorithm which is able to eliminate many false positive hits. The key principle of the algorithm is that it discards gene sets that have initially been flagged as significant, if their significance is only due to the overlap with another gene set. The algorithm is explained in detail and its performance is compared to that of other methods using objective benchmarking criteria. Furthermore, we explore how sample source bias can affect the results of a GSEA analysis. Conclusions The benchmarking results show that SetRank is a highly specific tool for GSEA. Furthermore, we show that the reliability of results can be improved by taking sample source bias into account. SetRank is available as an R package and through an online web interface.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:1471-2105
1471-2105
DOI:10.1186/s12859-017-1571-6