Exploring the functional landscape of gene expression: directed search of large microarray compendia

Motivation: The increasing availability of gene expression microarray technology has resulted in the publication of thousands of microarray gene expression datasets investigating various biological conditions. This vast repository is still underutilized due to the lack of methods for fast, accurate...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics Vol. 23; no. 20; pp. 2692 - 2699
Main Authors Hibbs, Matthew A., Hess, David C., Myers, Chad L., Huttenhower, Curtis, Li, Kai, Troyanskaya, Olga G.
Format Journal Article
LanguageEnglish
Published Oxford Oxford University Press 15.10.2007
Oxford Publishing Limited (England)
Subjects
Online AccessGet full text
ISSN1367-4803
1367-4811
1460-2059
1367-4811
DOI10.1093/bioinformatics/btm403

Cover

More Information
Summary:Motivation: The increasing availability of gene expression microarray technology has resulted in the publication of thousands of microarray gene expression datasets investigating various biological conditions. This vast repository is still underutilized due to the lack of methods for fast, accurate exploration of the entire compendium. Results: We have collected Saccharomyces cerevisiae gene expression microarray data containing roughly 2400 experimental conditions. We analyzed the functional coverage of this collection and we designed a context-sensitive search algorithm for rapid exploration of the compendium. A researcher using our system provides a small set of query genes to establish a biological search context; based on this query, we weight each dataset's relevance to the context, and within these weighted datasets we identify additional genes that are co-expressed with the query set. Our method exhibits an average increase in accuracy of 273% compared to previous mega-clustering approaches when recapitulating known biology. Further, we find that our search paradigm identifies novel biological predictions that can be verified through further experimentation. Our methodology provides the ability for biological researchers to explore the totality of existing microarray data in a manner useful for drawing conclusions and formulating hypotheses, which we believe is invaluable for the research community. Availability: Our query-driven search engine, called SPELL, is available at http://function.princeton.edu/SPELL Contact: ogt@genomics.princeton.edu Supplementary information: Several additional data files, figures and discussions are available at http://function.princeton.edu/SPELL/supplement
Bibliography:istex:79DDF8FE52A44B1EFB38C144515105E0213A6B83
To whom correspondence should be addressed.
Associate Editor: David Rocke
ark:/67375/HXZ-5LZN9PTR-L
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:1367-4803
1367-4811
1460-2059
1367-4811
DOI:10.1093/bioinformatics/btm403