Discretization of gene expression data revised

Gene expression measurements represent the most important source of biological data used to unveil the interaction and functionality of genes. In this regard, several data mining and machine learning algorithms have been proposed that require, in a number of cases, some kind of data discretization t...

Full description

Saved in:
Bibliographic Details
Published inBriefings in bioinformatics Vol. 17; no. 5; pp. 758 - 770
Main Authors Gallo, Cristian A., Cecchini, Rocio L., Carballido, Jessica A., Micheletto, Sandra, Ponzoni, Ignacio
Format Journal Article
LanguageEnglish
Published England 01.09.2016
Subjects
Online AccessGet full text
ISSN1467-5463
1477-4054
1477-4054
DOI10.1093/bib/bbv074

Cover

More Information
Summary:Gene expression measurements represent the most important source of biological data used to unveil the interaction and functionality of genes. In this regard, several data mining and machine learning algorithms have been proposed that require, in a number of cases, some kind of data discretization to perform the inference. Selection of an appropriate discretization process has a major impact on the design and outcome of the inference algorithms, as there are a number of relevant issues that need to be considered. This study presents a revision of the current state-of-the-art discretization techniques, together with the key subjects that need to be considered when designing or selecting a discretization approach for gene expression data.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1467-5463
1477-4054
1477-4054
DOI:10.1093/bib/bbv074