Identification of Protein Coding Regions in the Human Genome by Quadratic Discriminant Analysis

A new method for predicting internal coding exons in genomic DNA sequences has been developed. This method is based on a prediction algorithm that uses the quadratic discriminant function for multivariate statistical pattern recognition. Substantial improvements have been made (with only 9 discrimin...

Full description

Saved in:
Bibliographic Details
Published inProceedings of the National Academy of Sciences - PNAS Vol. 94; no. 2; pp. 565 - 568
Main Author Zhang, M. Q.
Format Journal Article
LanguageEnglish
Published United States National Academy of Sciences of the United States of America 21.01.1997
National Acad Sciences
National Academy of Sciences
The National Academy of Sciences of the USA
Subjects
Online AccessGet full text
ISSN0027-8424
1091-6490
1091-6490
DOI10.1073/pnas.94.2.565

Cover

More Information
Summary:A new method for predicting internal coding exons in genomic DNA sequences has been developed. This method is based on a prediction algorithm that uses the quadratic discriminant function for multivariate statistical pattern recognition. Substantial improvements have been made (with only 9 discriminant variables) when compared with existing methods: HEXON [Solovyev, V. V., Salamov, A. A. & Lawrence, C. B. (1994) Nucleic Acids Res. 22, 5156-5163] (based on linear discriminant analysis) and GRAIL2 [Uberbacher, E. C. & Mural, R. J. (1991) Proc. Natl. Acad. Sci. USA 88, 11261-11265] (based on neural networks). A computer program called MZEF is freely available to the genome community and allows users to adjust prior probability and to output alternative overlapping exons.
Bibliography:SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-2
content type line 23
ObjectType-Article-1
ObjectType-Feature-2
James D. Watson, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
ISSN:0027-8424
1091-6490
1091-6490
DOI:10.1073/pnas.94.2.565