A multi-array multi-SNP genotyping algorithm for Affymetrix SNP microarrays

Motivation: Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogatin...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics Vol. 23; no. 12; pp. 1459 - 1467
Main Authors Xiao, Yuanyuan, Segal, Mark R., Yang, Y.H., Yeh, Ru-Fang
Format Journal Article
LanguageEnglish
Published Oxford Oxford University Press 15.06.2007
Oxford Publishing Limited (England)
Subjects
Online AccessGet full text
ISSN1367-4803
1367-4811
1367-4811
1460-2059
DOI10.1093/bioinformatics/btm131

Cover

Abstract Motivation: Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogating up to hundreds of thousands of single nucleotide polymorphisms (SNPs) in a single assay. Similar to the development of expression arrays, these genotyping arrays pose many data analytic challenges that are often platform specific. Affymetrix SNP arrays, e.g. use multiple sets of short oligonucleotide probes for each known SNP, and require effective statistical methods to combine these probe intensities in order to generate reliable and accurate genotype calls. Results: We developed an integrated multi-SNP, multi-array genotype calling algorithm for Affymetrix SNP arrays, MAMS, that combines single-array multi-SNP (SAMS) and multi-array, single-SNP (MASS) calls to improve the accuracy of genotype calls, without the need for training data or computation-intensive normalization procedures as in other multi-array methods. The algorithm uses resampling techniques and model-based clustering to derive single array based genotype calls, which are subsequently refined by competitive genotype calls based on (MASS) clustering. The resampling scheme caps computation for single-array analysis and hence is readily scalable, important in view of expanding numbers of SNPs per array. The MASS update is designed to improve calls for atypical SNPs, harboring allele-imbalanced binding affinities, that are difficult to genotype without information from other arrays. Using a publicly available data set of HapMap samples from Affymetrix, and independent calls by alternative genotyping methods from the HapMap project, we show that our approach performs competitively to existing methods. Availability: R functions are available upon request from the authors. Contact: yxiao@itsa.ucsf.edu and rufang@biostat.ucsf.edu Supplementary information: Supplementary data are available at Bioinformatics online.
AbstractList Motivation: Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogating up to hundreds of thousands of single nucleotide polymorphisms (SNPs) in a single assay. Similar to the development of expression arrays, these genotyping arrays pose many data analytic challenges that are often platform specific. Affymetrix SNP arrays, e.g. use multiple sets of short oligonucleotide probes for each known SNP, and require effective statistical methods to combine these probe intensities in order to generate reliable and accurate genotype calls. Results: We developed an integrated multi-SNP, multi-array genotype calling algorithm for Affymetrix SNP arrays, MAMS, that combines single-array multi-SNP (SAMS) and multi-array, single-SNP (MASS) calls to improve the accuracy of genotype calls, without the need for training data or computation-intensive normalization procedures as in other multi-array methods. The algorithm uses resampling techniques and model-based clustering to derive single array based genotype calls, which are subsequently refined by competitive genotype calls based on (MASS) clustering. The resampling scheme caps computation for single-array analysis and hence is readily scalable, important in view of expanding numbers of SNPs per array. The MASS update is designed to improve calls for atypical SNPs, harboring allele-imbalanced binding affinities, that are difficult to genotype without information from other arrays. Using a publicly available data set of HapMap samples from Affymetrix, and independent calls by alternative genotyping methods from the HapMap project, we show that our approach performs competitively to existing methods. Availability: R functions are available upon request from the authors. Contact: yxiao@itsa.ucsf.edu and rufang@biostat.ucsf.eduSupplementary information: Supplementary data are available at Bioinformatics online.
Motivation: Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogating up to hundreds of thousands of single nucleotide polymorphisms (SNPs) in a single assay. Similar to the development of expression arrays, these genotyping arrays pose many data analytic challenges that are often platform specific. Affymetrix SNP arrays, e.g. use multiple sets of short oligonucleotide probes for each known SNP, and require effective statistical methods to combine these probe intensities in order to generate reliable and accurate genotype calls. Results: We developed an integrated multi-SNP, multi-array genotype calling algorithm for Affymetrix SNP arrays, MAMS, that combines single-array multi-SNP (SAMS) and multi-array, single-SNP (MASS) calls to improve the accuracy of genotype calls, without the need for training data or computation-intensive normalization procedures as in other multi-array methods. The algorithm uses resampling techniques and model-based clustering to derive single array based genotype calls, which are subsequently refined by competitive genotype calls based on (MASS) clustering. The resampling scheme caps computation for single-array analysis and hence is readily scalable, important in view of expanding numbers of SNPs per array. The MASS update is designed to improve calls for atypical SNPs, harboring allele-imbalanced binding affinities, that are difficult to genotype without information from other arrays. Using a publicly available data set of HapMap samples from Affymetrix, and independent calls by alternative genotyping methods from the HapMap project, we show that our approach performs competitively to existing methods. Availability: R functions are available upon request from the authors. Contact:  yxiao@itsa.ucsf.edu and rufang@biostat.ucsf.edu Supplementary information: Supplementary data are available at Bioinformatics online.
Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogating up to hundreds of thousands of single nucleotide polymorphisms (SNPs) in a single assay. Similar to the development of expression arrays, these genotyping arrays pose many data analytic challenges that are often platform specific. Affymetrix SNP arrays, e.g. use multiple sets of short oligonucleotide probes for each known SNP, and require effective statistical methods to combine these probe intensities in order to generate reliable and accurate genotype calls.MOTIVATIONModern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogating up to hundreds of thousands of single nucleotide polymorphisms (SNPs) in a single assay. Similar to the development of expression arrays, these genotyping arrays pose many data analytic challenges that are often platform specific. Affymetrix SNP arrays, e.g. use multiple sets of short oligonucleotide probes for each known SNP, and require effective statistical methods to combine these probe intensities in order to generate reliable and accurate genotype calls.We developed an integrated multi-SNP, multi-array genotype calling algorithm for Affymetrix SNP arrays, MAMS, that combines single-array multi-SNP (SAMS) and multi-array, single-SNP (MASS) calls to improve the accuracy of genotype calls, without the need for training data or computation-intensive normalization procedures as in other multi-array methods. The algorithm uses resampling techniques and model-based clustering to derive single array based genotype calls, which are subsequently refined by competitive genotype calls based on (MASS) clustering. The resampling scheme caps computation for single-array analysis and hence is readily scalable, important in view of expanding numbers of SNPs per array. The MASS update is designed to improve calls for atypical SNPs, harboring allele-imbalanced binding affinities, that are difficult to genotype without information from other arrays. Using a publicly available data set of HapMap samples from Affymetrix, and independent calls by alternative genotyping methods from the HapMap project, we show that our approach performs competitively to existing methods.RESULTSWe developed an integrated multi-SNP, multi-array genotype calling algorithm for Affymetrix SNP arrays, MAMS, that combines single-array multi-SNP (SAMS) and multi-array, single-SNP (MASS) calls to improve the accuracy of genotype calls, without the need for training data or computation-intensive normalization procedures as in other multi-array methods. The algorithm uses resampling techniques and model-based clustering to derive single array based genotype calls, which are subsequently refined by competitive genotype calls based on (MASS) clustering. The resampling scheme caps computation for single-array analysis and hence is readily scalable, important in view of expanding numbers of SNPs per array. The MASS update is designed to improve calls for atypical SNPs, harboring allele-imbalanced binding affinities, that are difficult to genotype without information from other arrays. Using a publicly available data set of HapMap samples from Affymetrix, and independent calls by alternative genotyping methods from the HapMap project, we show that our approach performs competitively to existing methods.R functions are available upon request from the authors.AVAILABILITYR functions are available upon request from the authors.
Motivation: Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogating up to hundreds of thousands of single nucleotide polymorphisms (SNPs) in a single assay. Similar to the development of expression arrays, these genotyping arrays pose many data analytic challenges that are often platform specific. Affymetrix SNP arrays, e.g. use multiple sets of short oligonucleotide probes for each known SNP, and require effective statistical methods to combine these probe intensities in order to generate reliable and accurate genotype calls. Results: We developed an integrated multi-SNP, multi-array genotype calling algorithm for Affymetrix SNP arrays, MAMS, that combines single-array multi-SNP (SAMS) and multi-array, single-SNP (MASS) calls to improve the accuracy of genotype calls, without the need for training data or computation-intensive normalization procedures as in other multi-array methods. The algorithm uses resampling techniques and model-based clustering to derive single array based genotype calls, which are subsequently refined by competitive genotype calls based on (MASS) clustering. The resampling scheme caps computation for single-array analysis and hence is readily scalable, important in view of expanding numbers of SNPs per array. The MASS update is designed to improve calls for atypical SNPs, harboring allele-imbalanced binding affinities, that are difficult to genotype without information from other arrays. Using a publicly available data set of HapMap samples from Affymetrix, and independent calls by alternative genotyping methods from the HapMap project, we show that our approach performs competitively to existing methods. Availability: R functions are available upon request from the authors. Contact: yxiao@itsa.ucsf.edu and rufang@biostat.ucsf.edu Supplementary information: Supplementary data are available at Bioinformatics online.
MOTIVATION: Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogating up to hundreds of thousands of single nucleotide polymorphisms (SNPs) in a single assay. Similar to the development of expression arrays, these genotyping arrays pose many data analytic challenges that are often platform specific. Affymetrix SNP arrays, e.g. use multiple sets of short oligonucleotide probes for each known SNP, and require effective statistical methods to combine these probe intensities in order to generate reliable and accurate genotype calls. RESULTS: We developed an integrated multi-SNP, multi-array genotype calling algorithm for Affymetrix SNP arrays, MAMS, that combines single-array multi-SNP (SAMS) and multi-array, single-SNP (MASS) calls to improve the accuracy of genotype calls, without the need for training data or computation-intensive normalization procedures as in other multi-array methods. The algorithm uses resampling techniques and model-based clustering to derive single array based genotype calls, which are subsequently refined by competitive genotype calls based on (MASS) clustering. The resampling scheme caps computation for single-array analysis and hence is readily scalable, important in view of expanding numbers of SNPs per array. The MASS update is designed to improve calls for atypical SNPs, harboring allele-imbalanced binding affinities, that are difficult to genotype without information from other arrays. Using a publicly available data set of HapMap samples from Affymetrix, and independent calls by alternative genotyping methods from the HapMap project, we show that our approach performs competitively to existing methods. AVAILABILITY: R functions are available upon request from the authors. CONTACT: yxiaotsa.ucsf.edu and rufangiostat.ucsf.edu Supplementary information: Supplementary data are available at Bioinformatics online.
Motivation: Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogating up to hundreds of thousands of single nucleotide polymorphisms (SNPs) in a single assay. Similar to the development of expression arrays, these genotyping arrays pose many data analytic challenges that are often platform specific. Affymetrix SNP arrays, e.g. use multiple sets of short oligonucleotide probes for each known SNP, and require effective statistical methods to combine these probe intensities in order to generate reliable and accurate genotype calls. Results: We developed an integrated multi-SNP, multi-array genotype calling algorithm for Affymetrix SNP arrays, MAMS, that combines single-array multi-SNP (SAMS) and multi-array, single-SNP (MASS) calls to improve the accuracy of genotype calls, without the need for training data or computation-intensive normalization procedures as in other multi-array methods. The algorithm uses resampling techniques and model-based clustering to derive single array based genotype calls, which are subsequently refined by competitive genotype calls based on (MASS) clustering. The resampling scheme caps computation for single-array analysis and hence is readily scalable, important in view of expanding numbers of SNPs per array. The MASS update is designed to improve calls for atypical SNPs, harboring allele-imbalanced binding affinities, that are difficult to genotype without information from other arrays. Using a publicly available data set of HapMap samples from Affymetrix, and independent calls by alternative genotyping methods from the HapMap project, we show that our approach performs competitively to existing methods. Availability: R functions are available upon request from the authors. Contact: yxiao@itsa.ucsf.edu and rufang@biostat.ucsf.edu Supplementary information: Supplementary data are available at Bioinformatics online.
Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogating up to hundreds of thousands of single nucleotide polymorphisms (SNPs) in a single assay. Similar to the development of expression arrays, these genotyping arrays pose many data analytic challenges that are often platform specific. Affymetrix SNP arrays, e.g. use multiple sets of short oligonucleotide probes for each known SNP, and require effective statistical methods to combine these probe intensities in order to generate reliable and accurate genotype calls. We developed an integrated multi-SNP, multi-array genotype calling algorithm for Affymetrix SNP arrays, MAMS, that combines single-array multi-SNP (SAMS) and multi-array, single-SNP (MASS) calls to improve the accuracy of genotype calls, without the need for training data or computation-intensive normalization procedures as in other multi-array methods. The algorithm uses resampling techniques and model-based clustering to derive single array based genotype calls, which are subsequently refined by competitive genotype calls based on (MASS) clustering. The resampling scheme caps computation for single-array analysis and hence is readily scalable, important in view of expanding numbers of SNPs per array. The MASS update is designed to improve calls for atypical SNPs, harboring allele-imbalanced binding affinities, that are difficult to genotype without information from other arrays. Using a publicly available data set of HapMap samples from Affymetrix, and independent calls by alternative genotyping methods from the HapMap project, we show that our approach performs competitively to existing methods. R functions are available upon request from the authors.
Author Yang, Y.H.
Yeh, Ru-Fang
Xiao, Yuanyuan
Segal, Mark R.
Author_xml – sequence: 1
  givenname: Yuanyuan
  surname: Xiao
  fullname: Xiao, Yuanyuan
  organization: Department of Epidemiology and Biostatistics, Center for Bioinformatics and Molecular Biostatistics, University of California, 185 Berry Street, Lobby 4, Suite 5700, San Francisco, CA 94107, USA and School of Mathematics and Statistics, University of Sydney, NSW 2006, Australia
– sequence: 2
  givenname: Mark R.
  surname: Segal
  fullname: Segal, Mark R.
  organization: Department of Epidemiology and Biostatistics, Center for Bioinformatics and Molecular Biostatistics, University of California, 185 Berry Street, Lobby 4, Suite 5700, San Francisco, CA 94107, USA and School of Mathematics and Statistics, University of Sydney, NSW 2006, Australia
– sequence: 3
  givenname: Y.H.
  surname: Yang
  fullname: Yang, Y.H.
  organization: Department of Epidemiology and Biostatistics, Center for Bioinformatics and Molecular Biostatistics, University of California, 185 Berry Street, Lobby 4, Suite 5700, San Francisco, CA 94107, USA and School of Mathematics and Statistics, University of Sydney, NSW 2006, Australia
– sequence: 4
  givenname: Ru-Fang
  surname: Yeh
  fullname: Yeh, Ru-Fang
  organization: Department of Epidemiology and Biostatistics, Center for Bioinformatics and Molecular Biostatistics, University of California, 185 Berry Street, Lobby 4, Suite 5700, San Francisco, CA 94107, USA and School of Mathematics and Statistics, University of Sydney, NSW 2006, Australia
BackLink http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=18906971$$DView record in Pascal Francis
https://www.ncbi.nlm.nih.gov/pubmed/17459966$$D View this record in MEDLINE/PubMed
BookMark eNqNkUtv1DAURi1URB_wE0AREuxC7fgVi9WoPIpalbdAbCzH4wwuiR1sRzT_Hg-JWtHNdOUr6xxf3-8egj3nnQHgMYIvEBT4uLHeutaHXiWr43GTeoTRPXCAMOMlqRHau64h3geHMV5CCCmk7AHYR5xQIRg7AGeroh-7ZEsVgpqW-vPFh2JjnE_TYN2mUN3GB5t-9kXuV6zadupNCvaq2HK91cH_k-NDcL9VXTSPlvMIfH3z-svJaXn-_u27k9V5qSmtUikQrIjAa2E0xAYqg2tGcAWRytcKc0brdg2pJqJBBhnctJRwSohYI10zgfARYPO7oxvU9Ed1nRyC7VWYJIJym478Px05p5PF57M4BP97NDHJ3kZtuk4548coOeQwf6XeCWJc1aKq4E4QCY5p5jL49BZ46cfgckyZqRmrMSEZerJAY9Ob9c1Uy7oy8GwBVNSqa4Ny2sYbrhaQCb4dlM5c3k2MwbR3DujlLU_blAnvUlC222nD2fbjcOeG5azYmMzVtaTCL8k45lSefv8hL16hj2fs26cc-18bN_E3
CODEN BOINFP
CitedBy_id crossref_primary_10_1093_nar_gkp559
crossref_primary_10_1093_bioinformatics_btaa295
crossref_primary_10_1093_nar_gkp493
crossref_primary_10_1093_bioinformatics_btp470
crossref_primary_10_1093_bioinformatics_btn632
crossref_primary_10_1038_nrg2344
crossref_primary_10_1142_S0219720011005458
crossref_primary_10_1093_bioinformatics_btm443
crossref_primary_10_1093_bioinformatics_btu107
crossref_primary_10_1016_j_bbcan_2009_03_001
crossref_primary_10_1186_1479_7364_5_4_304
crossref_primary_10_1097_MOL_0b013e3282f5dd77
crossref_primary_10_3389_fgene_2022_963852
crossref_primary_10_1089_cmb_2007_0133
crossref_primary_10_1038_s41576_019_0127_1
crossref_primary_10_1007_s11464_011_0125_x
crossref_primary_10_1186_1471_2105_9_S9_S17
crossref_primary_10_1093_biostatistics_kxp045
crossref_primary_10_1016_S1672_0229_08_60014_5
crossref_primary_10_1093_nar_gkq750
crossref_primary_10_1093_bioinformatics_btn147
crossref_primary_10_1186_1471_2105_10_68
crossref_primary_10_1371_journal_pone_0058677
crossref_primary_10_1093_bioinformatics_btn509
crossref_primary_10_3389_fgene_2021_736390
Cites_doi 10.2307/2532201
10.1093/biostatistics/4.2.249
10.1093/bioinformatics/bti741
10.1093/bioinformatics/bti275
10.1111/j.2517-6161.1977.tb01600.x
10.1016/0377-0427(87)90125-7
10.1371/journal.pcbi.0010065
10.1093/nar/gnj027
10.1158/0008-5472.CAN-05-0465
10.1093/bioinformatics/btl341
10.1093/bioinformatics/btl536
10.1038/ng1416
10.1093/bioinformatics/btg332
10.1198/016214502760047131
10.1038/nature02168
ContentType Journal Article
Copyright 2007 The Author(s) 2007
2008 INIST-CNRS
2007 The Author(s)
Copyright_xml – notice: 2007 The Author(s) 2007
– notice: 2008 INIST-CNRS
– notice: 2007 The Author(s)
DBID BSCLL
AAYXX
CITATION
IQODW
CGR
CUY
CVF
ECM
EIF
NPM
7QF
7QO
7QQ
7SC
7SE
7SP
7SR
7TA
7TB
7TM
7TO
7U5
8BQ
8FD
F28
FR3
H8D
H8G
H94
JG9
JQ2
K9.
KR7
L7M
L~C
L~D
P64
7X8
ADTOC
UNPAY
DOI 10.1093/bioinformatics/btm131
DatabaseName Istex
CrossRef
Pascal-Francis
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
Aluminium Industry Abstracts
Biotechnology Research Abstracts
Ceramic Abstracts
Computer and Information Systems Abstracts
Corrosion Abstracts
Electronics & Communications Abstracts
Engineered Materials Abstracts
Materials Business File
Mechanical & Transportation Engineering Abstracts
Nucleic Acids Abstracts
Oncogenes and Growth Factors Abstracts
Solid State and Superconductivity Abstracts
METADEX
Technology Research Database
ANTE: Abstracts in New Technology & Engineering
Engineering Research Database
Aerospace Database
Copper Technical Reference Library
AIDS and Cancer Research Abstracts
Materials Research Database
ProQuest Computer Science Collection
ProQuest Health & Medical Complete (Alumni)
Civil Engineering Abstracts
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
Biotechnology and BioEngineering Abstracts
MEDLINE - Academic
Unpaywall for CDI: Periodical Content
Unpaywall
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
Materials Research Database
Oncogenes and Growth Factors Abstracts
Technology Research Database
Computer and Information Systems Abstracts – Academic
Mechanical & Transportation Engineering Abstracts
Nucleic Acids Abstracts
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
ProQuest Health & Medical Complete (Alumni)
Materials Business File
Aerospace Database
Copper Technical Reference Library
Engineered Materials Abstracts
Biotechnology Research Abstracts
AIDS and Cancer Research Abstracts
Advanced Technologies Database with Aerospace
ANTE: Abstracts in New Technology & Engineering
Civil Engineering Abstracts
Aluminium Industry Abstracts
Electronics & Communications Abstracts
Ceramic Abstracts
METADEX
Biotechnology and BioEngineering Abstracts
Computer and Information Systems Abstracts Professional
Solid State and Superconductivity Abstracts
Engineering Research Database
Corrosion Abstracts
MEDLINE - Academic
DatabaseTitleList Computer and Information Systems Abstracts
CrossRef
MEDLINE - Academic

Engineering Research Database
Materials Research Database

MEDLINE
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
– sequence: 3
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 1367-4811
1460-2059
EndPage 1467
ExternalDocumentID 10.1093/bioinformatics/btm131
1317473271
17459966
18906971
10_1093_bioinformatics_btm131
ark_67375_HXZ_ND1QK6WR_3
Genre Research Support, Non-U.S. Gov't
Journal Article
GroupedDBID ---
-E4
-~X
.2P
.DC
.I3
0R~
1TH
23N
2WC
4.4
48X
53G
5GY
5WA
70D
AAIJN
AAIMJ
AAJKP
AAJQQ
AAKPC
AAMDB
AAMVS
AAOGV
AAPQZ
AAPXW
AAUQX
AAVAP
AAVLN
ABEJV
ABEUO
ABGNP
ABIXL
ABNGD
ABNKS
ABPQP
ABPTD
ABQLI
ABWST
ABXVV
ABZBJ
ACGFS
ACIWK
ACPRK
ACUFI
ACUKT
ACUXJ
ACYTK
ADBBV
ADEYI
ADEZT
ADFTL
ADGKP
ADGZP
ADHKW
ADHZD
ADMLS
ADOCK
ADPDF
ADRDM
ADRTK
ADVEK
ADYVW
ADZTZ
ADZXQ
AECKG
AEGPL
AEJOX
AEKKA
AEKSI
AELWJ
AEMDU
AENEX
AENZO
AEPUE
AETBJ
AEWNT
AFFNX
AFFZL
AFGWE
AFIYH
AFOFC
AFRAH
AGINJ
AGKEF
AGQPQ
AGQXC
AGSYK
AHMBA
AHXPO
AIJHB
AJEEA
AJEUX
AKHUL
AKWXX
ALMA_UNASSIGNED_HOLDINGS
ALTZX
ALUQC
AMNDL
APIBT
APWMN
ARIXL
ASPBG
AVWKF
AXUDD
AYOIW
AZFZN
AZVOD
BAWUL
BAYMD
BHONS
BQDIO
BQUQU
BSCLL
BSWAC
BTQHN
C1A
C45
CAG
CDBKE
COF
CS3
CZ4
DAKXR
DIK
DILTD
DU5
D~K
EBD
EBS
EE~
EJD
EMOBN
F5P
F9B
FEDTE
FHSFR
FLIZI
FLUFQ
FOEOM
FQBLK
GAUVT
GJXCC
GROUPED_DOAJ
GX1
H13
H5~
HAR
HVGLF
HW0
HZ~
IOX
J21
JXSIZ
KAQDR
KOP
KQ8
KSI
KSN
M-Z
MK~
ML0
N9A
NGC
NLBLG
NMDNZ
NOMLY
NTWIH
NU-
NVLIB
O0~
O9-
OAWHX
ODMLO
OJQWA
OK1
OVD
OVEED
P2P
PAFKI
PB-
PEELM
PQQKQ
Q1.
Q5Y
R44
RD5
RNS
ROL
RPM
RUSNO
RW1
RXO
SV3
TEORI
TJP
TLC
TOX
TR2
W8F
WOQ
X7H
YAYTL
YKOAZ
YXANX
ZKX
~91
~KM
AASNB
ABQTQ
ADRIX
AFXEN
BCRHZ
M49
RIG
ROX
AAYXX
CITATION
.-4
.GJ
ABEFU
AI.
AQDSO
ATTQO
ELUNK
IQODW
O~Y
RNI
RZF
RZO
VH1
ZGI
CGR
CUY
CVF
ECM
EIF
NPM
7QF
7QO
7QQ
7SC
7SE
7SP
7SR
7TA
7TB
7TM
7TO
7U5
8BQ
8FD
F28
FR3
H8D
H8G
H94
JG9
JQ2
K9.
KR7
L7M
L~C
L~D
P64
482
ABJNI
ROZ
TN5
WH7
7X8
ADTOC
UNPAY
ID FETCH-LOGICAL-c552t-9102493d9ec03e0ae38643201a249a37658fd05c49b1e1e3bf5475449d1c86913
IEDL.DBID UNPAY
ISSN 1367-4803
1367-4811
IngestDate Tue Aug 19 18:54:10 EDT 2025
Fri Sep 05 07:29:51 EDT 2025
Fri Sep 05 13:39:57 EDT 2025
Tue Oct 07 09:22:36 EDT 2025
Fri Oct 03 11:10:26 EDT 2025
Wed Feb 19 01:43:46 EST 2025
Mon Jul 21 09:15:17 EDT 2025
Thu Apr 24 23:04:56 EDT 2025
Wed Oct 01 04:04:41 EDT 2025
Wed Aug 28 03:24:14 EDT 2024
Sat Sep 20 11:02:14 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 12
Keywords Measurement
Allelic imbalance
DNA chip
Genotype
Gene expression
Microarray
Algorithm
Clusterin
Original document
Computer program
Single nucleotide polymorphism
Bioinformatics
Comparative study
Language English
License CC BY 4.0
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c552t-9102493d9ec03e0ae38643201a249a37658fd05c49b1e1e3bf5475449d1c86913
Notes ark:/67375/HXZ-ND1QK6WR-3
istex:E91771B98284F9F3DD801AB42B23C30AD4C5F7A0
To whom correspondence should be addressed.
Associate Editor: Chris Stoeckert
ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-1
ObjectType-Feature-2
content type line 23
OpenAccessLink https://proxy.k.utb.cz/login?url=https://academic.oup.com/bioinformatics/article-pdf/23/12/1459/16857913/btm131.pdf
PMID 17459966
PQID 198668344
PQPubID 36124
PageCount 9
ParticipantIDs unpaywall_primary_10_1093_bioinformatics_btm131
proquest_miscellaneous_70706438
proquest_miscellaneous_33289220
proquest_miscellaneous_19735203
proquest_journals_198668344
pubmed_primary_17459966
pascalfrancis_primary_18906971
crossref_primary_10_1093_bioinformatics_btm131
crossref_citationtrail_10_1093_bioinformatics_btm131
oup_primary_10_1093_bioinformatics_btm131
istex_primary_ark_67375_HXZ_ND1QK6WR_3
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2007-06-15
PublicationDateYYYYMMDD 2007-06-15
PublicationDate_xml – month: 06
  year: 2007
  text: 2007-06-15
  day: 15
PublicationDecade 2000
PublicationPlace Oxford
PublicationPlace_xml – name: Oxford
– name: England
PublicationTitle Bioinformatics
PublicationTitleAlternate Bioinformatics
PublicationYear 2007
Publisher Oxford University Press
Oxford Publishing Limited (England)
Publisher_xml – name: Oxford University Press
– name: Oxford Publishing Limited (England)
References Carvalho (2023041105083042600_) 2006
Rabbee (2023041105083042600_) 2006; 22
Liu (2023041105083042600_) 2003; 19
Iafrate (2023041105083042600_) 2004; 39
Rousseeuw (2023041105083042600_) 1987; 20
Nicolae (2023041105083042600_) 2006; 22
Irizarry (2023041105083042600_) 2003; 4
Fraley (2023041105083042600_) 2002; 97
Meaburn (2023041105083042600_) 2006; 34
Huber (2023041105083042600_) 2002; 1
Banfield (2023041105083042600_) 1993; 49
Dempster (2023041105083042600_) 1977; 39
Hua (2023041105083042600_) 2006; 23
Di (2023041105083042600_) 2005; 21
Affymetrix (2023041105083042600_) 2006
The International HapMap Consortium (2023041105083042600_) 2003; 426
Nannya (2023041105083042600_) 2005; 65
LaFramboise (2023041105083042600_) 2005; 1
References_xml – volume: 1
  start-page: 1
  year: 2002
  ident: 2023041105083042600_
  article-title: Variance stabilization applied to microarray data calibration and to the quantification of differetial expression
  publication-title: Bioinformatics
– volume-title: Technical report.
  year: 2006
  ident: 2023041105083042600_
  article-title: BRLMM: an improved genotype calling method for the genechip human mapping 500 k array set
– volume: 49
  start-page: 803
  year: 1993
  ident: 2023041105083042600_
  article-title: Model-based gaussian and non-gaussian clustering
  publication-title: Biometrics
  doi: 10.2307/2532201
– volume: 4
  start-page: 249
  year: 2003
  ident: 2023041105083042600_
  article-title: Exploration, normalization, and summaries of high density oligonucleotide array probe level data
  publication-title: Biostatistics
  doi: 10.1093/biostatistics/4.2.249
– volume: 22
  start-page: 7
  year: 2006
  ident: 2023041105083042600_
  article-title: A genotype calling algorithm for affymetrix SNP arrays
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/bti741
– volume: 21
  start-page: 1958
  year: 2005
  ident: 2023041105083042600_
  article-title: Dynamic model based algorithms for screening and genotyping over 100 k SNPs on oligonucleotide microarrays
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/bti275
– volume: 39
  start-page: 1
  year: 1977
  ident: 2023041105083042600_
  article-title: Maximum likelihood from incomplete data via EM algorithm (with discussion)
  publication-title: J. R. Stat. Soc. B
  doi: 10.1111/j.2517-6161.1977.tb01600.x
– volume: 20
  start-page: 53
  year: 1987
  ident: 2023041105083042600_
  article-title: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis
  publication-title: J. Comput. Appl. Math
  doi: 10.1016/0377-0427(87)90125-7
– volume-title: Technical report.
  year: 2006
  ident: 2023041105083042600_
  article-title: Exploration, normalization, and genotype calls of high density oligonucleotide SNP array data
– volume: 1
  start-page: e65
  year: 2005
  ident: 2023041105083042600_
  article-title: Allele-specific amplification in cancer revealed by SNP array analysis
  publication-title: PLoS Comput. Biol
  doi: 10.1371/journal.pcbi.0010065
– volume: 34
  start-page: e28
  year: 2006
  ident: 2023041105083042600_
  article-title: Genotyping pooled dna using 100 k SNP microarrays: A step towards genomewide association scans
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gnj027
– volume: 65
  start-page: 6071
  year: 2005
  ident: 2023041105083042600_
  article-title: A robust algorithm for copy number detection for high-density oligonucleotide single nucleotide polymorphism genotyping arrays
  publication-title: Cancer Res
  doi: 10.1158/0008-5472.CAN-05-0465
– volume: 22
  start-page: 1942
  year: 2006
  ident: 2023041105083042600_
  article-title: GEL: a novel genotype calling algorithm using empirical likelihood
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btl341
– volume: 23
  start-page: 57
  year: 2006
  ident: 2023041105083042600_
  article-title: SNiPer- HD: improved genotype calling accuracy by an expectation-maximization algorithm for high-density SNP arrays
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btl536
– volume: 39
  start-page: 949
  year: 2004
  ident: 2023041105083042600_
  article-title: Detection of large-scale variation in the human genome
  publication-title: Nat. Genet
  doi: 10.1038/ng1416
– volume: 19
  start-page: 2397
  year: 2003
  ident: 2023041105083042600_
  article-title: Algorithms for large-scale genotyping microarrays
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btg332
– volume: 97
  start-page: 611
  year: 2002
  ident: 2023041105083042600_
  article-title: Model-based clustering, discriminant analysis, and density estimation
  publication-title: JASA
  doi: 10.1198/016214502760047131
– volume: 426
  start-page: 789
  year: 2003
  ident: 2023041105083042600_
  article-title: The international hapmap project
  publication-title: Nature
  doi: 10.1038/nature02168
SSID ssj0005056
ssj0051444
Score 2.0555067
Snippet Motivation: Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive...
Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and...
MOTIVATION: Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive...
SourceID unpaywall
proquest
pubmed
pascalfrancis
crossref
oup
istex
SourceType Open Access Repository
Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 1459
SubjectTerms Algorithms
Alleles
Bioinformatics
Biological and medical sciences
Cluster Analysis
Computational Biology - methods
Fundamental and applied biological sciences. Psychology
General aspects
Genotype
Haplotypes
Humans
Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)
Models, Genetic
Models, Statistical
Oligonucleotide Array Sequence Analysis
Polymorphism, Single Nucleotide
Statistical methods
Title A multi-array multi-SNP genotyping algorithm for Affymetrix SNP microarrays
URI https://api.istex.fr/ark:/67375/HXZ-ND1QK6WR-3/fulltext.pdf
https://www.ncbi.nlm.nih.gov/pubmed/17459966
https://www.proquest.com/docview/198668344
https://www.proquest.com/docview/19735203
https://www.proquest.com/docview/33289220
https://www.proquest.com/docview/70706438
https://academic.oup.com/bioinformatics/article-pdf/23/12/1459/16857913/btm131.pdf
UnpaywallVersion publishedVersion
Volume 23
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAFT
  databaseName: Open Access Digital Library
  customDbUrl:
  eissn: 1367-4811
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: KQ8
  dateStart: 19960101
  isFulltext: true
  titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html
  providerName: Colorado Alliance of Research Libraries
– providerCode: PRVEBS
  databaseName: Inspec with Full Text
  customDbUrl:
  eissn: 1367-4811
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: ADMLS
  dateStart: 19980101
  isFulltext: true
  titleUrlDefault: https://www.ebsco.com/products/research-databases/inspec-full-text
  providerName: EBSCOhost
– providerCode: PRVBFR
  databaseName: Free Medical Journals
  customDbUrl:
  eissn: 1367-4811
  dateEnd: 20241102
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: DIK
  dateStart: 19960101
  isFulltext: true
  titleUrlDefault: http://www.freemedicaljournals.com
  providerName: Flying Publisher
– providerCode: PRVFQY
  databaseName: GFMER Free Medical Journals
  customDbUrl:
  eissn: 1367-4811
  dateEnd: 20241102
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: GX1
  dateStart: 19960101
  isFulltext: true
  titleUrlDefault: http://www.gfmer.ch/Medical_journals/Free_medical.php
  providerName: Geneva Foundation for Medical Education and Research
– providerCode: PRVAQN
  databaseName: PubMed Central
  customDbUrl:
  eissn: 1367-4811
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: RPM
  dateStart: 20070101
  isFulltext: true
  titleUrlDefault: https://www.ncbi.nlm.nih.gov/pmc/
  providerName: National Library of Medicine
– providerCode: PRVOVD
  databaseName: Journals@Ovid LWW All Open Access Journal Collection Rolling
  customDbUrl:
  eissn: 1367-4811
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: OVEED
  dateStart: 20010101
  isFulltext: true
  titleUrlDefault: http://ovidsp.ovid.com/
  providerName: Ovid
– providerCode: PRVASL
  databaseName: Oxford Journals Open Access Collection
  customDbUrl:
  eissn: 1367-4811
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: TOX
  dateStart: 19850101
  isFulltext: true
  titleUrlDefault: https://academic.oup.com/journals/
  providerName: Oxford University Press
– providerCode: PRVASL
  databaseName: Oxford Journals Open Access Collection
  customDbUrl:
  eissn: 1367-4811
  dateEnd: 20220930
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: TOX
  dateStart: 19850101
  isFulltext: true
  titleUrlDefault: https://academic.oup.com/journals/
  providerName: Oxford University Press
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3db9MwED9trRAgxPdHGJQ8ICQe0sRx7CSPFTBVTJQxVtHxEjmJvU1r0ypNxcJfzzkfHZ2YGBJvUXKXxOfL-Wfl7ncArx3hcnSjxEociRsUHgdWTKSw0lAQqZSKeUWk_WnEh2Pv44RNtuCgrYURTVZ4vy1piE_nDYWopi22G3tai1TZLrWJaxOPhTbhAfNDQu24mBGK0SdV29DlDPF5B7rj0f7gqC7A8i0vqNolN8eEtGU9Ib38rPpeGwtWV9v-vC2Gu7MQS7Skqltg_Amj3oabq2whyh9iOv1t3dq9B8t2xHW6yll_VcT95OclMsj_a5L7cLeBueagVnoAWzJ7CDfqxpflI9gbmFUmoyXyXJTN8dfRvqkpY4tS13CZYno8z0-Lk5mJL2EOlCpnuvvXuanlZjqPsFJePobx7ofDd0OraetgJYy5BYZXTVNI01AmDpWOkDRAWIRAROBpgQGPBSpF9_FC9BoiaayYp2n6wpQkAccRPYFONs_kMzAR2ykZOIonjHsp7vQThDepk3Cl24fEqQFeO3dR0nCe69Yb06j-906jTVtGta0M6K_VFjXpx98U3lSOsZYW-ZnOmPNZNJx8j0bvyZc9_u0goga8xVm87k17G_51oRWEDg99FNhpHS5qIs8yImHAuR69Aa_WVzFk6P9AIpPzlRbxEXY79GoJSnEf7rrO1RI-LhU4a4EBT2tPv3g739OUP9wAe-361xvw83_W2IFbbf4mYS-gU-Qr-RJBYhH3YPvw86TXfPu_APoxa8k
linkProvider Unpaywall
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3db9MwED-NVggQ4vsjDEYeEBIPaeI4dpzHCpgqJqoxqCi8RE5iw7Q2rdJULPz1nPPR0YmJIfEWJXdJfL6cf1bufgfwwpM-RzdKndRTuEHhiXASoqSTRZIorXXCayLt92M-mgTvpmy6A0ddLYxss8IHXUlDcrxoKUQNbbHb2tNZZtr1qUt8lwQscgkXLIwIdZNyTihGn0xfgT5niM970J-MD4dfmgKs0AlE3S65PSakK-uJ6PlnNffaWrD6xvanXTHczaVcoSV10wLjTxj1Blxb50tZ_ZCz2W_r1v5tWHUjbtJVTgbrMhmkP8-RQf5fk9yBWy3MtYeN0l3YUfk9uNo0vqzuw8HQrjMZHVkUsmqPP44PbUMZW1amhsuWs2-L4rj8PrfxJeyh1tXcdP86tY3c3OQR1sqrBzDZf_vp9chp2zo4KWN-ieHV0BTSLFKpR5UnFRUIixCISDwtMeAxoTN0nyBCryGKJpoFhqYvykgqOI7oIfTyRa4eg43YTivhaZ4yHmS4008R3mReyrVpH5JkFgTd3MVpy3luWm_M4ubfO423bRk3trJgsFFbNqQff1N4WTvGRloWJyZjLmTxaPo1Hr8hHw7456OYWvAKZ_GyN93b8q8zLRF5PApRYLdzuLiNPKuYRIJzM3oLnm-uYsgw_4FkrhZrIxIi7PboxRKU4j7c972LJUJcKnDWhAWPGk8_e7swMJQ_3AJ34_qXG_CTf9bYhetd_iZhT6FXFmv1DEFimey1X_0veJNqrQ
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+multi-array+multi-SNP+genotyping+algorithm+for+Affymetrix+SNP+microarrays&rft.jtitle=Bioinformatics+%28Oxford%2C+England%29&rft.au=Xiao%2C+Yuanyuan&rft.au=Segal%2C+Mark+R&rft.au=Yang%2C+Y+H&rft.au=Yeh%2C+Ru-Fang&rft.date=2007-06-15&rft.issn=1367-4811&rft.eissn=1367-4811&rft.volume=23&rft.issue=12&rft.spage=1459&rft_id=info:doi/10.1093%2Fbioinformatics%2Fbtm131&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1367-4803&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1367-4803&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1367-4803&client=summon