A multi-array multi-SNP genotyping algorithm for Affymetrix SNP microarrays
Motivation: Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogatin...
        Saved in:
      
    
          | Published in | Bioinformatics Vol. 23; no. 12; pp. 1459 - 1467 | 
|---|---|
| Main Authors | , , , | 
| Format | Journal Article | 
| Language | English | 
| Published | 
        Oxford
          Oxford University Press
    
        15.06.2007
     Oxford Publishing Limited (England)  | 
| Subjects | |
| Online Access | Get full text | 
| ISSN | 1367-4803 1367-4811 1367-4811 1460-2059  | 
| DOI | 10.1093/bioinformatics/btm131 | 
Cover
| Abstract | Motivation: Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogating up to hundreds of thousands of single nucleotide polymorphisms (SNPs) in a single assay. Similar to the development of expression arrays, these genotyping arrays pose many data analytic challenges that are often platform specific. Affymetrix SNP arrays, e.g. use multiple sets of short oligonucleotide probes for each known SNP, and require effective statistical methods to combine these probe intensities in order to generate reliable and accurate genotype calls. Results: We developed an integrated multi-SNP, multi-array genotype calling algorithm for Affymetrix SNP arrays, MAMS, that combines single-array multi-SNP (SAMS) and multi-array, single-SNP (MASS) calls to improve the accuracy of genotype calls, without the need for training data or computation-intensive normalization procedures as in other multi-array methods. The algorithm uses resampling techniques and model-based clustering to derive single array based genotype calls, which are subsequently refined by competitive genotype calls based on (MASS) clustering. The resampling scheme caps computation for single-array analysis and hence is readily scalable, important in view of expanding numbers of SNPs per array. The MASS update is designed to improve calls for atypical SNPs, harboring allele-imbalanced binding affinities, that are difficult to genotype without information from other arrays. Using a publicly available data set of HapMap samples from Affymetrix, and independent calls by alternative genotyping methods from the HapMap project, we show that our approach performs competitively to existing methods. Availability: R functions are available upon request from the authors. Contact: yxiao@itsa.ucsf.edu and rufang@biostat.ucsf.edu Supplementary information: Supplementary data are available at Bioinformatics online. | 
    
|---|---|
| AbstractList | Motivation: Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogating up to hundreds of thousands of single nucleotide polymorphisms (SNPs) in a single assay. Similar to the development of expression arrays, these genotyping arrays pose many data analytic challenges that are often platform specific. Affymetrix SNP arrays, e.g. use multiple sets of short oligonucleotide probes for each known SNP, and require effective statistical methods to combine these probe intensities in order to generate reliable and accurate genotype calls. Results: We developed an integrated multi-SNP, multi-array genotype calling algorithm for Affymetrix SNP arrays, MAMS, that combines single-array multi-SNP (SAMS) and multi-array, single-SNP (MASS) calls to improve the accuracy of genotype calls, without the need for training data or computation-intensive normalization procedures as in other multi-array methods. The algorithm uses resampling techniques and model-based clustering to derive single array based genotype calls, which are subsequently refined by competitive genotype calls based on (MASS) clustering. The resampling scheme caps computation for single-array analysis and hence is readily scalable, important in view of expanding numbers of SNPs per array. The MASS update is designed to improve calls for atypical SNPs, harboring allele-imbalanced binding affinities, that are difficult to genotype without information from other arrays. Using a publicly available data set of HapMap samples from Affymetrix, and independent calls by alternative genotyping methods from the HapMap project, we show that our approach performs competitively to existing methods. Availability: R functions are available upon request from the authors. Contact: yxiao@itsa.ucsf.edu and rufang@biostat.ucsf.eduSupplementary information: Supplementary data are available at Bioinformatics online. Motivation: Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogating up to hundreds of thousands of single nucleotide polymorphisms (SNPs) in a single assay. Similar to the development of expression arrays, these genotyping arrays pose many data analytic challenges that are often platform specific. Affymetrix SNP arrays, e.g. use multiple sets of short oligonucleotide probes for each known SNP, and require effective statistical methods to combine these probe intensities in order to generate reliable and accurate genotype calls. Results: We developed an integrated multi-SNP, multi-array genotype calling algorithm for Affymetrix SNP arrays, MAMS, that combines single-array multi-SNP (SAMS) and multi-array, single-SNP (MASS) calls to improve the accuracy of genotype calls, without the need for training data or computation-intensive normalization procedures as in other multi-array methods. The algorithm uses resampling techniques and model-based clustering to derive single array based genotype calls, which are subsequently refined by competitive genotype calls based on (MASS) clustering. The resampling scheme caps computation for single-array analysis and hence is readily scalable, important in view of expanding numbers of SNPs per array. The MASS update is designed to improve calls for atypical SNPs, harboring allele-imbalanced binding affinities, that are difficult to genotype without information from other arrays. Using a publicly available data set of HapMap samples from Affymetrix, and independent calls by alternative genotyping methods from the HapMap project, we show that our approach performs competitively to existing methods. Availability: R functions are available upon request from the authors. Contact: yxiao@itsa.ucsf.edu and rufang@biostat.ucsf.edu Supplementary information: Supplementary data are available at Bioinformatics online. Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogating up to hundreds of thousands of single nucleotide polymorphisms (SNPs) in a single assay. Similar to the development of expression arrays, these genotyping arrays pose many data analytic challenges that are often platform specific. Affymetrix SNP arrays, e.g. use multiple sets of short oligonucleotide probes for each known SNP, and require effective statistical methods to combine these probe intensities in order to generate reliable and accurate genotype calls.MOTIVATIONModern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogating up to hundreds of thousands of single nucleotide polymorphisms (SNPs) in a single assay. Similar to the development of expression arrays, these genotyping arrays pose many data analytic challenges that are often platform specific. Affymetrix SNP arrays, e.g. use multiple sets of short oligonucleotide probes for each known SNP, and require effective statistical methods to combine these probe intensities in order to generate reliable and accurate genotype calls.We developed an integrated multi-SNP, multi-array genotype calling algorithm for Affymetrix SNP arrays, MAMS, that combines single-array multi-SNP (SAMS) and multi-array, single-SNP (MASS) calls to improve the accuracy of genotype calls, without the need for training data or computation-intensive normalization procedures as in other multi-array methods. The algorithm uses resampling techniques and model-based clustering to derive single array based genotype calls, which are subsequently refined by competitive genotype calls based on (MASS) clustering. The resampling scheme caps computation for single-array analysis and hence is readily scalable, important in view of expanding numbers of SNPs per array. The MASS update is designed to improve calls for atypical SNPs, harboring allele-imbalanced binding affinities, that are difficult to genotype without information from other arrays. Using a publicly available data set of HapMap samples from Affymetrix, and independent calls by alternative genotyping methods from the HapMap project, we show that our approach performs competitively to existing methods.RESULTSWe developed an integrated multi-SNP, multi-array genotype calling algorithm for Affymetrix SNP arrays, MAMS, that combines single-array multi-SNP (SAMS) and multi-array, single-SNP (MASS) calls to improve the accuracy of genotype calls, without the need for training data or computation-intensive normalization procedures as in other multi-array methods. The algorithm uses resampling techniques and model-based clustering to derive single array based genotype calls, which are subsequently refined by competitive genotype calls based on (MASS) clustering. The resampling scheme caps computation for single-array analysis and hence is readily scalable, important in view of expanding numbers of SNPs per array. The MASS update is designed to improve calls for atypical SNPs, harboring allele-imbalanced binding affinities, that are difficult to genotype without information from other arrays. Using a publicly available data set of HapMap samples from Affymetrix, and independent calls by alternative genotyping methods from the HapMap project, we show that our approach performs competitively to existing methods.R functions are available upon request from the authors.AVAILABILITYR functions are available upon request from the authors. Motivation: Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogating up to hundreds of thousands of single nucleotide polymorphisms (SNPs) in a single assay. Similar to the development of expression arrays, these genotyping arrays pose many data analytic challenges that are often platform specific. Affymetrix SNP arrays, e.g. use multiple sets of short oligonucleotide probes for each known SNP, and require effective statistical methods to combine these probe intensities in order to generate reliable and accurate genotype calls. Results: We developed an integrated multi-SNP, multi-array genotype calling algorithm for Affymetrix SNP arrays, MAMS, that combines single-array multi-SNP (SAMS) and multi-array, single-SNP (MASS) calls to improve the accuracy of genotype calls, without the need for training data or computation-intensive normalization procedures as in other multi-array methods. The algorithm uses resampling techniques and model-based clustering to derive single array based genotype calls, which are subsequently refined by competitive genotype calls based on (MASS) clustering. The resampling scheme caps computation for single-array analysis and hence is readily scalable, important in view of expanding numbers of SNPs per array. The MASS update is designed to improve calls for atypical SNPs, harboring allele-imbalanced binding affinities, that are difficult to genotype without information from other arrays. Using a publicly available data set of HapMap samples from Affymetrix, and independent calls by alternative genotyping methods from the HapMap project, we show that our approach performs competitively to existing methods. Availability: R functions are available upon request from the authors. Contact: yxiao@itsa.ucsf.edu and rufang@biostat.ucsf.edu Supplementary information: Supplementary data are available at Bioinformatics online. MOTIVATION: Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogating up to hundreds of thousands of single nucleotide polymorphisms (SNPs) in a single assay. Similar to the development of expression arrays, these genotyping arrays pose many data analytic challenges that are often platform specific. Affymetrix SNP arrays, e.g. use multiple sets of short oligonucleotide probes for each known SNP, and require effective statistical methods to combine these probe intensities in order to generate reliable and accurate genotype calls. RESULTS: We developed an integrated multi-SNP, multi-array genotype calling algorithm for Affymetrix SNP arrays, MAMS, that combines single-array multi-SNP (SAMS) and multi-array, single-SNP (MASS) calls to improve the accuracy of genotype calls, without the need for training data or computation-intensive normalization procedures as in other multi-array methods. The algorithm uses resampling techniques and model-based clustering to derive single array based genotype calls, which are subsequently refined by competitive genotype calls based on (MASS) clustering. The resampling scheme caps computation for single-array analysis and hence is readily scalable, important in view of expanding numbers of SNPs per array. The MASS update is designed to improve calls for atypical SNPs, harboring allele-imbalanced binding affinities, that are difficult to genotype without information from other arrays. Using a publicly available data set of HapMap samples from Affymetrix, and independent calls by alternative genotyping methods from the HapMap project, we show that our approach performs competitively to existing methods. AVAILABILITY: R functions are available upon request from the authors. CONTACT: yxiaotsa.ucsf.edu and rufangiostat.ucsf.edu Supplementary information: Supplementary data are available at Bioinformatics online. Motivation: Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogating up to hundreds of thousands of single nucleotide polymorphisms (SNPs) in a single assay. Similar to the development of expression arrays, these genotyping arrays pose many data analytic challenges that are often platform specific. Affymetrix SNP arrays, e.g. use multiple sets of short oligonucleotide probes for each known SNP, and require effective statistical methods to combine these probe intensities in order to generate reliable and accurate genotype calls. Results: We developed an integrated multi-SNP, multi-array genotype calling algorithm for Affymetrix SNP arrays, MAMS, that combines single-array multi-SNP (SAMS) and multi-array, single-SNP (MASS) calls to improve the accuracy of genotype calls, without the need for training data or computation-intensive normalization procedures as in other multi-array methods. The algorithm uses resampling techniques and model-based clustering to derive single array based genotype calls, which are subsequently refined by competitive genotype calls based on (MASS) clustering. The resampling scheme caps computation for single-array analysis and hence is readily scalable, important in view of expanding numbers of SNPs per array. The MASS update is designed to improve calls for atypical SNPs, harboring allele-imbalanced binding affinities, that are difficult to genotype without information from other arrays. Using a publicly available data set of HapMap samples from Affymetrix, and independent calls by alternative genotyping methods from the HapMap project, we show that our approach performs competitively to existing methods. Availability: R functions are available upon request from the authors. Contact: yxiao@itsa.ucsf.edu and rufang@biostat.ucsf.edu Supplementary information: Supplementary data are available at Bioinformatics online. Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogating up to hundreds of thousands of single nucleotide polymorphisms (SNPs) in a single assay. Similar to the development of expression arrays, these genotyping arrays pose many data analytic challenges that are often platform specific. Affymetrix SNP arrays, e.g. use multiple sets of short oligonucleotide probes for each known SNP, and require effective statistical methods to combine these probe intensities in order to generate reliable and accurate genotype calls. We developed an integrated multi-SNP, multi-array genotype calling algorithm for Affymetrix SNP arrays, MAMS, that combines single-array multi-SNP (SAMS) and multi-array, single-SNP (MASS) calls to improve the accuracy of genotype calls, without the need for training data or computation-intensive normalization procedures as in other multi-array methods. The algorithm uses resampling techniques and model-based clustering to derive single array based genotype calls, which are subsequently refined by competitive genotype calls based on (MASS) clustering. The resampling scheme caps computation for single-array analysis and hence is readily scalable, important in view of expanding numbers of SNPs per array. The MASS update is designed to improve calls for atypical SNPs, harboring allele-imbalanced binding affinities, that are difficult to genotype without information from other arrays. Using a publicly available data set of HapMap samples from Affymetrix, and independent calls by alternative genotyping methods from the HapMap project, we show that our approach performs competitively to existing methods. R functions are available upon request from the authors.  | 
    
| Author | Yang, Y.H. Yeh, Ru-Fang Xiao, Yuanyuan Segal, Mark R.  | 
    
| Author_xml | – sequence: 1 givenname: Yuanyuan surname: Xiao fullname: Xiao, Yuanyuan organization: Department of Epidemiology and Biostatistics, Center for Bioinformatics and Molecular Biostatistics, University of California, 185 Berry Street, Lobby 4, Suite 5700, San Francisco, CA 94107, USA and School of Mathematics and Statistics, University of Sydney, NSW 2006, Australia – sequence: 2 givenname: Mark R. surname: Segal fullname: Segal, Mark R. organization: Department of Epidemiology and Biostatistics, Center for Bioinformatics and Molecular Biostatistics, University of California, 185 Berry Street, Lobby 4, Suite 5700, San Francisco, CA 94107, USA and School of Mathematics and Statistics, University of Sydney, NSW 2006, Australia – sequence: 3 givenname: Y.H. surname: Yang fullname: Yang, Y.H. organization: Department of Epidemiology and Biostatistics, Center for Bioinformatics and Molecular Biostatistics, University of California, 185 Berry Street, Lobby 4, Suite 5700, San Francisco, CA 94107, USA and School of Mathematics and Statistics, University of Sydney, NSW 2006, Australia – sequence: 4 givenname: Ru-Fang surname: Yeh fullname: Yeh, Ru-Fang organization: Department of Epidemiology and Biostatistics, Center for Bioinformatics and Molecular Biostatistics, University of California, 185 Berry Street, Lobby 4, Suite 5700, San Francisco, CA 94107, USA and School of Mathematics and Statistics, University of Sydney, NSW 2006, Australia  | 
    
| BackLink | http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=18906971$$DView record in Pascal Francis https://www.ncbi.nlm.nih.gov/pubmed/17459966$$D View this record in MEDLINE/PubMed  | 
    
| BookMark | eNqNkUtv1DAURi1URB_wE0AREuxC7fgVi9WoPIpalbdAbCzH4wwuiR1sRzT_Hg-JWtHNdOUr6xxf3-8egj3nnQHgMYIvEBT4uLHeutaHXiWr43GTeoTRPXCAMOMlqRHau64h3geHMV5CCCmk7AHYR5xQIRg7AGeroh-7ZEsVgpqW-vPFh2JjnE_TYN2mUN3GB5t-9kXuV6zadupNCvaq2HK91cH_k-NDcL9VXTSPlvMIfH3z-svJaXn-_u27k9V5qSmtUikQrIjAa2E0xAYqg2tGcAWRytcKc0brdg2pJqJBBhnctJRwSohYI10zgfARYPO7oxvU9Ed1nRyC7VWYJIJym478Px05p5PF57M4BP97NDHJ3kZtuk4548coOeQwf6XeCWJc1aKq4E4QCY5p5jL49BZ46cfgckyZqRmrMSEZerJAY9Ob9c1Uy7oy8GwBVNSqa4Ny2sYbrhaQCb4dlM5c3k2MwbR3DujlLU_blAnvUlC222nD2fbjcOeG5azYmMzVtaTCL8k45lSefv8hL16hj2fs26cc-18bN_E3 | 
    
| CODEN | BOINFP | 
    
| CitedBy_id | crossref_primary_10_1093_nar_gkp559 crossref_primary_10_1093_bioinformatics_btaa295 crossref_primary_10_1093_nar_gkp493 crossref_primary_10_1093_bioinformatics_btp470 crossref_primary_10_1093_bioinformatics_btn632 crossref_primary_10_1038_nrg2344 crossref_primary_10_1142_S0219720011005458 crossref_primary_10_1093_bioinformatics_btm443 crossref_primary_10_1093_bioinformatics_btu107 crossref_primary_10_1016_j_bbcan_2009_03_001 crossref_primary_10_1186_1479_7364_5_4_304 crossref_primary_10_1097_MOL_0b013e3282f5dd77 crossref_primary_10_3389_fgene_2022_963852 crossref_primary_10_1089_cmb_2007_0133 crossref_primary_10_1038_s41576_019_0127_1 crossref_primary_10_1007_s11464_011_0125_x crossref_primary_10_1186_1471_2105_9_S9_S17 crossref_primary_10_1093_biostatistics_kxp045 crossref_primary_10_1016_S1672_0229_08_60014_5 crossref_primary_10_1093_nar_gkq750 crossref_primary_10_1093_bioinformatics_btn147 crossref_primary_10_1186_1471_2105_10_68 crossref_primary_10_1371_journal_pone_0058677 crossref_primary_10_1093_bioinformatics_btn509 crossref_primary_10_3389_fgene_2021_736390  | 
    
| Cites_doi | 10.2307/2532201 10.1093/biostatistics/4.2.249 10.1093/bioinformatics/bti741 10.1093/bioinformatics/bti275 10.1111/j.2517-6161.1977.tb01600.x 10.1016/0377-0427(87)90125-7 10.1371/journal.pcbi.0010065 10.1093/nar/gnj027 10.1158/0008-5472.CAN-05-0465 10.1093/bioinformatics/btl341 10.1093/bioinformatics/btl536 10.1038/ng1416 10.1093/bioinformatics/btg332 10.1198/016214502760047131 10.1038/nature02168  | 
    
| ContentType | Journal Article | 
    
| Copyright | 2007 The Author(s) 2007 2008 INIST-CNRS 2007 The Author(s)  | 
    
| Copyright_xml | – notice: 2007 The Author(s) 2007 – notice: 2008 INIST-CNRS – notice: 2007 The Author(s)  | 
    
| DBID | BSCLL AAYXX CITATION IQODW CGR CUY CVF ECM EIF NPM 7QF 7QO 7QQ 7SC 7SE 7SP 7SR 7TA 7TB 7TM 7TO 7U5 8BQ 8FD F28 FR3 H8D H8G H94 JG9 JQ2 K9. KR7 L7M L~C L~D P64 7X8 ADTOC UNPAY  | 
    
| DOI | 10.1093/bioinformatics/btm131 | 
    
| DatabaseName | Istex CrossRef Pascal-Francis Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed Aluminium Industry Abstracts Biotechnology Research Abstracts Ceramic Abstracts Computer and Information Systems Abstracts Corrosion Abstracts Electronics & Communications Abstracts Engineered Materials Abstracts Materials Business File Mechanical & Transportation Engineering Abstracts Nucleic Acids Abstracts Oncogenes and Growth Factors Abstracts Solid State and Superconductivity Abstracts METADEX Technology Research Database ANTE: Abstracts in New Technology & Engineering Engineering Research Database Aerospace Database Copper Technical Reference Library AIDS and Cancer Research Abstracts Materials Research Database ProQuest Computer Science Collection ProQuest Health & Medical Complete (Alumni) Civil Engineering Abstracts Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts  Academic Computer and Information Systems Abstracts Professional Biotechnology and BioEngineering Abstracts MEDLINE - Academic Unpaywall for CDI: Periodical Content Unpaywall  | 
    
| DatabaseTitle | CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) Materials Research Database Oncogenes and Growth Factors Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Mechanical & Transportation Engineering Abstracts Nucleic Acids Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts ProQuest Health & Medical Complete (Alumni) Materials Business File Aerospace Database Copper Technical Reference Library Engineered Materials Abstracts Biotechnology Research Abstracts AIDS and Cancer Research Abstracts Advanced Technologies Database with Aerospace ANTE: Abstracts in New Technology & Engineering Civil Engineering Abstracts Aluminium Industry Abstracts Electronics & Communications Abstracts Ceramic Abstracts METADEX Biotechnology and BioEngineering Abstracts Computer and Information Systems Abstracts Professional Solid State and Superconductivity Abstracts Engineering Research Database Corrosion Abstracts MEDLINE - Academic  | 
    
| DatabaseTitleList | Computer and Information Systems Abstracts CrossRef MEDLINE - Academic Engineering Research Database Materials Research Database MEDLINE  | 
    
| Database_xml | – sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database – sequence: 3 dbid: UNPAY name: Unpaywall url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/ sourceTypes: Open Access Repository  | 
    
| DeliveryMethod | fulltext_linktorsrc | 
    
| Discipline | Biology | 
    
| EISSN | 1367-4811 1460-2059  | 
    
| EndPage | 1467 | 
    
| ExternalDocumentID | 10.1093/bioinformatics/btm131 1317473271 17459966 18906971 10_1093_bioinformatics_btm131 ark_67375_HXZ_ND1QK6WR_3  | 
    
| Genre | Research Support, Non-U.S. Gov't Journal Article  | 
    
| GroupedDBID | --- -E4 -~X .2P .DC .I3 0R~ 1TH 23N 2WC 4.4 48X 53G 5GY 5WA 70D AAIJN AAIMJ AAJKP AAJQQ AAKPC AAMDB AAMVS AAOGV AAPQZ AAPXW AAUQX AAVAP AAVLN ABEJV ABEUO ABGNP ABIXL ABNGD ABNKS ABPQP ABPTD ABQLI ABWST ABXVV ABZBJ ACGFS ACIWK ACPRK ACUFI ACUKT ACUXJ ACYTK ADBBV ADEYI ADEZT ADFTL ADGKP ADGZP ADHKW ADHZD ADMLS ADOCK ADPDF ADRDM ADRTK ADVEK ADYVW ADZTZ ADZXQ AECKG AEGPL AEJOX AEKKA AEKSI AELWJ AEMDU AENEX AENZO AEPUE AETBJ AEWNT AFFNX AFFZL AFGWE AFIYH AFOFC AFRAH AGINJ AGKEF AGQPQ AGQXC AGSYK AHMBA AHXPO AIJHB AJEEA AJEUX AKHUL AKWXX ALMA_UNASSIGNED_HOLDINGS ALTZX ALUQC AMNDL APIBT APWMN ARIXL ASPBG AVWKF AXUDD AYOIW AZFZN AZVOD BAWUL BAYMD BHONS BQDIO BQUQU BSCLL BSWAC BTQHN C1A C45 CAG CDBKE COF CS3 CZ4 DAKXR DIK DILTD DU5 D~K EBD EBS EE~ EJD EMOBN F5P F9B FEDTE FHSFR FLIZI FLUFQ FOEOM FQBLK GAUVT GJXCC GROUPED_DOAJ GX1 H13 H5~ HAR HVGLF HW0 HZ~ IOX J21 JXSIZ KAQDR KOP KQ8 KSI KSN M-Z MK~ ML0 N9A NGC NLBLG NMDNZ NOMLY NTWIH NU- NVLIB O0~ O9- OAWHX ODMLO OJQWA OK1 OVD OVEED P2P PAFKI PB- PEELM PQQKQ Q1. Q5Y R44 RD5 RNS ROL RPM RUSNO RW1 RXO SV3 TEORI TJP TLC TOX TR2 W8F WOQ X7H YAYTL YKOAZ YXANX ZKX ~91 ~KM AASNB ABQTQ ADRIX AFXEN BCRHZ M49 RIG ROX AAYXX CITATION .-4 .GJ ABEFU AI. AQDSO ATTQO ELUNK IQODW O~Y RNI RZF RZO VH1 ZGI CGR CUY CVF ECM EIF NPM 7QF 7QO 7QQ 7SC 7SE 7SP 7SR 7TA 7TB 7TM 7TO 7U5 8BQ 8FD F28 FR3 H8D H8G H94 JG9 JQ2 K9. KR7 L7M L~C L~D P64 482 ABJNI ROZ TN5 WH7 7X8 ADTOC UNPAY  | 
    
| ID | FETCH-LOGICAL-c552t-9102493d9ec03e0ae38643201a249a37658fd05c49b1e1e3bf5475449d1c86913 | 
    
| IEDL.DBID | UNPAY | 
    
| ISSN | 1367-4803 1367-4811  | 
    
| IngestDate | Tue Aug 19 18:54:10 EDT 2025 Fri Sep 05 07:29:51 EDT 2025 Fri Sep 05 13:39:57 EDT 2025 Tue Oct 07 09:22:36 EDT 2025 Fri Oct 03 11:10:26 EDT 2025 Wed Feb 19 01:43:46 EST 2025 Mon Jul 21 09:15:17 EDT 2025 Thu Apr 24 23:04:56 EDT 2025 Wed Oct 01 04:04:41 EDT 2025 Wed Aug 28 03:24:14 EDT 2024 Sat Sep 20 11:02:14 EDT 2025  | 
    
| IsDoiOpenAccess | true | 
    
| IsOpenAccess | true | 
    
| IsPeerReviewed | true | 
    
| IsScholarly | true | 
    
| Issue | 12 | 
    
| Keywords | Measurement Allelic imbalance DNA chip Genotype Gene expression Microarray Algorithm Clusterin Original document Computer program Single nucleotide polymorphism Bioinformatics Comparative study  | 
    
| Language | English | 
    
| License | CC BY 4.0 | 
    
| LinkModel | DirectLink | 
    
| MergedId | FETCHMERGED-LOGICAL-c552t-9102493d9ec03e0ae38643201a249a37658fd05c49b1e1e3bf5475449d1c86913 | 
    
| Notes | ark:/67375/HXZ-ND1QK6WR-3 istex:E91771B98284F9F3DD801AB42B23C30AD4C5F7A0 To whom correspondence should be addressed. Associate Editor: Chris Stoeckert ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-1 ObjectType-Feature-2 content type line 23  | 
    
| OpenAccessLink | https://proxy.k.utb.cz/login?url=https://academic.oup.com/bioinformatics/article-pdf/23/12/1459/16857913/btm131.pdf | 
    
| PMID | 17459966 | 
    
| PQID | 198668344 | 
    
| PQPubID | 36124 | 
    
| PageCount | 9 | 
    
| ParticipantIDs | unpaywall_primary_10_1093_bioinformatics_btm131 proquest_miscellaneous_70706438 proquest_miscellaneous_33289220 proquest_miscellaneous_19735203 proquest_journals_198668344 pubmed_primary_17459966 pascalfrancis_primary_18906971 crossref_primary_10_1093_bioinformatics_btm131 crossref_citationtrail_10_1093_bioinformatics_btm131 oup_primary_10_1093_bioinformatics_btm131 istex_primary_ark_67375_HXZ_ND1QK6WR_3  | 
    
| ProviderPackageCode | CITATION AAYXX  | 
    
| PublicationCentury | 2000 | 
    
| PublicationDate | 2007-06-15 | 
    
| PublicationDateYYYYMMDD | 2007-06-15 | 
    
| PublicationDate_xml | – month: 06 year: 2007 text: 2007-06-15 day: 15  | 
    
| PublicationDecade | 2000 | 
    
| PublicationPlace | Oxford | 
    
| PublicationPlace_xml | – name: Oxford – name: England  | 
    
| PublicationTitle | Bioinformatics | 
    
| PublicationTitleAlternate | Bioinformatics | 
    
| PublicationYear | 2007 | 
    
| Publisher | Oxford University Press Oxford Publishing Limited (England)  | 
    
| Publisher_xml | – name: Oxford University Press – name: Oxford Publishing Limited (England)  | 
    
| References | Carvalho (2023041105083042600_) 2006 Rabbee (2023041105083042600_) 2006; 22 Liu (2023041105083042600_) 2003; 19 Iafrate (2023041105083042600_) 2004; 39 Rousseeuw (2023041105083042600_) 1987; 20 Nicolae (2023041105083042600_) 2006; 22 Irizarry (2023041105083042600_) 2003; 4 Fraley (2023041105083042600_) 2002; 97 Meaburn (2023041105083042600_) 2006; 34 Huber (2023041105083042600_) 2002; 1 Banfield (2023041105083042600_) 1993; 49 Dempster (2023041105083042600_) 1977; 39 Hua (2023041105083042600_) 2006; 23 Di (2023041105083042600_) 2005; 21 Affymetrix (2023041105083042600_) 2006 The International HapMap Consortium (2023041105083042600_) 2003; 426 Nannya (2023041105083042600_) 2005; 65 LaFramboise (2023041105083042600_) 2005; 1  | 
    
| References_xml | – volume: 1 start-page: 1 year: 2002 ident: 2023041105083042600_ article-title: Variance stabilization applied to microarray data calibration and to the quantification of differetial expression publication-title: Bioinformatics – volume-title: Technical report. year: 2006 ident: 2023041105083042600_ article-title: BRLMM: an improved genotype calling method for the genechip human mapping 500 k array set – volume: 49 start-page: 803 year: 1993 ident: 2023041105083042600_ article-title: Model-based gaussian and non-gaussian clustering publication-title: Biometrics doi: 10.2307/2532201 – volume: 4 start-page: 249 year: 2003 ident: 2023041105083042600_ article-title: Exploration, normalization, and summaries of high density oligonucleotide array probe level data publication-title: Biostatistics doi: 10.1093/biostatistics/4.2.249 – volume: 22 start-page: 7 year: 2006 ident: 2023041105083042600_ article-title: A genotype calling algorithm for affymetrix SNP arrays publication-title: Bioinformatics doi: 10.1093/bioinformatics/bti741 – volume: 21 start-page: 1958 year: 2005 ident: 2023041105083042600_ article-title: Dynamic model based algorithms for screening and genotyping over 100 k SNPs on oligonucleotide microarrays publication-title: Bioinformatics doi: 10.1093/bioinformatics/bti275 – volume: 39 start-page: 1 year: 1977 ident: 2023041105083042600_ article-title: Maximum likelihood from incomplete data via EM algorithm (with discussion) publication-title: J. R. Stat. Soc. B doi: 10.1111/j.2517-6161.1977.tb01600.x – volume: 20 start-page: 53 year: 1987 ident: 2023041105083042600_ article-title: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis publication-title: J. Comput. Appl. Math doi: 10.1016/0377-0427(87)90125-7 – volume-title: Technical report. year: 2006 ident: 2023041105083042600_ article-title: Exploration, normalization, and genotype calls of high density oligonucleotide SNP array data – volume: 1 start-page: e65 year: 2005 ident: 2023041105083042600_ article-title: Allele-specific amplification in cancer revealed by SNP array analysis publication-title: PLoS Comput. Biol doi: 10.1371/journal.pcbi.0010065 – volume: 34 start-page: e28 year: 2006 ident: 2023041105083042600_ article-title: Genotyping pooled dna using 100 k SNP microarrays: A step towards genomewide association scans publication-title: Nucleic Acids Res doi: 10.1093/nar/gnj027 – volume: 65 start-page: 6071 year: 2005 ident: 2023041105083042600_ article-title: A robust algorithm for copy number detection for high-density oligonucleotide single nucleotide polymorphism genotyping arrays publication-title: Cancer Res doi: 10.1158/0008-5472.CAN-05-0465 – volume: 22 start-page: 1942 year: 2006 ident: 2023041105083042600_ article-title: GEL: a novel genotype calling algorithm using empirical likelihood publication-title: Bioinformatics doi: 10.1093/bioinformatics/btl341 – volume: 23 start-page: 57 year: 2006 ident: 2023041105083042600_ article-title: SNiPer- HD: improved genotype calling accuracy by an expectation-maximization algorithm for high-density SNP arrays publication-title: Bioinformatics doi: 10.1093/bioinformatics/btl536 – volume: 39 start-page: 949 year: 2004 ident: 2023041105083042600_ article-title: Detection of large-scale variation in the human genome publication-title: Nat. Genet doi: 10.1038/ng1416 – volume: 19 start-page: 2397 year: 2003 ident: 2023041105083042600_ article-title: Algorithms for large-scale genotyping microarrays publication-title: Bioinformatics doi: 10.1093/bioinformatics/btg332 – volume: 97 start-page: 611 year: 2002 ident: 2023041105083042600_ article-title: Model-based clustering, discriminant analysis, and density estimation publication-title: JASA doi: 10.1198/016214502760047131 – volume: 426 start-page: 789 year: 2003 ident: 2023041105083042600_ article-title: The international hapmap project publication-title: Nature doi: 10.1038/nature02168  | 
    
| SSID | ssj0005056 ssj0051444  | 
    
| Score | 2.0555067 | 
    
| Snippet | Motivation: Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive... Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and... MOTIVATION: Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive...  | 
    
| SourceID | unpaywall proquest pubmed pascalfrancis crossref oup istex  | 
    
| SourceType | Open Access Repository Aggregation Database Index Database Enrichment Source Publisher  | 
    
| StartPage | 1459 | 
    
| SubjectTerms | Algorithms Alleles Bioinformatics Biological and medical sciences Cluster Analysis Computational Biology - methods Fundamental and applied biological sciences. Psychology General aspects Genotype Haplotypes Humans Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) Models, Genetic Models, Statistical Oligonucleotide Array Sequence Analysis Polymorphism, Single Nucleotide Statistical methods  | 
    
| Title | A multi-array multi-SNP genotyping algorithm for Affymetrix SNP microarrays | 
    
| URI | https://api.istex.fr/ark:/67375/HXZ-ND1QK6WR-3/fulltext.pdf https://www.ncbi.nlm.nih.gov/pubmed/17459966 https://www.proquest.com/docview/198668344 https://www.proquest.com/docview/19735203 https://www.proquest.com/docview/33289220 https://www.proquest.com/docview/70706438 https://academic.oup.com/bioinformatics/article-pdf/23/12/1459/16857913/btm131.pdf  | 
    
| UnpaywallVersion | publishedVersion | 
    
| Volume | 23 | 
    
| hasFullText | 1 | 
    
| inHoldings | 1 | 
    
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVAFT databaseName: Open Access Digital Library customDbUrl: eissn: 1367-4811 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0005056 issn: 1367-4811 databaseCode: KQ8 dateStart: 19960101 isFulltext: true titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html providerName: Colorado Alliance of Research Libraries – providerCode: PRVEBS databaseName: Inspec with Full Text customDbUrl: eissn: 1367-4811 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0005056 issn: 1367-4811 databaseCode: ADMLS dateStart: 19980101 isFulltext: true titleUrlDefault: https://www.ebsco.com/products/research-databases/inspec-full-text providerName: EBSCOhost – providerCode: PRVBFR databaseName: Free Medical Journals customDbUrl: eissn: 1367-4811 dateEnd: 20241102 omitProxy: true ssIdentifier: ssj0005056 issn: 1367-4811 databaseCode: DIK dateStart: 19960101 isFulltext: true titleUrlDefault: http://www.freemedicaljournals.com providerName: Flying Publisher – providerCode: PRVFQY databaseName: GFMER Free Medical Journals customDbUrl: eissn: 1367-4811 dateEnd: 20241102 omitProxy: true ssIdentifier: ssj0005056 issn: 1367-4811 databaseCode: GX1 dateStart: 19960101 isFulltext: true titleUrlDefault: http://www.gfmer.ch/Medical_journals/Free_medical.php providerName: Geneva Foundation for Medical Education and Research – providerCode: PRVAQN databaseName: PubMed Central customDbUrl: eissn: 1367-4811 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0005056 issn: 1367-4811 databaseCode: RPM dateStart: 20070101 isFulltext: true titleUrlDefault: https://www.ncbi.nlm.nih.gov/pmc/ providerName: National Library of Medicine – providerCode: PRVOVD databaseName: Journals@Ovid LWW All Open Access Journal Collection Rolling customDbUrl: eissn: 1367-4811 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0005056 issn: 1367-4811 databaseCode: OVEED dateStart: 20010101 isFulltext: true titleUrlDefault: http://ovidsp.ovid.com/ providerName: Ovid – providerCode: PRVASL databaseName: Oxford Journals Open Access Collection customDbUrl: eissn: 1367-4811 dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0005056 issn: 1367-4811 databaseCode: TOX dateStart: 19850101 isFulltext: true titleUrlDefault: https://academic.oup.com/journals/ providerName: Oxford University Press – providerCode: PRVASL databaseName: Oxford Journals Open Access Collection customDbUrl: eissn: 1367-4811 dateEnd: 20220930 omitProxy: true ssIdentifier: ssj0005056 issn: 1367-4811 databaseCode: TOX dateStart: 19850101 isFulltext: true titleUrlDefault: https://academic.oup.com/journals/ providerName: Oxford University Press  | 
    
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3db9MwED9trRAgxPdHGJQ8ICQe0sRx7CSPFTBVTJQxVtHxEjmJvU1r0ypNxcJfzzkfHZ2YGBJvUXKXxOfL-Wfl7ncArx3hcnSjxEociRsUHgdWTKSw0lAQqZSKeUWk_WnEh2Pv44RNtuCgrYURTVZ4vy1piE_nDYWopi22G3tai1TZLrWJaxOPhTbhAfNDQu24mBGK0SdV29DlDPF5B7rj0f7gqC7A8i0vqNolN8eEtGU9Ib38rPpeGwtWV9v-vC2Gu7MQS7Skqltg_Amj3oabq2whyh9iOv1t3dq9B8t2xHW6yll_VcT95OclMsj_a5L7cLeBueagVnoAWzJ7CDfqxpflI9gbmFUmoyXyXJTN8dfRvqkpY4tS13CZYno8z0-Lk5mJL2EOlCpnuvvXuanlZjqPsFJePobx7ofDd0OraetgJYy5BYZXTVNI01AmDpWOkDRAWIRAROBpgQGPBSpF9_FC9BoiaayYp2n6wpQkAccRPYFONs_kMzAR2ykZOIonjHsp7vQThDepk3Cl24fEqQFeO3dR0nCe69Yb06j-906jTVtGta0M6K_VFjXpx98U3lSOsZYW-ZnOmPNZNJx8j0bvyZc9_u0goga8xVm87k17G_51oRWEDg99FNhpHS5qIs8yImHAuR69Aa_WVzFk6P9AIpPzlRbxEXY79GoJSnEf7rrO1RI-LhU4a4EBT2tPv3g739OUP9wAe-361xvw83_W2IFbbf4mYS-gU-Qr-RJBYhH3YPvw86TXfPu_APoxa8k | 
    
| linkProvider | Unpaywall | 
    
| linkToUnpaywall | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3db9MwED-NVggQ4vsjDEYeEBIPaeI4dpzHCpgqJqoxqCi8RE5iw7Q2rdJULPz1nPPR0YmJIfEWJXdJfL6cf1bufgfwwpM-RzdKndRTuEHhiXASoqSTRZIorXXCayLt92M-mgTvpmy6A0ddLYxss8IHXUlDcrxoKUQNbbHb2tNZZtr1qUt8lwQscgkXLIwIdZNyTihGn0xfgT5niM970J-MD4dfmgKs0AlE3S65PSakK-uJ6PlnNffaWrD6xvanXTHczaVcoSV10wLjTxj1Blxb50tZ_ZCz2W_r1v5tWHUjbtJVTgbrMhmkP8-RQf5fk9yBWy3MtYeN0l3YUfk9uNo0vqzuw8HQrjMZHVkUsmqPP44PbUMZW1amhsuWs2-L4rj8PrfxJeyh1tXcdP86tY3c3OQR1sqrBzDZf_vp9chp2zo4KWN-ieHV0BTSLFKpR5UnFRUIixCISDwtMeAxoTN0nyBCryGKJpoFhqYvykgqOI7oIfTyRa4eg43YTivhaZ4yHmS4008R3mReyrVpH5JkFgTd3MVpy3luWm_M4ubfO423bRk3trJgsFFbNqQff1N4WTvGRloWJyZjLmTxaPo1Hr8hHw7456OYWvAKZ_GyN93b8q8zLRF5PApRYLdzuLiNPKuYRIJzM3oLnm-uYsgw_4FkrhZrIxIi7PboxRKU4j7c972LJUJcKnDWhAWPGk8_e7swMJQ_3AJ34_qXG_CTf9bYhetd_iZhT6FXFmv1DEFimey1X_0veJNqrQ | 
    
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+multi-array+multi-SNP+genotyping+algorithm+for+Affymetrix+SNP+microarrays&rft.jtitle=Bioinformatics+%28Oxford%2C+England%29&rft.au=Xiao%2C+Yuanyuan&rft.au=Segal%2C+Mark+R&rft.au=Yang%2C+Y+H&rft.au=Yeh%2C+Ru-Fang&rft.date=2007-06-15&rft.issn=1367-4811&rft.eissn=1367-4811&rft.volume=23&rft.issue=12&rft.spage=1459&rft_id=info:doi/10.1093%2Fbioinformatics%2Fbtm131&rft.externalDBID=NO_FULL_TEXT | 
    
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1367-4803&client=summon | 
    
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1367-4803&client=summon | 
    
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1367-4803&client=summon |