pFind–Alioth: A novel unrestricted database search algorithm to improve the interpretation of high-resolution MS/MS data

Database search is the dominant approach in high-throughput proteomic analysis. However, the interpretation rate of MS/MS spectra is very low in such a restricted mode, which is mainly due to unexpected modifications and irregular digestion types. In this study, we developed a new algorithm called A...

Full description

Saved in:
Bibliographic Details
Published inJournal of proteomics Vol. 125; pp. 89 - 97
Main Authors Chi, Hao, He, Kun, Yang, Bing, Chen, Zhen, Sun, Rui-Xiang, Fan, Sheng-Bo, Zhang, Kun, Liu, Chao, Yuan, Zuo-Fei, Wang, Quan-Hui, Liu, Si-Qi, Dong, Meng-Qiu, He, Si-Min
Format Journal Article
LanguageEnglish
Published Netherlands 01.07.2015
Subjects
Online AccessGet full text
ISSN1874-3919
1876-7737
DOI10.1016/j.jprot.2015.05.009

Cover

Abstract Database search is the dominant approach in high-throughput proteomic analysis. However, the interpretation rate of MS/MS spectra is very low in such a restricted mode, which is mainly due to unexpected modifications and irregular digestion types. In this study, we developed a new algorithm called Alioth, to be integrated into the search engine of pFind, for fast and accurate unrestricted database search on high-resolution MS/MS data. An ion index is constructed for both peptide precursors and fragment ions, by which arbitrary digestions and a single site of any modifications and mutations can be searched efficiently. A new re-ranking algorithm is used to distinguish the correct peptide-spectrum matches from random ones. The algorithm is tested on several HCD datasets and the interpretation rate of MS/MS spectra using Alioth is as high as 60%-80%. Peptides from semi- and non-specific digestions, as well as those with unexpected modifications or mutations, can be effectively identified using Alioth and confidently validated using other search engines. The average processing speed of Alioth is 5-10 times faster than some other unrestricted search engines and is comparable to or even faster than the restricted search algorithms tested.
AbstractList Database search is the dominant approach in high-throughput proteomic analysis. However, the interpretation rate of MS/MS spectra is very low in such a restricted mode, which is mainly due to unexpected modifications and irregular digestion types. In this study, we developed a new algorithm called Alioth, to be integrated into the search engine of pFind, for fast and accurate unrestricted database search on high-resolution MS/MS data. An ion index is constructed for both peptide precursors and fragment ions, by which arbitrary digestions and a single site of any modifications and mutations can be searched efficiently. A new re-ranking algorithm is used to distinguish the correct peptide-spectrum matches from random ones. The algorithm is tested on several HCD datasets and the interpretation rate of MS/MS spectra using Alioth is as high as 60%-80%. Peptides from semi- and non-specific digestions, as well as those with unexpected modifications or mutations, can be effectively identified using Alioth and confidently validated using other search engines. The average processing speed of Alioth is 5-10 times faster than some other unrestricted search engines and is comparable to or even faster than the restricted search algorithms tested.
Author Chi, Hao
Liu, Chao
Yang, Bing
He, Kun
Fan, Sheng-Bo
He, Si-Min
Sun, Rui-Xiang
Yuan, Zuo-Fei
Liu, Si-Qi
Chen, Zhen
Zhang, Kun
Wang, Quan-Hui
Dong, Meng-Qiu
Author_xml – sequence: 1
  givenname: Hao
  surname: Chi
  fullname: Chi, Hao
– sequence: 2
  givenname: Kun
  surname: He
  fullname: He, Kun
– sequence: 3
  givenname: Bing
  surname: Yang
  fullname: Yang, Bing
– sequence: 4
  givenname: Zhen
  surname: Chen
  fullname: Chen, Zhen
– sequence: 5
  givenname: Rui-Xiang
  surname: Sun
  fullname: Sun, Rui-Xiang
– sequence: 6
  givenname: Sheng-Bo
  surname: Fan
  fullname: Fan, Sheng-Bo
– sequence: 7
  givenname: Kun
  surname: Zhang
  fullname: Zhang, Kun
– sequence: 8
  givenname: Chao
  surname: Liu
  fullname: Liu, Chao
– sequence: 9
  givenname: Zuo-Fei
  surname: Yuan
  fullname: Yuan, Zuo-Fei
– sequence: 10
  givenname: Quan-Hui
  surname: Wang
  fullname: Wang, Quan-Hui
– sequence: 11
  givenname: Si-Qi
  surname: Liu
  fullname: Liu, Si-Qi
– sequence: 12
  givenname: Meng-Qiu
  surname: Dong
  fullname: Dong, Meng-Qiu
– sequence: 13
  givenname: Si-Min
  surname: He
  fullname: He, Si-Min
BackLink https://www.ncbi.nlm.nih.gov/pubmed/25979774$$D View this record in MEDLINE/PubMed
BookMark eNp9kc9q3DAQxkVJyb_2CQpFx1680dhay-ptCUlTSOgh7VnMyuNYi21tJTmQnPoOfcM-SZTdpIceCgMaxO_7mJnvhB1MfiLGPoBYgID6bLPYbINPi1LAciFyCf2GHUOj6kKpSh3sellUGvQRO4lxI0QNSqtDdlQudW6UPGaP20s3tX9-_V4Nzqf-M1_xyd_TwOcpUEzB2UQtbzHhGiPxSBhsz3G488GlfuTJczfmMe6Jp564mxKFbaCEyfmJ-4737q4vspUf5t3Xze3Zze3O8B172-EQ6f3Le8p-XF58P78qrr99-Xq-ui6sFE0qqFFNJ22Ncm1JoWwUwFLUAltsULUgO1h3JWqrGpl3BYSuraFWnSx1I9tldco-7X3zmD_nvJQZXbQ0DDiRn6MphRClLAGqjH58Qef1SK3ZBjdieDCv98pAtQds8DEG6v4iIMxzKmZjdqmY51SMyCV0Vul_VNbtL5QCuuG_2icvXJcT
CitedBy_id crossref_primary_10_1038_s41589_018_0116_2
crossref_primary_10_1016_j_csbj_2020_06_002
crossref_primary_10_1021_acs_analchem_9b02445
crossref_primary_10_1038_s41592_021_01306_0
crossref_primary_10_1016_j_jpha_2020_11_006
crossref_primary_10_1021_acs_chemrestox_7b00183
crossref_primary_10_1097_JP9_0000000000000152
crossref_primary_10_1038_nmeth_4256
crossref_primary_10_1021_acs_jproteome_7b00673
crossref_primary_10_1109_ACCESS_2020_3047588
crossref_primary_10_1093_bib_bbaa015
crossref_primary_10_1038_s41598_017_09918_3
crossref_primary_10_4236_jcc_2022_104002
crossref_primary_10_1016_j_jprot_2015_07_016
crossref_primary_10_1038_s43588_021_00113_z
crossref_primary_10_1002_pmic_201700150
crossref_primary_10_1126_sciadv_add6550
crossref_primary_10_1126_scitranslmed_aah5583
crossref_primary_10_1074_mcp_RA118_000812
crossref_primary_10_1007_s00018_019_03333_9
crossref_primary_10_1016_j_jgg_2016_05_007
crossref_primary_10_1002_cpps_16
crossref_primary_10_1021_acs_jproteome_1c00205
crossref_primary_10_1002_pmic_201700306
crossref_primary_10_1021_acs_jproteome_2c00617
crossref_primary_10_1002_cpps_76
crossref_primary_10_1016_j_molp_2016_09_016
crossref_primary_10_1038_s41598_020_72196_z
crossref_primary_10_1007_s10815_023_02724_z
crossref_primary_10_1093_bioinformatics_btz366
crossref_primary_10_1016_j_xpro_2020_100137
crossref_primary_10_1021_acs_analchem_7b02566
crossref_primary_10_1021_acs_analchem_7b03237
crossref_primary_10_1002_jms_4653
crossref_primary_10_1105_tpc_18_00706
crossref_primary_10_1021_acs_jproteome_8b00993
crossref_primary_10_1021_jasms_0c00171
crossref_primary_10_1038_nbt_4236
crossref_primary_10_1021_acs_jproteome_6b00716
crossref_primary_10_1021_acs_jproteome_8b00032
crossref_primary_10_1021_acs_jproteome_7b00428
crossref_primary_10_1038_s41467_022_33124_z
crossref_primary_10_1111_jeu_70006
Cites_doi 10.1074/mcp.T500034-MCP200
10.1021/ac0258709
10.1093/bioinformatics/btq185
10.1002/rcm.3173
10.1002/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO;2-2
10.1073/pnas.0811739106
10.1074/mcp.T600050-MCP200
10.1073/pnas.0701130104
10.1038/nmeth1113
10.1074/mcp.M110.000455
10.1186/1756-0500-1-130
10.1021/ac0347462
10.1142/S0219720005001247
10.1093/bioinformatics/bth186
10.1021/pr101060v
10.1021/ac00096a002
10.1021/pr200152g
10.1021/pr3006843
10.1038/nprot.2006.10
10.1021/ac026199a
10.1021/pr800982s
10.1021/ac025747h
10.1021/ac050102d
10.1021/pr101065j
10.2144/000112487
10.1074/mcp.M111.007690
10.1002/rcm.1198
10.1038/nmeth1019
10.1021/pr0499491
10.1038/nature01511
10.1002/1615-9861(200210)2:10<1426::AID-PROT1426>3.0.CO;2-5
10.1021/pr070542g
10.1038/nmeth785
10.1021/pr900349r
10.1002/pmic.200300744
10.1021/pr100182k
10.1093/bioinformatics/bth092
10.1021/pr200153k
10.1074/mcp.M800021-MCP200
10.1093/nar/gkl245
10.1101/gr.219302
10.1074/mcp.M111.010587
10.1074/mcp.D500002-MCP200
10.1021/pr800154p
10.1038/85686
10.1021/pr801109k
10.1021/pr0498234
10.1002/pmic.201100529
10.1016/1044-0305(94)80016-2
10.1021/ac061515x
10.1021/ac0481046
10.1021/ac0617013
10.1002/rcm.4448
10.1074/mcp.M800103-MCP200
ContentType Journal Article
Copyright Copyright © 2015 Elsevier B.V. All rights reserved.
Copyright_xml – notice: Copyright © 2015 Elsevier B.V. All rights reserved.
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
7S9
L.6
DOI 10.1016/j.jprot.2015.05.009
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
AGRICOLA
AGRICOLA - Academic
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
AGRICOLA
AGRICOLA - Academic
DatabaseTitleList MEDLINE
AGRICOLA
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Anatomy & Physiology
EISSN 1876-7737
EndPage 97
ExternalDocumentID 25979774
10_1016_j_jprot_2015_05_009
Genre Research Support, Non-U.S. Gov't
Journal Article
GroupedDBID ---
--K
--M
.~1
0R~
1B1
1~.
1~5
4.4
457
4G.
53G
5GY
5VS
7-5
71M
8P~
AAEDT
AAEDW
AAHBH
AAIKJ
AAKOC
AALRI
AAOAW
AAQFI
AARLI
AATTM
AAXKI
AAXUO
AAYWO
AAYXX
ABFNM
ABGSF
ABMAC
ABUDA
ABWVN
ABXDB
ACDAQ
ACIUM
ACLOT
ACNNM
ACRLP
ACRPL
ACVFH
ADBBV
ADCNI
ADECG
ADEZE
ADMUD
ADNMO
ADUVX
AEBSH
AEHWI
AEIPS
AEKER
AENEX
AEUPX
AFJKZ
AFPUW
AFTJW
AFXIZ
AFZHZ
AGHFR
AGUBO
AGYEJ
AIEXJ
AIGII
AIIUN
AIKHN
AITUG
AJSZI
AKBMS
AKRWK
AKYEP
ALMA_UNASSIGNED_HOLDINGS
AMRAJ
ANKPU
APXCP
AXJTR
BKOJK
BLXMC
CITATION
CS3
DU5
EBS
EFJIC
EFKBS
EFLBG
EJD
EO9
EP2
EP3
F5P
FDB
FEDTE
FIRID
FLBIZ
FNPLU
FYGXN
GBLVA
HVGLF
HZ~
J1W
KOM
M41
MO0
N9A
O-L
O9-
OAUVE
OZT
P-8
P-9
PC.
Q38
ROL
SDF
SDG
SES
SPC
SPCBC
SSK
SSU
SSZ
T5K
~G-
~HD
ABJNI
CGR
CUY
CVF
ECM
EIF
NPM
7S9
L.6
ID FETCH-LOGICAL-c408t-e878f4c6a4bce7a487115060ada8a7d14f1bf2a9c7843911a1fd6167f42984d53
ISSN 1874-3919
IngestDate Sat Sep 27 22:10:03 EDT 2025
Thu Jan 02 22:19:30 EST 2025
Wed Oct 01 02:25:25 EDT 2025
Thu Apr 24 23:06:40 EDT 2025
IsPeerReviewed true
IsScholarly true
Keywords High resolution MS/MS
Unrestricted database search
Ion index
In-depth interpretation
Language English
License Copyright © 2015 Elsevier B.V. All rights reserved.
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c408t-e878f4c6a4bce7a487115060ada8a7d14f1bf2a9c7843911a1fd6167f42984d53
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
PMID 25979774
PQID 2000242113
PQPubID 24069
PageCount 9
ParticipantIDs proquest_miscellaneous_2000242113
pubmed_primary_25979774
crossref_primary_10_1016_j_jprot_2015_05_009
crossref_citationtrail_10_1016_j_jprot_2015_05_009
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2015-07-01
PublicationDateYYYYMMDD 2015-07-01
PublicationDate_xml – month: 07
  year: 2015
  text: 2015-07-01
  day: 01
PublicationDecade 2010
PublicationPlace Netherlands
PublicationPlace_xml – name: Netherlands
PublicationTitle Journal of proteomics
PublicationTitleAlternate J Proteomics
PublicationYear 2015
References Kim (10.1016/j.jprot.2015.05.009_bb0205) 2006; 34
Tanner (10.1016/j.jprot.2015.05.009_bb0080) 2006; 1
Tang (10.1016/j.jprot.2015.05.009_bb0085) 2005; 77
Tsur (10.1016/j.jprot.2015.05.009_bb0075) 2005
Fu (10.1016/j.jprot.2015.05.009_bb0035) 2004; 20
Tabb (10.1016/j.jprot.2015.05.009_bb0170) 2008; 7
Havilio (10.1016/j.jprot.2015.05.009_bb0110) 2007; 79
Tabb (10.1016/j.jprot.2015.05.009_bb0175) 2003; 75
Eng (10.1016/j.jprot.2015.05.009_bb0015) 1994; 5
Chen (10.1016/j.jprot.2015.05.009_bb0095) 2009; 106
Choi (10.1016/j.jprot.2015.05.009_bb0215) 2008; 7
Fenyo (10.1016/j.jprot.2015.05.009_bb0250) 2003; 75
Elias (10.1016/j.jprot.2015.05.009_bb0255) 2007; 4
Han (10.1016/j.jprot.2015.05.009_bb0155) 2011; 10
Bern (10.1016/j.jprot.2015.05.009_bb0195) 2007; 79
Zhang (10.1016/j.jprot.2015.05.009_bb0270) 2014
Geer (10.1016/j.jprot.2015.05.009_bb0030) 2004; 3
Chalkley (10.1016/j.jprot.2015.05.009_bb0065) 2005; 4
Wang (10.1016/j.jprot.2015.05.009_bb0040) 2007; 21
Bern (10.1016/j.jprot.2015.05.009_bb0160) 2009; 8
Ye (10.1016/j.jprot.2015.05.009_bb0135) 2010; 26
Kall (10.1016/j.jprot.2015.05.009_bb0230) 2007; 4
Washburn (10.1016/j.jprot.2015.05.009_bb0265) 2001; 19
Chalkley (10.1016/j.jprot.2015.05.009_bb0090) 2008; 7
Chi (10.1016/j.jprot.2015.05.009_bb0245) 2010; 9
Keller (10.1016/j.jprot.2015.05.009_bb0220) 2002; 74
Han (10.1016/j.jprot.2015.05.009_bb0190) 2005; 3
Mann (10.1016/j.jprot.2015.05.009_bb0165) 1994; 66
Shilov (10.1016/j.jprot.2015.05.009_bb0185) 2007; 6
Carvalho (10.1016/j.jprot.2015.05.009_bb0235) 2012; 12
Michalski (10.1016/j.jprot.2015.05.009_bb0060) 2011; 10
Brosch (10.1016/j.jprot.2015.05.009_bb0280) 2009; 8
Hansen (10.1016/j.jprot.2015.05.009_bb0100) 2005; 4
Bandeira (10.1016/j.jprot.2015.05.009_bb0130) 2007; 104
Li (10.1016/j.jprot.2015.05.009_bb0050) 2010; 24
Chi (10.1016/j.jprot.2015.05.009_bb0260) 2013; 12
Kim (10.1016/j.jprot.2015.05.009_bb0200) 2008; 8
Craig (10.1016/j.jprot.2015.05.009_bb0025) 2004; 20
Tanner (10.1016/j.jprot.2015.05.009_bb0070) 2005; 77
Shteynberg (10.1016/j.jprot.2015.05.009_bb0240) 2011; 10
Spivak (10.1016/j.jprot.2015.05.009_bb0225) 2009; 8
Perkins (10.1016/j.jprot.2015.05.009_bb0010) 1999; 20
Barsnes (10.1016/j.jprot.2015.05.009_bb0105) 2008; 1
Savitski (10.1016/j.jprot.2015.05.009_bb0115) 2006; 5
Zhang (10.1016/j.jprot.2015.05.009_bb0210) 2012; 11
Creasy (10.1016/j.jprot.2015.05.009_bb0145) 2002; 2
Fu (10.1016/j.jprot.2015.05.009_bb0120) 2011; 10
Aebersold (10.1016/j.jprot.2015.05.009_bb0005) 2003; 422
Cox (10.1016/j.jprot.2015.05.009_bb0045) 2011; 10
Ahrne (10.1016/j.jprot.2015.05.009_bb0140) 2011; 10
Bandeira (10.1016/j.jprot.2015.05.009_bb0125) 2007; 42
Creasy (10.1016/j.jprot.2015.05.009_bb0150) 2004; 4
Sunyaev (10.1016/j.jprot.2015.05.009_bb0180) 2003; 75
Elias (10.1016/j.jprot.2015.05.009_bb0055) 2005; 2
Bao (10.1016/j.jprot.2015.05.009_bb0275) 2002; 12
Craig (10.1016/j.jprot.2015.05.009_bb0020) 2003; 17
26232248 - J Proteomics. 2015 Nov 3;129:33-41
References_xml – volume: 5
  start-page: 935
  issue: 5
  year: 2006
  ident: 10.1016/j.jprot.2015.05.009_bb0115
  article-title: ModifiComb, a new proteomic tool for mapping substoichiometric post-translational modifications, finding novel types of modifications, and fingerprinting complex protein mixtures
  publication-title: Mol. Cell. Proteomics
  doi: 10.1074/mcp.T500034-MCP200
– volume: 75
  start-page: 768
  issue: 4
  year: 2003
  ident: 10.1016/j.jprot.2015.05.009_bb0250
  article-title: A method for assessing the statistical significance of mass spectrometry-based protein identifications using general scoring schemes
  publication-title: Anal. Chem.
  doi: 10.1021/ac0258709
– volume: 26
  start-page: i399
  issue: 12
  year: 2010
  ident: 10.1016/j.jprot.2015.05.009_bb0135
  article-title: Open MS/MS spectral library search to identify unanticipated post-translational modifications and increase spectral identification rate
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btq185
– volume: 21
  start-page: 2985
  issue: 18
  year: 2007
  ident: 10.1016/j.jprot.2015.05.009_bb0040
  article-title: pFind 2.0: a software package for peptide and protein identification via tandem mass spectrometry
  publication-title: Rapid Commun. Mass Spectrom.
  doi: 10.1002/rcm.3173
– volume: 20
  start-page: 3551
  issue: 18
  year: 1999
  ident: 10.1016/j.jprot.2015.05.009_bb0010
  article-title: Probability-based protein identification by searching sequence databases using mass spectrometry data
  publication-title: Electrophoresis
  doi: 10.1002/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO;2-2
– volume: 106
  start-page: 761
  issue: 3
  year: 2009
  ident: 10.1016/j.jprot.2015.05.009_bb0095
  article-title: PTMap — a sequence alignment software for unrestricted, accurate, and full-spectrum identification of post-translational modification sites
  publication-title: Proc. Natl. Acad. Sci. U. S. A.
  doi: 10.1073/pnas.0811739106
– year: 2014
  ident: 10.1016/j.jprot.2015.05.009_bb0270
  article-title: Identifying novel genes and gene refinements in Thermoanaerobacter tengcongensis
– volume: 6
  start-page: 1638
  issue: 9
  year: 2007
  ident: 10.1016/j.jprot.2015.05.009_bb0185
  article-title: The Paragon Algorithm, a next generation search engine that uses sequence temperature values and feature probabilities to identify peptides from tandem mass spectra
  publication-title: Mol. Cell. Proteomics
  doi: 10.1074/mcp.T600050-MCP200
– volume: 104
  start-page: 6140
  issue: 15
  year: 2007
  ident: 10.1016/j.jprot.2015.05.009_bb0130
  article-title: Protein identification by spectral networks analysis
  publication-title: Proc. Natl. Acad. Sci. U. S. A.
  doi: 10.1073/pnas.0701130104
– volume: 4
  start-page: 923
  issue: 11
  year: 2007
  ident: 10.1016/j.jprot.2015.05.009_bb0230
  article-title: Semi-supervised learning for peptide identification from shotgun proteomics datasets
  publication-title: Nat. Methods
  doi: 10.1038/nmeth1113
– volume: 10
  issue: 5
  year: 2011
  ident: 10.1016/j.jprot.2015.05.009_bb0120
  article-title: DeltAMT: a statistical algorithm for fast detection of protein modifications from LC–MS/MS data
  publication-title: Mol. Cell. Proteomics
  doi: 10.1074/mcp.M110.000455
– volume: 1
  start-page: 130
  year: 2008
  ident: 10.1016/j.jprot.2015.05.009_bb0105
  article-title: Blind search for post-translational modifications and amino acid substitutions using peptide mass fingerprints from two proteases
  publication-title: BMC Res. Notes
  doi: 10.1186/1756-0500-1-130
– volume: 75
  start-page: 6415
  issue: 23
  year: 2003
  ident: 10.1016/j.jprot.2015.05.009_bb0175
  article-title: GutenTag: high-throughput sequence tagging via an empirically derived fragmentation model
  publication-title: Anal. Chem.
  doi: 10.1021/ac0347462
– volume: 3
  start-page: 697
  issue: 3
  year: 2005
  ident: 10.1016/j.jprot.2015.05.009_bb0190
  article-title: SPIDER: software for protein identification from sequence tags with de novo sequencing error
  publication-title: J. Bioinform. Comput. Biol.
  doi: 10.1142/S0219720005001247
– volume: 20
  start-page: 1948
  issue: 12
  year: 2004
  ident: 10.1016/j.jprot.2015.05.009_bb0035
  article-title: Exploiting the kernel trick to correlate fragment ions for peptide identification via tandem mass spectrometry
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/bth186
– volume: 10
  start-page: 1785
  issue: 4
  year: 2011
  ident: 10.1016/j.jprot.2015.05.009_bb0060
  article-title: More than 100,000 detectable peptide species elute in single shotgun proteomics runs but the majority is inaccessible to data-dependent LC–MS/MS
  publication-title: J. Proteome Res.
  doi: 10.1021/pr101060v
– volume: 66
  start-page: 4390
  issue: 24
  year: 1994
  ident: 10.1016/j.jprot.2015.05.009_bb0165
  article-title: Error-tolerant identification of peptides in sequence databases by peptide sequence tags
  publication-title: Anal. Chem.
  doi: 10.1021/ac00096a002
– volume: 10
  start-page: 2913
  issue: 7
  year: 2011
  ident: 10.1016/j.jprot.2015.05.009_bb0140
  article-title: QuickMod: a tool for open modification spectrum library searches
  publication-title: J. Proteome Res.
  doi: 10.1021/pr200152g
– volume: 12
  start-page: 615
  year: 2013
  ident: 10.1016/j.jprot.2015.05.009_bb0260
  article-title: pNovo+: de novo peptide sequencing using complementary HCD and ETD tandem mass spectra
  publication-title: J. Proteome Res.
  doi: 10.1021/pr3006843
– volume: 1
  start-page: 67
  issue: 1
  year: 2006
  ident: 10.1016/j.jprot.2015.05.009_bb0080
  article-title: Unrestrictive identification of post-translational modifications through peptide mass spectrometry
  publication-title: Nat. Protoc.
  doi: 10.1038/nprot.2006.10
– volume: 75
  start-page: 1307
  issue: 6
  year: 2003
  ident: 10.1016/j.jprot.2015.05.009_bb0180
  article-title: MultiTag: multiple error-tolerant sequence tag search for the sequence-similarity identification of proteins by mass spectrometry
  publication-title: Anal. Chem.
  doi: 10.1021/ac026199a
– volume: 8
  start-page: 3176
  issue: 6
  year: 2009
  ident: 10.1016/j.jprot.2015.05.009_bb0280
  article-title: Accurate and sensitive peptide identification with Mascot Percolator
  publication-title: J. Proteome Res.
  doi: 10.1021/pr800982s
– volume: 74
  start-page: 5383
  issue: 20
  year: 2002
  ident: 10.1016/j.jprot.2015.05.009_bb0220
  article-title: Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search
  publication-title: Anal. Chem.
  doi: 10.1021/ac025747h
– volume: 77
  start-page: 4626
  issue: 14
  year: 2005
  ident: 10.1016/j.jprot.2015.05.009_bb0070
  article-title: InsPecT: identification of posttranslationally modified peptides from tandem mass spectra
  publication-title: Anal. Chem.
  doi: 10.1021/ac050102d
– volume: 10
  start-page: 1794
  issue: 4
  year: 2011
  ident: 10.1016/j.jprot.2015.05.009_bb0045
  article-title: Andromeda: a peptide search engine integrated into the MaxQuant environment
  publication-title: J. Proteome Res.
  doi: 10.1021/pr101065j
– volume: 42
  start-page: 687
  issue: 6
  year: 2007
  ident: 10.1016/j.jprot.2015.05.009_bb0125
  article-title: Spectral networks: a new approach to de novo discovery of protein sequences and posttranslational modifications
  publication-title: Biotechniques
  doi: 10.2144/000112487
– volume: 10
  issue: 12
  year: 2011
  ident: 10.1016/j.jprot.2015.05.009_bb0240
  article-title: iProphet: multi-level integrative analysis of shotgun proteomic data improves peptide and protein identification rates and error estimates
  publication-title: Mol. Cell. Proteomics
  doi: 10.1074/mcp.M111.007690
– volume: 17
  start-page: 2310
  issue: 20
  year: 2003
  ident: 10.1016/j.jprot.2015.05.009_bb0020
  article-title: A method for reducing the time required to match protein sequences with tandem mass spectra
  publication-title: Rapid Commun. Mass Spectrom.
  doi: 10.1002/rcm.1198
– volume: 4
  start-page: 207
  issue: 3
  year: 2007
  ident: 10.1016/j.jprot.2015.05.009_bb0255
  article-title: Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry
  publication-title: Nat. Methods
  doi: 10.1038/nmeth1019
– volume: 3
  start-page: 958
  issue: 5
  year: 2004
  ident: 10.1016/j.jprot.2015.05.009_bb0030
  article-title: Open mass spectrometry search algorithm
  publication-title: J. Proteome Res.
  doi: 10.1021/pr0499491
– volume: 422
  start-page: 198
  issue: 6928
  year: 2003
  ident: 10.1016/j.jprot.2015.05.009_bb0005
  article-title: Mass spectrometry-based proteomics
  publication-title: Nature
  doi: 10.1038/nature01511
– volume: 2
  start-page: 1426
  issue: 10
  year: 2002
  ident: 10.1016/j.jprot.2015.05.009_bb0145
  article-title: Error tolerant searching of uninterpreted tandem mass spectrometry data
  publication-title: Proteomics
  doi: 10.1002/1615-9861(200210)2:10<1426::AID-PROT1426>3.0.CO;2-5
– volume: 7
  start-page: 254
  issue: 1
  year: 2008
  ident: 10.1016/j.jprot.2015.05.009_bb0215
  article-title: Semisupervised model-based validation of peptide identifications in mass spectrometry-based proteomics
  publication-title: J. Proteome Res.
  doi: 10.1021/pr070542g
– volume: 2
  start-page: 667
  issue: 9
  year: 2005
  ident: 10.1016/j.jprot.2015.05.009_bb0055
  article-title: Comparative evaluation of mass spectrometry platforms used in large-scale proteomics investigations
  publication-title: Nat. Methods
  doi: 10.1038/nmeth785
– volume: 8
  start-page: 4328
  issue: 9
  year: 2009
  ident: 10.1016/j.jprot.2015.05.009_bb0160
  article-title: Reanalysis of Tyrannosaurus rex mass spectra
  publication-title: J. Proteome Res.
  doi: 10.1021/pr900349r
– volume: 4
  start-page: 1534
  issue: 6
  year: 2004
  ident: 10.1016/j.jprot.2015.05.009_bb0150
  article-title: Unimod: protein modifications for mass spectrometry
  publication-title: Proteomics
  doi: 10.1002/pmic.200300744
– volume: 9
  start-page: 2713
  issue: 5
  year: 2010
  ident: 10.1016/j.jprot.2015.05.009_bb0245
  article-title: pNovo: de novo peptide sequencing and identification using HCD spectra
  publication-title: J. Proteome Res.
  doi: 10.1021/pr100182k
– volume: 20
  start-page: 1466
  issue: 9
  year: 2004
  ident: 10.1016/j.jprot.2015.05.009_bb0025
  article-title: TANDEM: matching proteins with tandem mass spectra
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/bth092
– volume: 10
  start-page: 2930
  issue: 7
  year: 2011
  ident: 10.1016/j.jprot.2015.05.009_bb0155
  article-title: PeaksPTM: mass spectrometry-based identification of peptides with unspecified modifications
  publication-title: J. Proteome Res.
  doi: 10.1021/pr200153k
– volume: 7
  start-page: 2386
  issue: 12
  year: 2008
  ident: 10.1016/j.jprot.2015.05.009_bb0090
  article-title: In-depth analysis of tandem mass spectrometry data from disparate instrument types
  publication-title: Mol. Cell. Proteomics
  doi: 10.1074/mcp.M800021-MCP200
– volume: 34
  start-page: W258
  issue: Web Server issue
  year: 2006
  ident: 10.1016/j.jprot.2015.05.009_bb0205
  article-title: MODi: a powerful and convenient web server for identifying multiple post-translational peptide modifications from tandem mass spectra
  publication-title: Nucleic Acids Res.
  doi: 10.1093/nar/gkl245
– start-page: 157
  year: 2005
  ident: 10.1016/j.jprot.2015.05.009_bb0075
  article-title: Identification of post-translational modifications via blind search of mass-spectra
– volume: 12
  start-page: 689
  issue: 5
  year: 2002
  ident: 10.1016/j.jprot.2015.05.009_bb0275
  article-title: A complete sequence of the T. tengcongensis genome
  publication-title: Genome Res.
  doi: 10.1101/gr.219302
– volume: 11
  issue: 4
  year: 2012
  ident: 10.1016/j.jprot.2015.05.009_bb0210
  article-title: PEAKS DB: de novo sequencing assisted database search for sensitive and accurate peptide identification
  publication-title: Mol. Cell. Proteomics
  doi: 10.1074/mcp.M111.010587
– volume: 4
  start-page: 1194
  issue: 8
  year: 2005
  ident: 10.1016/j.jprot.2015.05.009_bb0065
  publication-title: Mol. Cell. Proteomics
  doi: 10.1074/mcp.D500002-MCP200
– volume: 7
  start-page: 3838
  issue: 9
  year: 2008
  ident: 10.1016/j.jprot.2015.05.009_bb0170
  article-title: DirecTag: accurate sequence tags from peptide MS/MS through statistical scoring
  publication-title: J. Proteome Res.
  doi: 10.1021/pr800154p
– volume: 19
  start-page: 242
  issue: 3
  year: 2001
  ident: 10.1016/j.jprot.2015.05.009_bb0265
  article-title: Large-scale analysis of the yeast proteome by multidimensional protein identification technology
  publication-title: Nat. Biotechnol.
  doi: 10.1038/85686
– volume: 8
  start-page: 3737
  issue: 7
  year: 2009
  ident: 10.1016/j.jprot.2015.05.009_bb0225
  article-title: Improvements to the percolator algorithm for Peptide identification from shotgun proteomics data sets
  publication-title: J. Proteome Res.
  doi: 10.1021/pr801109k
– volume: 4
  start-page: 358
  issue: 2
  year: 2005
  ident: 10.1016/j.jprot.2015.05.009_bb0100
  article-title: P-Mod: an algorithm and software to map modifications to peptide sequences using tandem MS data
  publication-title: J. Proteome Res.
  doi: 10.1021/pr0498234
– volume: 12
  start-page: 944
  issue: 7
  year: 2012
  ident: 10.1016/j.jprot.2015.05.009_bb0235
  article-title: Search engine processor: filtering and organizing peptide spectrum matches
  publication-title: Proteomics
  doi: 10.1002/pmic.201100529
– volume: 5
  start-page: 976
  issue: 11
  year: 1994
  ident: 10.1016/j.jprot.2015.05.009_bb0015
  article-title: An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database
  publication-title: J. Am. Soc. Mass Spectrom.
  doi: 10.1016/1044-0305(94)80016-2
– volume: 79
  start-page: 1362
  issue: 4
  year: 2007
  ident: 10.1016/j.jprot.2015.05.009_bb0110
  article-title: Large-scale unrestricted identification of post-translation modifications using tandem mass spectrometry
  publication-title: Anal. Chem.
  doi: 10.1021/ac061515x
– volume: 77
  start-page: 3931
  issue: 13
  year: 2005
  ident: 10.1016/j.jprot.2015.05.009_bb0085
  article-title: Discovering known and unanticipated protein modifications using MS/MS database searching
  publication-title: Anal. Chem.
  doi: 10.1021/ac0481046
– volume: 79
  start-page: 1393
  issue: 4
  year: 2007
  ident: 10.1016/j.jprot.2015.05.009_bb0195
  article-title: Lookup peaks: a hybrid of de novo sequencing and database search for protein identification by tandem mass spectrometry
  publication-title: Anal. Chem.
  doi: 10.1021/ac0617013
– volume: 24
  start-page: 807
  issue: 6
  year: 2010
  ident: 10.1016/j.jprot.2015.05.009_bb0050
  article-title: Speeding up tandem mass spectrometry based database searching by peptide and spectrum indexing
  publication-title: Rapid Commun. Mass Spectrom.
  doi: 10.1002/rcm.4448
– volume: 8
  start-page: 53
  issue: 1
  year: 2008
  ident: 10.1016/j.jprot.2015.05.009_bb0200
  article-title: Spectral dictionaries: integrating de novo peptide sequencing with database search of tandem mass spectra
  publication-title: Mol. Cell. Proteomics
  doi: 10.1074/mcp.M800103-MCP200
– reference: 26232248 - J Proteomics. 2015 Nov 3;129:33-41
SSID ssj0061797
ssib053392237
Score 2.3314195
Snippet Database search is the dominant approach in high-throughput proteomic analysis. However, the interpretation rate of MS/MS spectra is very low in such a...
SourceID proquest
pubmed
crossref
SourceType Aggregation Database
Index Database
Enrichment Source
StartPage 89
SubjectTerms Algorithms
data collection
Databases, Protein
ions
Mass Spectrometry
mutation
peptides
proteomics
Sequence Analysis, Protein - methods
tandem mass spectrometry
Title pFind–Alioth: A novel unrestricted database search algorithm to improve the interpretation of high-resolution MS/MS data
URI https://www.ncbi.nlm.nih.gov/pubmed/25979774
https://www.proquest.com/docview/2000242113
Volume 125
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVESC
  databaseName: Baden-Württemberg Complete Freedom Collection (Elsevier)
  customDbUrl:
  eissn: 1876-7737
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0061797
  issn: 1874-3919
  databaseCode: GBLVA
  dateStart: 20110101
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
– providerCode: PRVESC
  databaseName: Elsevier SD Complete Freedom Collection [SCCMFC]
  customDbUrl:
  eissn: 1876-7737
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0061797
  issn: 1874-3919
  databaseCode: ACRLP
  dateStart: 20080430
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
– providerCode: PRVESC
  databaseName: Elsevier SD Freedom Collection Journals [SCFCJ]
  customDbUrl:
  eissn: 1876-7737
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0061797
  issn: 1874-3919
  databaseCode: AIKHN
  dateStart: 20080430
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
– providerCode: PRVESC
  databaseName: ScienceDirect (Elsevier)
  customDbUrl:
  eissn: 1876-7737
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0061797
  issn: 1874-3919
  databaseCode: .~1
  dateStart: 20080430
  isFulltext: true
  titleUrlDefault: https://www.sciencedirect.com
  providerName: Elsevier
– providerCode: PRVLSH
  databaseName: Elsevier Journals
  customDbUrl:
  mediaType: online
  eissn: 1876-7737
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0061797
  issn: 1874-3919
  databaseCode: AKRWK
  dateStart: 20080430
  isFulltext: true
  providerName: Library Specific Holdings
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9NAEF6F9MIFAeURHtUiIS7Bxev4yc1EVBXCVUVaqXCxdv1oEqV2VWwkekL8Bf4hv4SZfThJWyqKFFmWFa-9mS8z38zOzBLyMssKzrgoLREJdFA8boXcFhY4QLafubj0g4XCyZ6_e-h-OPKOer2fK1lLbSO2s_Mr60r-R6pwDeSKVbI3kGw3KFyAc5AvHEHCcPwnGZ_ugEdt0hVGMWZfTFWpeVV_KxbDFnv5YAt-pJWYC4o2a6gDHXxxXJ_NmukJ0s-ZjC0UkobO1vMQkU2CB2_BUHoyw2QC75xMhrqw7Sp2K_s_YMVzx9nHU7VBNq-X8VepZtoOn5918PqdsafyNqUYv0x10ZoOUTCvS2c1WjUMXGsUad1o1K7jrShOtY-QNsEqY_eScldxhvn2HKeAWXmebLpqR0tbZtbvL5i4LvHQ5LTNUzlIioOkNnywBnTDActg98lGPP70cd_Yc6B4aoseMwnTu0pmCV56l3V-8xenRZKXg7vkjpYLjRWE7pFeUd0nm3HFm_rkO31FZR6wXGDZJOcSVb9__FJ4ektjKtFEV9FEDZqoQhPt0ESbmmo0UUATXUcTrUt6AU00mbxJJnLAB-Rw5_3BeNfSO3RYmWuHjVWEQVi6mc9dkRUBB-eXyY6VPOchD3LmlkyUDo-yIMQKb8ZZmfvMD0pgQaGbe6OHpF_VVfGYUFsAeQ0iwT1wSDLwW0vXK-GU23le-iIcEMf8rmmm29fjLiqL9BqZDsjr7qZT1b3l-q-_MAJLQcvi0hmvirr9ipu1IpllbDQgj5QkuwEd8MnRi3pys4c9JbeX_5VnpN-ctcVzILiN2NII3CK39vaTPxuZqfY
linkProvider Elsevier
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=pFind%E2%80%93Alioth%3A+A+novel+unrestricted+database+search+algorithm+to+improve+the+interpretation+of+high-resolution+MS%2FMS+data&rft.jtitle=Journal+of+proteomics&rft.au=Chi%2C+Hao&rft.au=He%2C+Kun&rft.au=Yang%2C+Bing&rft.au=Chen%2C+Zhen&rft.date=2015-07-01&rft.issn=1874-3919&rft.volume=125&rft.spage=89&rft.epage=97&rft_id=info:doi/10.1016%2Fj.jprot.2015.05.009&rft.externalDBID=n%2Fa&rft.externalDocID=10_1016_j_jprot_2015_05_009
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1874-3919&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1874-3919&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1874-3919&client=summon