iFeature: a Python package and web server for features extraction and selection from protein and peptide sequences

Abstract Summary Structural and physiochemical descriptors extracted from sequence data have been widely used to represent sequences and predict structural, functional, expression and interaction profiles of proteins and peptides as well as DNAs/RNAs. Here, we present iFeature, a versatile Python-ba...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics Vol. 34; no. 14; pp. 2499 - 2502
Main Authors Chen, Zhen, Zhao, Pei, Li, Fuyi, Leier, André, Marquez-Lago, Tatiana T, Wang, Yanan, Webb, Geoffrey I, Smith, A Ian, Daly, Roger J, Chou, Kuo-Chen, Song, Jiangning
Format Journal Article
LanguageEnglish
Published England Oxford University Press 15.07.2018
Subjects
Online AccessGet full text
ISSN1367-4803
1367-4811
1460-2059
1367-4811
DOI10.1093/bioinformatics/bty140

Cover

Abstract Abstract Summary Structural and physiochemical descriptors extracted from sequence data have been widely used to represent sequences and predict structural, functional, expression and interaction profiles of proteins and peptides as well as DNAs/RNAs. Here, we present iFeature, a versatile Python-based toolkit for generating various numerical feature representation schemes for both protein and peptide sequences. iFeature is capable of calculating and extracting a comprehensive spectrum of 18 major sequence encoding schemes that encompass 53 different types of feature descriptors. It also allows users to extract specific amino acid properties from the AAindex database. Furthermore, iFeature integrates 12 different types of commonly used feature clustering, selection and dimensionality reduction algorithms, greatly facilitating training, analysis and benchmarking of machine-learning models. The functionality of iFeature is made freely available via an online web server and a stand-alone toolkit. Availability and implementation http://iFeature.erc.monash.edu/; https://github.com/Superzchen/iFeature/. Supplementary information Supplementary data are available at Bioinformatics online.
AbstractList Structural and physiochemical descriptors extracted from sequence data have been widely used to represent sequences and predict structural, functional, expression and interaction profiles of proteins and peptides as well as DNAs/RNAs. Here, we present iFeature, a versatile Python-based toolkit for generating various numerical feature representation schemes for both protein and peptide sequences. iFeature is capable of calculating and extracting a comprehensive spectrum of 18 major sequence encoding schemes that encompass 53 different types of feature descriptors. It also allows users to extract specific amino acid properties from the AAindex database. Furthermore, iFeature integrates 12 different types of commonly used feature clustering, selection and dimensionality reduction algorithms, greatly facilitating training, analysis and benchmarking of machine-learning models. The functionality of iFeature is made freely available via an online web server and a stand-alone toolkit.SummaryStructural and physiochemical descriptors extracted from sequence data have been widely used to represent sequences and predict structural, functional, expression and interaction profiles of proteins and peptides as well as DNAs/RNAs. Here, we present iFeature, a versatile Python-based toolkit for generating various numerical feature representation schemes for both protein and peptide sequences. iFeature is capable of calculating and extracting a comprehensive spectrum of 18 major sequence encoding schemes that encompass 53 different types of feature descriptors. It also allows users to extract specific amino acid properties from the AAindex database. Furthermore, iFeature integrates 12 different types of commonly used feature clustering, selection and dimensionality reduction algorithms, greatly facilitating training, analysis and benchmarking of machine-learning models. The functionality of iFeature is made freely available via an online web server and a stand-alone toolkit.http://iFeature.erc.monash.edu/; https://github.com/Superzchen/iFeature/.Availability and implementationhttp://iFeature.erc.monash.edu/; https://github.com/Superzchen/iFeature/.Supplementary data are available at Bioinformatics online.Supplementary informationSupplementary data are available at Bioinformatics online.
Abstract Summary Structural and physiochemical descriptors extracted from sequence data have been widely used to represent sequences and predict structural, functional, expression and interaction profiles of proteins and peptides as well as DNAs/RNAs. Here, we present iFeature, a versatile Python-based toolkit for generating various numerical feature representation schemes for both protein and peptide sequences. iFeature is capable of calculating and extracting a comprehensive spectrum of 18 major sequence encoding schemes that encompass 53 different types of feature descriptors. It also allows users to extract specific amino acid properties from the AAindex database. Furthermore, iFeature integrates 12 different types of commonly used feature clustering, selection and dimensionality reduction algorithms, greatly facilitating training, analysis and benchmarking of machine-learning models. The functionality of iFeature is made freely available via an online web server and a stand-alone toolkit. Availability and implementation http://iFeature.erc.monash.edu/; https://github.com/Superzchen/iFeature/. Supplementary information Supplementary data are available at Bioinformatics online.
Structural and physiochemical descriptors extracted from sequence data have been widely used to represent sequences and predict structural, functional, expression and interaction profiles of proteins and peptides as well as DNAs/RNAs. Here, we present iFeature, a versatile Python-based toolkit for generating various numerical feature representation schemes for both protein and peptide sequences. iFeature is capable of calculating and extracting a comprehensive spectrum of 18 major sequence encoding schemes that encompass 53 different types of feature descriptors. It also allows users to extract specific amino acid properties from the AAindex database. Furthermore, iFeature integrates 12 different types of commonly used feature clustering, selection and dimensionality reduction algorithms, greatly facilitating training, analysis and benchmarking of machine-learning models. The functionality of iFeature is made freely available via an online web server and a stand-alone toolkit. http://iFeature.erc.monash.edu/; https://github.com/Superzchen/iFeature/. Supplementary data are available at Bioinformatics online.
Author Chou, Kuo-Chen
Li, Fuyi
Wang, Yanan
Webb, Geoffrey I
Song, Jiangning
Chen, Zhen
Smith, A Ian
Daly, Roger J
Leier, André
Marquez-Lago, Tatiana T
Zhao, Pei
AuthorAffiliation 8 Gordon Life Science Institute, Boston, MA, USA
9 Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
1 School of Basic Medical Science, Qingdao University, 38 Dengzhou Road, Qingdao, China
3 Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC, Australia
5 Department of Cell, Developmental and Integrative Biology, School of Medicine, University of Alabama at Birmingham, AL, USA
2 State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences (CAAS), Anyang, China
6 Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Shanghai, China
7 Monash Centre for Data Science, Faculty of Information Technology, Monash University, Melbourne, VIC, Australia
4 Department of Genetics, School of Medicine, University of Alabama at Birmingham, AL, USA
AuthorAffiliation_xml – name: 3 Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC, Australia
– name: 5 Department of Cell, Developmental and Integrative Biology, School of Medicine, University of Alabama at Birmingham, AL, USA
– name: 9 Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
– name: 1 School of Basic Medical Science, Qingdao University, 38 Dengzhou Road, Qingdao, China
– name: 7 Monash Centre for Data Science, Faculty of Information Technology, Monash University, Melbourne, VIC, Australia
– name: 4 Department of Genetics, School of Medicine, University of Alabama at Birmingham, AL, USA
– name: 6 Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Shanghai, China
– name: 2 State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences (CAAS), Anyang, China
– name: 8 Gordon Life Science Institute, Boston, MA, USA
Author_xml – sequence: 1
  givenname: Zhen
  surname: Chen
  fullname: Chen, Zhen
  organization: School of Basic Medical Science, Qingdao University, 38 Dengzhou Road, Qingdao, China
– sequence: 2
  givenname: Pei
  surname: Zhao
  fullname: Zhao, Pei
  organization: State Key Laboratory of Cotton Biology, Institute of Cotton Research of Chinese Academy of Agricultural Sciences (CAAS), Anyang, China
– sequence: 3
  givenname: Fuyi
  surname: Li
  fullname: Li, Fuyi
  email: kcchou@gordonlifescience.org
  organization: Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC, Australia
– sequence: 4
  givenname: André
  surname: Leier
  fullname: Leier, André
  organization: Department of Genetics, School of Medicine, University of Alabama at Birmingham, AL, USA
– sequence: 5
  givenname: Tatiana T
  surname: Marquez-Lago
  fullname: Marquez-Lago, Tatiana T
  organization: Department of Genetics, School of Medicine, University of Alabama at Birmingham, AL, USA
– sequence: 6
  givenname: Yanan
  surname: Wang
  fullname: Wang, Yanan
  organization: Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Shanghai, China
– sequence: 7
  givenname: Geoffrey I
  surname: Webb
  fullname: Webb, Geoffrey I
  organization: Monash Centre for Data Science, Faculty of Information Technology, Monash University, Melbourne, VIC, Australia
– sequence: 8
  givenname: A Ian
  surname: Smith
  fullname: Smith, A Ian
  organization: Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC, Australia
– sequence: 9
  givenname: Roger J
  surname: Daly
  fullname: Daly, Roger J
  email: roger.daly@monash.edu
  organization: Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC, Australia
– sequence: 10
  givenname: Kuo-Chen
  surname: Chou
  fullname: Chou, Kuo-Chen
  email: kcchou@gordonlifescience.org
  organization: Gordon Life Science Institute, Boston, MA, USA
– sequence: 11
  givenname: Jiangning
  orcidid: 0000-0001-8031-9086
  surname: Song
  fullname: Song, Jiangning
  email: jiangning.song@monash.edu
  organization: Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC, Australia
BackLink https://www.ncbi.nlm.nih.gov/pubmed/29528364$$D View this record in MEDLINE/PubMed
BookMark eNqNkUtv1TAQhS1URB_wE0BZsgm148dNQEJCFQWkSrCAtTV2xq0h1w6203L_Pb5KeZQNXdnW-c4czfExOQgxICFPGX3B6MBPjY8-uJi2ULzNp6bsmKAPyBETirYdlcNBvXO1aUVP-SE5zvkrpZIJIR6Rw26QXc-VOCLJnyOUJeHLBppPu3IVQzOD_QaX2EAYmxs0TcZ0jampYY1b4dzgj5LAFl_xPZZxwvXlUtw2c4oF_SrNOBc_YkW-Lxgs5sfkoYMp45Pb84R8OX_7-ex9e_Hx3YezNxetlZKVduBUKdZ3lnMEGKRgFB2MUsDAjREbWjWjhDMAsGHIXU_dMBqDnbPOKslPiFrnLmGG3Q1Mk56T30LaaUb1vkR9t0S9lliNr1fjvJgtjhZD3fWPOYLXd5Xgr_RlvNZKyX5D98nPbwekWJfORW99tjhNEDAuWXeUcUaF6oeKPvs763fIrx-qwKsVsCnmnNBp6wvsq67RfvrvKvIf930roKsvLvM9LT8BTc3XoQ
CitedBy_id crossref_primary_10_1186_s12885_024_12609_8
crossref_primary_10_1016_j_ab_2024_115523
crossref_primary_10_1016_j_ygeno_2019_05_024
crossref_primary_10_1186_s12859_024_05726_5
crossref_primary_10_1021_acs_jcim_9b00877
crossref_primary_10_1093_bib_bbab376
crossref_primary_10_1007_s11517_021_02436_5
crossref_primary_10_1093_bib_bbad433
crossref_primary_10_1016_j_omtn_2023_04_015
crossref_primary_10_1093_bib_bbaa284
crossref_primary_10_1016_j_compbiomed_2022_105704
crossref_primary_10_3389_fpls_2023_1077301
crossref_primary_10_4155_fdd_2023_0004
crossref_primary_10_1016_j_compbiomed_2022_105700
crossref_primary_10_1016_j_neucom_2025_129637
crossref_primary_10_1093_bib_bbab008
crossref_primary_10_1016_j_chemolab_2020_103999
crossref_primary_10_1093_bib_bbac216
crossref_primary_10_1016_j_isci_2022_104883
crossref_primary_10_1093_bib_bbab005
crossref_primary_10_3390_biom12101529
crossref_primary_10_1007_s00204_024_03888_y
crossref_primary_10_1002_pro_4480
crossref_primary_10_1021_acs_jcim_9b00629
crossref_primary_10_1093_bib_bbaa394
crossref_primary_10_1360_SSV_2022_0074
crossref_primary_10_1142_S2737416524400052
crossref_primary_10_3390_ijms222313128
crossref_primary_10_3390_ijms222313124
crossref_primary_10_1142_S0219720020500183
crossref_primary_10_1038_s41467_022_30741_6
crossref_primary_10_2174_0929866526666191112150636
crossref_primary_10_1016_j_ab_2024_115546
crossref_primary_10_3389_fbioe_2020_584807
crossref_primary_10_1093_bib_bbac125
crossref_primary_10_3389_fgene_2021_680117
crossref_primary_10_2174_1574893615999200503030350
crossref_primary_10_1109_ACCESS_2023_3279498
crossref_primary_10_1016_j_crbiot_2021_12_002
crossref_primary_10_1016_j_gene_2021_145643
crossref_primary_10_1021_acs_jproteome_0c00276
crossref_primary_10_1038_s41598_022_08555_9
crossref_primary_10_3390_cells11172646
crossref_primary_10_1093_bioinformatics_bty628
crossref_primary_10_1109_ACCESS_2020_3010944
crossref_primary_10_1002_mlf2_12125
crossref_primary_10_3390_biology9100325
crossref_primary_10_1021_acs_jcim_2c01034
crossref_primary_10_1016_j_rineng_2024_102878
crossref_primary_10_1186_s12859_024_05714_9
crossref_primary_10_2174_1568026619666191018100141
crossref_primary_10_1016_j_compbiomed_2022_105717
crossref_primary_10_1093_bib_bbac476
crossref_primary_10_1002_pmic_202400044
crossref_primary_10_1016_j_ab_2020_113592
crossref_primary_10_1093_bib_bbab263
crossref_primary_10_1016_j_omtn_2020_06_004
crossref_primary_10_3390_a17060247
crossref_primary_10_1016_j_ymeth_2023_07_002
crossref_primary_10_1093_bib_bbac108
crossref_primary_10_1142_S0219720020500080
crossref_primary_10_1016_j_ymeth_2023_07_003
crossref_primary_10_1177_11779322241263671
crossref_primary_10_3389_fcell_2020_594587
crossref_primary_10_1016_j_ymeth_2021_12_001
crossref_primary_10_1093_bioinformatics_bty522
crossref_primary_10_1016_j_chemolab_2022_104496
crossref_primary_10_1007_s12539_021_00464_1
crossref_primary_10_1002_advs_202404049
crossref_primary_10_1016_j_jtbi_2018_11_012
crossref_primary_10_1016_j_jtbi_2018_11_010
crossref_primary_10_1021_acsomega_3c00286
crossref_primary_10_1016_j_csbj_2022_08_053
crossref_primary_10_1016_j_ijbiomac_2022_12_315
crossref_primary_10_1016_j_compbiomed_2022_105868
crossref_primary_10_3389_fbioe_2020_00892
crossref_primary_10_1016_j_compbiomed_2022_105984
crossref_primary_10_1038_s41598_021_81063_4
crossref_primary_10_1038_s41598_020_67384_w
crossref_primary_10_1021_acsomega_1c03132
crossref_primary_10_1093_bib_bbae716
crossref_primary_10_7717_peerj_11906
crossref_primary_10_1002_pro_4442
crossref_primary_10_1016_j_genrep_2023_101876
crossref_primary_10_1016_j_compbiomed_2024_109200
crossref_primary_10_1093_bioinformatics_btz721
crossref_primary_10_1186_s12859_022_04727_6
crossref_primary_10_17537_2023_18_113
crossref_primary_10_3389_fgene_2020_569100
crossref_primary_10_1371_journal_pone_0288173
crossref_primary_10_1007_s41109_022_00468_w
crossref_primary_10_1177_11779322211020315
crossref_primary_10_1109_ACCESS_2022_3146395
crossref_primary_10_1007_s12539_023_00552_4
crossref_primary_10_1007_s12275_022_2044_9
crossref_primary_10_1093_bib_bbab434
crossref_primary_10_1093_nar_gkz383
crossref_primary_10_1093_bib_bby089
crossref_primary_10_1016_j_chemolab_2022_104490
crossref_primary_10_1093_bib_bbz177
crossref_primary_10_3934_mbe_2023954
crossref_primary_10_1016_j_ymeth_2021_09_008
crossref_primary_10_1016_j_ab_2022_115020
crossref_primary_10_1109_TCBB_2023_3238370
crossref_primary_10_1080_07391102_2024_2329298
crossref_primary_10_3390_cells9020353
crossref_primary_10_1016_j_eswa_2021_114876
crossref_primary_10_1016_j_compbiomed_2022_105767
crossref_primary_10_1093_bib_bbz041
crossref_primary_10_1093_bib_bbaa018
crossref_primary_10_1093_bib_bbaa016
crossref_primary_10_1093_bib_bbab227
crossref_primary_10_1007_s00500_020_05363_z
crossref_primary_10_1007_s13755_025_00342_w
crossref_primary_10_1016_j_eswa_2024_124087
crossref_primary_10_1093_bib_bby077
crossref_primary_10_3390_app12073631
crossref_primary_10_1016_j_ymeth_2024_01_005
crossref_primary_10_4236_ns_2020_127036
crossref_primary_10_3390_ijms24021146
crossref_primary_10_1016_j_ijbiomac_2024_138880
crossref_primary_10_2174_0929867328666211005140625
crossref_primary_10_1093_bib_bbab461
crossref_primary_10_1093_bib_bbaa128
crossref_primary_10_1093_bib_bbac428
crossref_primary_10_1016_j_cbi_2021_109533
crossref_primary_10_1177_1176934320919707
crossref_primary_10_1093_bib_bbaa367
crossref_primary_10_1186_s12859_019_2700_1
crossref_primary_10_1093_bib_bbz156
crossref_primary_10_1038_s41598_021_85629_0
crossref_primary_10_1007_s11704_020_9504_3
crossref_primary_10_2174_1574893616666210727161003
crossref_primary_10_1093_bioinformatics_btad506
crossref_primary_10_3390_math8020169
crossref_primary_10_1021_acs_jcim_3c01189
crossref_primary_10_1021_acs_jcim_3c02035
crossref_primary_10_1016_j_csbj_2021_08_013
crossref_primary_10_3389_fbioe_2020_00730
crossref_primary_10_1007_s00521_020_04792_z
crossref_primary_10_1109_ACCESS_2019_2952621
crossref_primary_10_3390_molecules28031148
crossref_primary_10_1002_aisy_202300426
crossref_primary_10_1016_j_ygeno_2022_110454
crossref_primary_10_1016_j_jtbi_2019_04_025
crossref_primary_10_1016_j_tifs_2024_104578
crossref_primary_10_1093_bib_bbaa312
crossref_primary_10_1042_BST20211240
crossref_primary_10_4236_vp_2021_71002
crossref_primary_10_1016_j_future_2025_107819
crossref_primary_10_1016_j_cie_2022_108124
crossref_primary_10_1109_TCBB_2021_3057128
crossref_primary_10_1021_acs_jcim_0c00489
crossref_primary_10_1186_s12859_023_05463_1
crossref_primary_10_2174_0115748936299646240625092734
crossref_primary_10_1016_j_jtbi_2018_09_005
crossref_primary_10_1016_j_jtbi_2018_10_021
crossref_primary_10_1007_s12539_021_00429_4
crossref_primary_10_1109_TCBB_2024_3378470
crossref_primary_10_1007_s40995_021_01134_z
crossref_primary_10_1038_s41598_020_75029_1
crossref_primary_10_1093_bib_bbaa423
crossref_primary_10_1016_j_csbj_2023_02_051
crossref_primary_10_1021_acspolymersau_3c00025
crossref_primary_10_1093_bib_bbaa301
crossref_primary_10_3389_fmicb_2020_567863
crossref_primary_10_1002_psc_3490
crossref_primary_10_3390_electronics13061059
crossref_primary_10_1093_bib_bbaa308
crossref_primary_10_1007_s12539_024_00673_4
crossref_primary_10_2174_0929866527666201103145635
crossref_primary_10_1371_journal_pone_0267106
crossref_primary_10_1016_j_compbiomed_2024_109013
crossref_primary_10_1093_bib_bbab310
crossref_primary_10_1093_bioinformatics_btaa492
crossref_primary_10_1093_bib_bbab551
crossref_primary_10_1093_bioinformatics_btz432
crossref_primary_10_1016_j_compbiomed_2023_106935
crossref_primary_10_3389_fbioe_2021_647113
crossref_primary_10_1186_s12918_018_0665_8
crossref_primary_10_3389_fcell_2020_580217
crossref_primary_10_1016_j_isci_2024_110718
crossref_primary_10_1093_bib_bbz123
crossref_primary_10_1109_TCBB_2024_3425644
crossref_primary_10_3390_math12091330
crossref_primary_10_1186_s12859_022_04756_1
crossref_primary_10_1093_bioinformatics_bty458
crossref_primary_10_3390_life12020307
crossref_primary_10_1021_acs_jcim_4c00295
crossref_primary_10_1016_j_chemolab_2023_105019
crossref_primary_10_3390_genes11091023
crossref_primary_10_1016_j_mib_2019_03_004
crossref_primary_10_1021_acs_jcim_0c01115
crossref_primary_10_1016_j_csbj_2024_06_030
crossref_primary_10_1038_s41467_022_29661_2
crossref_primary_10_1007_s11831_023_09933_w
crossref_primary_10_1016_j_compbiomed_2023_107915
crossref_primary_10_1007_s11042_024_19251_3
crossref_primary_10_1016_j_ab_2020_113747
crossref_primary_10_1093_bib_bbac627
crossref_primary_10_1016_j_ygeno_2021_07_004
crossref_primary_10_1093_bib_bbab414
crossref_primary_10_1080_10408398_2023_2240396
crossref_primary_10_1093_bib_bbz112
crossref_primary_10_1093_nargab_lqab039
crossref_primary_10_1093_bioinformatics_btac458
crossref_primary_10_1021_acs_jcim_3c01020
crossref_primary_10_1016_j_neucom_2019_09_070
crossref_primary_10_1016_j_compbiomed_2020_103964
crossref_primary_10_1093_bioinformatics_btad421
crossref_primary_10_3389_fgene_2022_1036862
crossref_primary_10_1016_j_isci_2022_104967
crossref_primary_10_1021_acs_jproteome_3c00364
crossref_primary_10_1038_s41598_022_08173_5
crossref_primary_10_3390_ijms21197281
crossref_primary_10_1021_acsomega_1c02569
crossref_primary_10_1093_bioinformatics_btaa160
crossref_primary_10_1016_j_ymeth_2022_10_001
crossref_primary_10_2174_1386207323666201204140438
crossref_primary_10_1093_nar_gkac351
crossref_primary_10_1080_07391102_2024_2323141
crossref_primary_10_1016_j_compbiomed_2024_108079
crossref_primary_10_1016_j_compbiomed_2024_109048
crossref_primary_10_1038_s41598_020_77173_0
crossref_primary_10_1093_bioinformatics_btad671
crossref_primary_10_1038_s41598_024_62419_y
crossref_primary_10_3389_fmicb_2024_1335310
crossref_primary_10_1016_j_compbiomed_2022_105577
crossref_primary_10_1016_j_compbiomed_2023_106844
crossref_primary_10_1186_s12859_022_05070_6
crossref_primary_10_1007_s11030_024_10937_2
crossref_primary_10_1007_s11704_020_9507_0
crossref_primary_10_1016_j_chemolab_2019_06_003
crossref_primary_10_3389_fcell_2023_1149535
crossref_primary_10_1007_s11432_024_4171_9
crossref_primary_10_1186_s12915_023_01510_8
crossref_primary_10_1049_cje_2021_06_003
crossref_primary_10_3390_ijms252211866
crossref_primary_10_3389_fgene_2022_877409
crossref_primary_10_1016_j_compbiomed_2023_106974
crossref_primary_10_1093_bib_bbaa415
crossref_primary_10_1016_j_knosys_2022_108191
crossref_primary_10_7717_peerj_cs_2660
crossref_primary_10_1111_jfpp_15602
crossref_primary_10_1093_bfgp_elab017
crossref_primary_10_1002_minf_202000212
crossref_primary_10_1093_bib_bby107
crossref_primary_10_1016_j_isci_2023_108006
crossref_primary_10_1016_j_csbj_2022_01_019
crossref_primary_10_1007_s00726_023_03322_0
crossref_primary_10_3390_biom14091083
crossref_primary_10_2174_1381612825666191129164042
crossref_primary_10_1016_j_ijbiomac_2023_124228
crossref_primary_10_1093_bib_bby104
crossref_primary_10_1186_s13321_023_00689_w
crossref_primary_10_1016_j_isci_2023_108250
crossref_primary_10_1093_bioinformatics_btad108
crossref_primary_10_3390_info14120636
crossref_primary_10_1016_j_knosys_2018_10_007
crossref_primary_10_1093_bioinformatics_btz016
crossref_primary_10_1016_j_plantsci_2019_03_020
crossref_primary_10_1093_bioinformatics_btac374
crossref_primary_10_1109_JBHI_2023_3333828
crossref_primary_10_1109_JBHI_2024_3483316
crossref_primary_10_1016_j_compbiomed_2023_107848
crossref_primary_10_1016_j_gene_2019_06_010
crossref_primary_10_1016_j_bspc_2022_103856
crossref_primary_10_1016_j_jmb_2024_168717
crossref_primary_10_1016_j_knosys_2023_111354
crossref_primary_10_1186_s12859_022_04677_z
crossref_primary_10_1093_bfgp_elad016
crossref_primary_10_1016_j_chemolab_2019_103876
crossref_primary_10_1016_j_ygeno_2019_08_009
crossref_primary_10_1038_s41591_024_03240_y
crossref_primary_10_1016_j_foodchem_2022_134935
crossref_primary_10_3389_fgene_2022_842127
crossref_primary_10_1021_acs_jcim_1c00181
crossref_primary_10_1093_nargab_lqad087
crossref_primary_10_1016_j_ijbiomac_2020_06_234
crossref_primary_10_1016_j_ijbiomac_2024_135741
crossref_primary_10_1109_ACCESS_2020_2982410
crossref_primary_10_1016_j_biosystems_2024_105177
crossref_primary_10_3934_mbe_2021297
crossref_primary_10_1007_s11030_022_10489_3
crossref_primary_10_2174_1574893615999200601122328
crossref_primary_10_3389_fgene_2022_845747
crossref_primary_10_1109_ACCESS_2020_3013409
crossref_primary_10_1007_s40588_025_00240_5
crossref_primary_10_1016_j_heliyon_2024_e32951
crossref_primary_10_3389_fmicb_2018_02571
crossref_primary_10_1016_j_chemolab_2020_104175
crossref_primary_10_3390_molecules28186680
crossref_primary_10_1002_minf_202000006
crossref_primary_10_2174_1386207322666191203141102
crossref_primary_10_1016_j_ijbiomac_2022_10_009
crossref_primary_10_1007_s10989_019_09893_5
crossref_primary_10_1016_j_fbio_2024_105495
crossref_primary_10_3389_fgene_2021_793800
crossref_primary_10_1038_s41598_022_11897_z
crossref_primary_10_1038_s41598_024_82208_x
crossref_primary_10_1109_ACCESS_2020_2991477
crossref_primary_10_1371_journal_pone_0220842
crossref_primary_10_1007_s12539_024_00666_3
crossref_primary_10_1099_mgen_0_000483
crossref_primary_10_1016_j_sbi_2021_07_012
crossref_primary_10_1016_j_compbiomed_2022_106368
crossref_primary_10_1371_journal_pcbi_1009682
crossref_primary_10_1109_ACCESS_2020_2992672
crossref_primary_10_1021_acs_jafc_0c04624
crossref_primary_10_1109_ACCESS_2019_2949415
crossref_primary_10_31829_2692_4242_biogen2019_2_1__102
crossref_primary_10_1016_j_heliyon_2021_e07933
crossref_primary_10_1016_j_compbiolchem_2019_107147
crossref_primary_10_3389_fmicb_2022_790063
crossref_primary_10_1093_bib_bbad076
crossref_primary_10_1371_journal_pcbi_1010974
crossref_primary_10_1016_j_compbiomed_2023_106798
crossref_primary_10_1109_TCBB_2020_3017386
crossref_primary_10_1109_ACCESS_2020_2994206
crossref_primary_10_1093_bib_bbad063
crossref_primary_10_1093_bib_bbac094
crossref_primary_10_1016_j_ijbiomac_2023_128998
crossref_primary_10_1021_acsomega_3c08676
crossref_primary_10_1002_aic_17781
crossref_primary_10_1007_s00521_021_06674_4
crossref_primary_10_1186_s12859_022_04938_x
crossref_primary_10_1093_bioinformatics_btz165
crossref_primary_10_1016_j_rineng_2024_103348
crossref_primary_10_3389_fgene_2021_698477
crossref_primary_10_1093_bib_bbae270
crossref_primary_10_1016_j_csbj_2025_03_005
crossref_primary_10_1016_j_eswa_2022_117760
crossref_primary_10_4236_ns_2020_123013
crossref_primary_10_1016_j_xcrp_2023_101600
crossref_primary_10_3389_fmicb_2020_580382
crossref_primary_10_1186_s12915_024_02064_z
crossref_primary_10_1016_j_ab_2019_113507
crossref_primary_10_3390_ijms231710116
crossref_primary_10_1016_j_compbiomed_2023_107545
crossref_primary_10_1007_s11704_022_2559_6
crossref_primary_10_1016_j_vascn_2022_107191
crossref_primary_10_1016_j_compbiomed_2023_107662
crossref_primary_10_1109_TCBB_2019_2957758
crossref_primary_10_1186_s40168_020_00993_9
crossref_primary_10_1016_j_ygeno_2018_08_007
crossref_primary_10_1002_psc_3666
crossref_primary_10_1021_acsomega_2c04305
crossref_primary_10_1080_19420889_2022_2143101
crossref_primary_10_1016_j_compbiomed_2020_103899
crossref_primary_10_1080_1062936X_2022_2160011
crossref_primary_10_3389_fgene_2022_935989
crossref_primary_10_1021_acs_jcim_8b00749
crossref_primary_10_1109_TCBB_2020_2966450
crossref_primary_10_1016_j_csbj_2023_05_022
crossref_primary_10_1021_acs_jcim_2c00799
crossref_primary_10_3389_fbioe_2020_645520
crossref_primary_10_1002_med_21658
crossref_primary_10_1093_bib_bbae484
crossref_primary_10_1099_mgen_0_000611
crossref_primary_10_1016_j_csbj_2022_06_002
crossref_primary_10_1016_j_csbj_2025_01_020
crossref_primary_10_1021_acs_jcim_4c00501
crossref_primary_10_1093_bib_bbad399
crossref_primary_10_1186_s12967_021_03084_x
crossref_primary_10_2298_CSIS240327039D
crossref_primary_10_1109_TCBB_2022_3204365
crossref_primary_10_1038_s42003_022_03997_3
crossref_primary_10_1016_j_ejmech_2023_115500
crossref_primary_10_1002_bit_28037
crossref_primary_10_1093_bib_bbab083
crossref_primary_10_1186_s13321_025_00961_1
crossref_primary_10_2174_1573406415666181217114710
crossref_primary_10_1093_bib_bbae599
crossref_primary_10_1016_j_compbiolchem_2021_107489
crossref_primary_10_34133_research_0011
crossref_primary_10_3390_e23060643
crossref_primary_10_1016_j_vaccine_2024_126204
crossref_primary_10_2174_1574893615999200601122840
crossref_primary_10_1093_bib_bbac173
crossref_primary_10_1016_j_bspc_2021_102630
crossref_primary_10_3389_fbioe_2020_00183
crossref_primary_10_1021_acs_jcim_4c01718
crossref_primary_10_1093_bioadv_vbac080
crossref_primary_10_1038_s41598_024_63461_6
crossref_primary_10_1021_acs_jcim_4c00625
crossref_primary_10_1093_bib_bbae469
crossref_primary_10_1186_s13040_019_0196_x
crossref_primary_10_3389_fgene_2022_885929
crossref_primary_10_1038_s44222_024_00152_x
crossref_primary_10_1109_JBHI_2024_3425716
crossref_primary_10_1007_s10989_019_09906_3
crossref_primary_10_1039_D4CS00196F
crossref_primary_10_1093_database_baad094
crossref_primary_10_1111_1758_2229_13068
crossref_primary_10_1007_s11033_018_4391_5
crossref_primary_10_1080_07391102_2024_2318482
crossref_primary_10_1109_ACCESS_2020_3015792
crossref_primary_10_1155_2020_9235920
crossref_primary_10_1007_s12539_022_00520_4
crossref_primary_10_1016_j_compbiolchem_2021_107494
crossref_primary_10_1016_j_slast_2024_100152
crossref_primary_10_3390_ijms24031878
crossref_primary_10_1016_j_asoc_2023_110064
crossref_primary_10_2174_0929867326666190507082559
crossref_primary_10_1016_j_chemolab_2020_104102
crossref_primary_10_34133_research_0016
crossref_primary_10_1186_s12859_019_3178_6
crossref_primary_10_3389_fgene_2024_1464976
crossref_primary_10_1016_j_ygeno_2018_05_017
crossref_primary_10_1109_ACCESS_2023_3246927
crossref_primary_10_3389_fcimb_2022_882995
crossref_primary_10_1021_acsomega_2c04465
crossref_primary_10_1093_bib_bbad288
crossref_primary_10_1016_j_dib_2019_105002
crossref_primary_10_1016_j_imu_2020_100318
crossref_primary_10_1016_j_compbiomed_2024_108737
crossref_primary_10_1186_s12864_023_09834_z
crossref_primary_10_1016_j_imu_2024_101532
crossref_primary_10_1093_bioadv_vbae165
crossref_primary_10_1016_j_gene_2018_09_006
crossref_primary_10_1016_j_csbj_2022_06_045
crossref_primary_10_2174_0115748936286848240108074303
crossref_primary_10_1155_2021_6664362
crossref_primary_10_2174_1574893616666211108094205
crossref_primary_10_1186_s12859_024_06004_0
crossref_primary_10_1109_ACCESS_2023_3321100
crossref_primary_10_1093_bib_bbad353
crossref_primary_10_3389_fgene_2022_1007618
crossref_primary_10_1155_2020_8894478
crossref_primary_10_3390_ijms23063044
crossref_primary_10_1128_msystems_00345_23
crossref_primary_10_1016_j_ijbiomac_2020_11_111
crossref_primary_10_1016_j_csbj_2024_09_015
crossref_primary_10_1186_s13321_023_00735_7
crossref_primary_10_1109_TCBB_2024_3489614
crossref_primary_10_1016_j_compbiomed_2023_107242
crossref_primary_10_1016_j_bbcan_2023_188999
crossref_primary_10_1016_j_ab_2020_114069
crossref_primary_10_1371_journal_pone_0307176
crossref_primary_10_1016_j_chemolab_2021_104428
crossref_primary_10_1016_j_cmpb_2024_108176
crossref_primary_10_2174_1573406415666181218102517
crossref_primary_10_1016_j_csbj_2022_12_044
crossref_primary_10_1186_s12859_025_06079_3
crossref_primary_10_2174_1389202921666200214125102
crossref_primary_10_1007_s00044_024_03260_w
crossref_primary_10_1093_abt_tbad007
crossref_primary_10_1093_database_baac085
crossref_primary_10_1016_j_compbiomed_2019_04_040
crossref_primary_10_1093_bib_bbac040
crossref_primary_10_1007_s10822_020_00323_z
crossref_primary_10_1109_ACCESS_2020_2966592
crossref_primary_10_1016_j_compbiomed_2024_108408
crossref_primary_10_1016_j_jmb_2024_168853
crossref_primary_10_1093_bib_bbad372
crossref_primary_10_1186_s12859_020_03574_7
crossref_primary_10_1016_j_synbio_2024_02_006
crossref_primary_10_1109_ACCESS_2020_2999394
crossref_primary_10_3389_fgene_2024_1498884
crossref_primary_10_1016_j_compbiomed_2023_107386
crossref_primary_10_1021_acs_jproteome_0c00871
crossref_primary_10_1109_TCBBIO_2024_3496503
crossref_primary_10_1016_j_compbiomed_2024_108534
crossref_primary_10_1016_j_ijbiomac_2024_131960
crossref_primary_10_1109_ACCESS_2020_2972922
crossref_primary_10_1016_j_chemolab_2021_104326
crossref_primary_10_1093_bib_bbaa099
crossref_primary_10_1016_j_compbiomed_2022_105911
crossref_primary_10_1093_bib_bbac031
crossref_primary_10_2174_1385272823666190718145613
crossref_primary_10_1016_j_ygeno_2018_12_001
crossref_primary_10_1007_s12539_023_00595_7
crossref_primary_10_1089_cbr_2024_0066
crossref_primary_10_1089_cmb_2022_0241
crossref_primary_10_1016_j_ab_2018_09_002
Cites_doi 10.1074/jbc.M401932200
10.1016/j.bbapap.2013.04.006
10.1093/bioinformatics/btv042
10.1038/nprot.2007.494
10.1186/1471-2105-9-310
10.1016/j.ab.2007.10.012
10.1093/nar/25.17.3389
10.1038/nrg3920
10.1006/bbrc.2000.3815
10.1093/nar/gkl305
10.1002/ajpa.20250
10.1093/bioinformatics/bth466
10.1016/j.bbrc.2004.06.073
10.1073/pnas.0607879104
10.1016/j.ab.2012.03.015
10.1093/bioinformatics/btq267
10.1093/bib/bbk007
10.1371/journal.pone.0017331
10.2174/1573406413666170515120507
10.1093/bioinformatics/btq043
10.1093/bioinformatics/btw564
10.1093/bioinformatics/btu624
10.1093/nar/gkr284
10.1016/j.jtbi.2010.12.024
10.1002/(SICI)1097-0134(19990601)35:4<401::AID-PROT3>3.0.CO;2-K
10.1371/journal.pcbi.1000636
10.1515/9781400874668
10.1089/omi.2015.0095
10.1073/pnas.92.19.8700
10.1093/bioinformatics/btt196
10.1016/S0006-3495(94)80782-9
10.1002/prot.1035
10.3390/ijms15033495
10.1093/bioinformatics/btt072
ContentType Journal Article
Copyright The Author(s) 2018. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com 2018
Copyright_xml – notice: The Author(s) 2018. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com 2018
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
7X8
5PM
ADTOC
UNPAY
DOI 10.1093/bioinformatics/bty140
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
MEDLINE - Academic
PubMed Central (Full Participant titles)
Unpaywall for CDI: Periodical Content
Unpaywall
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
DatabaseTitleList MEDLINE - Academic

MEDLINE
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
– sequence: 3
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 1460-2059
1367-4811
EndPage 2502
ExternalDocumentID 10.1093/bioinformatics/bty140
PMC6658705
29528364
10_1093_bioinformatics_bty140
Genre Research Support, Non-U.S. Gov't
Journal Article
Research Support, N.I.H., Extramural
GrantInformation_xml – fundername: Major Inter-Disciplinary Research
– fundername: IDR
– fundername: National Institute of Allergy and Infectious Diseases of the National Institutes of Health
  grantid: R01 AI111965
– fundername: National Health and Medical Research Council of Australia
– fundername: National Natural Science Foundation of China
  funderid: 10.13039/501100001809
– fundername: NSFC
  grantid: 31701142
  funderid: 10.13039/501100001809
– fundername: ARC
  grantid: LP110200333; DP120104460
  funderid: 10.13039/100000163
– fundername: NHMRC
  grantid: APP1058540
  funderid: 10.13039/501100000925
– fundername: Grant Awarded by Monash University
– fundername: UAB School of Medicine
  funderid: 10.13039/100011221
– fundername: NIAID NIH HHS
  grantid: R01 AI111965
– fundername: ;
– fundername: ;
  grantid: APP1058540
– fundername: ; ;
  grantid: 31701142
– fundername: ; ;
  grantid: R01 AI111965
– fundername: ; ;
  grantid: LP110200333; DP120104460
– fundername: ; ;
GroupedDBID -~X
.2P
5GY
AAMVS
ABJNI
ABPTD
ACGFS
ADZXQ
ALMA_UNASSIGNED_HOLDINGS
F5P
HW0
Q5Y
RD5
ROZ
TLC
TN5
TOX
WH7
---
-E4
.DC
.I3
0R~
23N
2WC
4.4
48X
53G
5WA
70D
AAIJN
AAIMJ
AAJKP
AAJQQ
AAKPC
AAMDB
AAOGV
AAPQZ
AAPXW
AAUQX
AAVAP
AAVLN
AAYXX
ABEJV
ABEUO
ABGNP
ABIXL
ABNKS
ABPQP
ABQLI
ABWST
ABXVV
ABZBJ
ACIWK
ACPRK
ACUFI
ACUXJ
ACYTK
ADBBV
ADEYI
ADEZT
ADFTL
ADGKP
ADGZP
ADHKW
ADHZD
ADMLS
ADOCK
ADPDF
ADRDM
ADRTK
ADVEK
ADYVW
ADZTZ
AECKG
AEGPL
AEJOX
AEKKA
AEKSI
AELWJ
AEMDU
AENEX
AENZO
AEPUE
AETBJ
AEWNT
AFFZL
AFGWE
AFIYH
AFOFC
AFRAH
AGINJ
AGKEF
AGQXC
AGSYK
AHMBA
AHXPO
AIJHB
AJEEA
AJEUX
AKHUL
AKWXX
ALTZX
ALUQC
AMNDL
APIBT
APWMN
ARIXL
ASPBG
AVWKF
AXUDD
AYOIW
AZVOD
BAWUL
BAYMD
BHONS
BQDIO
BQUQU
BSWAC
BTQHN
C45
CDBKE
CITATION
CS3
CZ4
DAKXR
DIK
DILTD
DU5
D~K
EBD
EBS
EE~
EJD
EMOBN
F9B
FEDTE
FHSFR
FLIZI
FLUFQ
FOEOM
FQBLK
GAUVT
GJXCC
GROUPED_DOAJ
GX1
H13
H5~
HAR
HZ~
IOX
J21
JXSIZ
KAQDR
KOP
KQ8
KSI
KSN
M-Z
MK~
ML0
N9A
NGC
NLBLG
NMDNZ
NOMLY
NU-
O9-
OAWHX
ODMLO
OJQWA
OK1
OVD
OVEED
P2P
PAFKI
PEELM
PQQKQ
Q1.
R44
RNS
ROL
RPM
RUSNO
RW1
RXO
SV3
TEORI
TJP
TR2
W8F
WOQ
X7H
YAYTL
YKOAZ
YXANX
ZKX
~91
~KM
CGR
CUY
CVF
ECM
EIF
NPM
7X8
5PM
.-4
.GJ
1TH
ABEFU
ABNGD
ACUKT
ADTOC
AFFNX
AGQPQ
AI.
AQDSO
ATTQO
AZFZN
C1A
CAG
COF
ELUNK
HVGLF
NTWIH
NVLIB
O0~
O~Y
PB-
RNI
RZF
RZO
UNPAY
VH1
ZGI
ID FETCH-LOGICAL-c551t-93066182c33eaa95410efad54a93bb470182b64fbaaa71e3f80f9dbbe2fcfc653
IEDL.DBID UNPAY
ISSN 1367-4803
1367-4811
IngestDate Sun Oct 26 04:14:18 EDT 2025
Thu Aug 21 18:45:40 EDT 2025
Fri Jul 11 16:23:56 EDT 2025
Mon Jul 21 05:51:32 EDT 2025
Tue Jul 01 03:27:24 EDT 2025
Thu Apr 24 23:11:08 EDT 2025
Wed Apr 02 07:03:18 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 14
Language English
License This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/about_us/legal/notices)
https://academic.oup.com/journals/pages/about_us/legal/notices
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c551t-93066182c33eaa95410efad54a93bb470182b64fbaaa71e3f80f9dbbe2fcfc653
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
The authors wish it to be known that, in their opinion, Zhen Chen and Pei Zhao authors should be regarded as Joint First Authors.
ORCID 0000-0001-8031-9086
OpenAccessLink https://proxy.k.utb.cz/login?url=https://academic.oup.com/bioinformatics/article-pdf/34/14/2499/29153563/bty140.pdf
PMID 29528364
PQID 2013104689
PQPubID 23479
PageCount 4
ParticipantIDs unpaywall_primary_10_1093_bioinformatics_bty140
pubmedcentral_primary_oai_pubmedcentral_nih_gov_6658705
proquest_miscellaneous_2013104689
pubmed_primary_29528364
crossref_citationtrail_10_1093_bioinformatics_bty140
crossref_primary_10_1093_bioinformatics_bty140
oup_primary_10_1093_bioinformatics_bty140
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2018-07-15
PublicationDateYYYYMMDD 2018-07-15
PublicationDate_xml – month: 07
  year: 2018
  text: 2018-07-15
  day: 15
PublicationDecade 2010
PublicationPlace England
PublicationPlace_xml – name: England
PublicationTitle Bioinformatics
PublicationTitleAlternate Bioinformatics
PublicationYear 2018
Publisher Oxford University Press
Publisher_xml – name: Oxford University Press
References Cao (2023012713012398600_bty140-B5) 2013; 29
Bellman (2023012713012398600_bty140-B3) 1961
Chen (2023012713012398600_bty140-B8) 2013; 1834
Xiao (2023012713012398600_bty140-B35) 2015; 31
Chen (2023012713012398600_bty140-B7) 2013; 29
Du (2023012713012398600_bty140-B16) 2012; 425
Zuo (2023012713012398600_bty140-B36) 2017; 33
Chou (2023012713012398600_bty140-B9) 2000; 278
Li (2023012713012398600_bty140-B23) 2006; 34
Sokal (2023012713012398600_bty140-B32) 2006; 129
Song (2023012713012398600_bty140-B33) 2010; 26
Barkan (2023012713012398600_bty140-B2) 2010; 26
Chou (2023012713012398600_bty140-B11) 2005; 21
Schneider (2023012713012398600_bty140-B29) 1994; 66
Libbrecht (2023012713012398600_bty140-B24) 2015; 16
Shen (2023012713012398600_bty140-B30) 2007; 104
Lee (2023012713012398600_bty140-B22) 2011; 6
Shen (2023012713012398600_bty140-B31) 2008; 373
Chou (2023012713012398600_bty140-B15) 1978; 47
Dubchak (2023012713012398600_bty140-B19) 1999; 35
Altschul (2023012713012398600_bty140-B1) 1997; 25
Kawashima (2023012713012398600_bty140-B20) 2008; 36 (Database issue)
Chou (2023012713012398600_bty140-B10) 2001; 43
Liu (2023012713012398600_bty140-B25) 2017; 13
Rottig (2023012713012398600_bty140-B27) 2010; 6
Larranaga (2023012713012398600_bty140-B21) 2006; 7
Tung (2023012713012398600_bty140-B34) 2008; 9
Du (2023012713012398600_bty140-B17) 2014; 15
Chou (2023012713012398600_bty140-B12) 2011; 273
Bhasin (2023012713012398600_bty140-B4) 2004; 279
Chou (2023012713012398600_bty140-B14) 2008; 3
Dubchak (2023012713012398600_bty140-B18) 1995; 92
Saravanan (2023012713012398600_bty140-B28) 2015; 19
Chou (2023012713012398600_bty140-B13) 2004; 320
Cao (2023012713012398600_bty140-B6) 2015; 31
Rao (2023012713012398600_bty140-B26) 2011; 39
References_xml – volume: 279
  start-page: 23262
  year: 2004
  ident: 2023012713012398600_bty140-B4
  article-title: Classification of nuclear receptors based on amino acid composition and dipeptide composition
  publication-title: J. Biol. Chem
  doi: 10.1074/jbc.M401932200
– volume: 1834
  start-page: 1461
  year: 2013
  ident: 2023012713012398600_bty140-B8
  article-title: hCKSAAP_UbSite: improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties
  publication-title: Biochim. Biophys. Acta
  doi: 10.1016/j.bbapap.2013.04.006
– volume: 31
  start-page: 1857
  year: 2015
  ident: 2023012713012398600_bty140-B35
  article-title: protr/ProtrWeb: r package and web server for generating various numerical representation schemes of protein sequences
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btv042
– volume: 3
  start-page: 153
  year: 2008
  ident: 2023012713012398600_bty140-B14
  article-title: Cell-PLoc: a package of Web servers for predicting subcellular localization of proteins in various organisms
  publication-title: Nat. Protoc
  doi: 10.1038/nprot.2007.494
– volume: 9
  start-page: 310
  year: 2008
  ident: 2023012713012398600_bty140-B34
  article-title: Computational identification of ubiquitylation sites from protein sequences
  publication-title: BMC Bioinformatics
  doi: 10.1186/1471-2105-9-310
– volume: 373
  start-page: 386
  year: 2008
  ident: 2023012713012398600_bty140-B31
  article-title: PseAAC: a flexible web server for generating various kinds of protein pseudo amino acid composition
  publication-title: Anal. Biochem
  doi: 10.1016/j.ab.2007.10.012
– volume: 25
  start-page: 3389
  year: 1997
  ident: 2023012713012398600_bty140-B1
  article-title: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/25.17.3389
– volume: 47
  start-page: 45
  year: 1978
  ident: 2023012713012398600_bty140-B15
  article-title: Prediction of the secondary structure of proteins from their amino acid sequence
  publication-title: Adv. Enzymol. Relat. Areas Mol. Biol
– volume: 16
  start-page: 321
  year: 2015
  ident: 2023012713012398600_bty140-B24
  article-title: Machine learning applications in genetics and genomics
  publication-title: Nat. Rev. Genet
  doi: 10.1038/nrg3920
– volume: 278
  start-page: 477
  year: 2000
  ident: 2023012713012398600_bty140-B9
  article-title: Prediction of protein subcellular locations by incorporating quasi-sequence-order effect
  publication-title: Biochem. Biophys. Res. Commun
  doi: 10.1006/bbrc.2000.3815
– volume: 34
  start-page: W32
  year: 2006
  ident: 2023012713012398600_bty140-B23
  article-title: PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkl305
– volume: 129
  start-page: 121
  year: 2006
  ident: 2023012713012398600_bty140-B32
  article-title: Population structure inferred by local spatial autocorrelation: an example from an Amerindian tribal population
  publication-title: Am. J. Phys. Anthropol
  doi: 10.1002/ajpa.20250
– volume: 21
  start-page: 10
  year: 2005
  ident: 2023012713012398600_bty140-B11
  article-title: Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/bth466
– volume: 320
  start-page: 1236
  year: 2004
  ident: 2023012713012398600_bty140-B13
  article-title: Prediction of protein subcellular locations by GO-FunD-PseAA predictor
  publication-title: Biochem. Biophys. Res. Commun
  doi: 10.1016/j.bbrc.2004.06.073
– volume: 104
  start-page: 4337
  year: 2007
  ident: 2023012713012398600_bty140-B30
  article-title: Predicting protein-protein interactions based only on sequences information
  publication-title: Proc. Natl. Acad. Sci. USA
  doi: 10.1073/pnas.0607879104
– volume: 425
  start-page: 117
  year: 2012
  ident: 2023012713012398600_bty140-B16
  article-title: PseAAC-Builder: a cross-platform stand-alone program for generating various special Chou’s pseudo-amino acid compositions
  publication-title: Anal. Biochem
  doi: 10.1016/j.ab.2012.03.015
– volume: 26
  start-page: 1714
  year: 2010
  ident: 2023012713012398600_bty140-B2
  article-title: Prediction of protease substrates using sequence and structure features
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btq267
– volume: 7
  start-page: 86
  year: 2006
  ident: 2023012713012398600_bty140-B21
  article-title: Machine learning in bioinformatics
  publication-title: Brief. Bioinform
  doi: 10.1093/bib/bbk007
– volume: 36 (Database issue)
  start-page: D202
  year: 2008
  ident: 2023012713012398600_bty140-B20
  article-title: AAindex: amino acid index database, progress report 2008
  publication-title: Nucleic Acids Res
– volume: 6
  start-page: e17331
  year: 2011
  ident: 2023012713012398600_bty140-B22
  article-title: Incorporating distant sequence features and radial basis function networks to identify ubiquitin conjugation sites
  publication-title: PLoS One
  doi: 10.1371/journal.pone.0017331
– volume: 13
  start-page: 552
  year: 2017
  ident: 2023012713012398600_bty140-B25
  article-title: iPGK-PseAAC: identify lysine phosphoglycerylation sites in proteins by incorporating four different tiers of amino acid pairwise coupling information into the general PseAAC
  publication-title: Med. Chem
  doi: 10.2174/1573406413666170515120507
– volume: 26
  start-page: 752
  year: 2010
  ident: 2023012713012398600_bty140-B33
  article-title: Cascleave: towards more accurate prediction of caspase substrate cleavage sites
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btq043
– volume: 33
  start-page: 122
  year: 2017
  ident: 2023012713012398600_bty140-B36
  article-title: PseKRAAC: a flexible web server for generating pseudo K-tuple reduced amino acids composition
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btw564
– volume: 31
  start-page: 279
  year: 2015
  ident: 2023012713012398600_bty140-B6
  article-title: Rcpi: r /Bioconductor package to generate various descriptors of proteins, compounds and their interactions
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btu624
– volume: 39
  start-page: W385
  year: 2011
  ident: 2023012713012398600_bty140-B26
  article-title: Update of PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkr284
– volume: 273
  start-page: 236
  year: 2011
  ident: 2023012713012398600_bty140-B12
  article-title: Some remarks on protein attribute prediction and pseudo amino acid composition
  publication-title: J. Theor. Biol
  doi: 10.1016/j.jtbi.2010.12.024
– volume: 35
  start-page: 401
  year: 1999
  ident: 2023012713012398600_bty140-B19
  article-title: Recognition of a protein fold in the context of the Structural Classification of Proteins (SCOP) classification
  publication-title: Proteins
  doi: 10.1002/(SICI)1097-0134(19990601)35:4<401::AID-PROT3>3.0.CO;2-K
– volume: 6
  start-page: e1000636
  year: 2010
  ident: 2023012713012398600_bty140-B27
  article-title: Combining structure and sequence information allows automated prediction of substrate specificities within enzyme families
  publication-title: PLoS Comput. Biol
  doi: 10.1371/journal.pcbi.1000636
– volume-title: Adaptive Control Processes: A Guided Tour
  year: 1961
  ident: 2023012713012398600_bty140-B3
  doi: 10.1515/9781400874668
– volume: 19
  start-page: 648
  year: 2015
  ident: 2023012713012398600_bty140-B28
  article-title: Harnessing computational biology for exact linear B-cell epitope prediction: a novel amino acid composition-based feature descriptor
  publication-title: Omics
  doi: 10.1089/omi.2015.0095
– volume: 92
  start-page: 8700
  year: 1995
  ident: 2023012713012398600_bty140-B18
  article-title: Prediction of protein folding class using global description of amino acid sequence
  publication-title: Proc. Natl. Acad. Sci. USA
  doi: 10.1073/pnas.92.19.8700
– volume: 29
  start-page: 1614
  year: 2013
  ident: 2023012713012398600_bty140-B7
  article-title: Incorporating key position and amino acid residue features to identify general and species-specific Ubiquitin conjugation sites
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btt196
– volume: 66
  start-page: 335
  year: 1994
  ident: 2023012713012398600_bty140-B29
  article-title: The rational design of amino acid sequences by artificial neural networks and simulated molecular evolution: de novo design of an idealized leader peptidase cleavage site
  publication-title: Biophys. J
  doi: 10.1016/S0006-3495(94)80782-9
– volume: 43
  start-page: 246
  year: 2001
  ident: 2023012713012398600_bty140-B10
  article-title: Prediction of protein cellular attributes using pseudo-amino acid composition
  publication-title: Proteins
  doi: 10.1002/prot.1035
– volume: 15
  start-page: 3495
  year: 2014
  ident: 2023012713012398600_bty140-B17
  article-title: PseAAC-General: fast building various modes of general form of Chou’s pseudo-amino acid composition for large-scale protein datasets
  publication-title: Int. J. Mol. Sci
  doi: 10.3390/ijms15033495
– volume: 29
  start-page: 960
  year: 2013
  ident: 2023012713012398600_bty140-B5
  article-title: propy: a tool to generate various modes of Chou’s PseAAC
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btt072
SSID ssj0051444
ssj0005056
Score 2.6934931
Snippet Abstract Summary Structural and physiochemical descriptors extracted from sequence data have been widely used to represent sequences and predict structural,...
Structural and physiochemical descriptors extracted from sequence data have been widely used to represent sequences and predict structural, functional,...
SourceID unpaywall
pubmedcentral
proquest
pubmed
crossref
oup
SourceType Open Access Repository
Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 2499
SubjectTerms Applications Notes
Machine Learning
Molecular Sequence Annotation
Peptides - chemistry
Peptides - metabolism
Peptides - physiology
Protein Conformation
Proteins - chemistry
Proteins - metabolism
Proteins - physiology
Sequence Analysis, Protein - methods
Software
Title iFeature: a Python package and web server for features extraction and selection from protein and peptide sequences
URI https://www.ncbi.nlm.nih.gov/pubmed/29528364
https://www.proquest.com/docview/2013104689
https://pubmed.ncbi.nlm.nih.gov/PMC6658705
https://academic.oup.com/bioinformatics/article-pdf/34/14/2499/29153563/bty140.pdf
UnpaywallVersion publishedVersion
Volume 34
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAFT
  databaseName: Open Access Digital Library
  customDbUrl:
  eissn: 1460-2059
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: KQ8
  dateStart: 19960101
  isFulltext: true
  titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html
  providerName: Colorado Alliance of Research Libraries
– providerCode: PRVEBS
  databaseName: Inspec with Full Text
  customDbUrl:
  eissn: 1460-2059
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: ADMLS
  dateStart: 19980101
  isFulltext: true
  titleUrlDefault: https://www.ebsco.com/products/research-databases/inspec-full-text
  providerName: EBSCOhost
– providerCode: PRVBFR
  databaseName: Free Medical Journals
  customDbUrl:
  eissn: 1460-2059
  dateEnd: 20241101
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: DIK
  dateStart: 19960101
  isFulltext: true
  titleUrlDefault: http://www.freemedicaljournals.com
  providerName: Flying Publisher
– providerCode: PRVFQY
  databaseName: GFMER Free Medical Journals
  customDbUrl:
  eissn: 1460-2059
  dateEnd: 20241101
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: GX1
  dateStart: 19960101
  isFulltext: true
  titleUrlDefault: http://www.gfmer.ch/Medical_journals/Free_medical.php
  providerName: Geneva Foundation for Medical Education and Research
– providerCode: PRVAQN
  databaseName: PubMed Central
  customDbUrl:
  eissn: 1460-2059
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: RPM
  dateStart: 20070101
  isFulltext: true
  titleUrlDefault: https://www.ncbi.nlm.nih.gov/pmc/
  providerName: National Library of Medicine
– providerCode: PRVOVD
  databaseName: Journals@Ovid LWW All Open Access Journal Collection Rolling
  customDbUrl:
  eissn: 1460-2059
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: OVEED
  dateStart: 20010101
  isFulltext: true
  titleUrlDefault: http://ovidsp.ovid.com/
  providerName: Ovid
– providerCode: PRVASL
  databaseName: Oxford Journals Open Access Collection
  customDbUrl:
  eissn: 1460-2059
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: TOX
  dateStart: 19850101
  isFulltext: true
  titleUrlDefault: https://academic.oup.com/journals/
  providerName: Oxford University Press
– providerCode: PRVASL
  databaseName: Oxford Journals Open Access Collection
  customDbUrl:
  eissn: 1460-2059
  dateEnd: 20220930
  omitProxy: true
  ssIdentifier: ssj0005056
  issn: 1367-4811
  databaseCode: TOX
  dateStart: 19850101
  isFulltext: true
  titleUrlDefault: https://academic.oup.com/journals/
  providerName: Oxford University Press
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Ja9wwFH4kE0p76b5Ml6BCLz14vEiWrd5CaRoKTUPJwPRkJEsiQwbP4BkTJr--T5ZtMumhKfRibGuxJGv5nvTe9wA-5DQ3cWJ5IIRlAROI4ZTMRBCVwnGpWBybzt75-yk_mbJvs3S2Bz97WxjZaYVPepMGNV92FKKOtjjs2jNYaRtSFsYsRPlBhInAkZtyGqrNFkWGCQbvwwFPEZ-P4GB6enb0yxtgZQHLW3fJ3X0c92Y9gt7-ls9rZ8HaMYK7gUX_VKm831Qrub2Si8WN9er4Eaz7mno1lctJs1GT8voWCeT_bYrH8LCDt-TIJ3oCe6Z6Cve8w8vtM6jnDnI2tflEJDnbOt4CgkL7JU5qRFaa4JxO3DaxqQmWgFgfeU1wBam9BUYbbd0673FPzjqGtFwTcx-0clo62pBBRfw5TI-_nH8-CTqvD0GJ6G0TCBRiOEo9JaVGSpGyODJW6pRJQZViWYRhijOrpJRZbKjNIyu0UiaxpS15Sl_AqFpW5hUQmuUcAYlEHKmY1kJm2uILbXWamEToMbD-1xZlR4nuPHMsCn80T4vdJi98k45hMiRbeU6QvyX4iP_wrnHf972rwJHujm9kZZbNukgcNVLEeC7G8NL3tiHLRDiSHs7GkO30wyGCYxHfDanmFy2bOEcMmkXpGMKhx96tpK__OcUbeICVyN3WeJy-hdGmbsw7xHQbdQj7X2cxXs9_zA67AfsbcMxXSw
linkProvider Unpaywall
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1La9wwEB7SDaW99N3GfaFCLz34KVm2egulIRQaQulCejKSJdEli3fxrgnbX9-RZS9xemgKvdnWw3rrG2nmG4D3JS1NmlkeCmFZyARiOCULESa1cFwqFuems3f-esZP5-zLRX5xAN9GWxg5aIVHo0mDWqwGClFHWxwP7RmutY0pi1MWo_wg4kzgzM05jdV2hyJDhMF34JDniM9ncDg_Oz_-4Q2wipCVvbvk4TlNR7MeQW_-y-c12bAmRnDXsOifKpX3umYtd1dyuby2X508hM1YU6-mchl1WxXVv26QQP7fpngEDwZ4S459osdwYJoncNc7vNw9hXbhIGfXmo9EkvOd4y0gKLRf4qJGZKMJrunEHROblmAJiPWRNwR3kNZbYPTRNr3zHvfmrGNIzzWx8EFrp6WjDdmriD-D-cnn759Ow8HrQ1gjetuGAoUYjlJPTamRUuQsTYyVOmdSUKVYkWCY4swqKWWRGmrLxAqtlMlsbWue0-cwa1aNOQJCi5IjIJGIIxXTWshCW_ygrc4zkwkdABu7tqoHSnTnmWNZ-at5Wk2bvPJNGkC0T7b2nCB_S_AB-_C2cd-No6vCme6ub2RjVt2myhw1UsJ4KQJ44UfbPstMOJIezgIoJuNwH8GxiE9DmsXPnk2cIwYtkjyAeD9ib1fSl_-c4hXcx0qU7mg8zV_DbNt25g1iuq16O0zS32qyVT4
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=iFeature%3A+a+Python+package+and+web+server+for+features+extraction+and+selection+from+protein+and+peptide+sequences&rft.jtitle=Bioinformatics&rft.au=Chen%2C+Zhen&rft.au=Zhao%2C+Pei&rft.au=Li%2C+Fuyi&rft.au=Leier%2C+Andr%C3%A9&rft.date=2018-07-15&rft.pub=Oxford+University+Press&rft.issn=1367-4803&rft.eissn=1460-2059&rft.volume=34&rft.issue=14&rft.spage=2499&rft.epage=2502&rft_id=info:doi/10.1093%2Fbioinformatics%2Fbty140&rft.externalDocID=10.1093%2Fbioinformatics%2Fbty140
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1367-4803&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1367-4803&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1367-4803&client=summon