A novel substitution matrix fitted to the compositional bias in Mollicutes improves the prediction of homologous relationships

Background Substitution matrices are key parameters for the alignment of two protein sequences, and consequently for most comparative genomics studies. The composition of biological sequences can vary importantly between species and groups of species, and classical matrices such as those in the BLOS...

Full description

Saved in:
Bibliographic Details
Published inBMC bioinformatics Vol. 12; no. 1; p. 457
Main Authors Lemaitre, Claire, Barré, Aurélien, Citti, Christine, Tardy, Florence, Thiaucourt, François, Sirand-Pugnet, Pascal, Thébault, Patricia
Format Journal Article
LanguageEnglish
Published London BioMed Central 24.11.2011
BioMed Central Ltd
Springer Nature B.V
BMC
Subjects
Online AccessGet full text
ISSN1471-2105
1471-2105
DOI10.1186/1471-2105-12-457

Cover

Abstract Background Substitution matrices are key parameters for the alignment of two protein sequences, and consequently for most comparative genomics studies. The composition of biological sequences can vary importantly between species and groups of species, and classical matrices such as those in the BLOSUM series fail to accurately estimate alignment scores and statistical significance with sequences sharing marked compositional biases. Results We present a general and simple methodology to build matrices that are especially fitted to the compositional bias of proteins. Our approach is inspired from the one used to build the BLOSUM matrices and is based on learning substitution and amino acid frequencies on real sequences with the corresponding compositional bias. We applied it to the large scale comparison of Mollicute AT-rich genomes. The new matrix, MOLLI60, was used to predict pairwise orthology relationships, as well as homolog families among 24 Mollicute genomes. We show that this new matrix enables to better discriminate between true and false orthologs and improves the clustering of homologous proteins, with respect to the use of the classical matrix BLOSUM62. Conclusions We show in this paper that well-fitted matrices can improve the predictions of orthologous and homologous relationships among proteins with a similar compositional bias. With the ever-increasing number of sequenced genomes, our approach could prove valuable in numerous comparative studies focusing on atypical genomes.
AbstractList Abstract Background: Substitution matrices are key parameters for the alignment of two protein sequences, and consequently for most comparative genomics studies. The composition of biological sequences can vary importantly between species and groups of species, and classical matrices such as those in the BLOSUM series fail to accurately estimate alignment scores and statistical significance with sequences sharing marked compositional biases. Results: We present a general and simple methodology to build matrices that are especially fitted to the compositional bias of proteins. Our approach is inspired from the one used to build the BLOSUM matrices and is based on learning substitution and amino acid frequencies on real sequences with the corresponding compositional bias. We applied it to the large scale comparison of Mollicute AT-rich genomes. The new matrix, MOLLI60, was used to predict pairwise orthology relationships, as well as homolog families among 24 Mollicute genomes. We show that this new matrix enables to better discriminate between true and false orthologs and improves the clustering of homologous proteins, with respect to the use of the classical matrix BLOSUM62. Conclusions: We show in this paper that well-fitted matrices can improve the predictions of orthologous and homologous relationships among proteins with a similar compositional bias. With the ever-increasing number of sequenced genomes, our approach could prove valuable in numerous comparative studies focusing on atypical genomes.
Substitution matrices are key parameters for the alignment of two protein sequences, and consequently for most comparative genomics studies. The composition of biological sequences can vary importantly between species and groups of species, and classical matrices such as those in the BLOSUM series fail to accurately estimate alignment scores and statistical significance with sequences sharing marked compositional biases. We present a general and simple methodology to build matrices that are especially fitted to the compositional bias of proteins. Our approach is inspired from the one used to build the BLOSUM matrices and is based on learning substitution and amino acid frequencies on real sequences with the corresponding compositional bias. We applied it to the large scale comparison of Mollicute AT-rich genomes. The new matrix, MOLLI60, was used to predict pairwise orthology relationships, as well as homolog families among 24 Mollicute genomes. We show that this new matrix enables to better discriminate between true and false orthologs and improves the clustering of homologous proteins, with respect to the use of the classical matrix BLOSUM62. We show in this paper that well-fitted matrices can improve the predictions of orthologous and homologous relationships among proteins with a similar compositional bias. With the ever-increasing number of sequenced genomes, our approach could prove valuable in numerous comparative studies focusing on atypical genomes.
Substitution matrices are key parameters for the alignment of two protein sequences, and consequently for most comparative genomics studies. The composition of biological sequences can vary importantly between species and groups of species, and classical matrices such as those in the BLOSUM series fail to accurately estimate alignment scores and statistical significance with sequences sharing marked compositional biases. We present a general and simple methodology to build matrices that are especially fitted to the compositional bias of proteins. Our approach is inspired from the one used to build the BLOSUM matrices and is based on learning substitution and amino acid frequencies on real sequences with the corresponding compositional bias. We applied it to the large scale comparison of Mollicute AT-rich genomes. The new matrix, MOLLI60, was used to predict pairwise orthology relationships, as well as homolog families among 24 Mollicute genomes. We show that this new matrix enables to better discriminate between true and false orthologs and improves the clustering of homologous proteins, with respect to the use of the classical matrix BLOSUM62. We show in this paper that well-fitted matrices can improve the predictions of orthologous and homologous relationships among proteins with a similar compositional bias. With the ever-increasing number of sequenced genomes, our approach could prove valuable in numerous comparative studies focusing on atypical genomes.
Background Substitution matrices are key parameters for the alignment of two protein sequences, and consequently for most comparative genomics studies. The composition of biological sequences can vary importantly between species and groups of species, and classical matrices such as those in the BLOSUM series fail to accurately estimate alignment scores and statistical significance with sequences sharing marked compositional biases. Results We present a general and simple methodology to build matrices that are especially fitted to the compositional bias of proteins. Our approach is inspired from the one used to build the BLOSUM matrices and is based on learning substitution and amino acid frequencies on real sequences with the corresponding compositional bias. We applied it to the large scale comparison of Mollicute AT-rich genomes. The new matrix, MOLLI60, was used to predict pairwise orthology relationships, as well as homolog families among 24 Mollicute genomes. We show that this new matrix enables to better discriminate between true and false orthologs and improves the clustering of homologous proteins, with respect to the use of the classical matrix BLOSUM62. Conclusions We show in this paper that well-fitted matrices can improve the predictions of orthologous and homologous relationships among proteins with a similar compositional bias. With the ever-increasing number of sequenced genomes, our approach could prove valuable in numerous comparative studies focusing on atypical genomes.
Background Substitution matrices are key parameters for the alignment of two protein sequences, and consequently for most comparative genomics studies. The composition of biological sequences can vary importantly between species and groups of species, and classical matrices such as those in the BLOSUM series fail to accurately estimate alignment scores and statistical significance with sequences sharing marked compositional biases. Results We present a general and simple methodology to build matrices that are especially fitted to the compositional bias of proteins. Our approach is inspired from the one used to build the BLOSUM matrices and is based on learning substitution and amino acid frequencies on real sequences with the corresponding compositional bias. We applied it to the large scale comparison of Mollicute AT-rich genomes. The new matrix, MOLLI60, was used to predict pairwise orthology relationships, as well as homolog families among 24 Mollicute genomes. We show that this new matrix enables to better discriminate between true and false orthologs and improves the clustering of homologous proteins, with respect to the use of the classical matrix BLOSUM62. Conclusions We show in this paper that well-fitted matrices can improve the predictions of orthologous and homologous relationships among proteins with a similar compositional bias. With the ever-increasing number of sequenced genomes, our approach could prove valuable in numerous comparative studies focusing on atypical genomes.
Substitution matrices are key parameters for the alignment of two protein sequences, and consequently for most comparative genomics studies. The composition of biological sequences can vary importantly between species and groups of species, and classical matrices such as those in the BLOSUM series fail to accurately estimate alignment scores and statistical significance with sequences sharing marked compositional biases.BACKGROUNDSubstitution matrices are key parameters for the alignment of two protein sequences, and consequently for most comparative genomics studies. The composition of biological sequences can vary importantly between species and groups of species, and classical matrices such as those in the BLOSUM series fail to accurately estimate alignment scores and statistical significance with sequences sharing marked compositional biases.We present a general and simple methodology to build matrices that are especially fitted to the compositional bias of proteins. Our approach is inspired from the one used to build the BLOSUM matrices and is based on learning substitution and amino acid frequencies on real sequences with the corresponding compositional bias. We applied it to the large scale comparison of Mollicute AT-rich genomes. The new matrix, MOLLI60, was used to predict pairwise orthology relationships, as well as homolog families among 24 Mollicute genomes. We show that this new matrix enables to better discriminate between true and false orthologs and improves the clustering of homologous proteins, with respect to the use of the classical matrix BLOSUM62.RESULTSWe present a general and simple methodology to build matrices that are especially fitted to the compositional bias of proteins. Our approach is inspired from the one used to build the BLOSUM matrices and is based on learning substitution and amino acid frequencies on real sequences with the corresponding compositional bias. We applied it to the large scale comparison of Mollicute AT-rich genomes. The new matrix, MOLLI60, was used to predict pairwise orthology relationships, as well as homolog families among 24 Mollicute genomes. We show that this new matrix enables to better discriminate between true and false orthologs and improves the clustering of homologous proteins, with respect to the use of the classical matrix BLOSUM62.We show in this paper that well-fitted matrices can improve the predictions of orthologous and homologous relationships among proteins with a similar compositional bias. With the ever-increasing number of sequenced genomes, our approach could prove valuable in numerous comparative studies focusing on atypical genomes.CONCLUSIONSWe show in this paper that well-fitted matrices can improve the predictions of orthologous and homologous relationships among proteins with a similar compositional bias. With the ever-increasing number of sequenced genomes, our approach could prove valuable in numerous comparative studies focusing on atypical genomes.
ArticleNumber 457
Audience Academic
Author Sirand-Pugnet, Pascal
Thiaucourt, François
Lemaitre, Claire
Barré, Aurélien
Citti, Christine
Tardy, Florence
Thébault, Patricia
AuthorAffiliation 3 Université de Toulouse, ENVT, UMR 1225, F-31076 Toulouse, France
8 INRA, UMR 1332, 71, avenue Edouard Bourlaux, F-33140 Villenave d'Ornon, France
9 Université de Bordeaux, Laboratoire Bordelais de Recherche en Informatique, UMR 5800, F-33405 Talence, France
1 Université de Bordeaux, Centre de Bioinformatique et Génomique Fonctionnelle Bordeaux, F-33000 Bordeaux, France
5 Anses, Lyon Laboratory, UMR Mycoplasmoses of Ruminants, 31 Avenue Tony Garnier F-69364 Lyon cedex 07, France
4 INRA, UMR 1225, F-31076 Toulouse, France
7 Université de Bordeaux, UMR 1332, 71, avenue Edouard Bourlaux, F-33140 Villenave d'Ornon, France
2 Equipe SYMBIOSE, INRIA Rennes Bretagne Atlantique, Campus de Beaulieu, F-35042 Rennes, France
6 CIRAD, UMR CMAEE, Campus de Baillarguet, F-34398 Montpellier, France
AuthorAffiliation_xml – name: 9 Université de Bordeaux, Laboratoire Bordelais de Recherche en Informatique, UMR 5800, F-33405 Talence, France
– name: 6 CIRAD, UMR CMAEE, Campus de Baillarguet, F-34398 Montpellier, France
– name: 3 Université de Toulouse, ENVT, UMR 1225, F-31076 Toulouse, France
– name: 4 INRA, UMR 1225, F-31076 Toulouse, France
– name: 8 INRA, UMR 1332, 71, avenue Edouard Bourlaux, F-33140 Villenave d'Ornon, France
– name: 5 Anses, Lyon Laboratory, UMR Mycoplasmoses of Ruminants, 31 Avenue Tony Garnier F-69364 Lyon cedex 07, France
– name: 2 Equipe SYMBIOSE, INRIA Rennes Bretagne Atlantique, Campus de Beaulieu, F-35042 Rennes, France
– name: 1 Université de Bordeaux, Centre de Bioinformatique et Génomique Fonctionnelle Bordeaux, F-33000 Bordeaux, France
– name: 7 Université de Bordeaux, UMR 1332, 71, avenue Edouard Bourlaux, F-33140 Villenave d'Ornon, France
Author_xml – sequence: 1
  givenname: Claire
  surname: Lemaitre
  fullname: Lemaitre, Claire
  email: claire.lemaitre@inria.fr
  organization: Centre de Bioinformatique et Génomique Fonctionnelle Bordeaux, Université de Bordeaux, Equipe SYMBIOSE, INRIA Rennes Bretagne Atlantique
– sequence: 2
  givenname: Aurélien
  surname: Barré
  fullname: Barré, Aurélien
  organization: Centre de Bioinformatique et Génomique Fonctionnelle Bordeaux, Université de Bordeaux
– sequence: 3
  givenname: Christine
  surname: Citti
  fullname: Citti, Christine
  organization: Université de Toulouse, ENVT, UMR 1225, INRA, UMR 1225
– sequence: 4
  givenname: Florence
  surname: Tardy
  fullname: Tardy, Florence
  organization: Anses, Lyon Laboratory, UMR Mycoplasmoses of Ruminants
– sequence: 5
  givenname: François
  surname: Thiaucourt
  fullname: Thiaucourt, François
  organization: CIRAD, UMR CMAEE
– sequence: 6
  givenname: Pascal
  surname: Sirand-Pugnet
  fullname: Sirand-Pugnet, Pascal
  organization: Université de Bordeaux, UMR 1332, INRA, UMR 1332
– sequence: 7
  givenname: Patricia
  surname: Thébault
  fullname: Thébault, Patricia
  organization: Centre de Bioinformatique et Génomique Fonctionnelle Bordeaux, Université de Bordeaux, Laboratoire Bordelais de Recherche en Informatique, UMR 5800, Université de Bordeaux
BackLink https://www.ncbi.nlm.nih.gov/pubmed/22115330$$D View this record in MEDLINE/PubMed
https://inria.hal.science/hal-00784414$$DView record in HAL
BookMark eNqNkkuP0zAUhSM0iHnAnhWKYIFm0cF2nNdmpGoETKUiJB5r68aP1pUTB9spMxt-O05byrQChLJIcvOdk6tzfJ6cdLaTSfIcoyuMq-INpiWeEIzyCSYTmpePkrP96OTB82ly7v0KIVxWKH-SnBKCcZ5l6Cz5MU07u5Ym9UPjgw5D0LZLWwhO36VKhyBFGmwaljLltu2t1yMAJm00-FR36QdrjOZDkPGt7V308hu6d1JovnGzKl3a1hq7sINPnTQwjv1S9_5p8liB8fLZ7n6RfH339svN7WT-8f3sZjqf8KLIwyQnpIJayhwaiquaQ6YILjBUWNRU1ryuKlSKOlOQq7xuSoI4LSlHQjUKRIOyi2S29RUWVqx3ugV3zyxothlYt2DgguZGMqFKIURGIBMFlahuCCaSCwQAigvOoxfeeg1dD_ffwZi9IUZs7IWNwbMxeIYJi71EzfVW0w9NKwWXXXBgDhY5_NLpJVvYNcsIrapqNLjcGiyPZLfTORtnCJUVpZiucWRf737m7LdB-sBa7bk0BjoZG2A1ziilWUki-fKIXNnBxXo3EI6WZR2hV1toATEe3SkbN-SjJZuSMq9rUpAsUld_oOIlZKt5PLZKx_mB4PJAEJkg78ICBu_Z7POnQ_bFw_j2Afw6xxEotgB31nsnFeM6bI5Z3EKbfxWDjoT_0eWufx_RbiHd79D-qvkJ0FglTA
CitedBy_id crossref_primary_10_1186_s12859_015_0639_4
crossref_primary_10_1093_bioinformatics_btt694
crossref_primary_10_1016_j_vetmic_2013_10_025
crossref_primary_10_1098_rsos_220939
crossref_primary_10_1186_s12859_020_03616_0
crossref_primary_10_3389_fbinf_2023_1227193
crossref_primary_10_1093_gbe_evu020
crossref_primary_10_1098_rsos_150196
crossref_primary_10_1111_syen_12217
Cites_doi 10.1093/nar/gkn635
10.1093/oxfordjournals.molbev.a026257
10.1093/nar/29.2.351
10.1016/0888-7543(91)90071-L
10.1093/bioinformatics/18.11.1500
10.1186/1471-2105-12-77
10.1038/nature01097
10.1371/journal.pgen.1000677
10.1016/j.resmic.2007.09.007
10.1093/nar/gkh114
10.1093/nar/19.23.6565
10.1186/1471-2105-9-236
10.1093/nar/gkp985
10.1146/annurev.genet.39.073003.114725
10.1073/pnas.85.8.2444
10.1016/j.patrec.2005.10.010
10.1016/j.crvi.2005.02.002
10.1186/gb-2008-9-10-235
10.1128/EC.5.4.628-637.2006
10.1371/journal.pcbi.1000262
10.1093/bioinformatics/bti070
10.1016/0022-2836(81)90087-5
10.1016/j.tig.2008.08.009
10.1002/prot.340170108
10.1006/jmbi.2000.4042
10.1093/nar/25.17.3389
ContentType Journal Article
Copyright Lemaitre et al; licensee BioMed Central Ltd. 2011
2011 Lemaitre et al; licensee BioMed Central Ltd.
COPYRIGHT 2011 BioMed Central Ltd.
2011 Lemaitre et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Distributed under a Creative Commons Attribution 4.0 International License
Copyright ©2011 Lemaitre et al; licensee BioMed Central Ltd. 2011 Lemaitre et al; licensee BioMed Central Ltd.
Copyright_xml – notice: Lemaitre et al; licensee BioMed Central Ltd. 2011
– notice: 2011 Lemaitre et al; licensee BioMed Central Ltd.
– notice: COPYRIGHT 2011 BioMed Central Ltd.
– notice: 2011 Lemaitre et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
– notice: Distributed under a Creative Commons Attribution 4.0 International License
– notice: Copyright ©2011 Lemaitre et al; licensee BioMed Central Ltd. 2011 Lemaitre et al; licensee BioMed Central Ltd.
DBID C6C
AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
ISR
3V.
7QO
7SC
7X7
7XB
88E
8AL
8AO
8FD
8FE
8FG
8FH
8FI
8FJ
8FK
ABUWG
AEUYN
AFKRA
ARAPS
AZQEC
BBNVY
BENPR
BGLVJ
BHPHI
CCPQU
DWQXO
FR3
FYUFA
GHDGH
GNUQQ
HCIFZ
JQ2
K7-
K9.
L7M
LK8
L~C
L~D
M0N
M0S
M1P
M7P
P5Z
P62
P64
PHGZM
PHGZT
PIMPY
PJZUB
PKEHL
PPXIY
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
Q9U
7X8
1XC
VOOES
5PM
ADTOC
UNPAY
DOA
DOI 10.1186/1471-2105-12-457
DatabaseName Springer Nature OA Free Journals
CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
Gale In Context: Science
ProQuest Central (Corporate)
Biotechnology Research Abstracts
Computer and Information Systems Abstracts
Health & Medical Collection
ProQuest Central (purchase pre-March 2016)
Medical Database (Alumni Edition)
Computing Database (Alumni Edition)
ProQuest Pharma Collection
Technology Research Database
ProQuest SciTech Collection
ProQuest Technology Collection
ProQuest Natural Science Journals
Hospital Premium Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
ProQuest Central (Alumni)
ProQuest One Sustainability
ProQuest Central
Advanced Technologies & Computer Science Collection
ProQuest Central Essentials - QC
Biological Science Collection
ProQuest Central
Technology Collection (via ProQuest SciTech Premium Collection)
Natural Science Collection
ProQuest One
ProQuest Central
Engineering Research Database
Health Research Premium Collection
Health Research Premium Collection (Alumni)
ProQuest Central Student
SciTech Premium Collection
ProQuest Computer Science Collection
Computer Science Database (Proquest)
ProQuest Health & Medical Complete (Alumni)
Advanced Technologies Database with Aerospace
Biological Sciences
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
Computing Database
ProQuest Health & Medical Collection
Medical Database
Biological Science Database
Advanced Technologies & Aerospace Database
ProQuest Advanced Technologies & Aerospace Collection
Biotechnology and BioEngineering Abstracts
ProQuest Central Premium
ProQuest One Academic (New)
ProQuest Publicly Available Content Database
ProQuest Health & Medical Research Collection
ProQuest One Academic Middle East (New)
ProQuest One Health & Nursing
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
ProQuest Central Basic
MEDLINE - Academic
Hyper Article en Ligne (HAL)
Hyper Article en Ligne (HAL) (Open Access)
PubMed Central (Full Participant titles)
Unpaywall for CDI: Periodical Content
Unpaywall
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
Publicly Available Content Database
Computer Science Database
ProQuest Central Student
ProQuest Advanced Technologies & Aerospace Collection
ProQuest Central Essentials
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
SciTech Premium Collection
ProQuest Central China
ProQuest One Applied & Life Sciences
ProQuest One Sustainability
Health Research Premium Collection
Natural Science Collection
Health & Medical Research Collection
Biological Science Collection
ProQuest Central (New)
ProQuest Medical Library (Alumni)
Advanced Technologies & Aerospace Collection
ProQuest Biological Science Collection
ProQuest One Academic Eastern Edition
ProQuest Hospital Collection
ProQuest Technology Collection
Health Research Premium Collection (Alumni)
Biological Science Database
ProQuest Hospital Collection (Alumni)
Biotechnology and BioEngineering Abstracts
ProQuest Health & Medical Complete
ProQuest One Academic UKI Edition
Engineering Research Database
ProQuest One Academic
ProQuest One Academic (New)
Technology Collection
Technology Research Database
Computer and Information Systems Abstracts – Academic
ProQuest One Academic Middle East (New)
ProQuest Health & Medical Complete (Alumni)
ProQuest Central (Alumni Edition)
ProQuest One Community College
ProQuest One Health & Nursing
ProQuest Natural Science Collection
ProQuest Pharma Collection
ProQuest Central
ProQuest Health & Medical Research Collection
Biotechnology Research Abstracts
Health and Medicine Complete (Alumni Edition)
ProQuest Central Korea
Advanced Technologies Database with Aerospace
ProQuest Computing
ProQuest Central Basic
ProQuest Computing (Alumni Edition)
ProQuest SciTech Collection
Computer and Information Systems Abstracts Professional
Advanced Technologies & Aerospace Database
ProQuest Medical Library
ProQuest Central (Alumni)
MEDLINE - Academic
DatabaseTitleList Publicly Available Content Database

MEDLINE





MEDLINE - Academic
Database_xml – sequence: 1
  dbid: C6C
  name: Springer Nature OA Free Journals
  url: http://www.springeropen.com/
  sourceTypes: Publisher
– sequence: 2
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 3
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 4
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
– sequence: 5
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
– sequence: 6
  dbid: 8FG
  name: ProQuest Technology Collection
  url: https://search.proquest.com/technologycollection1
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
Computer Science
EISSN 1471-2105
EndPage 457
ExternalDocumentID oai_doaj_org_article_df7ddd32a3d64e09b212ecd0aaafcdcc
10.1186/1471-2105-12-457
PMC3248887
oai:HAL:hal-00784414v1
2550013641
A275992623
22115330
10_1186_1471_2105_12_457
Genre Research Support, Non-U.S. Gov't
Journal Article
GeographicLocations France
GeographicLocations_xml – name: France
GroupedDBID ---
0R~
23N
2VQ
2WC
4.4
53G
5VS
6J9
7X7
88E
8AO
8FE
8FG
8FH
8FI
8FJ
AAFWJ
AAJSJ
AAKPC
AASML
ABDBF
ABUWG
ACGFO
ACGFS
ACIHN
ACIWK
ACPRK
ACUHS
ADBBV
ADMLS
ADRAZ
ADUKV
AEAQA
AENEX
AEUYN
AFKRA
AFPKN
AFRAH
AHBYD
AHMBA
AHSBF
AHYZX
ALMA_UNASSIGNED_HOLDINGS
AMKLP
AMTXH
AOIJS
ARAPS
AZQEC
BAPOH
BAWUL
BBNVY
BCNDV
BENPR
BFQNJ
BGLVJ
BHPHI
BMC
BPHCQ
BVXVI
C1A
C6C
CCPQU
CS3
DIK
DU5
DWQXO
E3Z
EAD
EAP
EAS
EBD
EBLON
EBS
EJD
EMB
EMK
EMOBN
ESX
F5P
FYUFA
GNUQQ
GROUPED_DOAJ
GX1
H13
HCIFZ
HMCUK
HYE
IAO
ICD
IHR
INH
INR
IPNFZ
ISR
ITC
K6V
K7-
KQ8
LK8
M1P
M48
M7P
MK~
ML0
M~E
O5R
O5S
OK1
OVT
P2P
P62
PGMZT
PHGZM
PHGZT
PIMPY
PJZUB
PPXIY
PQGLB
PQQKQ
PROAC
PSQYO
PUEGO
RBZ
RIG
RNS
ROL
RPM
RSV
SBL
SOJ
SV3
TR2
TUS
UKHRP
W2D
WOQ
WOW
XH6
XSB
AAYXX
CITATION
ALIPV
CGR
CUY
CVF
ECM
EIF
NPM
3V.
7QO
7SC
7XB
8AL
8FD
8FK
FR3
JQ2
K9.
L7M
L~C
L~D
M0N
P64
PKEHL
PQEST
PQUKI
PRINS
Q9U
7X8
123
1XC
VOOES
5PM
ADTOC
UNPAY
ID FETCH-LOGICAL-c665t-5228a9ee5ab4189ca3f2161a81d94e9c98807d93fa5f59b720c474c0dfbfadb03
IEDL.DBID M48
ISSN 1471-2105
IngestDate Tue Oct 14 19:08:13 EDT 2025
Sun Oct 26 04:12:40 EDT 2025
Tue Sep 30 16:55:58 EDT 2025
Tue Oct 14 20:37:00 EDT 2025
Fri Sep 05 10:03:30 EDT 2025
Mon Oct 06 18:30:41 EDT 2025
Mon Oct 20 21:57:55 EDT 2025
Mon Oct 20 16:35:50 EDT 2025
Thu Oct 16 14:15:54 EDT 2025
Mon Jul 21 06:02:21 EDT 2025
Thu Apr 24 23:06:46 EDT 2025
Wed Oct 01 04:15:20 EDT 2025
Sat Sep 06 07:27:16 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords Domain Architecture
Pairwise Alignment
Core Genome
Substitution Matrix
Amino Acid Frequency
mollicutes
mathematical and computational biology
biochemistry and molecular biology
biotechnology and applied microbiology
orthologous predictions
substitution matrix
Language English
License http://creativecommons.org/licenses/by/2.0
2011 Lemaitre et al; licensee BioMed Central Ltd.
Distributed under a Creative Commons Attribution 4.0 International License: http://creativecommons.org/licenses/by/4.0
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
cc-by
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c665t-5228a9ee5ab4189ca3f2161a81d94e9c98807d93fa5f59b720c474c0dfbfadb03
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ORCID 0000-0003-2276-4573
0000-0003-4754-6003
0000-0003-3968-4801
0000-0003-2613-0762
0000-0001-8675-170X
OpenAccessLink http://journals.scholarsportal.info/openUrl.xqy?doi=10.1186/1471-2105-12-457
PMID 22115330
PQID 913107879
PQPubID 44065
ParticipantIDs doaj_primary_oai_doaj_org_article_df7ddd32a3d64e09b212ecd0aaafcdcc
unpaywall_primary_10_1186_1471_2105_12_457
pubmedcentral_primary_oai_pubmedcentral_nih_gov_3248887
hal_primary_oai_HAL_hal_00784414v1
proquest_miscellaneous_913444372
proquest_journals_913107879
gale_infotracmisc_A275992623
gale_infotracacademiconefile_A275992623
gale_incontextgauss_ISR_A275992623
pubmed_primary_22115330
crossref_citationtrail_10_1186_1471_2105_12_457
crossref_primary_10_1186_1471_2105_12_457
springer_journals_10_1186_1471_2105_12_457
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2011-11-24
PublicationDateYYYYMMDD 2011-11-24
PublicationDate_xml – month: 11
  year: 2011
  text: 2011-11-24
  day: 24
PublicationDecade 2010
PublicationPlace London
PublicationPlace_xml – name: London
– name: England
PublicationTitle BMC bioinformatics
PublicationTitleAbbrev BMC Bioinformatics
PublicationTitleAlternate BMC Bioinformatics
PublicationYear 2011
Publisher BioMed Central
BioMed Central Ltd
Springer Nature B.V
BMC
Publisher_xml – name: BioMed Central
– name: BioMed Central Ltd
– name: Springer Nature B.V
– name: BMC
References GA Singer (4981_CR12) 2000; 17
WR Pearson (4981_CR18) 1991; 11
M Dayhoff (4981_CR2) 1978
JT Reese (4981_CR17) 2002; 18
RD Finn (4981_CR29) 2010; 38
SF Altschul (4981_CR20) 2001; 29
Y Yu (4981_CR4) 2003
A Barré (4981_CR11) 2004; 32
A Kuzniar (4981_CR23) 2008; 24
JE Coronado (4981_CR5) 2006; 5
P Sirand-Pugnet (4981_CR10) 2007; 158
S Henikoff (4981_CR16) 1993; 17
S Henikoff (4981_CR15) 1991; 19
EV Koonin (4981_CR21) 2005; 39
SF Altschul (4981_CR19) 1997; 25
AM Altenhoff (4981_CR24) 2009; 5
T Gabaldón (4981_CR22) 2008; 9
C Notredame (4981_CR13) 2000; 302
K Brick (4981_CR6) 2008; 9
MJ Gardner (4981_CR3) 2002; 419
T Fawcett (4981_CR14) 2006; 27
U Paila (4981_CR7) 2008; 36
Y Yu (4981_CR8) 2005; 21
X Robin (4981_CR28) 2011; 12
O Bastien (4981_CR9) 2005; 328
WR Pearson (4981_CR26) 1988; 85
TF Smith (4981_CR27) 1981; 147
S Henikoff (4981_CR1) 1992
S Pereyre (4981_CR25) 2009; 5
19816563 - PLoS Genet. 2009 Oct;5(10):e1000677
7265238 - J Mol Biol. 1981 Mar 25;147(1):195-7
16607010 - Eukaryot Cell. 2006 Apr;5(4):628-37
18485187 - BMC Bioinformatics. 2008;9:236
11070046 - Mol Biol Evol. 2000 Nov;17(11):1581-8
14663142 - Proc Natl Acad Sci U S A. 2003 Dec 23;100(26):15688-93
11139604 - Nucleic Acids Res. 2001 Jan 15;29(2):351-61
21414208 - BMC Bioinformatics. 2011;12:77
16285863 - Annu Rev Genet. 2005;39:309-38
10964570 - J Mol Biol. 2000 Sep 8;302(1):205-17
18983710 - Genome Biol. 2008;9(10):235
8234244 - Proteins. 1993 Sep;17(1):49-61
19148271 - PLoS Comput Biol. 2009 Jan;5(1):e1000262
19920124 - Nucleic Acids Res. 2010 Jan;38(Database issue):D211-22
18819722 - Trends Genet. 2008 Nov;24(11):539-51
3162770 - Proc Natl Acad Sci U S A. 1988 Apr;85(8):2444-8
9254694 - Nucleic Acids Res. 1997 Sep 1;25(17):3389-402
14681420 - Nucleic Acids Res. 2004 Jan 1;32(Database issue):D307-10
1774068 - Genomics. 1991 Nov;11(3):635-50
1754394 - Nucleic Acids Res. 1991 Dec 11;19(23):6565-72
12368864 - Nature. 2002 Oct 3;419(6906):498-511
15948633 - C R Biol. 2005 May;328(5):445-53
15509610 - Bioinformatics. 2005 Apr 1;21(7):902-11
18023150 - Res Microbiol. 2007 Dec;158(10):754-66
1438297 - Proc Natl Acad Sci U S A. 1992 Nov 15;89(22):10915-9
12424122 - Bioinformatics. 2002 Nov;18(11):1500-7
18948281 - Nucleic Acids Res. 2008 Dec;36(21):6664-75
References_xml – volume: 36
  start-page: 6664
  issue: 21
  year: 2008
  ident: 4981_CR7
  publication-title: Nucleic Acids Research
  doi: 10.1093/nar/gkn635
– volume: 17
  start-page: 1581
  issue: 11
  year: 2000
  ident: 4981_CR12
  publication-title: Mol Biol Evol
  doi: 10.1093/oxfordjournals.molbev.a026257
– volume: 29
  start-page: 351
  issue: 2
  year: 2001
  ident: 4981_CR20
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/29.2.351
– volume: 11
  start-page: 635
  issue: 3
  year: 1991
  ident: 4981_CR18
  publication-title: Genomics
  doi: 10.1016/0888-7543(91)90071-L
– volume: 18
  start-page: 1500
  issue: 11
  year: 2002
  ident: 4981_CR17
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/18.11.1500
– volume: 12
  start-page: 77
  year: 2011
  ident: 4981_CR28
  publication-title: BMC Bioinformatics
  doi: 10.1186/1471-2105-12-77
– volume: 419
  start-page: 498
  issue: 6906
  year: 2002
  ident: 4981_CR3
  publication-title: Nature
  doi: 10.1038/nature01097
– volume: 5
  start-page: e1000677
  issue: 10
  year: 2009
  ident: 4981_CR25
  publication-title: PLoS Genet
  doi: 10.1371/journal.pgen.1000677
– volume: 158
  start-page: 754
  issue: 10
  year: 2007
  ident: 4981_CR10
  publication-title: Res Microbiol
  doi: 10.1016/j.resmic.2007.09.007
– volume: 32
  start-page: D307
  issue: Database issue
  year: 2004
  ident: 4981_CR11
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkh114
– volume: 19
  start-page: 6565
  issue: 23
  year: 1991
  ident: 4981_CR15
  publication-title: Nucleic Acids Research
  doi: 10.1093/nar/19.23.6565
– volume: 9
  start-page: 236
  year: 2008
  ident: 4981_CR6
  publication-title: BMC Bioinformatics
  doi: 10.1186/1471-2105-9-236
– volume: 38
  start-page: D211
  issue: Database issue
  year: 2010
  ident: 4981_CR29
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/gkp985
– volume: 39
  start-page: 309
  year: 2005
  ident: 4981_CR21
  publication-title: Annu Rev Genet
  doi: 10.1146/annurev.genet.39.073003.114725
– volume: 85
  start-page: 2444
  issue: 8
  year: 1988
  ident: 4981_CR26
  publication-title: Proc Natl Acad Sci USA
  doi: 10.1073/pnas.85.8.2444
– volume: 27
  start-page: 861
  issue: 8
  year: 2006
  ident: 4981_CR14
  publication-title: Pattern Recognition Letters
  doi: 10.1016/j.patrec.2005.10.010
– start-page: 15688
  volume-title: The compositional adjustment of amino acid substitution matrices
  year: 2003
  ident: 4981_CR4
– volume: 328
  start-page: 445
  issue: 5
  year: 2005
  ident: 4981_CR9
  publication-title: C R Biol
  doi: 10.1016/j.crvi.2005.02.002
– volume: 9
  start-page: 235
  issue: 10
  year: 2008
  ident: 4981_CR22
  publication-title: Genome Biol
  doi: 10.1186/gb-2008-9-10-235
– start-page: 10915
  volume-title: Amino acid substitution matrices from protein blocks
  year: 1992
  ident: 4981_CR1
– volume: 5
  start-page: 628
  issue: 4
  year: 2006
  ident: 4981_CR5
  publication-title: Eukaryotic Cell
  doi: 10.1128/EC.5.4.628-637.2006
– volume: 5
  start-page: e1000262
  year: 2009
  ident: 4981_CR24
  publication-title: PLoS Comput Biol
  doi: 10.1371/journal.pcbi.1000262
– start-page: 345
  volume-title: A model of evolutionary change in proteins in Atlas of Protein Sequence and Structure
  year: 1978
  ident: 4981_CR2
– volume: 21
  start-page: 902
  issue: 7
  year: 2005
  ident: 4981_CR8
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/bti070
– volume: 147
  start-page: 195
  year: 1981
  ident: 4981_CR27
  publication-title: J Mol Biol
  doi: 10.1016/0022-2836(81)90087-5
– volume: 24
  start-page: 539
  issue: 11
  year: 2008
  ident: 4981_CR23
  publication-title: Trends Genet
  doi: 10.1016/j.tig.2008.08.009
– volume: 17
  start-page: 49
  year: 1993
  ident: 4981_CR16
  publication-title: Proteins
  doi: 10.1002/prot.340170108
– volume: 302
  start-page: 205
  year: 2000
  ident: 4981_CR13
  publication-title: J Mol Biol
  doi: 10.1006/jmbi.2000.4042
– volume: 25
  start-page: 3389
  issue: 17
  year: 1997
  ident: 4981_CR19
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/25.17.3389
– reference: 18485187 - BMC Bioinformatics. 2008;9:236
– reference: 18948281 - Nucleic Acids Res. 2008 Dec;36(21):6664-75
– reference: 10964570 - J Mol Biol. 2000 Sep 8;302(1):205-17
– reference: 12424122 - Bioinformatics. 2002 Nov;18(11):1500-7
– reference: 14681420 - Nucleic Acids Res. 2004 Jan 1;32(Database issue):D307-10
– reference: 21414208 - BMC Bioinformatics. 2011;12:77
– reference: 15948633 - C R Biol. 2005 May;328(5):445-53
– reference: 19920124 - Nucleic Acids Res. 2010 Jan;38(Database issue):D211-22
– reference: 15509610 - Bioinformatics. 2005 Apr 1;21(7):902-11
– reference: 1774068 - Genomics. 1991 Nov;11(3):635-50
– reference: 19148271 - PLoS Comput Biol. 2009 Jan;5(1):e1000262
– reference: 18819722 - Trends Genet. 2008 Nov;24(11):539-51
– reference: 8234244 - Proteins. 1993 Sep;17(1):49-61
– reference: 18983710 - Genome Biol. 2008;9(10):235
– reference: 11139604 - Nucleic Acids Res. 2001 Jan 15;29(2):351-61
– reference: 12368864 - Nature. 2002 Oct 3;419(6906):498-511
– reference: 16285863 - Annu Rev Genet. 2005;39:309-38
– reference: 18023150 - Res Microbiol. 2007 Dec;158(10):754-66
– reference: 14663142 - Proc Natl Acad Sci U S A. 2003 Dec 23;100(26):15688-93
– reference: 7265238 - J Mol Biol. 1981 Mar 25;147(1):195-7
– reference: 9254694 - Nucleic Acids Res. 1997 Sep 1;25(17):3389-402
– reference: 1438297 - Proc Natl Acad Sci U S A. 1992 Nov 15;89(22):10915-9
– reference: 19816563 - PLoS Genet. 2009 Oct;5(10):e1000677
– reference: 3162770 - Proc Natl Acad Sci U S A. 1988 Apr;85(8):2444-8
– reference: 11070046 - Mol Biol Evol. 2000 Nov;17(11):1581-8
– reference: 1754394 - Nucleic Acids Res. 1991 Dec 11;19(23):6565-72
– reference: 16607010 - Eukaryot Cell. 2006 Apr;5(4):628-37
SSID ssj0017805
Score 2.0596986
Snippet Background Substitution matrices are key parameters for the alignment of two protein sequences, and consequently for most comparative genomics studies. The...
Substitution matrices are key parameters for the alignment of two protein sequences, and consequently for most comparative genomics studies. The composition of...
Background Substitution matrices are key parameters for the alignment of two protein sequences, and consequently for most comparative genomics studies. The...
Abstract Background: Substitution matrices are key parameters for the alignment of two protein sequences, and consequently for most comparative genomics...
Abstract Background Substitution matrices are key parameters for the alignment of two protein sequences, and consequently for most comparative genomics...
SourceID doaj
unpaywall
pubmedcentral
hal
proquest
gale
pubmed
crossref
springer
SourceType Open Website
Open Access Repository
Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 457
SubjectTerms Algorithms
Amino acids
Bacterial Proteins - genetics
Biochemistry, Molecular Biology
Bioinformatics
Biomedical and Life Sciences
Comparative genomics
Comparative studies
Computational Biology/Bioinformatics
Computer Appl. in Life Sciences
Computer Science
Evolutionary biology
Genetic aspects
Genomes
Genomics
Genomics - methods
Life Sciences
Microarrays
Mycoplasmatales
Physiological aspects
Probability
Proteins
Quantitative Methods
Research Article
Sequence Homology, Amino Acid
Software
Studies
Tenericutes - classification
Tenericutes - genetics
SummonAdditionalLinks – databaseName: DOAJ Directory of Open Access Journals
  dbid: DOA
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3di9QwEA96IOqD-G29U8IhiAdl2zRtmsdVPFbxfFAP7i0kaeoW9tpl2z29F_92Z_rllkPvRehDSaZp2plMfulMfyHklXDaMaZT3yXW-JwL5kseaF_k1kWhcyZpeWZPPieLU_7xLD7b2eoLc8I6euDuxc2yXGRZFjEdZQl3gTTga53NAq11bjNr0fsGqRwWU338AJn62_-KROjDoiYeApRpMhvLMCmB47S0MyG1vP2jd765xOTIq8jzagLlGEW9S25vy7W-_KFXq52J6vg-udcjTDrvnuwBueHKh-RWt-fk5SPya07L6sKtaA0uo80TAM3Qc2Tq_0nzogEESpuKAi6kmG7e53RBg6bQNS1KeoI03nYLEJUW7RcJOEHp9QZjPm1rVU6X1TnesNrWdDPk2y2Ldf2YnB6___Zu4febMPg2SeIGFqos1dK5WBseptLqKGeAEjXgXMmdtBIcgMhklOs4j6URLLBccBtkucl1ZoLoCdkrq9I9I9RwpMuLuIk1shIKI2Ri4WKmYx5mceSR2aAJZXuGctwoY6XalUqaKNSdQt2pkCnQnUfejFesO3aOf8i-ReWOcsir3RaAtane2tR11uaRQzQNhcwZJabmfNfbulYfvn5RcyZiieyL8Byve6G8gv5b3f_pAG8BybYmkgcTSRjadlJ9CBY46fFi_klhGUI7QLL8IvTI_mCgqvc_tZIhwHbwxdIjdKzF1jGlrnSgfBThHKO2HnnaWfN4I8ZCXAYEHhETO5_0ZFpTFsuWmxzweQrzlkeOhhHxp1N_18zROGauVePz_6HGfXKnjQ3AwfgB2Ws2W_cCwGVjXrZ-5DfdinWK
  priority: 102
  providerName: Directory of Open Access Journals
– databaseName: ProQuest Central
  dbid: BENPR
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwhV3da9RAEF_aK6I-iN_GVlmKIBbCJZtNcvsgcpWWKl6RaqFvy36lF7gm5-Wu2hf_dmdySWwoVriHIztJNpnZ2d_uTH5DyJvUKceYGvkuMdrnPGW-4IHy08y4KHROJzXP7OQ4OTrln8_isw0yab-FwbTK1ifWjtqWBvfIhyIEIALWJT7Mf_hYNAqDq20FDdVUVrDva4axTbLFkBhrQLb2D46_nnRhBSTwb2OVo2QYgmf2QS7G_ASOM9S1uamm8O8c9eYU8yRvgtCbuZRdQPU-ubsq5urqp5rNrs1Zhw_JgwZs0vHaOh6RDVc8JnfW5SevnpDfY1qUl25GK_AedcoAKIleIGn_L5rlSwCjdFlSgIgUM8-b9C64oM5VRfOCTpDR26wArdK83pyAPyg9X2D4p75amdFpeYE3LFcVXbSpd9N8Xj0lp4cH3z8e-U09Bt8kSbyENSsbKeFcrDQPR8KoKGMAGBVAXsGdMAJ8QWpFlKk4i4VOWWB4yk1gM50pq4PoGRkUZeFeEKo5MudFXMcKCQpTnYrEwMlMxTy0ceSRYasJaRqycqyZMZP1omWUSNSdRN3JkEnQnUfedWfM10Qdt8juo3I7OaTYrg-Ui3PZjFhps9RaGzEV2YS7QGgwKWdsoJTKjDXGI7toGhJJNArM0jlXq6qSn76dyDFLY4FEjPAcbxuhrIT-G9V89ABvAXm3epI7PUkY5abXvAsW2Ovx0fiLxGOI8gDU8svQI9utgcrGFVWyGzgeoV0rXh2z6woHykcRzjGA65Hna2vubsRYiCuCwCNpz857Pem3FPm0pikHqD6CKcwje-2I-Nupf2tmrxsz_1Xjy1sfd5vcq_f_4cf4DhksFyv3CgDkUr9u3MIfXw9udA
  priority: 102
  providerName: ProQuest
– databaseName: Springer Nature OA Free Journals
  dbid: C6C
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1Lb9QwELZoEQIOiDehBVkVEqJS1MRx4vi4VFQLohyASr1ZtmN3I22T1WZT6IXfzkw2GxqVh5D2sIrHjpMZjz97Jp8JeSWcdozpPHSZNSHngoWSRzoU3rokds5kHc_s8adsesI_nKan_X4HfgtzNX4f59lBDM4zhGVJiikEPBVb5CZMUVkXls0Oh3gBMvNvgpC_qTWadDpu_sEDb80wAfI6uryeJDlESu-S22210Jff9Hx-ZTI6uk_u9SiSTtZqf0BuuOohubU-V_LyEfkxoVV94ea0AbfQ5QLA26fnyMb_nfpyBSiTrmoK2I9iSnmftwUNmlI3tKzoMVJ12xZgKC27XQf4g9KLJcZ1utZqT2f1Od6wbhu63OTUzcpF85icHL37ejgN-4MWQptl6QoWoyzX0rlUGx7n0urEM0CCGrCs5E5aCYNcFDLxOvWpNIJFlgtuo8IbrwsTJU_IdlVX7hmhhiMlXsJNqpF5UBghMwuVmU55XKRJQA42mlC2ZyHHwzDmqluN5JlC3SnUnYqZAt0F5M1QY7Fm4PiL7FtU7iCH3NndBTAp1Q9FVXhRFEXCdFJk3EXSwOztbBFprb0trA3IHpqGQnaMCtNvznTbNOr9l89qwkQqkWERnuN1L-Rr6L_V_dcM8BaQUGskuTuShOFrR8V7YIGjHk8nHxVeQ_gGaJVfxAHZ2Rio6n1Mo2QM0Bz8rQwIHUqxdUybqxwoH0U4x8hsQJ6urXm4EYOVP2YWB0SM7HzUk3FJVc46_nHA4DnMTQHZ34yIX536s2b2hzHzTzU-_5-Wd8idbp8ffozvku3VsnUvACiuzMvOR_wEkfVfzA
  priority: 102
  providerName: Springer Nature
– databaseName: Unpaywall
  dbid: UNPAY
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Lb9QwELb6EAIOPAuEFmRVSIhKaRPHSdbHBVEtiFYVsFI5WbbjdCO2ybLZLZQDv52ZvGgoDyEh5RDFY8exPeNv4vFnQp7EVlnG1MC1kdEu5zFzBfeUG6fGBr61Oqp4Zg8Oo9GYvz4Oj1fIUbsXRp8anRUNaSgSFe9e3IY-rXc54CkKdr43S9Ja6QfRng9G1gX3JcRQAx7Gq2Q9CgGdr5H18eHR8EO1yagRaVcrf5GtNztVJP6dqV6dYKTkZRh6OZqyW1K9Tq4u85k6_6ym0wuz1v5N8qn93jpY5ePucqF3zdefqCD_Z4PcIjcaiEuH9Zi8TVZsfodcqQ-9PL9Lvg1pXpzZKS3BZlWBCjA06CkeFfCFptkCIDBdFBSAKcV49yaoDArUmSppltMD5BE3S8DINKt-icANSs_muOhUlVakdFKc4guLZUnnbcDfJJuVG2S8__L9i5HbnALhmigKF-Aps4ES1oZKc38gjApSBjBVAdAW3AojwALFiQhSFaah0DHzDI-58ZJUpyrRXnCPrOVFbh8Qqjny9QVchwppEWMdi8hAZqZC7idh4JC9tvelaSjS8aSOqaxcpUEksVUltqr0mYRWdcizLsespgf5g-xzHFCdHBJ7Vw-K-Yls7IRM0jhJkoCpIIm49YQGaGFN4imlUpMY45BtHI4SqTtyjA06UcuylK_evZVDFocC6R_hO542QmkB9Teq2WoBrYBsXz3JrZ4k2BbTS96GUd-r8Wj4RuIzxJYApfmZ75DNVilkYwBLKXzwG2AyEA6hXSqWjjF9uYXORxHOcdnYIfdrDepexJiPfojnkLinW72a9FPybFKRo4ODMICJ0yE7rRb-qNTve2an09O_duPDfxHeJNeqRQi4GN8ia4v50j4CFLvQjxvD9B3gYpee
  priority: 102
  providerName: Unpaywall
Title A novel substitution matrix fitted to the compositional bias in Mollicutes improves the prediction of homologous relationships
URI https://link.springer.com/article/10.1186/1471-2105-12-457
https://www.ncbi.nlm.nih.gov/pubmed/22115330
https://www.proquest.com/docview/913107879
https://www.proquest.com/docview/913444372
https://inria.hal.science/hal-00784414
https://pubmed.ncbi.nlm.nih.gov/PMC3248887
https://bmcbioinformatics.biomedcentral.com/counter/pdf/10.1186/1471-2105-12-457
https://doaj.org/article/df7ddd32a3d64e09b212ecd0aaafcdcc
UnpaywallVersion publishedVersion
Volume 12
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVADU
  databaseName: BioMed Central Open Access Free
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: RBZ
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://www.biomedcentral.com/search/
  providerName: BioMedCentral
– providerCode: PRVAFT
  databaseName: Open Access Digital Library
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: KQ8
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html
  providerName: Colorado Alliance of Research Libraries
– providerCode: PRVAFT
  databaseName: Open Access Digital Library
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: KQ8
  dateStart: 20000701
  isFulltext: true
  titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html
  providerName: Colorado Alliance of Research Libraries
– providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: DOA
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVEBS
  databaseName: Academic Search Ultimate
  customDbUrl: https://search.ebscohost.com/login.aspx?authtype=ip,shib&custid=s3936755&profile=ehost&defaultdb=asn
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: ABDBF
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://search.ebscohost.com/direct.asp?db=asn
  providerName: EBSCOhost
– providerCode: PRVEBS
  databaseName: Inspec with Full Text
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: ADMLS
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://www.ebsco.com/products/research-databases/inspec-full-text
  providerName: EBSCOhost
– providerCode: PRVBFR
  databaseName: Free Medical Journals
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: DIK
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: http://www.freemedicaljournals.com
  providerName: Flying Publisher
– providerCode: PRVFQY
  databaseName: GFMER Free Medical Journals
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: GX1
  dateStart: 0
  isFulltext: true
  titleUrlDefault: http://www.gfmer.ch/Medical_journals/Free_medical.php
  providerName: Geneva Foundation for Medical Education and Research
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: M~E
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
– providerCode: PRVAQN
  databaseName: PubMed Central
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: RPM
  dateStart: 20000101
  isFulltext: true
  titleUrlDefault: https://www.ncbi.nlm.nih.gov/pmc/
  providerName: National Library of Medicine
– providerCode: PRVPQU
  databaseName: Health & Medical Collection
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: 7X7
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/healthcomplete
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Central
  customDbUrl: http://www.proquest.com/pqcentral?accountid=15518
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: BENPR
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://www.proquest.com/central
  providerName: ProQuest
– providerCode: PRVPQU
  databaseName: ProQuest Technology Collection
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: 8FG
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://search.proquest.com/technologycollection1
  providerName: ProQuest
– providerCode: PRVFZP
  databaseName: Scholars Portal Journals: Open Access
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 20250131
  omitProxy: true
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: M48
  dateStart: 20000701
  isFulltext: true
  titleUrlDefault: http://journals.scholarsportal.info
  providerName: Scholars Portal
– providerCode: PRVAVX
  databaseName: HAS SpringerNature Open Access 2022
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: AAJSJ
  dateStart: 20001201
  isFulltext: true
  titleUrlDefault: https://www.springernature.com
  providerName: Springer Nature
– providerCode: PRVAVX
  databaseName: Springer Nature OA Free Journals
  customDbUrl:
  eissn: 1471-2105
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0017805
  issn: 1471-2105
  databaseCode: C6C
  dateStart: 20000112
  isFulltext: true
  titleUrlDefault: http://www.springeropen.com/
  providerName: Springer Nature
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3ra9swEBdry9j2YY_u5bULogzGCl5jW7asD2O4oVlX1lLaBbJPQpblJpDaWZx07Zf97btzbHem3QNCHtJZVnQPnazT7wh5w40yrqtC2wQ6thnjri1YV9k81cZzjImDEmf28CjYH7CDoT-8Ph5dDWBx69IO80kNZpP3l9-vPoLCfygVPgx2HDCwNixdfAwzYD5fIWswTwlM5HDIrvcUEL2_PGtUUdeblre0gBDBsDLCyMvWfFXC-jfGe2WEsZM3HdOb8ZXNJusDcm-RTdXVDzWZ_DaP9R-Th5UDSqOlxDwhd0y2Tu4uU1JerZNHdZoHWmn9U_Izoll-YSa0ACNTRhYAL-k5Yvtf0nQ8B5-VznMKniTFAPUqCgzuEY9VQccZPUTgbw2Nwq_yGQZ8QerpDHeJytbylI7yc-xDvijorI7QG42nxTMy6O997e3bVdoGWweBP4elrRsqYYyvYuaEQisvdcGvVOAZC2aEFmAyeCK8VPmpL2LudjXjTHeTNE5VEne952Q1yzPzktCYIcCex2JfIY4hj7kINFzsKp85ie9ZZKdmjtQVpjmm1pjIcm0TBhI5K5Gz0nElcNYi75orpks8j7_Q7iK_GzpE4i4L8tmZrBRbJilPksRzlZcEzHRFDL6A0UlXKZXqRGuLbKG0SMTayDCY50wtikJ-Pj2Rkct9gXiN8D_eVkRpDv3XqjobAaOA8Fwtys0WJRgD3areAqFs9Xg_-iKxDJ1B8H3ZhWORjVpmZa1wUjjg6IP1FhahTS22jkF4mQHmIwljuM9rkRdLAW9uVGuLRXhL9Fs9addk41GJZg4efQgznUW2ayW57tSfObPdqNE_2fjqf8Zkg9wvdwvg5bJNsjqfLcxrcDfncYes8CGH97D_qUPWoujg9AA-d_eOjk-gtBf0OuWDnE5pbaBmcHQcffsFqrZ-5A
linkProvider Scholars Portal
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3db9MwELe2ITR4QHwTNsCaQIhJURPHSeoHhMrH1LJuD7BJffMcx1kjdUlp2o2-8B_xP3KXJmHRxHia1IcqvjhO7nz3u9zljpDXoVGGMdW1TaAjm_OQ2YI7yg4TbTzXmCgo68weHAb9Y_515I_WyO_6WxhMq6x1Yqmo41zjO_KOcAGIgHSJD9MfNjaNwuBq3UFjJRX7ZnkBHlvxfvAZ2PuGsb0vR5_6dtVUwNZB4M_B8WJdJYzxVcTdrtDKSxigHgW4TXAjtACBDmPhJcpPfBGFzNE85NqJkyhRceR4MO86ucU9UCWwfcJR49-52B6gjoR2g44Let8Gj8rH7AeO9u-S5SsbBDRmYH2MWZhXIe7VTM0mXHuXbC6yqVpeqMnkkkXcu0_uVVCW9lay94Csmewhub1qbrl8RH71aJafmwktQDeVCQkgAvQMWwL8pEk6B6hL5zkFAEoxr71KHoMJo1QVNM3oAdYL1wvAwjQtX33AH6SezjC4VM6WJ3Scn-EF80VBZ3Vi3zidFo_J8Y0w5gnZyPLMPCM04liXz-ORr7D8YRiFItBwMlM-d2Pfs0in5oTUVSl07MgxkaVL1A0k8k4i76TLJPDOIu-aM6arMiDX0H5E5jZ0WMC7PJDPTmWlD2SchHEce0x5ccCNIyKAEEbHjlIq0bHWFtlB0ZBYoiPDHKBTtSgKOfj-TfZY6Ass8wj38bYiSnJYv1bVJxXwFLCqV4tyu0UJOkS3hndAAlsr7veGEo8hhgTIzM9di2zVAiorRVfIZltahDajODvm7mUGmI8knGN42CJPV9LcXIgxF_0NxyJhS85bK2mPZOm4LIIOjkAXDKRFdusd8XdR_-bMbrNn_svG59fe7iuy2T86GMrh4HB_i9wpIw3wY3ybbMxnC_MCoOo8elkqCEpObloj_QGulqVn
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1bb9MwFLbYELcHxGVA2ABrQkJMipo4Thw_lkK1wTYhYNLeLMdx1kpdUjXpYC_8ds7JDaJxEVIfqvjYcXsu_pxz8pmQl8Jqy5iOXRuZxOVcMFdyT7siMzbwrU2immf26DjaP-HvT8PT9oFb2VW7dynJ5p0GZGnKq9EyzRoXj6ORDyHVhc1KiIUFPBQb5DqHtQ1PMJhEkz6LgHz9XWryN70GS1HN2N_H5Y0ZlkVexZxXSyf7_OkdcmudL_XlV71Y_LJETe-Ruy22pOPGGO6TazZ_QG40p01ePiTfxzQvLuyClhAs6goB0Ak9R47-bzSbV4A9aVVQQIQUC83bai4YMJnrks5zeoQE3mYN4JTO62cR8AWllyvM9tSjFRmdFed4w2Jd0lVXaTebL8stcjJ992Wy77bHL7gmisIKtqgs1tLaUCfcj6XRQcYAH2pAuJJbaSS4vkhlkOkwC2UimGe44MZLsyTTaeIFj8hmXuT2CaEJR6K8gCehRj5CkQgZGejMdMj9NAwcMuo0oUzLTY5HZCxUvUeJI4W6U6g75TMFunPI677HsuHl-IvsG1RuL4eM2vWFYnWmWgdVaSbSNA2YDtKIW08msKZbk3pa68ykxjhkF01DIWdGjkU5Z3pdlurg8yc1ZiKUyLsIv-NVK5QVMH-j23cc4F9Amq2B5M5AEpzaDJp3wQIHM94fHyq8hqAOMCy_8B2y3RmoaiNPqaQPgB2isHQI7VtxdCymyy0oH0U4x3ytQx431tzfiDEfNwCeQ8TAzgczGbbk81nNSg7IPIYVyyF7nUf8nNSfNbPX-8w_1fj0f0Z-QW5-fDtVhwfHH7bJ7ToRAB_Gd8hmtVrbZ4Akq-R5HS5-AFNNawI
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1Lb9QwELb6EAIOPAuEFmRVSIhKaRPHSdbHBVEtiFYVsFI5WbbjdCO2ybLZLZQDv52ZvGgoDyEh5RDFY8exPeNv4vFnQp7EVlnG1MC1kdEu5zFzBfeUG6fGBr61Oqp4Zg8Oo9GYvz4Oj1fIUbsXRp8anRUNaSgSFe9e3IY-rXc54CkKdr43S9Ja6QfRng9G1gX3JcRQAx7Gq2Q9CgGdr5H18eHR8EO1yagRaVcrf5GtNztVJP6dqV6dYKTkZRh6OZqyW1K9Tq4u85k6_6ym0wuz1v5N8qn93jpY5ePucqF3zdefqCD_Z4PcIjcaiEuH9Zi8TVZsfodcqQ-9PL9Lvg1pXpzZKS3BZlWBCjA06CkeFfCFptkCIDBdFBSAKcV49yaoDArUmSppltMD5BE3S8DINKt-icANSs_muOhUlVakdFKc4guLZUnnbcDfJJuVG2S8__L9i5HbnALhmigKF-Aps4ES1oZKc38gjApSBjBVAdAW3AojwALFiQhSFaah0DHzDI-58ZJUpyrRXnCPrOVFbh8Qqjny9QVchwppEWMdi8hAZqZC7idh4JC9tvelaSjS8aSOqaxcpUEksVUltqr0mYRWdcizLsespgf5g-xzHFCdHBJ7Vw-K-Yls7IRM0jhJkoCpIIm49YQGaGFN4imlUpMY45BtHI4SqTtyjA06UcuylK_evZVDFocC6R_hO542QmkB9Teq2WoBrYBsXz3JrZ4k2BbTS96GUd-r8Wj4RuIzxJYApfmZ75DNVilkYwBLKXzwG2AyEA6hXSqWjjF9uYXORxHOcdnYIfdrDepexJiPfojnkLinW72a9FPybFKRo4ODMICJ0yE7rRb-qNTve2an09O_duPDfxHeJNeqRQi4GN8ia4v50j4CFLvQjxvD9B3gYpee
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+novel+substitution+matrix+fitted+to+the+compositional+bias+in+Mollicutes+improves+the+prediction+of+homologous+relationships&rft.jtitle=BMC+bioinformatics&rft.au=Lemaitre%2C+Claire&rft.au=Barr%C3%A9%2C+Aur%C3%A9lien&rft.au=Citti%2C+Christine&rft.au=Tardy%2C+Florence&rft.date=2011-11-24&rft.pub=BioMed+Central&rft.issn=1471-2105&rft.eissn=1471-2105&rft.volume=12&rft.issue=1&rft_id=info:doi/10.1186%2F1471-2105-12-457&rft_id=info%3Apmid%2F22115330&rft.externalDBID=HAS_PDF_LINK&rft.externalDocID=oai%3AHAL%3Ahal-00784414v1
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1471-2105&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1471-2105&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1471-2105&client=summon