Meta-Analysis of RNA-Seq and Microarray Expression Data to Identify Genes Effective in Sheep Muscle Growth and Development

Introduction: Among different sheep breeds in the world, the Texel breed is known as a meaty and muscular breed. Skeletal muscle growth is a step-by-step and exponential process from differentiation, development and maturation, which is regulated by gene networks and cell signaling pathways, and sev...

Full description

Saved in:
Bibliographic Details
Published inPizhūhishhā-yi ʻulūm-i dāmī-i Īrān Vol. 15; no. 4; pp. 555 - 567
Main Authors Fahime Mohammadi, Mojtaba Tahmoorespur, Ali Javadmanesh
Format Journal Article
LanguagePersian
Published Ferdowsi University of Mashhad 01.12.2023
Subjects
Online AccessGet full text
ISSN2008-3106
2423-4001
DOI10.22067/ijasr.2023.79909.1111

Cover

Abstract Introduction: Among different sheep breeds in the world, the Texel breed is known as a meaty and muscular breed. Skeletal muscle growth is a step-by-step and exponential process from differentiation, development and maturation, which is regulated by gene networks and cell signaling pathways, and several genes and factors are involved in the process of muscle fiber formation and their growth and hypertrophy (Badday Betti et al. 2022). The study of gene expression is done with several methods, and this gene expression information is used in breeding programs as a tool to improve phenotypic choices. Databases are a large source of expression data that can be used by bioinformatics methods to integrate heterogeneous data from different studies and platforms. In this study, by integrating the microarray and RNA-Seq data available in the database belonging to the muscle tissue of Texel breed sheep, the transcriptomic profile of the muscle was compared at two ages of embryonic and adult.Materials and Methods: Microarray data related to longissimus dorsi muscle tissue with three replicates d-70 embryos from GEO database with accession number GSE23563 and RNA-Seq data related to muscle tissue from six samples with two replicates from adult individuals from ArrayExpress database were selected. Limma, Biobase and GEOquery software packages were used to calculate the expression values of the microarray data related to the embryonic age  in the R environment, and Tuxedo, HTSeq and DESeq2 packages were used in the Linux and R environment to calculate the expression values of the RNA-Seq data (Kamali et al. 2022; Sahraei et al. 2019). Then two types of expression values were integrated and to eliminate non-biological effects, the batch effects were also removed. Next, differential genes were identified with the limma software package. In order to identify the relationship between the identified differential genes, the gene network was drawn between them by software of Cytoscape version 3.7.1 and String 1.5.1 program. next, due to the vastness of the gene network, each network was clustered with MCODE 1.6.1 and CytoCluster 2.1.0 programs (ClusterOne algorithm) and significant clusters (P-value < 0.05) were identified (Saedi et al. 2022). In order to better understand the ontology and function of the identified differential genes, the Gene Ontology of the genes was investigated using software of Cytoscape version 3.7.1 and ClueGO 2.5.9 and CluePedia 1.5.9 programs. After receiving the Gene Ontology results, significant Gene Ontology terms (P-Value < 0.05) related to functional groups were identified. Finally, the selected genes (Adj P-Value < 0.05) were identified and introduced in these two age groups.Results and Discussion: After quality control, correcting and normalizing the microarray data, the GPL10778 platform annotation file with 1042520 Probe ID was used to calculate their expression values. After relevant analyzes of 9289 Probe ID identified related to the data of this study, 7918 Gene Symbol was identified finally. After quality control, trimming and normalizing the RNA-Seq data in total, the number of Ensembl_Genes based on which the reading values were calculated by HTSeq was 27056. After removing IDs that had zero readings in all 6 samples, 10855 IDs remained. Then, these 10855 Ensembl ID were merged with the annotation file to obtain Gene Symbol, and finally 9417 common genes were identified between the six samples of adult age. The results of differential expression analysis showed that there were significant differences in the expression of 62 genes (37 increased and 25 decreased) in the muscle tissue between adult and embryonic age. By creating a gene network between differential genes, 15 selected genes were identified, including MYH1, ACTN3, CASQ1, TMOD4, FBP2, SLC2A4, MX1, COX4I1, SOD2, MFN2, UQCRB, UCP3, PRKAB2, PHKG2, PPP1R3C. The function of these genes has been proven in cell proliferation, protein synthesis, myofibril formation, and lipid metabolism. Differential gene enrichment analysis revealed some biological processes such as Vasculogenesis, positive regulation of ossification, positive regulation of muscle tissue development, regulation of muscle contraction, contractile fiber part, calcium signaling, calcineurin-NFAT signaling cascade and regulation of receptor signaling pathway via JAK-STAT, the molecular function of regulating cation channel activity and the cellular components of the contractile fiber.Conclusion: This study in addition to confirming the accuracy of the integration method of two types of heterogeneous data, provided a general view of the transcriptomic differences of Texel sheep muscle tissue at two important age points to be a useful source for biological investigations of genes related to muscle growth and development in sheep.
AbstractList Introduction: Among different sheep breeds in the world, the Texel breed is known as a meaty and muscular breed. Skeletal muscle growth is a step-by-step and exponential process from differentiation, development and maturation, which is regulated by gene networks and cell signaling pathways, and several genes and factors are involved in the process of muscle fiber formation and their growth and hypertrophy (Badday Betti et al. 2022). The study of gene expression is done with several methods, and this gene expression information is used in breeding programs as a tool to improve phenotypic choices. Databases are a large source of expression data that can be used by bioinformatics methods to integrate heterogeneous data from different studies and platforms. In this study, by integrating the microarray and RNA-Seq data available in the database belonging to the muscle tissue of Texel breed sheep, the transcriptomic profile of the muscle was compared at two ages of embryonic and adult.Materials and Methods: Microarray data related to longissimus dorsi muscle tissue with three replicates d-70 embryos from GEO database with accession number GSE23563 and RNA-Seq data related to muscle tissue from six samples with two replicates from adult individuals from ArrayExpress database were selected. Limma, Biobase and GEOquery software packages were used to calculate the expression values of the microarray data related to the embryonic age  in the R environment, and Tuxedo, HTSeq and DESeq2 packages were used in the Linux and R environment to calculate the expression values of the RNA-Seq data (Kamali et al. 2022; Sahraei et al. 2019). Then two types of expression values were integrated and to eliminate non-biological effects, the batch effects were also removed. Next, differential genes were identified with the limma software package. In order to identify the relationship between the identified differential genes, the gene network was drawn between them by software of Cytoscape version 3.7.1 and String 1.5.1 program. next, due to the vastness of the gene network, each network was clustered with MCODE 1.6.1 and CytoCluster 2.1.0 programs (ClusterOne algorithm) and significant clusters (P-value < 0.05) were identified (Saedi et al. 2022). In order to better understand the ontology and function of the identified differential genes, the Gene Ontology of the genes was investigated using software of Cytoscape version 3.7.1 and ClueGO 2.5.9 and CluePedia 1.5.9 programs. After receiving the Gene Ontology results, significant Gene Ontology terms (P-Value < 0.05) related to functional groups were identified. Finally, the selected genes (Adj P-Value < 0.05) were identified and introduced in these two age groups.Results and Discussion: After quality control, correcting and normalizing the microarray data, the GPL10778 platform annotation file with 1042520 Probe ID was used to calculate their expression values. After relevant analyzes of 9289 Probe ID identified related to the data of this study, 7918 Gene Symbol was identified finally. After quality control, trimming and normalizing the RNA-Seq data in total, the number of Ensembl_Genes based on which the reading values were calculated by HTSeq was 27056. After removing IDs that had zero readings in all 6 samples, 10855 IDs remained. Then, these 10855 Ensembl ID were merged with the annotation file to obtain Gene Symbol, and finally 9417 common genes were identified between the six samples of adult age. The results of differential expression analysis showed that there were significant differences in the expression of 62 genes (37 increased and 25 decreased) in the muscle tissue between adult and embryonic age. By creating a gene network between differential genes, 15 selected genes were identified, including MYH1, ACTN3, CASQ1, TMOD4, FBP2, SLC2A4, MX1, COX4I1, SOD2, MFN2, UQCRB, UCP3, PRKAB2, PHKG2, PPP1R3C. The function of these genes has been proven in cell proliferation, protein synthesis, myofibril formation, and lipid metabolism. Differential gene enrichment analysis revealed some biological processes such as Vasculogenesis, positive regulation of ossification, positive regulation of muscle tissue development, regulation of muscle contraction, contractile fiber part, calcium signaling, calcineurin-NFAT signaling cascade and regulation of receptor signaling pathway via JAK-STAT, the molecular function of regulating cation channel activity and the cellular components of the contractile fiber.Conclusion: This study in addition to confirming the accuracy of the integration method of two types of heterogeneous data, provided a general view of the transcriptomic differences of Texel sheep muscle tissue at two important age points to be a useful source for biological investigations of genes related to muscle growth and development in sheep.
Author Mojtaba Tahmoorespur
Fahime Mohammadi
Ali Javadmanesh
Author_xml – sequence: 1
  fullname: Fahime Mohammadi
  organization: Department of Animal Science, Faculty of Agriculture, Ferdowsi University of Mashhad, Mashhad, Iran
– sequence: 2
  fullname: Mojtaba Tahmoorespur
  organization: Department of Animal Science, Faculty of Agriculture, Ferdowsi University of Mashhad, Mashhad, Iran
– sequence: 3
  fullname: Ali Javadmanesh
  organization: Department of Animal Science, Faculty of Agriculture, Ferdowsi University of Mashhad, Mashhad, Iran
BookMark eNqtjEFOwzAURC1UJAr0CsgXSPh23CZZVrSULsKCso8-yTd1ldrBNoVweqKKIzCbGY303jWbWGeJsTsBqZSwyO_NAYNPJcgszcsSylSMuWBTqWSWKAAxGTdAkWQCFldsFoJ5AyVKoeZZMWU_FUVMlha7IZjAneYvz8tkRx8cbcsr03iH3uPA19-9pxF2lq8wIo-Ob1uy0eiBb8hS4GutqYnmRNxYvtsT9bz6DE1HfOPdV9yfjSs6Uef640jeskuNXaDZX9-w7eP69eEpaR0e6t6bI_qhdmjq8-H8e40-mlFYY64VliS1gkYByqLIBOUIuoVi0czz7D9dv5VCckE
ContentType Journal Article
DBID DOA
DOI 10.22067/ijasr.2023.79909.1111
DatabaseName Directory of Open Access Journals (no login required)
DatabaseTitleList
Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
DeliveryMethod fulltext_linktorsrc
Discipline Zoology
EISSN 2423-4001
EndPage 567
ExternalDocumentID oai_doaj_org_article_a7f4a9e2f40c40a28831e7a0fd086c57
GroupedDBID 5VS
ADBBV
ALMA_UNASSIGNED_HOLDINGS
BCNDV
GROUPED_DOAJ
ID FETCH-doaj_primary_oai_doaj_org_article_a7f4a9e2f40c40a28831e7a0fd086c573
IEDL.DBID DOA
ISSN 2008-3106
IngestDate Fri Oct 03 12:47:54 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 4
Language Persian
LinkModel DirectLink
MergedId FETCHMERGED-doaj_primary_oai_doaj_org_article_a7f4a9e2f40c40a28831e7a0fd086c573
OpenAccessLink https://doaj.org/article/a7f4a9e2f40c40a28831e7a0fd086c57
ParticipantIDs doaj_primary_oai_doaj_org_article_a7f4a9e2f40c40a28831e7a0fd086c57
PublicationCentury 2000
PublicationDate 2023-12-01
PublicationDateYYYYMMDD 2023-12-01
PublicationDate_xml – month: 12
  year: 2023
  text: 2023-12-01
  day: 01
PublicationDecade 2020
PublicationTitle Pizhūhishhā-yi ʻulūm-i dāmī-i Īrān
PublicationYear 2023
Publisher Ferdowsi University of Mashhad
Publisher_xml – name: Ferdowsi University of Mashhad
SSID ssib041914538
ssib043502489
ssib050736746
ssib051329982
ssj0001862418
Score 4.4840646
Snippet Introduction: Among different sheep breeds in the world, the Texel breed is known as a meaty and muscular breed. Skeletal muscle growth is a step-by-step and...
SourceID doaj
SourceType Open Website
StartPage 555
SubjectTerms data integration
differential gene expression
gene network
gene ontology
Title Meta-Analysis of RNA-Seq and Microarray Expression Data to Identify Genes Effective in Sheep Muscle Growth and Development
URI https://doaj.org/article/a7f4a9e2f40c40a28831e7a0fd086c57
Volume 15
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAON
  databaseName: DOAJ Directory of Open Access Journals
  customDbUrl:
  eissn: 2423-4001
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0001862418
  issn: 2008-3106
  databaseCode: DOA
  dateStart: 20090101
  isFulltext: true
  titleUrlDefault: https://www.doaj.org/
  providerName: Directory of Open Access Journals
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 2423-4001
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssib050736746
  issn: 2008-3106
  databaseCode: M~E
  dateStart: 20150101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrZ09T8MwEIYtVAmJBfEpvnUDa2iaOEkzFmgpSOlAQapYootjq2VISptKlIHfjs8OUCYGWDJksJ3c6e61dfeYsXM35yiyPHSCLHQdrnOQg5mHjlCilXlCeb6iRuFkEPYf-d0oGK1c9UU1YRYPbH9cEyPFMZae4q7gLtLluC0ZoatyLcZFYPrI3Xa8spnSnsSJWhZ8e5bWBMTu-krcWgT5YfS9EQrouvW4pryY0xnqmzCHg6Y-QGug0LYXe4Q7b06ecU4wUc-_iHQ4j03g-UH9N-mpt8U2a10JHfs922xN4Q5bfyrNqfkue0tkhc4ngQRKBfeDjjOUL4BFDglV5eFshkvovtaVsQVcY4VQlWB7edUSCFE9B8s71kESJgUMx1JOIVnM9aRwo7f01diMuFKLtMdue92Hq75Dy06nlm6REm_avNBWSGsrpL9Zwd9njaIs5AEDkceynWEeBELxCBHbxJgJ3axFcikXh-zy7_Md_ccgx2yDTGiLU05Yo5ot5KmWGFV2ZrxJP5P37gcOg83p
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Meta-Analysis+of+RNA-Seq+and+Microarray+Expression+Data+to+Identify+Genes+Effective+in+Sheep+Muscle+Growth+and+Development&rft.jtitle=Pizh%C5%ABhishh%C4%81-yi+%CA%BBul%C5%ABm-i+d%C4%81m%C4%AB-i+%C4%AAr%C4%81n&rft.au=Fahime+Mohammadi&rft.au=Mojtaba+Tahmoorespur&rft.au=Ali+Javadmanesh&rft.date=2023-12-01&rft.pub=Ferdowsi+University+of+Mashhad&rft.issn=2008-3106&rft.eissn=2423-4001&rft.volume=15&rft.issue=4&rft.spage=555&rft.epage=567&rft_id=info:doi/10.22067%2Fijasr.2023.79909.1111&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_a7f4a9e2f40c40a28831e7a0fd086c57
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2008-3106&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2008-3106&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2008-3106&client=summon