Monuments in Mongolian Writing: An Experience of Creating a Parallel Corpus

This article highlights the results of the work on creating a parallel corpus of Buryat sources in Mongolian script. The project is being carried out with the support of the Russian Science Foundation, based on the archival materials from the Center for Eastern Manuscripts and Xylographs of the IMBT...

Full description

Saved in:
Bibliographic Details
Published inИсторическая информатика no. 2; pp. 1 - 10
Main Authors Debenova, Zinaida Antsiferovna, TSipilova, Snezhana Sergeevna, Tsyrenova, Nomin' Dondokovna
Format Journal Article
LanguageEnglish
Published 01.02.2025
Online AccessGet full text
ISSN2585-7797
2585-7797
DOI10.7256/2585-7797.2025.2.73930

Cover

Abstract This article highlights the results of the work on creating a parallel corpus of Buryat sources in Mongolian script. The project is being carried out with the support of the Russian Science Foundation, based on the archival materials from the Center for Eastern Manuscripts and Xylographs of the IMBT SB RAS. The subject of the research is the process of creating a database for the corpus, the specifics of compiling it, particularly the selection of materials. Currently, the developing corpus includes the following documents from the archival funds of the CVRK IMBT SB RAS: texts of historical content—"A Brief Outline of the History of Khori-Mongolian Buryats," "On the History of the Zugalai Region"; an official document "Protocol of the All-Buryat Assembly in Chita in 1917"; an ethnographic composition "Narrative of Samdan Noyon," a medical work "Notes of Tibetan Doctor Donduba Munkuyev"; a work of Buddhist didactic literature "Subhashita" translated by Galsan-Jimba Tuguldur. General scientific and source study methods were applied to the analysis of handwritten, printed, and xylographic texts in Mongolian script. The processes of material selection, their transliteration and translation, as well as substantive (thematic, lexical) and technical aspects (typos, pagination, numerals) were examined. The parallel Russian-language version is being created by the research group. The authors emphasize the significance of creating a parallel corpus as a resource for further research in the field of Buryat linguistics, translation studies, and cultural studies, as well as its role in promoting Old Mongolian script among the general public and preserving the intangible heritage of the Baikal region. The corpus represents a unique database for further research in various fields of science, etc. The texts considered will serve as a basis for the development of machine translation algorithms, and the work being conducted at this stage will help future developers create more effective algorithms. The creation of a specialized database that is open not only to researchers but also to representatives of the educational sector, professional translators, and anyone showing a scientific or cultural interest in written heritage appears promising.
AbstractList This article highlights the results of the work on creating a parallel corpus of Buryat sources in Mongolian script. The project is being carried out with the support of the Russian Science Foundation, based on the archival materials from the Center for Eastern Manuscripts and Xylographs of the IMBT SB RAS. The subject of the research is the process of creating a database for the corpus, the specifics of compiling it, particularly the selection of materials. Currently, the developing corpus includes the following documents from the archival funds of the CVRK IMBT SB RAS: texts of historical content—"A Brief Outline of the History of Khori-Mongolian Buryats," "On the History of the Zugalai Region"; an official document "Protocol of the All-Buryat Assembly in Chita in 1917"; an ethnographic composition "Narrative of Samdan Noyon," a medical work "Notes of Tibetan Doctor Donduba Munkuyev"; a work of Buddhist didactic literature "Subhashita" translated by Galsan-Jimba Tuguldur. General scientific and source study methods were applied to the analysis of handwritten, printed, and xylographic texts in Mongolian script. The processes of material selection, their transliteration and translation, as well as substantive (thematic, lexical) and technical aspects (typos, pagination, numerals) were examined. The parallel Russian-language version is being created by the research group. The authors emphasize the significance of creating a parallel corpus as a resource for further research in the field of Buryat linguistics, translation studies, and cultural studies, as well as its role in promoting Old Mongolian script among the general public and preserving the intangible heritage of the Baikal region. The corpus represents a unique database for further research in various fields of science, etc. The texts considered will serve as a basis for the development of machine translation algorithms, and the work being conducted at this stage will help future developers create more effective algorithms. The creation of a specialized database that is open not only to researchers but also to representatives of the educational sector, professional translators, and anyone showing a scientific or cultural interest in written heritage appears promising.
Author Tsyrenova, Nomin' Dondokovna
TSipilova, Snezhana Sergeevna
Debenova, Zinaida Antsiferovna
Author_xml – sequence: 1
  givenname: Zinaida Antsiferovna
  surname: Debenova
  fullname: Debenova, Zinaida Antsiferovna
– sequence: 2
  givenname: Snezhana Sergeevna
  surname: TSipilova
  fullname: TSipilova, Snezhana Sergeevna
– sequence: 3
  givenname: Nomin' Dondokovna
  surname: Tsyrenova
  fullname: Tsyrenova, Nomin' Dondokovna
BookMark eNqNkF1LwzAUhoNMcM79BckfaE3T5su7UaYTJ3ox8DKcpckIdGlJNnT_3taJeOnV-Xh5D-d9rtEkdMEidFuQXFDG7yiTLBNCiZwSynKai1KV5AJNf4XJn_4KzVPyW1JVoqKlVFP0_NKF496GQ8I-4GHYda2HgN-jP_iwu8eLgJefvY3eBmNx53AdLYwSBvwGEdrWtrjuYn9MN-jSQZvs_KfO0OZhualX2fr18alerDOjGMl4aStmAYRTVFEHkjGmZFNIRwhQJ7eMm6bhynFTyoYzVykqnIVha7gRqpwhcT57DD2cPoYPdB_9HuJJF0SPVPQYWI-B9UhFU_1NZXDys9PELqVo3X-NX46RZ_Q
Cites_doi 10.30853/phil20230006
10.21209/1996-7853-2020-15-3-153-160
10.28995/2073-0101-2020-4-1255-1266
10.31554/2222-9175-2023-49-97-103
10.22162/2619-0990-2022-61-4-740-750
ContentType Journal Article
DBID AAYXX
CITATION
ADTOC
UNPAY
DOI 10.7256/2585-7797.2025.2.73930
DatabaseName CrossRef
Unpaywall for CDI: Periodical Content
Unpaywall
DatabaseTitle CrossRef
DatabaseTitleList CrossRef
Database_xml – sequence: 1
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
EISSN 2585-7797
EndPage 10
ExternalDocumentID 10.7256/2585-7797.2025.2.73930
10_7256_2585_7797_2025_2_73930
GroupedDBID AAYXX
ALMA_UNASSIGNED_HOLDINGS
CITATION
M~E
ADTOC
UNPAY
ID FETCH-LOGICAL-c950-63e45eaa7f9292fa855598d18f00a2f8b56cdd69f6c38d65f4927fea6cdc6c793
IEDL.DBID UNPAY
ISSN 2585-7797
IngestDate Tue Aug 19 23:28:43 EDT 2025
Wed Oct 01 06:27:01 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Issue 2
Language English
License cc-by-nc
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c950-63e45eaa7f9292fa855598d18f00a2f8b56cdd69f6c38d65f4927fea6cdc6c793
OpenAccessLink https://proxy.k.utb.cz/login?url=https://doi.org/10.7256/2585-7797.2025.2.73930
PageCount 10
ParticipantIDs unpaywall_primary_10_7256_2585_7797_2025_2_73930
crossref_primary_10_7256_2585_7797_2025_2_73930
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2025-2-00
PublicationDateYYYYMMDD 2025-02-01
PublicationDate_xml – month: 02
  year: 2025
  text: 2025-2-00
PublicationDecade 2020
PublicationTitle Историческая информатика
PublicationYear 2025
References ref13
ref12
ref15
ref14
ref11
ref10
ref2
ref1
ref17
ref16
ref8
ref7
ref9
ref4
ref3
ref6
ref5
References_xml – ident: ref4
  doi: 10.30853/phil20230006
– ident: ref13
– ident: ref1
– ident: ref2
– ident: ref3
– ident: ref5
– ident: ref6
– ident: ref7
– ident: ref9
  doi: 10.21209/1996-7853-2020-15-3-153-160
– ident: ref15
  doi: 10.28995/2073-0101-2020-4-1255-1266
– ident: ref11
  doi: 10.31554/2222-9175-2023-49-97-103
– ident: ref17
  doi: 10.22162/2619-0990-2022-61-4-740-750
– ident: ref8
– ident: ref16
– ident: ref10
– ident: ref12
– ident: ref14
SSID ssib044742389
ssib032177147
Score 1.9008486
Snippet This article highlights the results of the work on creating a parallel corpus of Buryat sources in Mongolian script. The project is being carried out with the...
SourceID unpaywall
crossref
SourceType Open Access Repository
Index Database
StartPage 1
Title Monuments in Mongolian Writing: An Experience of Creating a Parallel Corpus
URI https://doi.org/10.7256/2585-7797.2025.2.73930
UnpaywallVersion publishedVersion
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVHPJ
  databaseName: ROAD
  customDbUrl:
  eissn: 2585-7797
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssib044742389
  issn: 2585-7797
  databaseCode: M~E
  dateStart: 20170101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PT8IwFG4UDp78ETVilPTgdWPr1m7zRgiEaEAOEPG0tGtLjKQQZDF68G_3dRv-Ohg8rulbmm99ed_r-r6H0JWnCdOx9B0_VrETqkw7QgfK4YkQVm5O8EJ4fjBk_Ul4M6XTKlG0tTDf_t9HEI1bBOgsMMAkglSOUJe4VsENUvQ6o8C9a6g-GY7aD7aD3GZiWQb8h_GPCLSXmyV_feHz-bew0jtAd5sFlbdJntx8Ldzs7ZdW4_YrPkT7FcPE7XJLHKEdZY7RLfhuXlSz4UeD4WG2sOcb-N5qGpnZNW4b_KV6jBcadwo2aWaY4xFf2Y4rc2xlj_PnEzTudcedvlP1UXCyhEJyGKiQKs4jDVSIaB5TK8ou_Vh7Hic6FpRlUrJEsyyIJaM6TEikFYfRjGXgv6eoZhZGnSEM3s6DQMBrJTi7JwQXGgIazA609GnUQK0NtOmyVMtIIcuwuKQWl9TiklpcUpIWuDSQ9_kFtjQ5_7_JBaqtV7m6BP6wFk20O3jvNqut8wG-xrwO
linkProvider Unpaywall
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PS8MwFA6yHTz5AxUnKjl4bdemTdp6G8MxFOcOG85Tyc8hjmzMFdG_3pe20-lB5rEhr4SvebzvpXnfQ-gqMISZVIVemOrUi7U0njCR9ngmhJObE7wUnr8fsP44vp3QSZ0oulqYjf_3CUTjNgE6CwwwSyCVI9QnvlNwgxS9yShw7wZqjgfDzpPrILeeWJUB_2H8IwLtFnbB39_4bLYRVnr76GG9oOo2yYtfrIQvP35pNW6_4gO0VzNM3Km2xCHa0fYI3YHvFmU1G362GB6mc3e-gR-dppGdXuOOxd-qx3hucLdkk3aKOR7ypeu4MsNO9rh4PUaj3s2o2_fqPgqezCgkh5GOqeY8MUCFiOEpdaLsKkxNEHBiUkGZVIplhskoVYyaOCOJ0RxGJZPgvyeoYedWnyIM3s6jSMBrFTh7IAQXBgIazI6MCmnSQu01tPmiUsvIIctwuOQOl9zhkjtccpKXuLRQ8PUFtjQ5-7_JOWqsloW-AP6wEpf1pvkEWwy63Q
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Monuments+in+Mongolian+Writing%3A+An+Experience+of+Creating+a+Parallel+Corpus&rft.jtitle=%D0%98%D1%81%D1%82%D0%BE%D1%80%D0%B8%D1%87%D0%B5%D1%81%D0%BA%D0%B0%D1%8F+%D0%B8%D0%BD%D1%84%D0%BE%D1%80%D0%BC%D0%B0%D1%82%D0%B8%D0%BA%D0%B0&rft.au=Debenova%2C+Zinaida+Antsiferovna&rft.au=TSipilova%2C+Snezhana+Sergeevna&rft.au=Tsyrenova%2C+Nomin%27+Dondokovna&rft.date=2025-02-01&rft.issn=2585-7797&rft.eissn=2585-7797&rft.issue=2&rft.spage=1&rft.epage=10&rft_id=info:doi/10.7256%2F2585-7797.2025.2.73930&rft.externalDBID=n%2Fa&rft.externalDocID=10_7256_2585_7797_2025_2_73930
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2585-7797&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2585-7797&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2585-7797&client=summon