Monuments in Mongolian Writing: An Experience of Creating a Parallel Corpus
This article highlights the results of the work on creating a parallel corpus of Buryat sources in Mongolian script. The project is being carried out with the support of the Russian Science Foundation, based on the archival materials from the Center for Eastern Manuscripts and Xylographs of the IMBT...
Saved in:
| Published in | Историческая информатика no. 2; pp. 1 - 10 |
|---|---|
| Main Authors | , , |
| Format | Journal Article |
| Language | English |
| Published |
01.02.2025
|
| Online Access | Get full text |
| ISSN | 2585-7797 2585-7797 |
| DOI | 10.7256/2585-7797.2025.2.73930 |
Cover
| Abstract | This article highlights the results of the work on creating a parallel corpus of Buryat sources in Mongolian script. The project is being carried out with the support of the Russian Science Foundation, based on the archival materials from the Center for Eastern Manuscripts and Xylographs of the IMBT SB RAS. The subject of the research is the process of creating a database for the corpus, the specifics of compiling it, particularly the selection of materials. Currently, the developing corpus includes the following documents from the archival funds of the CVRK IMBT SB RAS: texts of historical content—"A Brief Outline of the History of Khori-Mongolian Buryats," "On the History of the Zugalai Region"; an official document "Protocol of the All-Buryat Assembly in Chita in 1917"; an ethnographic composition "Narrative of Samdan Noyon," a medical work "Notes of Tibetan Doctor Donduba Munkuyev"; a work of Buddhist didactic literature "Subhashita" translated by Galsan-Jimba Tuguldur. General scientific and source study methods were applied to the analysis of handwritten, printed, and xylographic texts in Mongolian script. The processes of material selection, their transliteration and translation, as well as substantive (thematic, lexical) and technical aspects (typos, pagination, numerals) were examined. The parallel Russian-language version is being created by the research group. The authors emphasize the significance of creating a parallel corpus as a resource for further research in the field of Buryat linguistics, translation studies, and cultural studies, as well as its role in promoting Old Mongolian script among the general public and preserving the intangible heritage of the Baikal region. The corpus represents a unique database for further research in various fields of science, etc. The texts considered will serve as a basis for the development of machine translation algorithms, and the work being conducted at this stage will help future developers create more effective algorithms. The creation of a specialized database that is open not only to researchers but also to representatives of the educational sector, professional translators, and anyone showing a scientific or cultural interest in written heritage appears promising. |
|---|---|
| AbstractList | This article highlights the results of the work on creating a parallel corpus of Buryat sources in Mongolian script. The project is being carried out with the support of the Russian Science Foundation, based on the archival materials from the Center for Eastern Manuscripts and Xylographs of the IMBT SB RAS. The subject of the research is the process of creating a database for the corpus, the specifics of compiling it, particularly the selection of materials. Currently, the developing corpus includes the following documents from the archival funds of the CVRK IMBT SB RAS: texts of historical content—"A Brief Outline of the History of Khori-Mongolian Buryats," "On the History of the Zugalai Region"; an official document "Protocol of the All-Buryat Assembly in Chita in 1917"; an ethnographic composition "Narrative of Samdan Noyon," a medical work "Notes of Tibetan Doctor Donduba Munkuyev"; a work of Buddhist didactic literature "Subhashita" translated by Galsan-Jimba Tuguldur. General scientific and source study methods were applied to the analysis of handwritten, printed, and xylographic texts in Mongolian script. The processes of material selection, their transliteration and translation, as well as substantive (thematic, lexical) and technical aspects (typos, pagination, numerals) were examined. The parallel Russian-language version is being created by the research group. The authors emphasize the significance of creating a parallel corpus as a resource for further research in the field of Buryat linguistics, translation studies, and cultural studies, as well as its role in promoting Old Mongolian script among the general public and preserving the intangible heritage of the Baikal region. The corpus represents a unique database for further research in various fields of science, etc. The texts considered will serve as a basis for the development of machine translation algorithms, and the work being conducted at this stage will help future developers create more effective algorithms. The creation of a specialized database that is open not only to researchers but also to representatives of the educational sector, professional translators, and anyone showing a scientific or cultural interest in written heritage appears promising. |
| Author | Tsyrenova, Nomin' Dondokovna TSipilova, Snezhana Sergeevna Debenova, Zinaida Antsiferovna |
| Author_xml | – sequence: 1 givenname: Zinaida Antsiferovna surname: Debenova fullname: Debenova, Zinaida Antsiferovna – sequence: 2 givenname: Snezhana Sergeevna surname: TSipilova fullname: TSipilova, Snezhana Sergeevna – sequence: 3 givenname: Nomin' Dondokovna surname: Tsyrenova fullname: Tsyrenova, Nomin' Dondokovna |
| BookMark | eNqNkF1LwzAUhoNMcM79BckfaE3T5su7UaYTJ3ox8DKcpckIdGlJNnT_3taJeOnV-Xh5D-d9rtEkdMEidFuQXFDG7yiTLBNCiZwSynKai1KV5AJNf4XJn_4KzVPyW1JVoqKlVFP0_NKF496GQ8I-4GHYda2HgN-jP_iwu8eLgJefvY3eBmNx53AdLYwSBvwGEdrWtrjuYn9MN-jSQZvs_KfO0OZhualX2fr18alerDOjGMl4aStmAYRTVFEHkjGmZFNIRwhQJ7eMm6bhynFTyoYzVykqnIVha7gRqpwhcT57DD2cPoYPdB_9HuJJF0SPVPQYWI-B9UhFU_1NZXDys9PELqVo3X-NX46RZ_Q |
| Cites_doi | 10.30853/phil20230006 10.21209/1996-7853-2020-15-3-153-160 10.28995/2073-0101-2020-4-1255-1266 10.31554/2222-9175-2023-49-97-103 10.22162/2619-0990-2022-61-4-740-750 |
| ContentType | Journal Article |
| DBID | AAYXX CITATION ADTOC UNPAY |
| DOI | 10.7256/2585-7797.2025.2.73930 |
| DatabaseName | CrossRef Unpaywall for CDI: Periodical Content Unpaywall |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | CrossRef |
| Database_xml | – sequence: 1 dbid: UNPAY name: Unpaywall url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/ sourceTypes: Open Access Repository |
| DeliveryMethod | fulltext_linktorsrc |
| EISSN | 2585-7797 |
| EndPage | 10 |
| ExternalDocumentID | 10.7256/2585-7797.2025.2.73930 10_7256_2585_7797_2025_2_73930 |
| GroupedDBID | AAYXX ALMA_UNASSIGNED_HOLDINGS CITATION M~E ADTOC UNPAY |
| ID | FETCH-LOGICAL-c950-63e45eaa7f9292fa855598d18f00a2f8b56cdd69f6c38d65f4927fea6cdc6c793 |
| IEDL.DBID | UNPAY |
| ISSN | 2585-7797 |
| IngestDate | Tue Aug 19 23:28:43 EDT 2025 Wed Oct 01 06:27:01 EDT 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | false |
| IsScholarly | false |
| Issue | 2 |
| Language | English |
| License | cc-by-nc |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c950-63e45eaa7f9292fa855598d18f00a2f8b56cdd69f6c38d65f4927fea6cdc6c793 |
| OpenAccessLink | https://proxy.k.utb.cz/login?url=https://doi.org/10.7256/2585-7797.2025.2.73930 |
| PageCount | 10 |
| ParticipantIDs | unpaywall_primary_10_7256_2585_7797_2025_2_73930 crossref_primary_10_7256_2585_7797_2025_2_73930 |
| ProviderPackageCode | CITATION AAYXX |
| PublicationCentury | 2000 |
| PublicationDate | 2025-2-00 |
| PublicationDateYYYYMMDD | 2025-02-01 |
| PublicationDate_xml | – month: 02 year: 2025 text: 2025-2-00 |
| PublicationDecade | 2020 |
| PublicationTitle | Историческая информатика |
| PublicationYear | 2025 |
| References | ref13 ref12 ref15 ref14 ref11 ref10 ref2 ref1 ref17 ref16 ref8 ref7 ref9 ref4 ref3 ref6 ref5 |
| References_xml | – ident: ref4 doi: 10.30853/phil20230006 – ident: ref13 – ident: ref1 – ident: ref2 – ident: ref3 – ident: ref5 – ident: ref6 – ident: ref7 – ident: ref9 doi: 10.21209/1996-7853-2020-15-3-153-160 – ident: ref15 doi: 10.28995/2073-0101-2020-4-1255-1266 – ident: ref11 doi: 10.31554/2222-9175-2023-49-97-103 – ident: ref17 doi: 10.22162/2619-0990-2022-61-4-740-750 – ident: ref8 – ident: ref16 – ident: ref10 – ident: ref12 – ident: ref14 |
| SSID | ssib044742389 ssib032177147 |
| Score | 1.9008486 |
| Snippet | This article highlights the results of the work on creating a parallel corpus of Buryat sources in Mongolian script. The project is being carried out with the... |
| SourceID | unpaywall crossref |
| SourceType | Open Access Repository Index Database |
| StartPage | 1 |
| Title | Monuments in Mongolian Writing: An Experience of Creating a Parallel Corpus |
| URI | https://doi.org/10.7256/2585-7797.2025.2.73930 |
| UnpaywallVersion | publishedVersion |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVHPJ databaseName: ROAD customDbUrl: eissn: 2585-7797 dateEnd: 99991231 omitProxy: true ssIdentifier: ssib044742389 issn: 2585-7797 databaseCode: M~E dateStart: 20170101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PT8IwFG4UDp78ETVilPTgdWPr1m7zRgiEaEAOEPG0tGtLjKQQZDF68G_3dRv-Ohg8rulbmm99ed_r-r6H0JWnCdOx9B0_VrETqkw7QgfK4YkQVm5O8EJ4fjBk_Ul4M6XTKlG0tTDf_t9HEI1bBOgsMMAkglSOUJe4VsENUvQ6o8C9a6g-GY7aD7aD3GZiWQb8h_GPCLSXmyV_feHz-bew0jtAd5sFlbdJntx8Ldzs7ZdW4_YrPkT7FcPE7XJLHKEdZY7RLfhuXlSz4UeD4WG2sOcb-N5qGpnZNW4b_KV6jBcadwo2aWaY4xFf2Y4rc2xlj_PnEzTudcedvlP1UXCyhEJyGKiQKs4jDVSIaB5TK8ou_Vh7Hic6FpRlUrJEsyyIJaM6TEikFYfRjGXgv6eoZhZGnSEM3s6DQMBrJTi7JwQXGgIazA609GnUQK0NtOmyVMtIIcuwuKQWl9TiklpcUpIWuDSQ9_kFtjQ5_7_JBaqtV7m6BP6wFk20O3jvNqut8wG-xrwO |
| linkProvider | Unpaywall |
| linkToUnpaywall | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PS8MwFA6yHTz5AxUnKjl4bdemTdp6G8MxFOcOG85Tyc8hjmzMFdG_3pe20-lB5rEhr4SvebzvpXnfQ-gqMISZVIVemOrUi7U0njCR9ngmhJObE7wUnr8fsP44vp3QSZ0oulqYjf_3CUTjNgE6CwwwSyCVI9QnvlNwgxS9yShw7wZqjgfDzpPrILeeWJUB_2H8IwLtFnbB39_4bLYRVnr76GG9oOo2yYtfrIQvP35pNW6_4gO0VzNM3Km2xCHa0fYI3YHvFmU1G362GB6mc3e-gR-dppGdXuOOxd-qx3hucLdkk3aKOR7ypeu4MsNO9rh4PUaj3s2o2_fqPgqezCgkh5GOqeY8MUCFiOEpdaLsKkxNEHBiUkGZVIplhskoVYyaOCOJ0RxGJZPgvyeoYedWnyIM3s6jSMBrFTh7IAQXBgIazI6MCmnSQu01tPmiUsvIIctwuOQOl9zhkjtccpKXuLRQ8PUFtjQ5-7_JOWqsloW-AP6wEpf1pvkEWwy63Q |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Monuments+in+Mongolian+Writing%3A+An+Experience+of+Creating+a+Parallel+Corpus&rft.jtitle=%D0%98%D1%81%D1%82%D0%BE%D1%80%D0%B8%D1%87%D0%B5%D1%81%D0%BA%D0%B0%D1%8F+%D0%B8%D0%BD%D1%84%D0%BE%D1%80%D0%BC%D0%B0%D1%82%D0%B8%D0%BA%D0%B0&rft.au=Debenova%2C+Zinaida+Antsiferovna&rft.au=TSipilova%2C+Snezhana+Sergeevna&rft.au=Tsyrenova%2C+Nomin%27+Dondokovna&rft.date=2025-02-01&rft.issn=2585-7797&rft.eissn=2585-7797&rft.issue=2&rft.spage=1&rft.epage=10&rft_id=info:doi/10.7256%2F2585-7797.2025.2.73930&rft.externalDBID=n%2Fa&rft.externalDocID=10_7256_2585_7797_2025_2_73930 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2585-7797&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2585-7797&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2585-7797&client=summon |