The world through the eyes of an educated person in Minusinsk of the late XIX - early XX centuries: distribution of the frequency of geographical names in the books of the Minusinsk Public Library
The subject of the study is the corpus of children's literature from the collection of the Minusinsk Public Library of the late XIX – early XX century, consisting of 121 works written between 1719 and 1905. These texts are a significant source for studying the formation of geographical percepti...
Saved in:
| Published in | Историческая информатика no. 1; pp. 174 - 189 |
|---|---|
| Main Authors | , |
| Format | Journal Article |
| Language | English |
| Published |
01.01.2025
|
| Online Access | Get full text |
| ISSN | 2585-7797 2585-7797 |
| DOI | 10.7256/2585-7797.2025.1.72586 |
Cover
| Abstract | The subject of the study is the corpus of children's literature from the collection of the Minusinsk Public Library of the late XIX – early XX century, consisting of 121 works written between 1719 and 1905. These texts are a significant source for studying the formation of geographical perception among residents of a provincial Siberian city through fiction. Special attention is paid to the analysis of geographical names (toponyms) found in texts in order to identify their frequency and geographical distribution. This allows us to reconstruct the picture of the world presented in the books of that time and understand how it was perceived by the children's audience, forming their idea of countries, cities and cultural centers. The research is aimed at studying the role of children's literature as a cultural tool that reflects and forms geographical representations, as well as at identifying methodological challenges and limitations when working with historical buildings. The methodological basis includes bringing pre-reform texts to a machine-readable form using digitization tools and geoparsing to automatically identify geographical entities. The Spacy library was used for the analysis, followed by manual verification and correction of the data. The results of the study include the identification of 668 cities and 97 countries represented in the texts, as well as the construction of a cartographic visualization of the frequency distribution of mentions. The analysis revealed an uneven distribution of geographical names in various texts, where mentions of Russia, Poland and England prevail among countries, and Kiev, Moscow and St. Petersburg among cities. The scope of the results includes research in the field of digital humanities, library science and historical and cultural studies. The novelty of the work lies in the use of modern geoparsing methods for processing Russian-language texts of pre-reform spelling and in the analysis of the previously unexplored literature corpus of the Minusinsk Library. The conclusions emphasize the importance of text mapping for understanding the formation of geographical perception and the need for further development of NER tools for complex corpora. Despite the limitations, the research contributes to the development of NLP methods for historical texts. |
|---|---|
| AbstractList | The subject of the study is the corpus of children's literature from the collection of the Minusinsk Public Library of the late XIX – early XX century, consisting of 121 works written between 1719 and 1905. These texts are a significant source for studying the formation of geographical perception among residents of a provincial Siberian city through fiction. Special attention is paid to the analysis of geographical names (toponyms) found in texts in order to identify their frequency and geographical distribution. This allows us to reconstruct the picture of the world presented in the books of that time and understand how it was perceived by the children's audience, forming their idea of countries, cities and cultural centers. The research is aimed at studying the role of children's literature as a cultural tool that reflects and forms geographical representations, as well as at identifying methodological challenges and limitations when working with historical buildings. The methodological basis includes bringing pre-reform texts to a machine-readable form using digitization tools and geoparsing to automatically identify geographical entities. The Spacy library was used for the analysis, followed by manual verification and correction of the data. The results of the study include the identification of 668 cities and 97 countries represented in the texts, as well as the construction of a cartographic visualization of the frequency distribution of mentions. The analysis revealed an uneven distribution of geographical names in various texts, where mentions of Russia, Poland and England prevail among countries, and Kiev, Moscow and St. Petersburg among cities. The scope of the results includes research in the field of digital humanities, library science and historical and cultural studies. The novelty of the work lies in the use of modern geoparsing methods for processing Russian-language texts of pre-reform spelling and in the analysis of the previously unexplored literature corpus of the Minusinsk Library. The conclusions emphasize the importance of text mapping for understanding the formation of geographical perception and the need for further development of NER tools for complex corpora. Despite the limitations, the research contributes to the development of NLP methods for historical texts. |
| Author | Kizhner, Inna Aleksandrovna Mekhovskii, Vadim Aleksandrovich |
| Author_xml | – sequence: 1 givenname: Vadim Aleksandrovich surname: Mekhovskii fullname: Mekhovskii, Vadim Aleksandrovich – sequence: 2 givenname: Inna Aleksandrovna surname: Kizhner fullname: Kizhner, Inna Aleksandrovna |
| BookMark | eNqNkctOwzAQRS0EEq_-ApofSEncJHbYIcRLKoJFF9lFjjNurKZ2sROh_B8fRkx5LVnNaHzPXGvuKTk01iAhF0k8ZzTLL2nGs4ixgs1pTLN5EqY8PyAnPw-Hf_pjMvNe13GaspQueHFC3lctwpt1XQN96-ywbqeKgCN6sAqEAWwGKXpsYIfOWwPawJM2g9fGb4IkyLtJAOVjCRGgcN0IZQkSTT84jf4KGu17p-uh1xP_hSiHrwMaOYbBGu3aiV2rpejAiO1kPtkEWW3txn8zv74vQ91pCUtdO-HGc3KkROdx9lXPyOrudnXzEC2f7x9vrpeRLLI8ShKRL3LJGaOomoylikvKs1osaskFxZzFTR7XSAVTjPO0aVAWXBUxU5jFVC3OCNuvHcxOjG-i66qd09vpA1USVyGOKly6CpeuQhxVUn3GMZH5npTOeu9Q_Rf8AEsklFA |
| Cites_doi | 10.3366/ijhac.2019.0229 10.1093/llc/fqac014 |
| ContentType | Journal Article |
| DBID | AAYXX CITATION ADTOC UNPAY |
| DOI | 10.7256/2585-7797.2025.1.72586 |
| DatabaseName | CrossRef Unpaywall for CDI: Periodical Content Unpaywall |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | CrossRef |
| Database_xml | – sequence: 1 dbid: UNPAY name: Unpaywall url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/ sourceTypes: Open Access Repository |
| DeliveryMethod | fulltext_linktorsrc |
| EISSN | 2585-7797 |
| EndPage | 189 |
| ExternalDocumentID | 10.7256/2585-7797.2025.1.72586 10_7256_2585_7797_2025_1_72586 |
| GroupedDBID | AAYXX ALMA_UNASSIGNED_HOLDINGS CITATION M~E ADTOC UNPAY |
| ID | FETCH-LOGICAL-c956-11a636c8772efd574f8c285ba3bc8a2e670d60be2a7f7884ddec98f907fe502f3 |
| IEDL.DBID | UNPAY |
| ISSN | 2585-7797 |
| IngestDate | Tue Aug 19 23:54:11 EDT 2025 Wed Oct 01 08:30:32 EDT 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | false |
| IsScholarly | false |
| Issue | 1 |
| Language | English |
| License | cc-by-nc |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c956-11a636c8772efd574f8c285ba3bc8a2e670d60be2a7f7884ddec98f907fe502f3 |
| OpenAccessLink | https://proxy.k.utb.cz/login?url=https://doi.org/10.7256/2585-7797.2025.1.72586 |
| PageCount | 16 |
| ParticipantIDs | unpaywall_primary_10_7256_2585_7797_2025_1_72586 crossref_primary_10_7256_2585_7797_2025_1_72586 |
| ProviderPackageCode | CITATION AAYXX |
| PublicationCentury | 2000 |
| PublicationDate | 2025-1-00 |
| PublicationDateYYYYMMDD | 2025-01-01 |
| PublicationDate_xml | – month: 01 year: 2025 text: 2025-1-00 |
| PublicationDecade | 2020 |
| PublicationTitle | Историческая информатика |
| PublicationYear | 2025 |
| References | ref8 ref7 ref9 ref4 ref3 ref6 ref11 ref5 ref10 ref2 ref1 |
| References_xml | – ident: ref1 – ident: ref4 – ident: ref7 doi: 10.3366/ijhac.2019.0229 – ident: ref2 – ident: ref3 – ident: ref5 – ident: ref6 – ident: ref9 – ident: ref8 – ident: ref10 doi: 10.1093/llc/fqac014 – ident: ref11 |
| SSID | ssib044742389 ssib032177147 |
| Score | 1.8973504 |
| Snippet | The subject of the study is the corpus of children's literature from the collection of the Minusinsk Public Library of the late XIX – early XX century,... |
| SourceID | unpaywall crossref |
| SourceType | Open Access Repository Index Database |
| StartPage | 174 |
| Title | The world through the eyes of an educated person in Minusinsk of the late XIX - early XX centuries: distribution of the frequency of geographical names in the books of the Minusinsk Public Library |
| URI | https://doi.org/10.7256/2585-7797.2025.1.72586 |
| UnpaywallVersion | publishedVersion |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources customDbUrl: eissn: 2585-7797 dateEnd: 99991231 omitProxy: true ssIdentifier: ssib044742389 issn: 2585-7797 databaseCode: M~E dateStart: 20170101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LS8NAEF60Hjz5QEVFyxy8pqbJbrL1JmJRoepBIZ7CPkUssfSB1IO_zh_mTB6-DqKnwDLDLnwT5ptk9hvGDvCNcdLGJkg5VwFPvQh0KkwQY7KxMnFSlePbBpfJ2S2_yERWF4p0F-bL__sUs_FhhHQWGWAvxVIuEp0urcpkkS0lArl3iy3dXl4f39EEucawugb8i_O3DLQ8K0Zq_qyGwy9ppb_KrpoDVd0kj53ZVHfMyw-txr-feI2t1AwTjquQWGcLrthgbxgOUKqjQj2ZB58O3NxN4MmDKqCScHUWRiUHh4cCBg8FtcVPHsmEzIdoANl5BgE4EkaGLANTZi2st4_AkghvPT-rcfHjqld7Tgv31cR16v4YQkH9ubQNmRHXnzQ-n_tWHxWhvlyxyW76pzcnZ0E9wCEwpbxhVyVxYiQSeOetSLmXJpJCq1gbqSKXpKFNQu0ilXqsxDnGjelJj-W6dyKMfLzFWsVT4bYZRNZqLL54pLXh2tqedoJbFapubKziYocdNpjmo0qmI8fyhgDJCZCcAMkJkLybl4DssPAD-j-67P7fZY-1puOZ20fiMtVttjh4PW3XMfsO1yLoxw |
| linkProvider | Unpaywall |
| linkToUnpaywall | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LS8NAEF60PXjygYqKyhy8pqbJbrL1JqKo4OPQQjyFfYpYYukDqb_PH-ZMHlo9iJ4Cywy78E2Yb5LZbxg7wjfGSRubIOVcBTz1ItCpMEGMycbKxElVjm-7uU0uB_w6E1ldKNJdmIX_9ylm4-MI6SwywF6KpVwkOl1alckyaycCuXeLtQe396cPNEGuMayuAf_i_C0DrcyKkZq_quFwIa1crLG75kBVN8lzZzbVHfP2Q6vx7ydeZ6s1w4TTKiQ22JIrNtk7hgOU6qhQT-bBpwM3dxN48aAKqCRcnYVRycHhqYCbp4La4ifPZELmQzSA7CqDABwJI0OWgSmzFtbbJ2BJhLeen9W4-HHVqz2nhcdq4jp1fwyhoP5c2obMiOtPGp-vfauPilBfrthi_Yvz_tllUA9wCEwpb9hVSZwYiQTeeStS7qWJpNAq1kaqyCVpaJNQu0ilHitxjnFjetJjue6dCCMfb7NW8VK4HQaRtRqLLx5pbbi2tqed4FaFqhsbq7jYZccNpvmokunIsbwhQHICJCdAcgIk7-YlILss_IT-jy57_3fZZ63peOYOkLhM9WEdrR9nA-eW |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=The+world+through+the+eyes+of+an+educated+person+in+Minusinsk+of+the+late+XIX+-+early+XX+centuries%3A+distribution+of+the+frequency+of+geographical+names+in+the+books+of+the+Minusinsk+Public+Library&rft.jtitle=%D0%98%D1%81%D1%82%D0%BE%D1%80%D0%B8%D1%87%D0%B5%D1%81%D0%BA%D0%B0%D1%8F+%D0%B8%D0%BD%D1%84%D0%BE%D1%80%D0%BC%D0%B0%D1%82%D0%B8%D0%BA%D0%B0&rft.au=Mekhovskii%2C+Vadim+Aleksandrovich&rft.au=Kizhner%2C+Inna+Aleksandrovna&rft.date=2025-01-01&rft.issn=2585-7797&rft.eissn=2585-7797&rft.issue=1&rft.spage=174&rft.epage=189&rft_id=info:doi/10.7256%2F2585-7797.2025.1.72586&rft.externalDBID=n%2Fa&rft.externalDocID=10_7256_2585_7797_2025_1_72586 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2585-7797&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2585-7797&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2585-7797&client=summon |