The world through the eyes of an educated person in Minusinsk of the late XIX - early XX centuries: distribution of the frequency of geographical names in the books of the Minusinsk Public Library

The subject of the study is the corpus of children's literature from the collection of the Minusinsk Public Library of the late XIX – early XX century, consisting of 121 works written between 1719 and 1905. These texts are a significant source for studying the formation of geographical percepti...

Full description

Saved in:
Bibliographic Details
Published inИсторическая информатика no. 1; pp. 174 - 189
Main Authors Mekhovskii, Vadim Aleksandrovich, Kizhner, Inna Aleksandrovna
Format Journal Article
LanguageEnglish
Published 01.01.2025
Online AccessGet full text
ISSN2585-7797
2585-7797
DOI10.7256/2585-7797.2025.1.72586

Cover

Abstract The subject of the study is the corpus of children's literature from the collection of the Minusinsk Public Library of the late XIX – early XX century, consisting of 121 works written between 1719 and 1905. These texts are a significant source for studying the formation of geographical perception among residents of a provincial Siberian city through fiction. Special attention is paid to the analysis of geographical names (toponyms) found in texts in order to identify their frequency and geographical distribution. This allows us to reconstruct the picture of the world presented in the books of that time and understand how it was perceived by the children's audience, forming their idea of countries, cities and cultural centers. The research is aimed at studying the role of children's literature as a cultural tool that reflects and forms geographical representations, as well as at identifying methodological challenges and limitations when working with historical buildings. The methodological basis includes bringing pre-reform texts to a machine-readable form using digitization tools and geoparsing to automatically identify geographical entities. The Spacy library was used for the analysis, followed by manual verification and correction of the data. The results of the study include the identification of 668 cities and 97 countries represented in the texts, as well as the construction of a cartographic visualization of the frequency distribution of mentions. The analysis revealed an uneven distribution of geographical names in various texts, where mentions of Russia, Poland and England prevail among countries, and Kiev, Moscow and St. Petersburg among cities. The scope of the results includes research in the field of digital humanities, library science and historical and cultural studies. The novelty of the work lies in the use of modern geoparsing methods for processing Russian-language texts of pre-reform spelling and in the analysis of the previously unexplored literature corpus of the Minusinsk Library. The conclusions emphasize the importance of text mapping for understanding the formation of geographical perception and the need for further development of NER tools for complex corpora. Despite the limitations, the research contributes to the development of NLP methods for historical texts.
AbstractList The subject of the study is the corpus of children's literature from the collection of the Minusinsk Public Library of the late XIX – early XX century, consisting of 121 works written between 1719 and 1905. These texts are a significant source for studying the formation of geographical perception among residents of a provincial Siberian city through fiction. Special attention is paid to the analysis of geographical names (toponyms) found in texts in order to identify their frequency and geographical distribution. This allows us to reconstruct the picture of the world presented in the books of that time and understand how it was perceived by the children's audience, forming their idea of countries, cities and cultural centers. The research is aimed at studying the role of children's literature as a cultural tool that reflects and forms geographical representations, as well as at identifying methodological challenges and limitations when working with historical buildings. The methodological basis includes bringing pre-reform texts to a machine-readable form using digitization tools and geoparsing to automatically identify geographical entities. The Spacy library was used for the analysis, followed by manual verification and correction of the data. The results of the study include the identification of 668 cities and 97 countries represented in the texts, as well as the construction of a cartographic visualization of the frequency distribution of mentions. The analysis revealed an uneven distribution of geographical names in various texts, where mentions of Russia, Poland and England prevail among countries, and Kiev, Moscow and St. Petersburg among cities. The scope of the results includes research in the field of digital humanities, library science and historical and cultural studies. The novelty of the work lies in the use of modern geoparsing methods for processing Russian-language texts of pre-reform spelling and in the analysis of the previously unexplored literature corpus of the Minusinsk Library. The conclusions emphasize the importance of text mapping for understanding the formation of geographical perception and the need for further development of NER tools for complex corpora. Despite the limitations, the research contributes to the development of NLP methods for historical texts.
Author Kizhner, Inna Aleksandrovna
Mekhovskii, Vadim Aleksandrovich
Author_xml – sequence: 1
  givenname: Vadim Aleksandrovich
  surname: Mekhovskii
  fullname: Mekhovskii, Vadim Aleksandrovich
– sequence: 2
  givenname: Inna Aleksandrovna
  surname: Kizhner
  fullname: Kizhner, Inna Aleksandrovna
BookMark eNqNkctOwzAQRS0EEq_-ApofSEncJHbYIcRLKoJFF9lFjjNurKZ2sROh_B8fRkx5LVnNaHzPXGvuKTk01iAhF0k8ZzTLL2nGs4ixgs1pTLN5EqY8PyAnPw-Hf_pjMvNe13GaspQueHFC3lctwpt1XQN96-ywbqeKgCN6sAqEAWwGKXpsYIfOWwPawJM2g9fGb4IkyLtJAOVjCRGgcN0IZQkSTT84jf4KGu17p-uh1xP_hSiHrwMaOYbBGu3aiV2rpejAiO1kPtkEWW3txn8zv74vQ91pCUtdO-HGc3KkROdx9lXPyOrudnXzEC2f7x9vrpeRLLI8ShKRL3LJGaOomoylikvKs1osaskFxZzFTR7XSAVTjPO0aVAWXBUxU5jFVC3OCNuvHcxOjG-i66qd09vpA1USVyGOKly6CpeuQhxVUn3GMZH5npTOeu9Q_Rf8AEsklFA
Cites_doi 10.3366/ijhac.2019.0229
10.1093/llc/fqac014
ContentType Journal Article
DBID AAYXX
CITATION
ADTOC
UNPAY
DOI 10.7256/2585-7797.2025.1.72586
DatabaseName CrossRef
Unpaywall for CDI: Periodical Content
Unpaywall
DatabaseTitle CrossRef
DatabaseTitleList CrossRef
Database_xml – sequence: 1
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
EISSN 2585-7797
EndPage 189
ExternalDocumentID 10.7256/2585-7797.2025.1.72586
10_7256_2585_7797_2025_1_72586
GroupedDBID AAYXX
ALMA_UNASSIGNED_HOLDINGS
CITATION
M~E
ADTOC
UNPAY
ID FETCH-LOGICAL-c956-11a636c8772efd574f8c285ba3bc8a2e670d60be2a7f7884ddec98f907fe502f3
IEDL.DBID UNPAY
ISSN 2585-7797
IngestDate Tue Aug 19 23:54:11 EDT 2025
Wed Oct 01 08:30:32 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Issue 1
Language English
License cc-by-nc
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c956-11a636c8772efd574f8c285ba3bc8a2e670d60be2a7f7884ddec98f907fe502f3
OpenAccessLink https://proxy.k.utb.cz/login?url=https://doi.org/10.7256/2585-7797.2025.1.72586
PageCount 16
ParticipantIDs unpaywall_primary_10_7256_2585_7797_2025_1_72586
crossref_primary_10_7256_2585_7797_2025_1_72586
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2025-1-00
PublicationDateYYYYMMDD 2025-01-01
PublicationDate_xml – month: 01
  year: 2025
  text: 2025-1-00
PublicationDecade 2020
PublicationTitle Историческая информатика
PublicationYear 2025
References ref8
ref7
ref9
ref4
ref3
ref6
ref11
ref5
ref10
ref2
ref1
References_xml – ident: ref1
– ident: ref4
– ident: ref7
  doi: 10.3366/ijhac.2019.0229
– ident: ref2
– ident: ref3
– ident: ref5
– ident: ref6
– ident: ref9
– ident: ref8
– ident: ref10
  doi: 10.1093/llc/fqac014
– ident: ref11
SSID ssib044742389
ssib032177147
Score 1.8973504
Snippet The subject of the study is the corpus of children's literature from the collection of the Minusinsk Public Library of the late XIX – early XX century,...
SourceID unpaywall
crossref
SourceType Open Access Repository
Index Database
StartPage 174
Title The world through the eyes of an educated person in Minusinsk of the late XIX - early XX centuries: distribution of the frequency of geographical names in the books of the Minusinsk Public Library
URI https://doi.org/10.7256/2585-7797.2025.1.72586
UnpaywallVersion publishedVersion
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 2585-7797
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssib044742389
  issn: 2585-7797
  databaseCode: M~E
  dateStart: 20170101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LS8NAEF60Hjz5QEVFyxy8pqbJbrL1JmJRoepBIZ7CPkUssfSB1IO_zh_mTB6-DqKnwDLDLnwT5ptk9hvGDvCNcdLGJkg5VwFPvQh0KkwQY7KxMnFSlePbBpfJ2S2_yERWF4p0F-bL__sUs_FhhHQWGWAvxVIuEp0urcpkkS0lArl3iy3dXl4f39EEucawugb8i_O3DLQ8K0Zq_qyGwy9ppb_KrpoDVd0kj53ZVHfMyw-txr-feI2t1AwTjquQWGcLrthgbxgOUKqjQj2ZB58O3NxN4MmDKqCScHUWRiUHh4cCBg8FtcVPHsmEzIdoANl5BgE4EkaGLANTZi2st4_AkghvPT-rcfHjqld7Tgv31cR16v4YQkH9ubQNmRHXnzQ-n_tWHxWhvlyxyW76pzcnZ0E9wCEwpbxhVyVxYiQSeOetSLmXJpJCq1gbqSKXpKFNQu0ilXqsxDnGjelJj-W6dyKMfLzFWsVT4bYZRNZqLL54pLXh2tqedoJbFapubKziYocdNpjmo0qmI8fyhgDJCZCcAMkJkLybl4DssPAD-j-67P7fZY-1puOZ20fiMtVttjh4PW3XMfsO1yLoxw
linkProvider Unpaywall
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LS8NAEF60PXjygYqKyhy8pqbJbrL1JqKo4OPQQjyFfYpYYukDqb_PH-ZMHlo9iJ4Cywy78E2Yb5LZbxg7wjfGSRubIOVcBTz1ItCpMEGMycbKxElVjm-7uU0uB_w6E1ldKNJdmIX_9ylm4-MI6SwywF6KpVwkOl1alckyaycCuXeLtQe396cPNEGuMayuAf_i_C0DrcyKkZq_quFwIa1crLG75kBVN8lzZzbVHfP2Q6vx7ydeZ6s1w4TTKiQ22JIrNtk7hgOU6qhQT-bBpwM3dxN48aAKqCRcnYVRycHhqYCbp4La4ifPZELmQzSA7CqDABwJI0OWgSmzFtbbJ2BJhLeen9W4-HHVqz2nhcdq4jp1fwyhoP5c2obMiOtPGp-vfauPilBfrthi_Yvz_tllUA9wCEwpb9hVSZwYiQTeeStS7qWJpNAq1kaqyCVpaJNQu0ilHitxjnFjetJjue6dCCMfb7NW8VK4HQaRtRqLLx5pbbi2tqed4FaFqhsbq7jYZccNpvmokunIsbwhQHICJCdAcgIk7-YlILss_IT-jy57_3fZZ63peOYOkLhM9WEdrR9nA-eW
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=The+world+through+the+eyes+of+an+educated+person+in+Minusinsk+of+the+late+XIX+-+early+XX+centuries%3A+distribution+of+the+frequency+of+geographical+names+in+the+books+of+the+Minusinsk+Public+Library&rft.jtitle=%D0%98%D1%81%D1%82%D0%BE%D1%80%D0%B8%D1%87%D0%B5%D1%81%D0%BA%D0%B0%D1%8F+%D0%B8%D0%BD%D1%84%D0%BE%D1%80%D0%BC%D0%B0%D1%82%D0%B8%D0%BA%D0%B0&rft.au=Mekhovskii%2C+Vadim+Aleksandrovich&rft.au=Kizhner%2C+Inna+Aleksandrovna&rft.date=2025-01-01&rft.issn=2585-7797&rft.eissn=2585-7797&rft.issue=1&rft.spage=174&rft.epage=189&rft_id=info:doi/10.7256%2F2585-7797.2025.1.72586&rft.externalDBID=n%2Fa&rft.externalDocID=10_7256_2585_7797_2025_1_72586
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2585-7797&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2585-7797&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2585-7797&client=summon