Analyzing Multilingual French and Russian Text using NLTK, spaCy, and Stanza
This lesson covers tokenization, part-of-speech tagging, and lemmatization, as well as automatic language detection, for non-English and multilingual text. You’ll learn how to use the Python packages NLTK, spaCy, and Stanza to analyze a multilingual Russian and French text.
Saved in:
| Published in | The programming historian Vol. 13; no. 13 |
|---|---|
| Main Author | |
| Format | Journal Article |
| Language | English |
| Published |
ProgHist Ltd
13.11.2024
Editorial Board of the Programming Historian |
| Subjects | |
| Online Access | Get full text |
| ISSN | 2397-2068 2397-2068 |
| DOI | 10.46430/phen0121 |
Cover
| Summary: | This lesson covers tokenization, part-of-speech tagging, and lemmatization, as well as automatic language detection, for non-English and multilingual text. You’ll learn how to use the Python packages NLTK, spaCy, and Stanza to analyze a multilingual Russian and French text. |
|---|---|
| Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ISSN: | 2397-2068 2397-2068 |
| DOI: | 10.46430/phen0121 |