Analyzing Multilingual French and Russian Text using NLTK, spaCy, and Stanza

This lesson covers tokenization, part-of-speech tagging, and lemmatization, as well as automatic language detection, for non-English and multilingual text. You’ll learn how to use the Python packages NLTK, spaCy, and Stanza to analyze a multilingual Russian and French text.

Saved in:
Bibliographic Details
Published inThe programming historian Vol. 13; no. 13
Main Author Goodale, Ian
Format Journal Article
LanguageEnglish
Published ProgHist Ltd 13.11.2024
Editorial Board of the Programming Historian
Subjects
Online AccessGet full text
ISSN2397-2068
2397-2068
DOI10.46430/phen0121

Cover

More Information
Summary:This lesson covers tokenization, part-of-speech tagging, and lemmatization, as well as automatic language detection, for non-English and multilingual text. You’ll learn how to use the Python packages NLTK, spaCy, and Stanza to analyze a multilingual Russian and French text.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:2397-2068
2397-2068
DOI:10.46430/phen0121