Analyzing Multilingual French and Russian Text using NLTK, spaCy, and Stanza

This lesson covers tokenization, part-of-speech tagging, and lemmatization, as well as automatic language detection, for non-English and multilingual text. You’ll learn how to use the Python packages NLTK, spaCy, and Stanza to analyze a multilingual Russian and French text.

Saved in:

Bibliographic Details
Published in	The programming historian Vol. 13; no. 13
Main Author	Goodale, Ian
Format	Journal Article
Language	English
Published	ProgHist Ltd 13.11.2024 Editorial Board of the Programming Historian
Subjects	Historians Language Libraries Multilingualism Natural language Natural language processing Python Sentiment analysis Speech Text analysis
Online Access	Get full text
ISSN	2397-2068 2397-2068
DOI	10.46430/phen0121

Cover

More Information
Summary:	This lesson covers tokenization, part-of-speech tagging, and lemmatization, as well as automatic language detection, for non-English and multilingual text. You’ll learn how to use the Python packages NLTK, spaCy, and Stanza to analyze a multilingual Russian and French text.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	2397-2068 2397-2068
DOI:	10.46430/phen0121