Corpus Analysis with spaCy

This lesson demonstrates how to use the Python library spaCy for analysis of large collections of texts. This lesson details the process of using spaCy to enrich a corpus via lemmatization, part-of-speech tagging, dependency parsing, and named entity recognition. Readers will learn how the linguisti...

Full description

Saved in:

Bibliographic Details
Published in	The programming historian Vol. 12; no. 12
Main Author	Kane, Megan S.
Format	Journal Article
Language	English
Published	ProgHist Ltd 02.11.2023 Editorial Board of the Programming Historian
Subjects	Algorithms Annotations Biology Creative writing Datasets Humanities Hypotheses Language Learning Linguistics Metadata Natural language processing Product reviews Researchers Speech Student writing Tagging Texts United States > US Michigan
Online Access	Get full text
ISSN	2397-2068 2397-2068
DOI	10.46430/phen0113

Cover

More Information
Summary:	This lesson demonstrates how to use the Python library spaCy for analysis of large collections of texts. This lesson details the process of using spaCy to enrich a corpus via lemmatization, part-of-speech tagging, dependency parsing, and named entity recognition. Readers will learn how the linguistic annotations produced by spaCy can be analyzed to help researchers explore meaningful trends in language patterns across a set of texts.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	2397-2068 2397-2068
DOI:	10.46430/phen0113