Corpus Analysis with spaCy

This lesson demonstrates how to use the Python library spaCy for analysis of large collections of texts. This lesson details the process of using spaCy to enrich a corpus via lemmatization, part-of-speech tagging, dependency parsing, and named entity recognition. Readers will learn how the linguisti...

Full description

Saved in:
Bibliographic Details
Published inThe programming historian Vol. 12; no. 12
Main Author Kane, Megan S.
Format Journal Article
LanguageEnglish
Published ProgHist Ltd 02.11.2023
Editorial Board of the Programming Historian
Subjects
Online AccessGet full text
ISSN2397-2068
2397-2068
DOI10.46430/phen0113

Cover

More Information
Summary:This lesson demonstrates how to use the Python library spaCy for analysis of large collections of texts. This lesson details the process of using spaCy to enrich a corpus via lemmatization, part-of-speech tagging, dependency parsing, and named entity recognition. Readers will learn how the linguistic annotations produced by spaCy can be analyzed to help researchers explore meaningful trends in language patterns across a set of texts.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:2397-2068
2397-2068
DOI:10.46430/phen0113