Nested Named Entity Recognition as Holistic Structure Parsing
As a fundamental natural language processing task and one of core knowledge extraction techniques, named entity recognition (NER) is widely used to extract information from texts for downstream tasks. Nested NER is a branch of NER in which the named entities (NEs) are nested with each other. However...
Saved in:
| Main Authors | , , |
|---|---|
| Format | Journal Article |
| Language | English |
| Published |
17.04.2022
|
| Subjects | |
| Online Access | Get full text |
| DOI | 10.48550/arxiv.2204.08006 |
Cover
| Abstract | As a fundamental natural language processing task and one of core knowledge
extraction techniques, named entity recognition (NER) is widely used to extract
information from texts for downstream tasks. Nested NER is a branch of NER in
which the named entities (NEs) are nested with each other. However, most of the
previous studies on nested NER usually apply linear structure to model the
nested NEs which are actually accommodated in a hierarchical structure. Thus in
order to address this mismatch, this work models the full nested NEs in a
sentence as a holistic structure, then we propose a holistic structure parsing
algorithm to disclose the entire NEs once for all. Besides, there is no
research on applying corpus-level information to NER currently. To make up for
the loss of this information, we introduce Point-wise Mutual Information (PMI)
and other frequency features from corpus-aware statistics for even better
performance by holistic modeling from sentence-level to corpus-level.
Experiments show that our model yields promising results on widely-used
benchmarks which approach or even achieve state-of-the-art. Further empirical
studies show that our proposed corpus-aware features can substantially improve
NER domain adaptation, which demonstrates the surprising advantage of our
proposed corpus-level holistic structure modeling. |
|---|---|
| AbstractList | As a fundamental natural language processing task and one of core knowledge
extraction techniques, named entity recognition (NER) is widely used to extract
information from texts for downstream tasks. Nested NER is a branch of NER in
which the named entities (NEs) are nested with each other. However, most of the
previous studies on nested NER usually apply linear structure to model the
nested NEs which are actually accommodated in a hierarchical structure. Thus in
order to address this mismatch, this work models the full nested NEs in a
sentence as a holistic structure, then we propose a holistic structure parsing
algorithm to disclose the entire NEs once for all. Besides, there is no
research on applying corpus-level information to NER currently. To make up for
the loss of this information, we introduce Point-wise Mutual Information (PMI)
and other frequency features from corpus-aware statistics for even better
performance by holistic modeling from sentence-level to corpus-level.
Experiments show that our model yields promising results on widely-used
benchmarks which approach or even achieve state-of-the-art. Further empirical
studies show that our proposed corpus-aware features can substantially improve
NER domain adaptation, which demonstrates the surprising advantage of our
proposed corpus-level holistic structure modeling. |
| Author | Li, Zuchao Zhao, Hai Yang, Yifei |
| Author_xml | – sequence: 1 givenname: Yifei surname: Yang fullname: Yang, Yifei – sequence: 2 givenname: Zuchao surname: Li fullname: Li, Zuchao – sequence: 3 givenname: Hai surname: Zhao fullname: Zhao, Hai |
| BackLink | https://doi.org/10.48550/arXiv.2204.08006$$DView paper in arXiv |
| BookMark | eNrjYmDJy89LZWCQNDTQM7EwNTXQTyyqyCzTMzIyMNEzsDAwMONksPVLLS5JTVHwS8wFkq55JZkllQpBqcn56XmZJZn5eQqJxQoe-TmZxSWZyQrBJUWlySWlRakKAYlFxZl56TwMrGmJOcWpvFCam0HezTXE2UMXbFF8QVFmbmJRZTzIwniwhcaEVQAAbEw3Tw |
| ContentType | Journal Article |
| Copyright | http://creativecommons.org/licenses/by/4.0 |
| Copyright_xml | – notice: http://creativecommons.org/licenses/by/4.0 |
| DBID | AKY GOX |
| DOI | 10.48550/arxiv.2204.08006 |
| DatabaseName | arXiv Computer Science arXiv.org |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository |
| DeliveryMethod | fulltext_linktorsrc |
| ExternalDocumentID | 2204_08006 |
| GroupedDBID | AKY GOX |
| ID | FETCH-arxiv_primary_2204_080063 |
| IEDL.DBID | GOX |
| IngestDate | Wed Jul 23 02:01:45 EDT 2025 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-arxiv_primary_2204_080063 |
| OpenAccessLink | https://arxiv.org/abs/2204.08006 |
| ParticipantIDs | arxiv_primary_2204_08006 |
| PublicationCentury | 2000 |
| PublicationDate | 2022-04-17 |
| PublicationDateYYYYMMDD | 2022-04-17 |
| PublicationDate_xml | – month: 04 year: 2022 text: 2022-04-17 day: 17 |
| PublicationDecade | 2020 |
| PublicationYear | 2022 |
| Score | 3.5895073 |
| SecondaryResourceType | preprint |
| Snippet | As a fundamental natural language processing task and one of core knowledge
extraction techniques, named entity recognition (NER) is widely used to extract... |
| SourceID | arxiv |
| SourceType | Open Access Repository |
| SubjectTerms | Computer Science - Computation and Language |
| Title | Nested Named Entity Recognition as Holistic Structure Parsing |
| URI | https://arxiv.org/abs/2204.08006 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwY2BQSTM0TEm2AOY0E4vUVF0TQ4tE3USjVFPdVCMjc8NU0Hkn4HMLfP3MPEJNvCJMI5gYFGB7YRKLKjLLIOcDJxXrGxmBjiG1AJ-pzQxsKIA28_pHQCYnwUdxQdUj1AHbmGAhpErCTZCBH9q6U3CERIcQA1NqnggDaBIG2K5T8EsE1jwKrqB9sZUKQbCFO_l5ConFCh75OeATkxWCwee5lhalKgQkgvvxogzybq4hzh66YAvjCyCnQ8SD3BIPdouxGAMLsA-fKsGgAOxFGaQaJqUagw6GTkwEtuLTks0Mks1Sgdwkw6QUSQYJXKZI4ZaSZuAyAq3GBx09aC7DwAJ0YaossI4sSZIDBxQAWztqqg |
| linkProvider | Cornell University |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Nested+Named+Entity+Recognition+as+Holistic+Structure+Parsing&rft.au=Yang%2C+Yifei&rft.au=Li%2C+Zuchao&rft.au=Zhao%2C+Hai&rft.date=2022-04-17&rft_id=info:doi/10.48550%2Farxiv.2204.08006&rft.externalDocID=2204_08006 |