A Grammatical Error Correction Model for English Essay Words in Colleges Using Natural Language Processing

Natural language processing technology is a theory and approach for exploring and developing successful human-computer communication. With the rapid growth of computer science and technology, statistical learning methods have become an important research area in artificial intelligence and semantic...

Full description

Saved in:
Bibliographic Details
Published inMobile information systems Vol. 2022; pp. 1 - 9
Main Author Long, Juan
Format Journal Article
LanguageEnglish
Published Amsterdam Hindawi 13.07.2022
John Wiley & Sons, Inc
Subjects
Online AccessGet full text
ISSN1574-017X
1875-905X
1875-905X
DOI10.1155/2022/1881369

Cover

Abstract Natural language processing technology is a theory and approach for exploring and developing successful human-computer communication. With the rapid growth of computer science and technology, statistical learning methods have become an important research area in artificial intelligence and semantic search. If there are errors in the semantic units (words and sentences), it will affect future text analysis and semantic understanding, eventually affecting the whole application system performance. As a result, intelligent word and grammatical error detection and correction in English text are a significant and difficult aspect of natural language processing. Therefore, this paper examines the phenomena of word spelling and grammatical errors in undergraduate English essays and balances the mathematical-statistical models and technology solutions involved in intelligent error correction. The research findings of this study are represented in two aspects. (1) In nonword mistakes, four sorts of errors are studied: insertion, loss, replacement, and exchange between letters. It focuses on nonword mistakes and varied word forms (such as English abbreviations, hyphenated compound terms, and proper nouns) produced by word pronunciation difficulties. This paper utilizes the nonword check information to recommend an optimum combination prediction method based on the suggested candidate list for actual word errors, and the genuine word repair model is trained. This approach is 83.78% accurate when used with actual words with spelling errors in the context. (2) It verifies and corrects sentence grammar using context information from the text training set, as well as grammatical rules and statistical models. In addition, it has investigated singular and plural inconsistency, word confusion, subject, and predicate inconsistency, and modal (auxiliary) verb errors. It includes sentence boundary disambiguation, word part-of-speech tagging, named entity identification, and context information extraction. The software for checking and fixing sentence grammatical mistakes presented in this article works on English texts with difficulty levels 4 and 6. Furthermore, this work obtains a clause correctness rate of 99.70%, and the system’s average corrective accuracy rate for four-level and six-level essays is more than 80%.
AbstractList Natural language processing technology is a theory and approach for exploring and developing successful human-computer communication. With the rapid growth of computer science and technology, statistical learning methods have become an important research area in artificial intelligence and semantic search. If there are errors in the semantic units (words and sentences), it will affect future text analysis and semantic understanding, eventually affecting the whole application system performance. As a result, intelligent word and grammatical error detection and correction in English text are a significant and difficult aspect of natural language processing. Therefore, this paper examines the phenomena of word spelling and grammatical errors in undergraduate English essays and balances the mathematical-statistical models and technology solutions involved in intelligent error correction. The research findings of this study are represented in two aspects. (1) In nonword mistakes, four sorts of errors are studied: insertion, loss, replacement, and exchange between letters. It focuses on nonword mistakes and varied word forms (such as English abbreviations, hyphenated compound terms, and proper nouns) produced by word pronunciation difficulties. This paper utilizes the nonword check information to recommend an optimum combination prediction method based on the suggested candidate list for actual word errors, and the genuine word repair model is trained. This approach is 83.78% accurate when used with actual words with spelling errors in the context. (2) It verifies and corrects sentence grammar using context information from the text training set, as well as grammatical rules and statistical models. In addition, it has investigated singular and plural inconsistency, word confusion, subject, and predicate inconsistency, and modal (auxiliary) verb errors. It includes sentence boundary disambiguation, word part-of-speech tagging, named entity identification, and context information extraction. The software for checking and fixing sentence grammatical mistakes presented in this article works on English texts with difficulty levels 4 and 6. Furthermore, this work obtains a clause correctness rate of 99.70%, and the system’s average corrective accuracy rate for four-level and six-level essays is more than 80%.
Author Long, Juan
Author_xml – sequence: 1
  givenname: Juan
  orcidid: 0000-0002-0287-0830
  surname: Long
  fullname: Long, Juan
  organization: School of HumanitiesHunan City UniversityYiyangHunanChinahncu.net
BookMark eNqFkF1LwzAUhoMouE3v_AEBL7UuadavyzHmFObHhcPdldM07TKyZCYtY__elO5KUK9OOHny5uUZonNttEDohpIHSqNoHJIwHNM0pSzOztCApkkUZCRan_tzlEwCQpP1JRo6tyUkJixKBmg7xQsLux00koPCc2uNxTNjreCNNBq_mFIoXPnlXNdKug2eOwdH_Gls6bDUnlVK1MLhlZO6xq_QtNYHLUHXLdQCv1vDhevurtBFBcqJ69McodXj_GP2FCzfFs-z6TLgjCVNwCtGizIBQlNKIYyzuIiiDICXHGhZTmKSMJHERTWhJasKIjxNiWBECO8ACjZCQZ_b6j0cD6BUvrdyB_aYU5J3ovJOVH4S5fnbnt9b89UK1-Rb01rtK-b-d2-JZCH1VNhT3BrnrKhyLhvoHDUWpPot-v7Ho3-a3PX4RuoSDvJv-htmpZUH
CitedBy_id crossref_primary_10_59652_jetm_v3i1_404
Cites_doi 10.1023/a:1011424425034
10.1088/1742-6596/1235/1/012059
10.1136/jamia.1994.95236146
10.1145/354324.354348
10.1145/363958.363994
10.1155/2022/9246966
10.48550/arXiv.cmp-lg/9607024
10.1145/129875.129882
10.32674/jis.v6i4.321
10.1007/s00521-017-2884-0
10.1016/s0167-9236(03)00096-4
10.3115/1599081.1599103
10.1155/2021/7058723
10.1155/2022/2709255
ContentType Journal Article
Copyright Copyright © 2022 Juan Long.
Copyright © 2022 Juan Long. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0
Copyright_xml – notice: Copyright © 2022 Juan Long.
– notice: Copyright © 2022 Juan Long. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0
DBID RHU
RHW
RHX
AAYXX
CITATION
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
ADTOC
UNPAY
DOI 10.1155/2022/1881369
DatabaseName Hindawi Publishing Complete
Hindawi Publishing Subscription Journals
Hindawi Publishing Open Access
CrossRef
Computer and Information Systems Abstracts
Electronics & Communications Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
Unpaywall for CDI: Periodical Content
Unpaywall
DatabaseTitle CrossRef
Technology Research Database
Computer and Information Systems Abstracts – Academic
Electronics & Communications Abstracts
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts Professional
DatabaseTitleList Technology Research Database

CrossRef
Database_xml – sequence: 1
  dbid: RHX
  name: Hindawi Publishing Open Access
  url: http://www.hindawi.com/journals/
  sourceTypes: Publisher
– sequence: 2
  dbid: UNPAY
  name: Unpaywall
  url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISSN 1875-905X
Editor Yahya, Abid
Editor_xml – sequence: 1
  givenname: Abid
  surname: Yahya
  fullname: Yahya, Abid
EndPage 9
ExternalDocumentID 10.1155/2022/1881369
10_1155_2022_1881369
GrantInformation_xml – fundername: Hunan Social Science Achievement Evaluation Committee
  grantid: XSP22YBC531
GroupedDBID -CS
-CY
.4S
.DC
0R~
4.4
5VS
AAFWJ
AAJEY
ABHFT
ABJNI
ACGFO
ACGFS
ADBBV
AEGXH
AENEX
AIAGR
ALMA_UNASSIGNED_HOLDINGS
ARCSS
ASPBG
AVWKF
BCNDV
EBS
EDO
GROUPED_DOAJ
HZ~
I-F
IAO
IHR
IOS
KQ8
KZ1
LMP
MIO
MV1
NGNOM
O9-
OK1
P2P
RHU
RHW
RHX
TUS
24P
AAMMB
AAYXX
ACCMX
AEFGJ
AGXDD
AIDQK
AIDYY
CITATION
H13
7SC
7SP
8FD
JQ2
L7M
L~C
L~D
ABUBZ
ACPQW
ADTOC
AFRHK
AGIAB
CAG
COF
EJD
FEDTE
IL9
IPNFZ
MET
RIG
UNPAY
ID FETCH-LOGICAL-c337t-cf31bd7a01811a2696b559aacdca1dd46073e76bf41d3fb0e1bd10e30ee155ab3
IEDL.DBID RHX
ISSN 1574-017X
1875-905X
IngestDate Sun Oct 26 03:46:59 EDT 2025
Fri Jul 25 09:32:35 EDT 2025
Thu Apr 24 23:02:55 EDT 2025
Wed Oct 01 01:58:58 EDT 2025
Sun Jun 02 19:22:33 EDT 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Language English
License This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
https://creativecommons.org/licenses/by/4.0
cc-by
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c337t-cf31bd7a01811a2696b559aacdca1dd46073e76bf41d3fb0e1bd10e30ee155ab3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ORCID 0000-0002-0287-0830
OpenAccessLink https://dx.doi.org/10.1155/2022/1881369
PQID 2693570921
PQPubID 2048814
PageCount 9
ParticipantIDs unpaywall_primary_10_1155_2022_1881369
proquest_journals_2693570921
crossref_citationtrail_10_1155_2022_1881369
crossref_primary_10_1155_2022_1881369
hindawi_primary_10_1155_2022_1881369
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2022-07-13
PublicationDateYYYYMMDD 2022-07-13
PublicationDate_xml – month: 07
  year: 2022
  text: 2022-07-13
  day: 13
PublicationDecade 2020
PublicationPlace Amsterdam
PublicationPlace_xml – name: Amsterdam
PublicationTitle Mobile information systems
PublicationYear 2022
Publisher Hindawi
John Wiley & Sons, Inc
Publisher_xml – name: Hindawi
– name: John Wiley & Sons, Inc
References P. Ratanaworabhan (20)
22
13
14
15
17
S. Verberne (9) 2002
19
H. G. Kim (21) 2012; 28
H. L. Liang (12) 2008
1
2
P. Ye (18)
3
4
Y. Guo (23) 2006; 40
Y. S. Zhang (11) 2006; 6
J. Lee (16)
6
D. Jurafsky (5) 2010
7
8
E. S. Atwell (24) 1987; 12
10
References_xml – ident: 4
  doi: 10.1023/a:1011424425034
– ident: 3
  doi: 10.1088/1742-6596/1235/1/012059
– ident: 6
  doi: 10.1136/jamia.1994.95236146
– start-page: 9
  volume-title: Spell Checkers and Correctors: A Unified Treatment
  year: 2008
  ident: 12
– start-page: 1978
  ident: 16
  article-title: Automatic Grammar Correction for Second-Language Learners
– volume: 6
  start-page: 8
  issue: 5
  year: 2006
  ident: 11
  article-title: Summary of text automatic proofreading technology
  publication-title: Application Research of Computers
– ident: 15
  doi: 10.1145/354324.354348
– volume: 12
  start-page: 120
  year: 1987
  ident: 24
  article-title: Dealing with ill-formed English text the computational analysis of English
  publication-title: A Corpus-Based Approach
– ident: 13
  doi: 10.1145/363958.363994
– ident: 7
  doi: 10.1155/2022/9246966
– start-page: 241
  ident: 18
  article-title: MELB-YB: preposition sense disambiguation using rich semantic features
– volume-title: Context-sensitive Spell Checking Based on Word Trigram Probabilities
  year: 2002
  ident: 9
– ident: 14
  doi: 10.48550/arXiv.cmp-lg/9607024
– volume-title: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition
  year: 2010
  ident: 5
– ident: 10
  doi: 10.1145/129875.129882
– ident: 22
  doi: 10.32674/jis.v6i4.321
– ident: 1
  doi: 10.1007/s00521-017-2884-0
– ident: 8
  doi: 10.1016/s0167-9236(03)00096-4
– volume: 28
  start-page: 911
  issue: 5
  year: 2012
  ident: 21
  article-title: Efficient detection of malicious web pages using high-interaction client honeypots
  publication-title: Journal of Information Science and Engineering
– ident: 17
  doi: 10.3115/1599081.1599103
– start-page: 169
  ident: 20
  article-title: Nozzle: a defense against heap-spraying code injection attacks
– ident: 2
  doi: 10.1155/2021/7058723
– ident: 19
  doi: 10.1155/2022/2709255
– volume: 40
  start-page: 117
  year: 2006
  ident: 23
  article-title: The hegemony of English as a global language: reclaiming local knowledge and culture in China
  publication-title: Convergence
SSID ssj0060357
ssib050733852
Score 2.2416587
Snippet Natural language processing technology is a theory and approach for exploring and developing successful human-computer communication. With the rapid growth of...
SourceID unpaywall
proquest
crossref
hindawi
SourceType Open Access Repository
Aggregation Database
Enrichment Source
Index Database
Publisher
StartPage 1
SubjectTerms Abbreviations
Accuracy
Algorithms
Artificial intelligence
Computers
Context
English language
Error correction
Error correction & detection
Error detection
Essays
Grammar
Information retrieval
Language
Mathematical models
Microbalances
Natural language processing
Search engines
Semantics
Sentences
Software
Speech
Spelling
Statistical methods
Statistical models
Words (language)
Writing
SummonAdditionalLinks – databaseName: Unpaywall
  dbid: UNPAY
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3dS8MwED90Ivritzi_yIP6Ip3NkvQDfBkyFdHhg8P5ICVpUvyY3Wg3ZP71Jl0qTvAD31p6pE3umvzuuPsdwJ6MiecmmDpa4cKhCXWdgFHp8DAgVAU0CYs45FXLO2_Tiw7rTMFxWQsjDUV8j8u89mB80tfHYre265of6bmPjL9eP8JBgIkX1voymYYZj2kkXoGZduu6cVdQpPomucLvmGsNyZ3QZZ0y752xiSEmTqRZ-9oJwDk3TPt89Mq73U9nz-ki3JdfPU45ea4NB6IWv30hdPzvtJZgwYJS1Bhb0TJMqXQFFsuGD8j-_6vw1EBnGX8pWF61fDPLehk6Mf09iuoIZBqrdZGGwchWB6NmnvMRutUebo4eU2TDFDkqMhVQixesH-jSBk2RLVvQz9agfdq8OTl3bLMGJybEHzhxQrCQPjcEYJjXvdAT2lnhPJYxx1JST-8lyvdEQrEkiXCVlsauIq5SWiFckHWopL1UbQCKfUo1sFCEuabsVQhZV6EIFNYjYZb4VTgsFRbFlsncNNToRoVHw1hkljKyS1mF_Q_p_pjB4xu5PaubX8S2S8OISv1FerqE-W5Yx1U4-DCWH8fZ_KvgFsybWxNPxmQbKoNsqHY0EBqIXWvv7wm4Ajs
  priority: 102
  providerName: Unpaywall
Title A Grammatical Error Correction Model for English Essay Words in Colleges Using Natural Language Processing
URI https://dx.doi.org/10.1155/2022/1881369
https://www.proquest.com/docview/2693570921
https://downloads.hindawi.com/journals/misy/2022/1881369.pdf
UnpaywallVersion publishedVersion
Volume 2022
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVAFT
  databaseName: Colorado Digital library
  customDbUrl:
  eissn: 1875-905X
  dateEnd: 20240530
  omitProxy: true
  ssIdentifier: ssj0060357
  issn: 1875-905X
  databaseCode: KQ8
  dateStart: 20050101
  isFulltext: true
  titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html
  providerName: Colorado Alliance of Research Libraries
– providerCode: PRVHPJ
  databaseName: ROAD: Directory of Open Access Scholarly Resources
  customDbUrl:
  eissn: 1875-905X
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssib050733852
  issn: 1574-017X
  databaseCode: M~E
  dateStart: 20050101
  isFulltext: true
  titleUrlDefault: https://road.issn.org
  providerName: ISSN International Centre
– providerCode: PRVWIB
  databaseName: Wiley Online Library Open Access
  customDbUrl:
  eissn: 1875-905X
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssj0060357
  issn: 1875-905X
  databaseCode: 24P
  dateStart: 20050101
  isFulltext: true
  titleUrlDefault: https://authorservices.wiley.com/open-science/open-access/browse-journals.html
  providerName: Wiley-Blackwell
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEB6sInrxLVZr2UP1IsFsd7NJjkWqxUdRsVhPYTfZYKWmJWkR_7272434wMcxZJiQmdnMIzPfADSSmDA3xdRRChcOTanrBB5NHB4GhMqApqGpQ151WadHz_te34IkFd9_4Stvp9Pz5jEOAkxYWIFKwHTn1m2nX5qNp_cOmhHg2QeYucQAfGLP1w0Wfr_sd__C65MnWnzUKfDL4FOguTTNxvz1hQ-HH3zO6Rqs2GARtWbaXYc5mW3AarmIAdlzuQlPLXSW82eDvqro23k-ytGJ3rthphaQXng2RCo8RXZqF7WLgr-ie5V5FmiQIVs-KJDpIEBdbtA40KUtZiI7TqDubUHvtH130nHsEgUnJsSfOHFKsEh8roG5MG-ykAmVRHAeJzHHSUKZkp70mUgpTkgqXKmosSuJK6USGBdkG-azUSZ3AMU-pcrhS-K5ehxViKQpQxFIrDhhL_WrcFQKNIotwrhedDGMTKbheZEWf2TFX4WDd-rxDFnjB7qG1c0fZLVScZE9hkWkXlfZghs2cRUO35X5K5_d_z1uD5b1pa7yYlKD-Uk-lfsqPJmIOlQuboK6MdE6LPS6162HN8vo2yc
linkProvider Hindawi Publishing
linkToUnpaywall http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3dS8MwED90Ivritzi_yIP6Ip3NkvQDfBkyFdHhg8P5ICVpUvyY3Wg3ZP71Jl0qTvAD31p6pE3umvzuuPsdwJ6MiecmmDpa4cKhCXWdgFHp8DAgVAU0CYs45FXLO2_Tiw7rTMFxWQsjDUV8j8u89mB80tfHYre265of6bmPjL9eP8JBgIkX1voymYYZj2kkXoGZduu6cVdQpPomucLvmGsNyZ3QZZ0y752xiSEmTqRZ-9oJwDk3TPt89Mq73U9nz-ki3JdfPU45ea4NB6IWv30hdPzvtJZgwYJS1Bhb0TJMqXQFFsuGD8j-_6vw1EBnGX8pWF61fDPLehk6Mf09iuoIZBqrdZGGwchWB6NmnvMRutUebo4eU2TDFDkqMhVQixesH-jSBk2RLVvQz9agfdq8OTl3bLMGJybEHzhxQrCQPjcEYJjXvdAT2lnhPJYxx1JST-8lyvdEQrEkiXCVlsauIq5SWiFckHWopL1UbQCKfUo1sFCEuabsVQhZV6EIFNYjYZb4VTgsFRbFlsncNNToRoVHw1hkljKyS1mF_Q_p_pjB4xu5PaubX8S2S8OISv1FerqE-W5Yx1U4-DCWH8fZ_KvgFsybWxNPxsE2VAbZUO1oIDQQu9be3wEKJgJA
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+Grammatical+Error+Correction+Model+for+English+Essay+Words+in+Colleges+Using+Natural+Language+Processing&rft.jtitle=Mobile+information+systems&rft.au=Long%2C+Juan&rft.date=2022-07-13&rft.issn=1574-017X&rft.eissn=1875-905X&rft.volume=2022&rft.spage=1&rft.epage=9&rft_id=info:doi/10.1155%2F2022%2F1881369&rft.externalDBID=n%2Fa&rft.externalDocID=10_1155_2022_1881369
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1574-017X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1574-017X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1574-017X&client=summon