A Grammatical Error Correction Model for English Essay Words in Colleges Using Natural Language Processing
Natural language processing technology is a theory and approach for exploring and developing successful human-computer communication. With the rapid growth of computer science and technology, statistical learning methods have become an important research area in artificial intelligence and semantic...
Saved in:
| Published in | Mobile information systems Vol. 2022; pp. 1 - 9 |
|---|---|
| Main Author | |
| Format | Journal Article |
| Language | English |
| Published |
Amsterdam
Hindawi
13.07.2022
John Wiley & Sons, Inc |
| Subjects | |
| Online Access | Get full text |
| ISSN | 1574-017X 1875-905X 1875-905X |
| DOI | 10.1155/2022/1881369 |
Cover
| Abstract | Natural language processing technology is a theory and approach for exploring and developing successful human-computer communication. With the rapid growth of computer science and technology, statistical learning methods have become an important research area in artificial intelligence and semantic search. If there are errors in the semantic units (words and sentences), it will affect future text analysis and semantic understanding, eventually affecting the whole application system performance. As a result, intelligent word and grammatical error detection and correction in English text are a significant and difficult aspect of natural language processing. Therefore, this paper examines the phenomena of word spelling and grammatical errors in undergraduate English essays and balances the mathematical-statistical models and technology solutions involved in intelligent error correction. The research findings of this study are represented in two aspects. (1) In nonword mistakes, four sorts of errors are studied: insertion, loss, replacement, and exchange between letters. It focuses on nonword mistakes and varied word forms (such as English abbreviations, hyphenated compound terms, and proper nouns) produced by word pronunciation difficulties. This paper utilizes the nonword check information to recommend an optimum combination prediction method based on the suggested candidate list for actual word errors, and the genuine word repair model is trained. This approach is 83.78% accurate when used with actual words with spelling errors in the context. (2) It verifies and corrects sentence grammar using context information from the text training set, as well as grammatical rules and statistical models. In addition, it has investigated singular and plural inconsistency, word confusion, subject, and predicate inconsistency, and modal (auxiliary) verb errors. It includes sentence boundary disambiguation, word part-of-speech tagging, named entity identification, and context information extraction. The software for checking and fixing sentence grammatical mistakes presented in this article works on English texts with difficulty levels 4 and 6. Furthermore, this work obtains a clause correctness rate of 99.70%, and the system’s average corrective accuracy rate for four-level and six-level essays is more than 80%. |
|---|---|
| AbstractList | Natural language processing technology is a theory and approach for exploring and developing successful human-computer communication. With the rapid growth of computer science and technology, statistical learning methods have become an important research area in artificial intelligence and semantic search. If there are errors in the semantic units (words and sentences), it will affect future text analysis and semantic understanding, eventually affecting the whole application system performance. As a result, intelligent word and grammatical error detection and correction in English text are a significant and difficult aspect of natural language processing. Therefore, this paper examines the phenomena of word spelling and grammatical errors in undergraduate English essays and balances the mathematical-statistical models and technology solutions involved in intelligent error correction. The research findings of this study are represented in two aspects. (1) In nonword mistakes, four sorts of errors are studied: insertion, loss, replacement, and exchange between letters. It focuses on nonword mistakes and varied word forms (such as English abbreviations, hyphenated compound terms, and proper nouns) produced by word pronunciation difficulties. This paper utilizes the nonword check information to recommend an optimum combination prediction method based on the suggested candidate list for actual word errors, and the genuine word repair model is trained. This approach is 83.78% accurate when used with actual words with spelling errors in the context. (2) It verifies and corrects sentence grammar using context information from the text training set, as well as grammatical rules and statistical models. In addition, it has investigated singular and plural inconsistency, word confusion, subject, and predicate inconsistency, and modal (auxiliary) verb errors. It includes sentence boundary disambiguation, word part-of-speech tagging, named entity identification, and context information extraction. The software for checking and fixing sentence grammatical mistakes presented in this article works on English texts with difficulty levels 4 and 6. Furthermore, this work obtains a clause correctness rate of 99.70%, and the system’s average corrective accuracy rate for four-level and six-level essays is more than 80%. |
| Author | Long, Juan |
| Author_xml | – sequence: 1 givenname: Juan orcidid: 0000-0002-0287-0830 surname: Long fullname: Long, Juan organization: School of HumanitiesHunan City UniversityYiyangHunanChinahncu.net |
| BookMark | eNqFkF1LwzAUhoMouE3v_AEBL7UuadavyzHmFObHhcPdldM07TKyZCYtY__elO5KUK9OOHny5uUZonNttEDohpIHSqNoHJIwHNM0pSzOztCApkkUZCRan_tzlEwCQpP1JRo6tyUkJixKBmg7xQsLux00koPCc2uNxTNjreCNNBq_mFIoXPnlXNdKug2eOwdH_Gls6bDUnlVK1MLhlZO6xq_QtNYHLUHXLdQCv1vDhevurtBFBcqJ69McodXj_GP2FCzfFs-z6TLgjCVNwCtGizIBQlNKIYyzuIiiDICXHGhZTmKSMJHERTWhJasKIjxNiWBECO8ACjZCQZ_b6j0cD6BUvrdyB_aYU5J3ovJOVH4S5fnbnt9b89UK1-Rb01rtK-b-d2-JZCH1VNhT3BrnrKhyLhvoHDUWpPot-v7Ho3-a3PX4RuoSDvJv-htmpZUH |
| CitedBy_id | crossref_primary_10_59652_jetm_v3i1_404 |
| Cites_doi | 10.1023/a:1011424425034 10.1088/1742-6596/1235/1/012059 10.1136/jamia.1994.95236146 10.1145/354324.354348 10.1145/363958.363994 10.1155/2022/9246966 10.48550/arXiv.cmp-lg/9607024 10.1145/129875.129882 10.32674/jis.v6i4.321 10.1007/s00521-017-2884-0 10.1016/s0167-9236(03)00096-4 10.3115/1599081.1599103 10.1155/2021/7058723 10.1155/2022/2709255 |
| ContentType | Journal Article |
| Copyright | Copyright © 2022 Juan Long. Copyright © 2022 Juan Long. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0 |
| Copyright_xml | – notice: Copyright © 2022 Juan Long. – notice: Copyright © 2022 Juan Long. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0 |
| DBID | RHU RHW RHX AAYXX CITATION 7SC 7SP 8FD JQ2 L7M L~C L~D ADTOC UNPAY |
| DOI | 10.1155/2022/1881369 |
| DatabaseName | Hindawi Publishing Complete Hindawi Publishing Subscription Journals Hindawi Publishing Open Access CrossRef Computer and Information Systems Abstracts Electronics & Communications Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional Unpaywall for CDI: Periodical Content Unpaywall |
| DatabaseTitle | CrossRef Technology Research Database Computer and Information Systems Abstracts – Academic Electronics & Communications Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Professional |
| DatabaseTitleList | Technology Research Database CrossRef |
| Database_xml | – sequence: 1 dbid: RHX name: Hindawi Publishing Open Access url: http://www.hindawi.com/journals/ sourceTypes: Publisher – sequence: 2 dbid: UNPAY name: Unpaywall url: https://proxy.k.utb.cz/login?url=https://unpaywall.org/ sourceTypes: Open Access Repository |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Computer Science |
| EISSN | 1875-905X |
| Editor | Yahya, Abid |
| Editor_xml | – sequence: 1 givenname: Abid surname: Yahya fullname: Yahya, Abid |
| EndPage | 9 |
| ExternalDocumentID | 10.1155/2022/1881369 10_1155_2022_1881369 |
| GrantInformation_xml | – fundername: Hunan Social Science Achievement Evaluation Committee grantid: XSP22YBC531 |
| GroupedDBID | -CS -CY .4S .DC 0R~ 4.4 5VS AAFWJ AAJEY ABHFT ABJNI ACGFO ACGFS ADBBV AEGXH AENEX AIAGR ALMA_UNASSIGNED_HOLDINGS ARCSS ASPBG AVWKF BCNDV EBS EDO GROUPED_DOAJ HZ~ I-F IAO IHR IOS KQ8 KZ1 LMP MIO MV1 NGNOM O9- OK1 P2P RHU RHW RHX TUS 24P AAMMB AAYXX ACCMX AEFGJ AGXDD AIDQK AIDYY CITATION H13 7SC 7SP 8FD JQ2 L7M L~C L~D ABUBZ ACPQW ADTOC AFRHK AGIAB CAG COF EJD FEDTE IL9 IPNFZ MET RIG UNPAY |
| ID | FETCH-LOGICAL-c337t-cf31bd7a01811a2696b559aacdca1dd46073e76bf41d3fb0e1bd10e30ee155ab3 |
| IEDL.DBID | RHX |
| ISSN | 1574-017X 1875-905X |
| IngestDate | Sun Oct 26 03:46:59 EDT 2025 Fri Jul 25 09:32:35 EDT 2025 Thu Apr 24 23:02:55 EDT 2025 Wed Oct 01 01:58:58 EDT 2025 Sun Jun 02 19:22:33 EDT 2024 |
| IsDoiOpenAccess | true |
| IsOpenAccess | true |
| IsPeerReviewed | true |
| IsScholarly | true |
| Language | English |
| License | This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. https://creativecommons.org/licenses/by/4.0 cc-by |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c337t-cf31bd7a01811a2696b559aacdca1dd46073e76bf41d3fb0e1bd10e30ee155ab3 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ORCID | 0000-0002-0287-0830 |
| OpenAccessLink | https://dx.doi.org/10.1155/2022/1881369 |
| PQID | 2693570921 |
| PQPubID | 2048814 |
| PageCount | 9 |
| ParticipantIDs | unpaywall_primary_10_1155_2022_1881369 proquest_journals_2693570921 crossref_citationtrail_10_1155_2022_1881369 crossref_primary_10_1155_2022_1881369 hindawi_primary_10_1155_2022_1881369 |
| ProviderPackageCode | CITATION AAYXX |
| PublicationCentury | 2000 |
| PublicationDate | 2022-07-13 |
| PublicationDateYYYYMMDD | 2022-07-13 |
| PublicationDate_xml | – month: 07 year: 2022 text: 2022-07-13 day: 13 |
| PublicationDecade | 2020 |
| PublicationPlace | Amsterdam |
| PublicationPlace_xml | – name: Amsterdam |
| PublicationTitle | Mobile information systems |
| PublicationYear | 2022 |
| Publisher | Hindawi John Wiley & Sons, Inc |
| Publisher_xml | – name: Hindawi – name: John Wiley & Sons, Inc |
| References | P. Ratanaworabhan (20) 22 13 14 15 17 S. Verberne (9) 2002 19 H. G. Kim (21) 2012; 28 H. L. Liang (12) 2008 1 2 P. Ye (18) 3 4 Y. Guo (23) 2006; 40 Y. S. Zhang (11) 2006; 6 J. Lee (16) 6 D. Jurafsky (5) 2010 7 8 E. S. Atwell (24) 1987; 12 10 |
| References_xml | – ident: 4 doi: 10.1023/a:1011424425034 – ident: 3 doi: 10.1088/1742-6596/1235/1/012059 – ident: 6 doi: 10.1136/jamia.1994.95236146 – start-page: 9 volume-title: Spell Checkers and Correctors: A Unified Treatment year: 2008 ident: 12 – start-page: 1978 ident: 16 article-title: Automatic Grammar Correction for Second-Language Learners – volume: 6 start-page: 8 issue: 5 year: 2006 ident: 11 article-title: Summary of text automatic proofreading technology publication-title: Application Research of Computers – ident: 15 doi: 10.1145/354324.354348 – volume: 12 start-page: 120 year: 1987 ident: 24 article-title: Dealing with ill-formed English text the computational analysis of English publication-title: A Corpus-Based Approach – ident: 13 doi: 10.1145/363958.363994 – ident: 7 doi: 10.1155/2022/9246966 – start-page: 241 ident: 18 article-title: MELB-YB: preposition sense disambiguation using rich semantic features – volume-title: Context-sensitive Spell Checking Based on Word Trigram Probabilities year: 2002 ident: 9 – ident: 14 doi: 10.48550/arXiv.cmp-lg/9607024 – volume-title: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition year: 2010 ident: 5 – ident: 10 doi: 10.1145/129875.129882 – ident: 22 doi: 10.32674/jis.v6i4.321 – ident: 1 doi: 10.1007/s00521-017-2884-0 – ident: 8 doi: 10.1016/s0167-9236(03)00096-4 – volume: 28 start-page: 911 issue: 5 year: 2012 ident: 21 article-title: Efficient detection of malicious web pages using high-interaction client honeypots publication-title: Journal of Information Science and Engineering – ident: 17 doi: 10.3115/1599081.1599103 – start-page: 169 ident: 20 article-title: Nozzle: a defense against heap-spraying code injection attacks – ident: 2 doi: 10.1155/2021/7058723 – ident: 19 doi: 10.1155/2022/2709255 – volume: 40 start-page: 117 year: 2006 ident: 23 article-title: The hegemony of English as a global language: reclaiming local knowledge and culture in China publication-title: Convergence |
| SSID | ssj0060357 ssib050733852 |
| Score | 2.2416587 |
| Snippet | Natural language processing technology is a theory and approach for exploring and developing successful human-computer communication. With the rapid growth of... |
| SourceID | unpaywall proquest crossref hindawi |
| SourceType | Open Access Repository Aggregation Database Enrichment Source Index Database Publisher |
| StartPage | 1 |
| SubjectTerms | Abbreviations Accuracy Algorithms Artificial intelligence Computers Context English language Error correction Error correction & detection Error detection Essays Grammar Information retrieval Language Mathematical models Microbalances Natural language processing Search engines Semantics Sentences Software Speech Spelling Statistical methods Statistical models Words (language) Writing |
| SummonAdditionalLinks | – databaseName: Unpaywall dbid: UNPAY link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3dS8MwED90Ivritzi_yIP6Ip3NkvQDfBkyFdHhg8P5ICVpUvyY3Wg3ZP71Jl0qTvAD31p6pE3umvzuuPsdwJ6MiecmmDpa4cKhCXWdgFHp8DAgVAU0CYs45FXLO2_Tiw7rTMFxWQsjDUV8j8u89mB80tfHYre265of6bmPjL9eP8JBgIkX1voymYYZj2kkXoGZduu6cVdQpPomucLvmGsNyZ3QZZ0y752xiSEmTqRZ-9oJwDk3TPt89Mq73U9nz-ki3JdfPU45ea4NB6IWv30hdPzvtJZgwYJS1Bhb0TJMqXQFFsuGD8j-_6vw1EBnGX8pWF61fDPLehk6Mf09iuoIZBqrdZGGwchWB6NmnvMRutUebo4eU2TDFDkqMhVQixesH-jSBk2RLVvQz9agfdq8OTl3bLMGJybEHzhxQrCQPjcEYJjXvdAT2lnhPJYxx1JST-8lyvdEQrEkiXCVlsauIq5SWiFckHWopL1UbQCKfUo1sFCEuabsVQhZV6EIFNYjYZb4VTgsFRbFlsncNNToRoVHw1hkljKyS1mF_Q_p_pjB4xu5PaubX8S2S8OISv1FerqE-W5Yx1U4-DCWH8fZ_KvgFsybWxNPxmQbKoNsqHY0EBqIXWvv7wm4Ajs priority: 102 providerName: Unpaywall |
| Title | A Grammatical Error Correction Model for English Essay Words in Colleges Using Natural Language Processing |
| URI | https://dx.doi.org/10.1155/2022/1881369 https://www.proquest.com/docview/2693570921 https://downloads.hindawi.com/journals/misy/2022/1881369.pdf |
| UnpaywallVersion | publishedVersion |
| Volume | 2022 |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVAFT databaseName: Colorado Digital library customDbUrl: eissn: 1875-905X dateEnd: 20240530 omitProxy: true ssIdentifier: ssj0060357 issn: 1875-905X databaseCode: KQ8 dateStart: 20050101 isFulltext: true titleUrlDefault: http://grweb.coalliance.org/oadl/oadl.html providerName: Colorado Alliance of Research Libraries – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources customDbUrl: eissn: 1875-905X dateEnd: 99991231 omitProxy: true ssIdentifier: ssib050733852 issn: 1574-017X databaseCode: M~E dateStart: 20050101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre – providerCode: PRVWIB databaseName: Wiley Online Library Open Access customDbUrl: eissn: 1875-905X dateEnd: 99991231 omitProxy: true ssIdentifier: ssj0060357 issn: 1875-905X databaseCode: 24P dateStart: 20050101 isFulltext: true titleUrlDefault: https://authorservices.wiley.com/open-science/open-access/browse-journals.html providerName: Wiley-Blackwell |
| link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEB6sInrxLVZr2UP1IsFsd7NJjkWqxUdRsVhPYTfZYKWmJWkR_7272434wMcxZJiQmdnMIzPfADSSmDA3xdRRChcOTanrBB5NHB4GhMqApqGpQ151WadHz_te34IkFd9_4Stvp9Pz5jEOAkxYWIFKwHTn1m2nX5qNp_cOmhHg2QeYucQAfGLP1w0Wfr_sd__C65MnWnzUKfDL4FOguTTNxvz1hQ-HH3zO6Rqs2GARtWbaXYc5mW3AarmIAdlzuQlPLXSW82eDvqro23k-ytGJ3rthphaQXng2RCo8RXZqF7WLgr-ie5V5FmiQIVs-KJDpIEBdbtA40KUtZiI7TqDubUHvtH130nHsEgUnJsSfOHFKsEh8roG5MG-ykAmVRHAeJzHHSUKZkp70mUgpTkgqXKmosSuJK6USGBdkG-azUSZ3AMU-pcrhS-K5ehxViKQpQxFIrDhhL_WrcFQKNIotwrhedDGMTKbheZEWf2TFX4WDd-rxDFnjB7qG1c0fZLVScZE9hkWkXlfZghs2cRUO35X5K5_d_z1uD5b1pa7yYlKD-Uk-lfsqPJmIOlQuboK6MdE6LPS6162HN8vo2yc |
| linkProvider | Hindawi Publishing |
| linkToUnpaywall | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3dS8MwED90Ivritzi_yIP6Ip3NkvQDfBkyFdHhg8P5ICVpUvyY3Wg3ZP71Jl0qTvAD31p6pE3umvzuuPsdwJ6MiecmmDpa4cKhCXWdgFHp8DAgVAU0CYs45FXLO2_Tiw7rTMFxWQsjDUV8j8u89mB80tfHYre265of6bmPjL9eP8JBgIkX1voymYYZj2kkXoGZduu6cVdQpPomucLvmGsNyZ3QZZ0y752xiSEmTqRZ-9oJwDk3TPt89Mq73U9nz-ki3JdfPU45ea4NB6IWv30hdPzvtJZgwYJS1Bhb0TJMqXQFFsuGD8j-_6vw1EBnGX8pWF61fDPLehk6Mf09iuoIZBqrdZGGwchWB6NmnvMRutUebo4eU2TDFDkqMhVQixesH-jSBk2RLVvQz9agfdq8OTl3bLMGJybEHzhxQrCQPjcEYJjXvdAT2lnhPJYxx1JST-8lyvdEQrEkiXCVlsauIq5SWiFckHWopL1UbQCKfUo1sFCEuabsVQhZV6EIFNYjYZb4VTgsFRbFlsncNNToRoVHw1hkljKyS1mF_Q_p_pjB4xu5PaubX8S2S8OISv1FerqE-W5Yx1U4-DCWH8fZ_KvgFsybWxNPxsE2VAbZUO1oIDQQu9be3wEKJgJA |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+Grammatical+Error+Correction+Model+for+English+Essay+Words+in+Colleges+Using+Natural+Language+Processing&rft.jtitle=Mobile+information+systems&rft.au=Long%2C+Juan&rft.date=2022-07-13&rft.issn=1574-017X&rft.eissn=1875-905X&rft.volume=2022&rft.spage=1&rft.epage=9&rft_id=info:doi/10.1155%2F2022%2F1881369&rft.externalDBID=n%2Fa&rft.externalDocID=10_1155_2022_1881369 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1574-017X&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1574-017X&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1574-017X&client=summon |