Comparative Analysis of Reddit Posts and ChatGPT-Generated Texts’ Linguistic Features: A Short Report on Artificial Intelligence’s Imitative Capabilities

In recent years, the unprecedented explosion of artificial intelligence (AI), particularly generative AI, has dramatically and drastically altered many human fields, posing queries about how generative AI can imitate human language. Given the newness of generative AI as a controversial phenomenon, t...

Full description

Saved in:
Bibliographic Details
Published inInternational Journal of Multidisciplinary: Applied Business and Education Research Vol. 5; no. 9; pp. 3475 - 3481
Main Authors Arcenal, Erika Kristine E., Capistrano, Licca Pauleen V., De Guzman, Marielle Jessie D., Forrosuelo, Micaela Isabel M., Miranda, Janeson M.
Format Journal Article
LanguageEnglish
Published 23.09.2024
Online AccessGet full text
ISSN2774-5368
2774-5368
DOI10.11594/ijmaber.05.09.06

Cover

Abstract In recent years, the unprecedented explosion of artificial intelligence (AI), particularly generative AI, has dramatically and drastically altered many human fields, posing queries about how generative AI can imitate human language. Given the newness of generative AI as a controversial phenomenon, there is an urgency to closely examine how its linguistic outputs could mimic human language produced in natural contexts. Hence, in this short report, we discuss the observed similarities and differences in the linguistic features of the subreddit r/Marriage spouse appreciatory posts and ChatGPT-4 outputs. These results were the offshoot of our genre analysis on these two linguistic data sets. Our analysis revealed that ChatGPT-4 generated texts contain impeccable grammar, while the Reddit appreciatory posts have grammatical discrepancies, such as errors in subject-verb agreement, improper punctuation marks, and erroneous capitalization; ChatGPT-4 generated texts have more complex syntactical structure; Reddit dataset utilized more internet jargon, slang, and profanities and seems to be unpredictable and arbitrary in terms of textual length; and ChatGPT-4 outputs appear to overuse emojis while underuse emoticons and tend to use these digital linguistic elements without regard to their proper contexts. In light of these results, we claim that AI-generated texts, although they can mimic human language, this is on a mere surface level, and a closer inspection could uncover distinct variations. We recommend that future studies use more comprehensive and different datasets and continuously employ comparative and contrastive linguistic analysis to further investigate AI’s imitative capabilities.
AbstractList In recent years, the unprecedented explosion of artificial intelligence (AI), particularly generative AI, has dramatically and drastically altered many human fields, posing queries about how generative AI can imitate human language. Given the newness of generative AI as a controversial phenomenon, there is an urgency to closely examine how its linguistic outputs could mimic human language produced in natural contexts. Hence, in this short report, we discuss the observed similarities and differences in the linguistic features of the subreddit r/Marriage spouse appreciatory posts and ChatGPT-4 outputs. These results were the offshoot of our genre analysis on these two linguistic data sets. Our analysis revealed that ChatGPT-4 generated texts contain impeccable grammar, while the Reddit appreciatory posts have grammatical discrepancies, such as errors in subject-verb agreement, improper punctuation marks, and erroneous capitalization; ChatGPT-4 generated texts have more complex syntactical structure; Reddit dataset utilized more internet jargon, slang, and profanities and seems to be unpredictable and arbitrary in terms of textual length; and ChatGPT-4 outputs appear to overuse emojis while underuse emoticons and tend to use these digital linguistic elements without regard to their proper contexts. In light of these results, we claim that AI-generated texts, although they can mimic human language, this is on a mere surface level, and a closer inspection could uncover distinct variations. We recommend that future studies use more comprehensive and different datasets and continuously employ comparative and contrastive linguistic analysis to further investigate AI’s imitative capabilities.
Author Capistrano, Licca Pauleen V.
Miranda, Janeson M.
Arcenal, Erika Kristine E.
Forrosuelo, Micaela Isabel M.
De Guzman, Marielle Jessie D.
Author_xml – sequence: 1
  givenname: Erika Kristine E.
  surname: Arcenal
  fullname: Arcenal, Erika Kristine E.
– sequence: 2
  givenname: Licca Pauleen V.
  surname: Capistrano
  fullname: Capistrano, Licca Pauleen V.
– sequence: 3
  givenname: Marielle Jessie D.
  surname: De Guzman
  fullname: De Guzman, Marielle Jessie D.
– sequence: 4
  givenname: Micaela Isabel M.
  surname: Forrosuelo
  fullname: Forrosuelo, Micaela Isabel M.
– sequence: 5
  givenname: Janeson M.
  surname: Miranda
  fullname: Miranda, Janeson M.
BookMark eNpNkE1OwzAUhC1UJErpAdj5Aim2E-eHXRTRUqkSFWQfOclL-1DiRLaL6I5rsOBynIRAu2A1M9Lo02iuyUT3Ggi55WzBuUyCO3ztVAlmweSCJQsWXpCpiKLAk34YT_75KzK3FksmuS8iGUZT8pX13aCMcvgGNNWqPVq0tG_oM9Q1OrrtrbNU6Zpme-VW29xbgYaxDzXN4d3Z749PukG9O6B1WNElKHcwYO9pSl_2vXEjaPiVXtPUOGywQtXStXbQtrgDXcFIsHTdoTuNyNSgSmzRIdgbctmo1sL8rDOSLx_y7NHbPK3WWbrxqliGXtTEpRQySuKYxzxhwmfSb0TNykaIMQSVAB77DfAwqEseAGdKJmGkBISSRZU_I_yErUxvrYGmGAx2yhwLzoq_h4vzwwWTBUsKFvo_l2d2Hw
ContentType Journal Article
DBID AAYXX
CITATION
DOI 10.11594/ijmaber.05.09.06
DatabaseName CrossRef
DatabaseTitle CrossRef
DatabaseTitleList CrossRef
DeliveryMethod fulltext_linktorsrc
EISSN 2774-5368
EndPage 3481
ExternalDocumentID 10_11594_ijmaber_05_09_06
GroupedDBID AAYXX
ABDBF
ALMA_UNASSIGNED_HOLDINGS
CITATION
ID FETCH-LOGICAL-c856-7f8b52579881819023053f2d0bf222304c2e183fe164db14e10a5967a2e6507c3
ISSN 2774-5368
IngestDate Tue Jul 01 03:21:46 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed false
IsScholarly true
Issue 9
Language English
License https://creativecommons.org/licenses/by/4.0
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c856-7f8b52579881819023053f2d0bf222304c2e183fe164db14e10a5967a2e6507c3
OpenAccessLink https://doi.org/10.11594/ijmaber.05.09.06
PageCount 7
ParticipantIDs crossref_primary_10_11594_ijmaber_05_09_06
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2024-09-23
PublicationDateYYYYMMDD 2024-09-23
PublicationDate_xml – month: 09
  year: 2024
  text: 2024-09-23
  day: 23
PublicationDecade 2020
PublicationTitle International Journal of Multidisciplinary: Applied Business and Education Research
PublicationYear 2024
SSID ssib051327567
Score 2.2726483
Snippet In recent years, the unprecedented explosion of artificial intelligence (AI), particularly generative AI, has dramatically and drastically altered many human...
SourceID crossref
SourceType Index Database
StartPage 3475
Title Comparative Analysis of Reddit Posts and ChatGPT-Generated Texts’ Linguistic Features: A Short Report on Artificial Intelligence’s Imitative Capabilities
Volume 5
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVEBS
  databaseName: Academic Search Ultimate - eBooks
  customDbUrl: https://search.ebscohost.com/login.aspx?authtype=ip,shib&custid=s3936755&profile=ehost&defaultdb=asn
  eissn: 2774-5368
  dateEnd: 99991231
  omitProxy: true
  ssIdentifier: ssib051327567
  issn: 2774-5368
  databaseCode: ABDBF
  dateStart: 20220301
  isFulltext: true
  titleUrlDefault: https://search.ebscohost.com/direct.asp?db=asn
  providerName: EBSCOhost
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3LbtQwFLXKdMMGgQDxKvKCFVGGJH4k7m4609JWKkIioO4iJ3Foy3SmmsemK36DBT_Hl3D9SOKWQaJsolEUWc7cE99j-_hchN5IQpiUpA5FxssQ8m0SSpFVertQNLzkXNZGIPuBH36mx6fsdOte7qmW1qtyWF1vPFfyP1GFexBXfUr2DpHtGoUb8BviC1eIMFz_KcZjz7r7prtIXZ-vTCFea8E8PpOr9x_z0JpMa5KZw6C8bJUOQk_Nv66NZ3OgSeF6YaVyo-DTGfBzR9P1xsJoYcRF1qKjd_PsWloGR5fG9hu6NIZEbLS3rVDxopfN31yFdJTYnAb2zwmbPjie3En09Qt1upROOtgjt1IzU8VAj_HfpBvGgEvvD_sNlyvjF2zKjsO7A76MRFIL1r50T030B3TtVohP9JrCdKqCY60bVsGke0zXN5kDeO0e1gmAXk1lcLSUpRYND_1llYRqDYg9-WxH3wR4cciIrfkzVBvuufTBvK9EeKmAUFsSxtEKfeB5c8piggLQzi8uoV8LYyMrhtEGe_BbabsTU5ppHDRSuCaKiBWRKLQP_XaScp4M0PZob7J30I6zLCba9F87CXRv5Db8dTvvbnfFo2we98ofogcOIXhkv4BHaEvNHqOfHvpxi348b7BFPzboxwAW_Af6sUH_r-8_cI973OJ-F4-wQT22qMfzGe5Rj33UQwtL3OEd-3h_gvKD_Xx8GLpiI2GVMR6mTVZqY2CRAYMFkgwzc0aapI7KRjPoiFaJguzXqJjTuoypiiPJBE9lomCOk1bkKRrM5jP1DGGgxCStVEwFqWgd86ymQlKuqpTEimXyOXrb_p3FlbWUKf4awxd3efglut9D-RUarBZrtQOceVW-dhD4DYkEyag
linkProvider EBSCOhost
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Comparative+Analysis+of+Reddit+Posts+and+ChatGPT-Generated+Texts%E2%80%99+Linguistic+Features%3A+A+Short+Report+on+Artificial+Intelligence%E2%80%99s+Imitative+Capabilities&rft.jtitle=International+Journal+of+Multidisciplinary%3A+Applied+Business+and+Education+Research&rft.au=Arcenal%2C+Erika+Kristine+E.&rft.au=Capistrano%2C+Licca+Pauleen+V.&rft.au=De+Guzman%2C+Marielle+Jessie+D.&rft.au=Forrosuelo%2C+Micaela+Isabel+M.&rft.date=2024-09-23&rft.issn=2774-5368&rft.eissn=2774-5368&rft.volume=5&rft.issue=9&rft.spage=3475&rft.epage=3481&rft_id=info:doi/10.11594%2Fijmaber.05.09.06&rft.externalDBID=n%2Fa&rft.externalDocID=10_11594_ijmaber_05_09_06
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2774-5368&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2774-5368&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2774-5368&client=summon