Comparative Analysis of Reddit Posts and ChatGPT-Generated Texts’ Linguistic Features: A Short Report on Artificial Intelligence’s Imitative Capabilities

In recent years, the unprecedented explosion of artificial intelligence (AI), particularly generative AI, has dramatically and drastically altered many human fields, posing queries about how generative AI can imitate human language. Given the newness of generative AI as a controversial phenomenon, t...

Full description

Saved in:

Bibliographic Details
Published in	International Journal of Multidisciplinary: Applied Business and Education Research Vol. 5; no. 9; pp. 3475 - 3481
Main Authors	Arcenal, Erika Kristine E., Capistrano, Licca Pauleen V., De Guzman, Marielle Jessie D., Forrosuelo, Micaela Isabel M., Miranda, Janeson M.
Format	Journal Article
Language	English
Published	23.09.2024
Online Access	Get full text
ISSN	2774-5368 2774-5368
DOI	10.11594/ijmaber.05.09.06

Cover

Abstract	In recent years, the unprecedented explosion of artificial intelligence (AI), particularly generative AI, has dramatically and drastically altered many human fields, posing queries about how generative AI can imitate human language. Given the newness of generative AI as a controversial phenomenon, there is an urgency to closely examine how its linguistic outputs could mimic human language produced in natural contexts. Hence, in this short report, we discuss the observed similarities and differences in the linguistic features of the subreddit r/Marriage spouse appreciatory posts and ChatGPT-4 outputs. These results were the offshoot of our genre analysis on these two linguistic data sets. Our analysis revealed that ChatGPT-4 generated texts contain impeccable grammar, while the Reddit appreciatory posts have grammatical discrepancies, such as errors in subject-verb agreement, improper punctuation marks, and erroneous capitalization; ChatGPT-4 generated texts have more complex syntactical structure; Reddit dataset utilized more internet jargon, slang, and profanities and seems to be unpredictable and arbitrary in terms of textual length; and ChatGPT-4 outputs appear to overuse emojis while underuse emoticons and tend to use these digital linguistic elements without regard to their proper contexts. In light of these results, we claim that AI-generated texts, although they can mimic human language, this is on a mere surface level, and a closer inspection could uncover distinct variations. We recommend that future studies use more comprehensive and different datasets and continuously employ comparative and contrastive linguistic analysis to further investigate AI’s imitative capabilities.
AbstractList	In recent years, the unprecedented explosion of artificial intelligence (AI), particularly generative AI, has dramatically and drastically altered many human fields, posing queries about how generative AI can imitate human language. Given the newness of generative AI as a controversial phenomenon, there is an urgency to closely examine how its linguistic outputs could mimic human language produced in natural contexts. Hence, in this short report, we discuss the observed similarities and differences in the linguistic features of the subreddit r/Marriage spouse appreciatory posts and ChatGPT-4 outputs. These results were the offshoot of our genre analysis on these two linguistic data sets. Our analysis revealed that ChatGPT-4 generated texts contain impeccable grammar, while the Reddit appreciatory posts have grammatical discrepancies, such as errors in subject-verb agreement, improper punctuation marks, and erroneous capitalization; ChatGPT-4 generated texts have more complex syntactical structure; Reddit dataset utilized more internet jargon, slang, and profanities and seems to be unpredictable and arbitrary in terms of textual length; and ChatGPT-4 outputs appear to overuse emojis while underuse emoticons and tend to use these digital linguistic elements without regard to their proper contexts. In light of these results, we claim that AI-generated texts, although they can mimic human language, this is on a mere surface level, and a closer inspection could uncover distinct variations. We recommend that future studies use more comprehensive and different datasets and continuously employ comparative and contrastive linguistic analysis to further investigate AI’s imitative capabilities.
Author	Capistrano, Licca Pauleen V. Miranda, Janeson M. Arcenal, Erika Kristine E. Forrosuelo, Micaela Isabel M. De Guzman, Marielle Jessie D.
Author_xml	– sequence: 1 givenname: Erika Kristine E. surname: Arcenal fullname: Arcenal, Erika Kristine E. – sequence: 2 givenname: Licca Pauleen V. surname: Capistrano fullname: Capistrano, Licca Pauleen V. – sequence: 3 givenname: Marielle Jessie D. surname: De Guzman fullname: De Guzman, Marielle Jessie D. – sequence: 4 givenname: Micaela Isabel M. surname: Forrosuelo fullname: Forrosuelo, Micaela Isabel M. – sequence: 5 givenname: Janeson M. surname: Miranda fullname: Miranda, Janeson M.
BookMark	eNpNkE1OwzAUhC1UJErpAdj5Aim2E-eHXRTRUqkSFWQfOclL-1DiRLaL6I5rsOBynIRAu2A1M9Lo02iuyUT3Ggi55WzBuUyCO3ztVAlmweSCJQsWXpCpiKLAk34YT_75KzK3FksmuS8iGUZT8pX13aCMcvgGNNWqPVq0tG_oM9Q1OrrtrbNU6Zpme-VW29xbgYaxDzXN4d3Z749PukG9O6B1WNElKHcwYO9pSl_2vXEjaPiVXtPUOGywQtXStXbQtrgDXcFIsHTdoTuNyNSgSmzRIdgbctmo1sL8rDOSLx_y7NHbPK3WWbrxqliGXtTEpRQySuKYxzxhwmfSb0TNykaIMQSVAB77DfAwqEseAGdKJmGkBISSRZU_I_yErUxvrYGmGAx2yhwLzoq_h4vzwwWTBUsKFvo_l2d2Hw
ContentType	Journal Article
DBID	AAYXX CITATION
DOI	10.11594/ijmaber.05.09.06
DatabaseName	CrossRef
DatabaseTitle	CrossRef
DatabaseTitleList	CrossRef
DeliveryMethod	fulltext_linktorsrc
EISSN	2774-5368
EndPage	3481
ExternalDocumentID	10_11594_ijmaber_05_09_06
GroupedDBID	AAYXX ABDBF ALMA_UNASSIGNED_HOLDINGS CITATION
ID	FETCH-LOGICAL-c856-7f8b52579881819023053f2d0bf222304c2e183fe164db14e10a5967a2e6507c3
ISSN	2774-5368
IngestDate	Tue Jul 01 03:21:46 EDT 2025
IsDoiOpenAccess	false
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	true
Issue	9
Language	English
License	https://creativecommons.org/licenses/by/4.0
LinkModel	OpenURL
MergedId	FETCHMERGED-LOGICAL-c856-7f8b52579881819023053f2d0bf222304c2e183fe164db14e10a5967a2e6507c3
OpenAccessLink	https://doi.org/10.11594/ijmaber.05.09.06
PageCount	7
ParticipantIDs	crossref_primary_10_11594_ijmaber_05_09_06
ProviderPackageCode	CITATION AAYXX
PublicationCentury	2000
PublicationDate	2024-09-23
PublicationDateYYYYMMDD	2024-09-23
PublicationDate_xml	– month: 09 year: 2024 text: 2024-09-23 day: 23
PublicationDecade	2020
PublicationTitle	International Journal of Multidisciplinary: Applied Business and Education Research
PublicationYear	2024
SSID	ssib051327567
Score	2.2726483
Snippet	In recent years, the unprecedented explosion of artificial intelligence (AI), particularly generative AI, has dramatically and drastically altered many human...
SourceID	crossref
SourceType	Index Database
StartPage	3475
Title	Comparative Analysis of Reddit Posts and ChatGPT-Generated Texts’ Linguistic Features: A Short Report on Artificial Intelligence’s Imitative Capabilities
Volume	5
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
journalDatabaseRights	– providerCode: PRVEBS databaseName: Academic Search Ultimate - eBooks customDbUrl: https://search.ebscohost.com/login.aspx?authtype=ip,shib&custid=s3936755&profile=ehost&defaultdb=asn eissn: 2774-5368 dateEnd: 99991231 omitProxy: true ssIdentifier: ssib051327567 issn: 2774-5368 databaseCode: ABDBF dateStart: 20220301 isFulltext: true titleUrlDefault: https://search.ebscohost.com/direct.asp?db=asn providerName: EBSCOhost
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3LbtQwFLXKdMMGgQDxKvKCFVGGJH4k7m4609JWKkIioO4iJ3Foy3SmmsemK36DBT_Hl3D9SOKWQaJsolEUWc7cE99j-_hchN5IQpiUpA5FxssQ8m0SSpFVertQNLzkXNZGIPuBH36mx6fsdOte7qmW1qtyWF1vPFfyP1GFexBXfUr2DpHtGoUb8BviC1eIMFz_KcZjz7r7prtIXZ-vTCFea8E8PpOr9x_z0JpMa5KZw6C8bJUOQk_Nv66NZ3OgSeF6YaVyo-DTGfBzR9P1xsJoYcRF1qKjd_PsWloGR5fG9hu6NIZEbLS3rVDxopfN31yFdJTYnAb2zwmbPjie3En09Qt1upROOtgjt1IzU8VAj_HfpBvGgEvvD_sNlyvjF2zKjsO7A76MRFIL1r50T030B3TtVohP9JrCdKqCY60bVsGke0zXN5kDeO0e1gmAXk1lcLSUpRYND_1llYRqDYg9-WxH3wR4cciIrfkzVBvuufTBvK9EeKmAUFsSxtEKfeB5c8piggLQzi8uoV8LYyMrhtEGe_BbabsTU5ppHDRSuCaKiBWRKLQP_XaScp4M0PZob7J30I6zLCba9F87CXRv5Db8dTvvbnfFo2we98ofogcOIXhkv4BHaEvNHqOfHvpxi348b7BFPzboxwAW_Af6sUH_r-8_cI973OJ-F4-wQT22qMfzGe5Rj33UQwtL3OEd-3h_gvKD_Xx8GLpiI2GVMR6mTVZqY2CRAYMFkgwzc0aapI7KRjPoiFaJguzXqJjTuoypiiPJBE9lomCOk1bkKRrM5jP1DGGgxCStVEwFqWgd86ymQlKuqpTEimXyOXrb_p3FlbWUKf4awxd3efglut9D-RUarBZrtQOceVW-dhD4DYkEyag
linkProvider	EBSCOhost
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Comparative+Analysis+of+Reddit+Posts+and+ChatGPT-Generated+Texts%E2%80%99+Linguistic+Features%3A+A+Short+Report+on+Artificial+Intelligence%E2%80%99s+Imitative+Capabilities&rft.jtitle=International+Journal+of+Multidisciplinary%3A+Applied+Business+and+Education+Research&rft.au=Arcenal%2C+Erika+Kristine+E.&rft.au=Capistrano%2C+Licca+Pauleen+V.&rft.au=De+Guzman%2C+Marielle+Jessie+D.&rft.au=Forrosuelo%2C+Micaela+Isabel+M.&rft.date=2024-09-23&rft.issn=2774-5368&rft.eissn=2774-5368&rft.volume=5&rft.issue=9&rft.spage=3475&rft.epage=3481&rft_id=info:doi/10.11594%2Fijmaber.05.09.06&rft.externalDBID=n%2Fa&rft.externalDocID=10_11594_ijmaber_05_09_06
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2774-5368&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2774-5368&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2774-5368&client=summon