On Analyzing COVID-19-related Hate Speech Using BERT Attention

The emergence of COVID-19 has engendered a new wave of online hate speech in social media platforms such as Twitter. Its widespread effects range from acts of cyber-harassment towards certain ethnic communities (e.g., the Asian community), to targeting older people belonging to age groups correlated...

Full description

Saved in:

Bibliographic Details
Published in	2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA) pp. 669 - 676
Main Authors	Vishwamitra, Nishant, Hu, Ruijia Roger, Luo, Feng, Cheng, Long, Costello, Matthew, Yang, Yin
Format	Conference Proceeding
Language	English
Published	IEEE 01.12.2020
Subjects	Analytical models BERT Bit error rate Blogs COVID-19 explanation hate-speech online-hate Predictive models Social networking (online) Training data Twitter
Online Access	Get full text
DOI	10.1109/ICMLA51294.2020.00111

Cover

Abstract	The emergence of COVID-19 has engendered a new wave of online hate speech in social media platforms such as Twitter. Its widespread effects range from acts of cyber-harassment towards certain ethnic communities (e.g., the Asian community), to targeting older people belonging to age groups correlated with higher mortality rates (termed infamously as "Boomer Remover"). Thus, an urgent need arises for a timely mitigation of this new wave of online hate speech. In this work, we aim to discover the hate-related keywords linked to COVID-19 in hateful tweets posted on Twitter so that users posting such keywords can be asked to reconsider posting them. We first collect a new dataset of tweets targeting older people supplementing with a dataset targeting the Asian community. Then, we develop an approach to analyze the datasets with BERT (a transformer-based model) attention mechanism and discover 186 novel keywords targeting the Asian community and 100 keywords targeting older people. Based on our study, we then propose a control mechanism wherein a user can be asked to reconsider using certain sensitive words identified by our approach. We further perform an exploratory analysis of BERT attention mechanism and find that the most high-impact, long distance attentions are learned in the earlier or later layers of the model depending on the underlying data distribution. Our study indicates that the BERT model in some cases uses a hate keyword and an associated group or individual to make predictions, a finding that is inline with existing hate-speech research, which suggests that hate-speech is often aimed at certain groups or individuals.
AbstractList	The emergence of COVID-19 has engendered a new wave of online hate speech in social media platforms such as Twitter. Its widespread effects range from acts of cyber-harassment towards certain ethnic communities (e.g., the Asian community), to targeting older people belonging to age groups correlated with higher mortality rates (termed infamously as "Boomer Remover"). Thus, an urgent need arises for a timely mitigation of this new wave of online hate speech. In this work, we aim to discover the hate-related keywords linked to COVID-19 in hateful tweets posted on Twitter so that users posting such keywords can be asked to reconsider posting them. We first collect a new dataset of tweets targeting older people supplementing with a dataset targeting the Asian community. Then, we develop an approach to analyze the datasets with BERT (a transformer-based model) attention mechanism and discover 186 novel keywords targeting the Asian community and 100 keywords targeting older people. Based on our study, we then propose a control mechanism wherein a user can be asked to reconsider using certain sensitive words identified by our approach. We further perform an exploratory analysis of BERT attention mechanism and find that the most high-impact, long distance attentions are learned in the earlier or later layers of the model depending on the underlying data distribution. Our study indicates that the BERT model in some cases uses a hate keyword and an associated group or individual to make predictions, a finding that is inline with existing hate-speech research, which suggests that hate-speech is often aimed at certain groups or individuals.
Author	Yang, Yin Cheng, Long Costello, Matthew Vishwamitra, Nishant Luo, Feng Hu, Ruijia Roger
Author_xml	– sequence: 1 givenname: Nishant surname: Vishwamitra fullname: Vishwamitra, Nishant email: nvishwa@g.clemson.edu organization: Clemson University,Clemson,USA – sequence: 2 givenname: Ruijia Roger surname: Hu fullname: Hu, Ruijia Roger email: roger.rj.hu@gmail.com organization: Clemson University,Clemson,USA – sequence: 3 givenname: Feng surname: Luo fullname: Luo, Feng email: luofeng@clemson.edu organization: Clemson University,Clemson,USA – sequence: 4 givenname: Long surname: Cheng fullname: Cheng, Long email: lcheng2@clemson.edu organization: Clemson University,Clemson,USA – sequence: 5 givenname: Matthew surname: Costello fullname: Costello, Matthew email: mjcoste@clemson.edu organization: Clemson University,Clemson,USA – sequence: 6 givenname: Yin surname: Yang fullname: Yang, Yin email: yin5@clemson.edu organization: Clemson University,Clemson,USA
BookMark	eNotjstKw0AYRkfQha0-gQjzAonzz6WT2QgxVhuIBLR1W-byjwbitKSzqU9vRTff2RwO34ycp11CQm6BlQDM3LXNS1cr4EaWnHFWMgYAZ2QGmldQSc3EJbnvE62THY_fQ_qgTf_ePhZgiglHmzHQ1Wnp2x7Rf9LN4Vd5WL6uaZ0zpjzs0hW5iHY84PU_52TztFw3q6Lrn9um7ooBoMoFaq7AxogGAqrIFj4Gyaxx2hoRlBHSeXCOuyh8BGkCP_3z2vEgIVjJxJzc_HUHRNzup-HLTsetEWrBlRI_Pw9Etg
CODEN	IEEPAD
ContentType	Conference Proceeding
DBID	6IE 6IL CBEJK RIE RIL
DOI	10.1109/ICMLA51294.2020.00111
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
EISBN	1728184703 9781728184708
EndPage	676
ExternalDocumentID	9356255
Genre	orig-research
GrantInformation_xml	– fundername: National Science Foundation funderid: 10.13039/100000001
GroupedDBID	6IE 6IL CBEJK RIE RIL
ID	FETCH-LOGICAL-i118t-e7251affe91de5f06cfd40a9b7a93d5934bc1bb2bf3cf149d2184c7b2d41da403
IEDL.DBID	RIE
IngestDate	Thu Jun 29 18:38:18 EDT 2023
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i118t-e7251affe91de5f06cfd40a9b7a93d5934bc1bb2bf3cf149d2184c7b2d41da403
PageCount	8
ParticipantIDs	ieee_primary_9356255
PublicationCentury	2000
PublicationDate	2020-Dec.
PublicationDateYYYYMMDD	2020-12-01
PublicationDate_xml	– month: 12 year: 2020 text: 2020-Dec.
PublicationDecade	2020
PublicationTitle	2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA)
PublicationTitleAbbrev	ICMLA
PublicationYear	2020
Publisher	IEEE
Publisher_xml	– name: IEEE
Score	1.9150791
Snippet	The emergence of COVID-19 has engendered a new wave of online hate speech in social media platforms such as Twitter. Its widespread effects range from acts of...
SourceID	ieee
SourceType	Publisher
StartPage	669
SubjectTerms	Analytical models BERT Bit error rate Blogs COVID-19 explanation hate-speech online-hate Predictive models Social networking (online) Training data Twitter
Title	On Analyzing COVID-19-related Hate Speech Using BERT Attention
URI	https://ieeexplore.ieee.org/document/9356255
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LSwMxEA5tT55UWvFNDh5Nu9lNdjcXodaWVqwVbaW3kscERdgW2V76602yVVE8eAlhCOQJ30zyfROELiJlbM65JiZPJWG5siR3uEcSnhpu4hyMDWyL-3Q4Y7dzPq-hyy8tDAAE8hm0fTW85ZulXvurso5IvLvO66jujlml1dqKcmgkOqPe-K7r8ctflcRReGOgPz5NCZgx2EXjz94qqshbe12qtt78SsT43-Hsoda3Og8_fOHOPqpB0URXkwKHBCMbZ8O9yfPohlBBglQFDB66Ej-tAPQLDiwBfN1_nOJuWVZ0xxaaDfrT3pBs_0Ygry4kKAlkzjGR1oKgBriNUm0Ni6RQmRSJ4SJhSlOlYmUTbV0UZHwopzMVG0aNZFFygBrFsoBDhFmmMya1FOAaWmplJiMGGSRKOGfF0CPU9HNfrKr0F4vttI__Np-gHb_6FePjFDXK9zWcOdwu1XnYsA8ZLJkF
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1JSwMxFA61HvSk0oq7OXg07SzJzOQi1Noy1S6irfRWsrygCNMi00t_vZlMrSgevIQQAkl4h-8t3_eC0JUntUkYU0QnkSA0kYYkFvdIyCLNdJCANo5tMYzSCb2fsmkFXW-0MADgyGfQKKaulq_nalmkypo8LNx1toW2mY0qklKttZbl-B5v9tqDfqtAsCJZEniuyuD_-DbFoUZ3Dw2-zivJIu-NZS4bavWrFeN_L7SP6t_6PPy4QZ4DVIGshm5GGXYtRlZ2DbdHL7074nPixCqgcWpH_LwAUK_Y8QTwbedpjFt5XhIe62jS7YzbKVn_jkDebFCQE4itayKMAe5rYMaLlNHUE1zGgoea8ZBK5UsZSBMqY-MgXQRzKpaBpr4W1AsPUTWbZ3CEMI1VTIUSHOxG4xsRC49CDKHk1l3R_jGqFW-fLcoGGLP1s0_-Xr5EO-l40J_1e8OHU7RbWKLkf5yhav6xhHOL4rm8cMb7BFpmnFg
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2020+19th+IEEE+International+Conference+on+Machine+Learning+and+Applications+%28ICMLA%29&rft.atitle=On+Analyzing+COVID-19-related+Hate+Speech+Using+BERT+Attention&rft.au=Vishwamitra%2C+Nishant&rft.au=Hu%2C+Ruijia+Roger&rft.au=Luo%2C+Feng&rft.au=Cheng%2C+Long&rft.date=2020-12-01&rft.pub=IEEE&rft.spage=669&rft.epage=676&rft_id=info:doi/10.1109%2FICMLA51294.2020.00111&rft.externalDocID=9356255