On Analyzing COVID-19-related Hate Speech Using BERT Attention

The emergence of COVID-19 has engendered a new wave of online hate speech in social media platforms such as Twitter. Its widespread effects range from acts of cyber-harassment towards certain ethnic communities (e.g., the Asian community), to targeting older people belonging to age groups correlated...

Full description

Saved in:
Bibliographic Details
Published in2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA) pp. 669 - 676
Main Authors Vishwamitra, Nishant, Hu, Ruijia Roger, Luo, Feng, Cheng, Long, Costello, Matthew, Yang, Yin
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.12.2020
Subjects
Online AccessGet full text
DOI10.1109/ICMLA51294.2020.00111

Cover

Abstract The emergence of COVID-19 has engendered a new wave of online hate speech in social media platforms such as Twitter. Its widespread effects range from acts of cyber-harassment towards certain ethnic communities (e.g., the Asian community), to targeting older people belonging to age groups correlated with higher mortality rates (termed infamously as "Boomer Remover"). Thus, an urgent need arises for a timely mitigation of this new wave of online hate speech. In this work, we aim to discover the hate-related keywords linked to COVID-19 in hateful tweets posted on Twitter so that users posting such keywords can be asked to reconsider posting them. We first collect a new dataset of tweets targeting older people supplementing with a dataset targeting the Asian community. Then, we develop an approach to analyze the datasets with BERT (a transformer-based model) attention mechanism and discover 186 novel keywords targeting the Asian community and 100 keywords targeting older people. Based on our study, we then propose a control mechanism wherein a user can be asked to reconsider using certain sensitive words identified by our approach. We further perform an exploratory analysis of BERT attention mechanism and find that the most high-impact, long distance attentions are learned in the earlier or later layers of the model depending on the underlying data distribution. Our study indicates that the BERT model in some cases uses a hate keyword and an associated group or individual to make predictions, a finding that is inline with existing hate-speech research, which suggests that hate-speech is often aimed at certain groups or individuals.
AbstractList The emergence of COVID-19 has engendered a new wave of online hate speech in social media platforms such as Twitter. Its widespread effects range from acts of cyber-harassment towards certain ethnic communities (e.g., the Asian community), to targeting older people belonging to age groups correlated with higher mortality rates (termed infamously as "Boomer Remover"). Thus, an urgent need arises for a timely mitigation of this new wave of online hate speech. In this work, we aim to discover the hate-related keywords linked to COVID-19 in hateful tweets posted on Twitter so that users posting such keywords can be asked to reconsider posting them. We first collect a new dataset of tweets targeting older people supplementing with a dataset targeting the Asian community. Then, we develop an approach to analyze the datasets with BERT (a transformer-based model) attention mechanism and discover 186 novel keywords targeting the Asian community and 100 keywords targeting older people. Based on our study, we then propose a control mechanism wherein a user can be asked to reconsider using certain sensitive words identified by our approach. We further perform an exploratory analysis of BERT attention mechanism and find that the most high-impact, long distance attentions are learned in the earlier or later layers of the model depending on the underlying data distribution. Our study indicates that the BERT model in some cases uses a hate keyword and an associated group or individual to make predictions, a finding that is inline with existing hate-speech research, which suggests that hate-speech is often aimed at certain groups or individuals.
Author Yang, Yin
Cheng, Long
Costello, Matthew
Vishwamitra, Nishant
Luo, Feng
Hu, Ruijia Roger
Author_xml – sequence: 1
  givenname: Nishant
  surname: Vishwamitra
  fullname: Vishwamitra, Nishant
  email: nvishwa@g.clemson.edu
  organization: Clemson University,Clemson,USA
– sequence: 2
  givenname: Ruijia Roger
  surname: Hu
  fullname: Hu, Ruijia Roger
  email: roger.rj.hu@gmail.com
  organization: Clemson University,Clemson,USA
– sequence: 3
  givenname: Feng
  surname: Luo
  fullname: Luo, Feng
  email: luofeng@clemson.edu
  organization: Clemson University,Clemson,USA
– sequence: 4
  givenname: Long
  surname: Cheng
  fullname: Cheng, Long
  email: lcheng2@clemson.edu
  organization: Clemson University,Clemson,USA
– sequence: 5
  givenname: Matthew
  surname: Costello
  fullname: Costello, Matthew
  email: mjcoste@clemson.edu
  organization: Clemson University,Clemson,USA
– sequence: 6
  givenname: Yin
  surname: Yang
  fullname: Yang, Yin
  email: yin5@clemson.edu
  organization: Clemson University,Clemson,USA
BookMark eNotjstKw0AYRkfQha0-gQjzAonzz6WT2QgxVhuIBLR1W-byjwbitKSzqU9vRTff2RwO34ycp11CQm6BlQDM3LXNS1cr4EaWnHFWMgYAZ2QGmldQSc3EJbnvE62THY_fQ_qgTf_ePhZgiglHmzHQ1Wnp2x7Rf9LN4Vd5WL6uaZ0zpjzs0hW5iHY84PU_52TztFw3q6Lrn9um7ooBoMoFaq7AxogGAqrIFj4Gyaxx2hoRlBHSeXCOuyh8BGkCP_3z2vEgIVjJxJzc_HUHRNzup-HLTsetEWrBlRI_Pw9Etg
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/ICMLA51294.2020.00111
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 1728184703
9781728184708
EndPage 676
ExternalDocumentID 9356255
Genre orig-research
GrantInformation_xml – fundername: National Science Foundation
  funderid: 10.13039/100000001
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i118t-e7251affe91de5f06cfd40a9b7a93d5934bc1bb2bf3cf149d2184c7b2d41da403
IEDL.DBID RIE
IngestDate Thu Jun 29 18:38:18 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i118t-e7251affe91de5f06cfd40a9b7a93d5934bc1bb2bf3cf149d2184c7b2d41da403
PageCount 8
ParticipantIDs ieee_primary_9356255
PublicationCentury 2000
PublicationDate 2020-Dec.
PublicationDateYYYYMMDD 2020-12-01
PublicationDate_xml – month: 12
  year: 2020
  text: 2020-Dec.
PublicationDecade 2020
PublicationTitle 2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA)
PublicationTitleAbbrev ICMLA
PublicationYear 2020
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.9150791
Snippet The emergence of COVID-19 has engendered a new wave of online hate speech in social media platforms such as Twitter. Its widespread effects range from acts of...
SourceID ieee
SourceType Publisher
StartPage 669
SubjectTerms Analytical models
BERT
Bit error rate
Blogs
COVID-19
explanation
hate-speech
online-hate
Predictive models
Social networking (online)
Training data
Twitter
Title On Analyzing COVID-19-related Hate Speech Using BERT Attention
URI https://ieeexplore.ieee.org/document/9356255
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LSwMxEA5tT55UWvFNDh5Nu9lNdjcXodaWVqwVbaW3kscERdgW2V76602yVVE8eAlhCOQJ30zyfROELiJlbM65JiZPJWG5siR3uEcSnhpu4hyMDWyL-3Q4Y7dzPq-hyy8tDAAE8hm0fTW85ZulXvurso5IvLvO66jujlml1dqKcmgkOqPe-K7r8ctflcRReGOgPz5NCZgx2EXjz94qqshbe12qtt78SsT43-Hsoda3Og8_fOHOPqpB0URXkwKHBCMbZ8O9yfPohlBBglQFDB66Ej-tAPQLDiwBfN1_nOJuWVZ0xxaaDfrT3pBs_0Ygry4kKAlkzjGR1oKgBriNUm0Ni6RQmRSJ4SJhSlOlYmUTbV0UZHwopzMVG0aNZFFygBrFsoBDhFmmMya1FOAaWmplJiMGGSRKOGfF0CPU9HNfrKr0F4vttI__Np-gHb_6FePjFDXK9zWcOdwu1XnYsA8ZLJkF
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1JSwMxFA61HvSk0oq7OXg07SzJzOQi1Noy1S6irfRWsrygCNMi00t_vZlMrSgevIQQAkl4h-8t3_eC0JUntUkYU0QnkSA0kYYkFvdIyCLNdJCANo5tMYzSCb2fsmkFXW-0MADgyGfQKKaulq_nalmkypo8LNx1toW2mY0qklKttZbl-B5v9tqDfqtAsCJZEniuyuD_-DbFoUZ3Dw2-zivJIu-NZS4bavWrFeN_L7SP6t_6PPy4QZ4DVIGshm5GGXYtRlZ2DbdHL7074nPixCqgcWpH_LwAUK_Y8QTwbedpjFt5XhIe62jS7YzbKVn_jkDebFCQE4itayKMAe5rYMaLlNHUE1zGgoea8ZBK5UsZSBMqY-MgXQRzKpaBpr4W1AsPUTWbZ3CEMI1VTIUSHOxG4xsRC49CDKHk1l3R_jGqFW-fLcoGGLP1s0_-Xr5EO-l40J_1e8OHU7RbWKLkf5yhav6xhHOL4rm8cMb7BFpmnFg
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2020+19th+IEEE+International+Conference+on+Machine+Learning+and+Applications+%28ICMLA%29&rft.atitle=On+Analyzing+COVID-19-related+Hate+Speech+Using+BERT+Attention&rft.au=Vishwamitra%2C+Nishant&rft.au=Hu%2C+Ruijia+Roger&rft.au=Luo%2C+Feng&rft.au=Cheng%2C+Long&rft.date=2020-12-01&rft.pub=IEEE&rft.spage=669&rft.epage=676&rft_id=info:doi/10.1109%2FICMLA51294.2020.00111&rft.externalDocID=9356255