On Analyzing COVID-19-related Hate Speech Using BERT Attention
The emergence of COVID-19 has engendered a new wave of online hate speech in social media platforms such as Twitter. Its widespread effects range from acts of cyber-harassment towards certain ethnic communities (e.g., the Asian community), to targeting older people belonging to age groups correlated...
Saved in:
Published in | 2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA) pp. 669 - 676 |
---|---|
Main Authors | , , , , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.12.2020
|
Subjects | |
Online Access | Get full text |
DOI | 10.1109/ICMLA51294.2020.00111 |
Cover
Abstract | The emergence of COVID-19 has engendered a new wave of online hate speech in social media platforms such as Twitter. Its widespread effects range from acts of cyber-harassment towards certain ethnic communities (e.g., the Asian community), to targeting older people belonging to age groups correlated with higher mortality rates (termed infamously as "Boomer Remover"). Thus, an urgent need arises for a timely mitigation of this new wave of online hate speech. In this work, we aim to discover the hate-related keywords linked to COVID-19 in hateful tweets posted on Twitter so that users posting such keywords can be asked to reconsider posting them. We first collect a new dataset of tweets targeting older people supplementing with a dataset targeting the Asian community. Then, we develop an approach to analyze the datasets with BERT (a transformer-based model) attention mechanism and discover 186 novel keywords targeting the Asian community and 100 keywords targeting older people. Based on our study, we then propose a control mechanism wherein a user can be asked to reconsider using certain sensitive words identified by our approach. We further perform an exploratory analysis of BERT attention mechanism and find that the most high-impact, long distance attentions are learned in the earlier or later layers of the model depending on the underlying data distribution. Our study indicates that the BERT model in some cases uses a hate keyword and an associated group or individual to make predictions, a finding that is inline with existing hate-speech research, which suggests that hate-speech is often aimed at certain groups or individuals. |
---|---|
AbstractList | The emergence of COVID-19 has engendered a new wave of online hate speech in social media platforms such as Twitter. Its widespread effects range from acts of cyber-harassment towards certain ethnic communities (e.g., the Asian community), to targeting older people belonging to age groups correlated with higher mortality rates (termed infamously as "Boomer Remover"). Thus, an urgent need arises for a timely mitigation of this new wave of online hate speech. In this work, we aim to discover the hate-related keywords linked to COVID-19 in hateful tweets posted on Twitter so that users posting such keywords can be asked to reconsider posting them. We first collect a new dataset of tweets targeting older people supplementing with a dataset targeting the Asian community. Then, we develop an approach to analyze the datasets with BERT (a transformer-based model) attention mechanism and discover 186 novel keywords targeting the Asian community and 100 keywords targeting older people. Based on our study, we then propose a control mechanism wherein a user can be asked to reconsider using certain sensitive words identified by our approach. We further perform an exploratory analysis of BERT attention mechanism and find that the most high-impact, long distance attentions are learned in the earlier or later layers of the model depending on the underlying data distribution. Our study indicates that the BERT model in some cases uses a hate keyword and an associated group or individual to make predictions, a finding that is inline with existing hate-speech research, which suggests that hate-speech is often aimed at certain groups or individuals. |
Author | Yang, Yin Cheng, Long Costello, Matthew Vishwamitra, Nishant Luo, Feng Hu, Ruijia Roger |
Author_xml | – sequence: 1 givenname: Nishant surname: Vishwamitra fullname: Vishwamitra, Nishant email: nvishwa@g.clemson.edu organization: Clemson University,Clemson,USA – sequence: 2 givenname: Ruijia Roger surname: Hu fullname: Hu, Ruijia Roger email: roger.rj.hu@gmail.com organization: Clemson University,Clemson,USA – sequence: 3 givenname: Feng surname: Luo fullname: Luo, Feng email: luofeng@clemson.edu organization: Clemson University,Clemson,USA – sequence: 4 givenname: Long surname: Cheng fullname: Cheng, Long email: lcheng2@clemson.edu organization: Clemson University,Clemson,USA – sequence: 5 givenname: Matthew surname: Costello fullname: Costello, Matthew email: mjcoste@clemson.edu organization: Clemson University,Clemson,USA – sequence: 6 givenname: Yin surname: Yang fullname: Yang, Yin email: yin5@clemson.edu organization: Clemson University,Clemson,USA |
BookMark | eNotjstKw0AYRkfQha0-gQjzAonzz6WT2QgxVhuIBLR1W-byjwbitKSzqU9vRTff2RwO34ycp11CQm6BlQDM3LXNS1cr4EaWnHFWMgYAZ2QGmldQSc3EJbnvE62THY_fQ_qgTf_ePhZgiglHmzHQ1Wnp2x7Rf9LN4Vd5WL6uaZ0zpjzs0hW5iHY84PU_52TztFw3q6Lrn9um7ooBoMoFaq7AxogGAqrIFj4Gyaxx2hoRlBHSeXCOuyh8BGkCP_3z2vEgIVjJxJzc_HUHRNzup-HLTsetEWrBlRI_Pw9Etg |
CODEN | IEEPAD |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/ICMLA51294.2020.00111 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 1728184703 9781728184708 |
EndPage | 676 |
ExternalDocumentID | 9356255 |
Genre | orig-research |
GrantInformation_xml | – fundername: National Science Foundation funderid: 10.13039/100000001 |
GroupedDBID | 6IE 6IL CBEJK RIE RIL |
ID | FETCH-LOGICAL-i118t-e7251affe91de5f06cfd40a9b7a93d5934bc1bb2bf3cf149d2184c7b2d41da403 |
IEDL.DBID | RIE |
IngestDate | Thu Jun 29 18:38:18 EDT 2023 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i118t-e7251affe91de5f06cfd40a9b7a93d5934bc1bb2bf3cf149d2184c7b2d41da403 |
PageCount | 8 |
ParticipantIDs | ieee_primary_9356255 |
PublicationCentury | 2000 |
PublicationDate | 2020-Dec. |
PublicationDateYYYYMMDD | 2020-12-01 |
PublicationDate_xml | – month: 12 year: 2020 text: 2020-Dec. |
PublicationDecade | 2020 |
PublicationTitle | 2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA) |
PublicationTitleAbbrev | ICMLA |
PublicationYear | 2020 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
Score | 1.9150791 |
Snippet | The emergence of COVID-19 has engendered a new wave of online hate speech in social media platforms such as Twitter. Its widespread effects range from acts of... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 669 |
SubjectTerms | Analytical models BERT Bit error rate Blogs COVID-19 explanation hate-speech online-hate Predictive models Social networking (online) Training data |
Title | On Analyzing COVID-19-related Hate Speech Using BERT Attention |
URI | https://ieeexplore.ieee.org/document/9356255 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LSwMxEA5tT55UWvFNDh5Nu9lNdjcXodaWVqwVbaW3kscERdgW2V76602yVVE8eAlhCOQJ30zyfROELiJlbM65JiZPJWG5siR3uEcSnhpu4hyMDWyL-3Q4Y7dzPq-hyy8tDAAE8hm0fTW85ZulXvurso5IvLvO66jujlml1dqKcmgkOqPe-K7r8ctflcRReGOgPz5NCZgx2EXjz94qqshbe12qtt78SsT43-Hsoda3Og8_fOHOPqpB0URXkwKHBCMbZ8O9yfPohlBBglQFDB66Ej-tAPQLDiwBfN1_nOJuWVZ0xxaaDfrT3pBs_0Ygry4kKAlkzjGR1oKgBriNUm0Ni6RQmRSJ4SJhSlOlYmUTbV0UZHwopzMVG0aNZFFygBrFsoBDhFmmMya1FOAaWmplJiMGGSRKOGfF0CPU9HNfrKr0F4vttI__Np-gHb_6FePjFDXK9zWcOdwu1XnYsA8ZLJkF |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1JSwMxFA61HvSk0oq7OXg07SzJzOQi1Noy1S6irfRWsrygCNMi00t_vZlMrSgevIQQAkl4h-8t3_eC0JUntUkYU0QnkSA0kYYkFvdIyCLNdJCANo5tMYzSCb2fsmkFXW-0MADgyGfQKKaulq_nalmkypo8LNx1toW2mY0qklKttZbl-B5v9tqDfqtAsCJZEniuyuD_-DbFoUZ3Dw2-zivJIu-NZS4bavWrFeN_L7SP6t_6PPy4QZ4DVIGshm5GGXYtRlZ2DbdHL7074nPixCqgcWpH_LwAUK_Y8QTwbedpjFt5XhIe62jS7YzbKVn_jkDebFCQE4itayKMAe5rYMaLlNHUE1zGgoea8ZBK5UsZSBMqY-MgXQRzKpaBpr4W1AsPUTWbZ3CEMI1VTIUSHOxG4xsRC49CDKHk1l3R_jGqFW-fLcoGGLP1s0_-Xr5EO-l40J_1e8OHU7RbWKLkf5yhav6xhHOL4rm8cMb7BFpmnFg |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2020+19th+IEEE+International+Conference+on+Machine+Learning+and+Applications+%28ICMLA%29&rft.atitle=On+Analyzing+COVID-19-related+Hate+Speech+Using+BERT+Attention&rft.au=Vishwamitra%2C+Nishant&rft.au=Hu%2C+Ruijia+Roger&rft.au=Luo%2C+Feng&rft.au=Cheng%2C+Long&rft.date=2020-12-01&rft.pub=IEEE&rft.spage=669&rft.epage=676&rft_id=info:doi/10.1109%2FICMLA51294.2020.00111&rft.externalDocID=9356255 |