Detecting Rumors Transformed from Hong Kong Copypasta

Yin Chun Fung, Lap Kei Lee, Kwok Tai Chui, Ian Cheuk Yin Lee, Morris Tsz On Chan, Jake Ka Lok Cheung, Marco Kwan Long Lam, Nga In Wu, Markus Lu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

A copypasta is a piece of text that is copied and pasted in online forums and social networking sites (SNSs) repeatedly, usually for a humorous or mocking purpose. In recent years, copypasta is also used to spread rumors and false information, which damages not only the reputation of individuals or organizations but also misleads many netizens. This paper presents a tool for Hong Kong netizens to detect text messages that are copypasta or their variants (by transforming an existing copypasta with new subjects and events). We exploit the Encyclopedia of Virtual Communities in Hong Kong (EVCHK), which contains a database of 315 commonly occurred copypasta in Hong Kong, and a CNN model to determine whether a text message is a copypasta or its variant with an accuracy rate of around 98%. We also showed a prototype of a Google Chrome browser extension that provides a user-friendly interface for netizens to identify copypasta and their variants on a selected text message directly (e.g., in an online forum or SNS). This tool can show the source of the corresponding copypasta and highlight their differences (if it is a variant). From a survey, users agreed that our tool can effectively help them to identify copypasta and hence help stop the spreading of this kind of online rumor.

Original languageEnglish
Title of host publicationInternational Conference on Cyber Security, Privacy and Networking, ICSPN 2022
EditorsNadia Nedjah, Gregorio Martínez Pérez, B.B. Gupta
PublisherSpringer Science and Business Media Deutschland GmbH
Pages11-23
Number of pages13
ISBN (Print)9783031220173
DOIs
Publication statusPublished - 2023
EventInternational Conference on Cyber Security, Privacy and Networking, ICSPN 2022 - Virtual, Online
Duration: 9 Sept 202111 Sept 2021

Publication series

NameLecture Notes in Networks and Systems
Volume599 LNNS
ISSN (Print)2367-3370
ISSN (Electronic)2367-3389

Conference

ConferenceInternational Conference on Cyber Security, Privacy and Networking, ICSPN 2022
CityVirtual, Online
Period9/09/2111/09/21

Keywords

  • Copypasta
  • Natural language processing
  • Rumor detection

Fingerprint

Dive into the research topics of 'Detecting Rumors Transformed from Hong Kong Copypasta'. Together they form a unique fingerprint.

Cite this