Preprocessing Technique for Malicious Comments Detection Considering the Form of Comments Used in the Online Community


KIPS Transactions on Computer and Communication Systems, Vol. 12, No. 3, pp. 103-110, Mar. 2023
https://doi.org/10.3745/KTCCS.2023.12.3.103,   PDF Download:
Keywords: Natural Languege Processing, Preprocessing, Deep Learning, Malicious Comment
Abstract

With the spread of the Internet, anonymous communities emerged along with the activation of communities for communication between people, and many users are doing harm to others, such as posting aggressive posts and leaving comments using anonymity. In the past, administrators directly checked posts and comments, then deleted and blocked them, but as the number of community users increased, they reached a level that managers could not continue to monitor. Initially, word filtering techniques were used to prevent malicious writing from being posted in a form that could not post or comment if a specific word was included, but they avoided filtering in a bypassed form, such as using similar words. As a way to solve this problem, deep learning was used to monitor posts posted by users in real-time, but recently, the community uses words that can only be understood by the community or from a human perspective, not from a general Korean word. There are various types and forms of characters, making it difficult to learn everything in the artificial intelligence model. Therefore, in this paper, we proposes a preprocessing technique in which each character of a sentence is imaged using a CNN model that learns the consonants, vowel and spacing images of Korean word and converts characters that can only be understood from a human perspective into characters predicted by the CNN model. As a result of the experiment, it was confirmed that the performance of the LSTM, BiLSTM and CNN-BiLSTM models increased by 3.2%, 3.3%, and 4.88%, respectively, through the proposed preprocessing technique.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
K. H. Soo and K. M. Hui, "Preprocessing Technique for Malicious Comments Detection Considering the Form of Comments Used in the Online Community," KIPS Transactions on Computer and Communication Systems, vol. 12, no. 3, pp. 103-110, 2023. DOI: https://doi.org/10.3745/KTCCS.2023.12.3.103.

[ACM Style]
Kim Hae Soo and Kim Mi Hui. 2023. Preprocessing Technique for Malicious Comments Detection Considering the Form of Comments Used in the Online Community. KIPS Transactions on Computer and Communication Systems, 12, 3, (2023), 103-110. DOI: https://doi.org/10.3745/KTCCS.2023.12.3.103.