Offensive Language Detection Using Soft Voting Ensemble Model

dc.contributor.authorFieri, Brillian
dc.contributor.authorSuhartono, Derwin
dc.coverage.issue1cs
dc.coverage.volume29cs
dc.date.accessioned2024-01-11T08:34:34Z
dc.date.available2024-01-11T08:34:34Z
dc.date.issued2023-06-30cs
dc.description.abstractOffensive language is one of the problems that have become increasingly severe along with the rise of the internet and social media usage. This language can be used to attack a person or specific groups. Automatic moderation, such as the usage of machine learning, can help detect and filter this particular language for someone who needs it. This study focuses on improving the performance of the soft voting classifier to detect offensive language by experimenting with the combinations of the soft voting estimators. The model was applied to a Twitter dataset that was augmented using several augmentation techniques. The features were extracted using Term Frequency-Inverse Document Frequency, sentiment analysis, and GloVe embedding. In this study, there were two types of soft voting models: machine learning-based, with the estimators of Random Forest, Decision Tree, Logistic Regression, Naïve Bayes, and AdaBoost as the best combination, and deep learning-based, with the best estimator combination of Convolutional Neural Network, Bidirectional Long Short-Term Memory, and Bidirectional Gated Recurrent Unit. The results of this study show that the soft voting classifier was better in performance compared to classic machine learning and deep learning models on both original and augmented datasets.en
dc.formattextcs
dc.format.extent1-6cs
dc.format.mimetypeapplication/pdfen
dc.identifier.citationMendel. 2023 vol. 29, č. 1, s. 1-6. ISSN 1803-3814cs
dc.identifier.doi10.13164/mendel.2023.1.001en
dc.identifier.issn2571-3701
dc.identifier.issn1803-3814
dc.identifier.urihttps://hdl.handle.net/11012/244235
dc.language.isoencs
dc.publisherInstitute of Automation and Computer Science, Brno University of Technologycs
dc.relation.ispartofMendelcs
dc.relation.urihttps://mendel-journal.org/index.php/mendel/article/view/211cs
dc.rightsCreative Commons Attribution-NonCommercial-ShareAlike 4.0 International licenseen
dc.rights.accessopenAccessen
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0en
dc.subjectOffensive Languageen
dc.subjectText Classificationen
dc.subjectVoting Classifieren
dc.subjectEnsemble Modelen
dc.titleOffensive Language Detection Using Soft Voting Ensemble Modelen
dc.type.driverarticleen
dc.type.statusPeer-revieweden
dc.type.versionpublishedVersionen
eprints.affiliatedInstitution.facultyFakulta strojního inženýrstvícs
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
211-Article Text-568-2-10-20230630.pdf
Size:
1.9 MB
Format:
Adobe Portable Document Format
Description:
Collections