Online Disinformation Detection through Text Analysis: A Comparative Study of Supervised Models with Hyperparameter Optimization

Viviane Kaseka Katadi *

Faculty of Computer Science, Notre Dame University of Kasayi (U.KA), Kananga, Democratic Republic of Congo.

Gilbert Ngoyi Maloba

Faculty of Computer Science, Notre Dame University of Lomami (UNILO), Lomami, Democratic Republic of Congo.

Jean-Marie Ibanga Mbayo

Faculty of Computer Science, Notre Dame University of Lomami (UNILO), Lomami, Democratic Republic of Congo.

Pélagie Mpembe Mukonkole

Faculty of Computer Science, Notre Dame University of Lomami (UNILO), Lomami, Democratic Republic of Congo.

Clarisse Ngoyi Tshite

Faculty of Computer Science, Notre Dame University of Lomami (UNILO), Lomami, Democratic Republic of Congo.

Mardochée Kalala Mpolesha

Department of Computer Science, Luiza Rural Development Institute (ISDR-LUIZA), Kananga, Democratic Republic of Congo.

Pierre Kafunda Katalay

Faculty of Science and Technology, University of Kinshasa (UNIKIN), Kinshasa, Democratic Republic of Congo.

*Author to whom correspondence should be addressed.


Abstract

The rapid spread of disinformation on social media poses a major challenge in the digital age, with significant impacts on public opinion and decision-making. In this context, this study proposes a machine learning-based approach for the automatic detection of online disinformation. A comparative analysis is conducted on several supervised learning models, including logistic regression, support vector machines (SVMs), random forests, and gradient boosting. The experiment is based on a real-world dataset of textual content from digital platforms, preprocessed using TF-IDF. Furthermore, hyperparameter optimization, primarily using Grid Search, is implemented to improve model performance. The results obtained reveal very high performance for all models, with accuracy values ​​exceeding 98% and areas under the ROC curve (AUC) close to 1. The Gradient Boosting model stands out as the best performer, offering an excellent balance between accuracy and generalization capabilities, while the Random Forest model, although exhibiting a perfect AUC, shows potential signs of overfitting. This study highlights the effectiveness of machine learning methods for disinformation detection and underscores the importance of hyperparameter optimization in improving model performance. It also opens up interesting avenues for integrating more advanced techniques, including deep learning and multimodal analysis, into disinformation countermeasures systems. The models were evaluated using data separation into training and test sets, allowing for a reliable estimation of their performance. The results show that hyperparameter optimization significantly improves the performance of classical models. However, certain limitations related to the diversity of data sources and methodological choices must be taken into account.

Graphical Summary

mceclip0.png

 

Keywords: Disinformation, machine learning, hyperparameters;, TF-IDF, supervised learning, fake-news detection


How to Cite

Katadi, Viviane Kaseka, Gilbert Ngoyi Maloba, Jean-Marie Ibanga Mbayo, Pélagie Mpembe Mukonkole, Clarisse Ngoyi Tshite, Mardochée Kalala Mpolesha, and Pierre Kafunda Katalay. 2026. “Online Disinformation Detection through Text Analysis: A Comparative Study of Supervised Models With Hyperparameter Optimization”. Asian Journal of Research in Computer Science 19 (4):55-72. https://doi.org/10.9734/ajrcos/2026/v19i4849.

Downloads

Download data is not yet available.