Development of a Machine Learning-Based Model In Detecting Fake News Analyzing Techniques For Accurate Content Verification

Main Article Content

Esther Kinkosi Tomba
Lawrence Mwenda Muriira
Timothy Anondo

Abstract

Information through social media and other news outlets made detecting fake news crucial for individuals. The Pew Research Centre conducted surveys in the U.S.A to examine how adults consume news via social media, aiming to understand the behaviours and demographics of those relying on such platforms. This study addressed a critical gap in traditional fake news detection methods, which mainly used manual approaches and lacked advanced machine learning or AI techniques. Traditional methods are insufficient to handle the complexity, and contextual manipulation, where accurate information is presented misleadingly. To overcome these limitations, the study developed a ML Based model for detecting fake news, by analysing article content, and identifying patterns of misinformation. It employed advanced natural language processing techniques and supervised learning algorithms such as Decision Trees with 99.67% of accuracy, Logistic Regression with 99.13%, and Random Forest with 99.15%. Methods like Tokenization and TF-IDF were used to train the model using the ISO Fake news dataset. This dataset included real news from Reuters.com and fake news from unreliable sources flagged by PolitiFact and Wikipedia. Additional labelled datasets like LIAR and FakeNewsNet, along with newly gathered data, were used to supplement the training. Model performance was assessed using accuracy, precision, recall and F1-Score, all achieving 99.67%, demonstrating superior detection capabilities. The research contributed to ML by advancing NLP Techniques and improving fake news detection models. The study recommends future researchers, engineers and all those involved in developing machine learning systems to enhance further effectiveness should expand datasets and including diverse languages, applying deep learning models like RNN, CNN, and Transformers, (e.g., BERT, ROBERTa) for better contextual analysis, and establishing benchmarks using real-world case studies.    

Article Details

How to Cite
Tomba, E. K., Muriira, L. M., & Anondo, T. (2025). Development of a Machine Learning-Based Model In Detecting Fake News: Analyzing Techniques For Accurate Content Verification. International Journal of Professional Practice, 13(2), 28–41. https://doi.org/10.71274/ijpp.v13i2.569
Section
Browse Articles in this Issue

References

Ahuja, N., & Kumar, S. (2020). S-HAN: Hierarchical Attention Networks with Stacked Gated Recurrent Unit for Fake News Detection. 2022 10th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO), 873–877. https://doi.org/10.1109/icrito48877.2020.9197914

Albahr, A., & Albahar, M. (2020). An Empirical Comparison of Fake News Detection using different Machine Learning Algorithms. International Journal of Advanced Computer Science and Applications, 11(9). https://doi.org/10.14569/ijacsa.2020.0110917

Al-Furaiji, R. H., & Abdulkader, H. (2024). Comparison of the performance of six machine learning algorithms for fake news. EAI Endorsed Transactions on AI and Robotics, 3. https://doi.org/10.4108/airo.4153

Aslam, N., Khan, I. U., Alotaibi, F. S., Aldaej, L. A., & Aldubaikil, A. K. (2021). Fake Detect: A deep learning ensemble model for fake news detection. Complexity, 2021(1). https://doi.org/10.1155/2021/5557784

Bahad, P., Saxena, P., & Kamal, R. (2019). Fake News Detection using Bi-directional LSTM-Recurrent Neural Network. Procedia Computer Science, 165, 74–82. https://doi.org/10.1016/j.procs.2020.01.072

Birunda, S. S., & Devi, R. K. (2021b). A Novel Score-Based Multi-Source Fake News Detection using Gradient Boosting Algorithm. International Conference on Artificial Intelligence and Smart Systems (ICAIS), 406–414. https://doi.org/10.1109/icais50930.2021.9395896

Cikambasi, C. L., Muriira, L. M., & Murungi, R. M. (2024). Deep Learning Network Intrusion Detection with the Conv1d-Lstm Model: Integrating CNN and LSTM For Superior Performance. International Journal of Professional Practice, 12(4), 41–49. https://doi.org/10.71274/ijpp.v12i4.475

Daud, S., Ullah, M., Rehman, A., Saba, T., Damaševičius, R., & Sattar, A. (2023). Topic Classification of online news articles using optimized machine learning models. Computers, 12(1), 16. https://doi.org/10.3390/computers12010016

Faustini, P. H. A., & Covões, T. F. (2020). Fake news detection in multiple platforms and languages. Expert Systems with Applications, 158, 113503. https://doi.org/10.1016/j.eswa.2020.113503

Gereme, F., Zhu, W., Ayall, T., & Alemu, D. (2021). Combating fake news in “Low-Resource” languages: amharic fake news detection accompanied by resource crafting. Information, 12(1), 20. https://doi.org/10.3390/info12010020

Goldani, M. H., Momtazi, S., & Safabakhsh, R. (2020). Detecting fake news with capsule neural networks. Applied Soft Computing, 101, 106991. https://doi.org/10.1016/j.asoc.2020.106991

Jain, A., & Kasbe, A. (2018). Fake news detection. 2020 IEEE International Students’ Conference on Electrical,Electronics and Computer Science (SCEECS). https://doi.org/10.1109/sceecs.2018.8546944

Kaliyar, R. K., Goswami, A., & Narang, P. (2021). EchoFakeD: improving fake news detection in social media with an efficient deep neural network. Neural Computing and Applications, 33(14), 8597–8613. https://doi.org/10.1007/s00521-020-05611-1

Konkobo, P. M., Zhang, R., Huang, S., Minoungou, T. T., Ouedraogo, J. A., & Li, L. (2020). A Deep Learning Model for Early Detection of Fake News on Social Media. 7th International Conference on Behavioural and Social Computing (BESC), 1–6. https://doi.org/10.1109/besc51023.2020.9348311

Lyu, S., & Lo, D. C. (2020). Fake news detection by decision tree. SoutheastCon. https://doi.org/10.1109/southeastcon44009.2020.9249688

Mugdha, S. B. S., Ferdous, S. M., & Fahmin, A. (2020). Evaluating Machine Learning Algorithms For Bengali Fake News Detection. International Conference on Computer and Information Technology, 1–6. https://doi.org/10.1109/iccit51783.2020.9392662

Nasir, J. A., Khan, O. S., & Varlamis, I. (2021). Fake news detection: A hybrid CNN-RNN based deep learning approach. International Journal of Information Management Data Insights, 1(1), 100007. https://doi.org/10.1016/j.jjimei.2020.100007

Ozbay, F. A., & Alatas, B. (2019). A novel approach for detection of fake news on social media using metaheuristic optimization algorithms. Elektronika Ir Elektrotechnika, 25(4), 62–67. https://doi.org/10.5755/j01.eie.25.4.23972

Pandey, S., Prabhakaran, S., Reddy, N. V. S., & Acharya, D. (2022). Fake News Detection from Online media using Machine learning Classifiers. Journal of Physics Conference Series, 2161(1), 012027. https://doi.org/10.1088/1742-6596/2161/1/012027

Pardamean, A., & Pardede, H. F. (2021). Tuned bidirectional encoder representations from transformers for fake news detection. Indonesian Journal of Electrical Engineering and Computer Science, 22(3), 1667. https://doi.org/10.11591/ijeecs.v22.i3.pp1667-1671

Sahoo, S. R., & Gupta, B. (2020). Multiple features based approach for automatic fake news detection on social networks using deep learning. Applied Soft Computing, 100, 106983. https://doi.org/10.1016/j.asoc.2020.106983

Song, C., Ning, N., Zhang, Y., & Wu, B. (2020). A multimodal fake news detection model based on crossmodal attention residual and multichannel convolutional neural networks. Information Processing & Management, 58(1), 102437. https://doi.org/10.1016/j.ipm.2020.102437

Pisner, D. A., & Schnyer, D. M. (2019). Support vector machine. In Machine learning (pp. 101–121). https://doi.org/10.1016/b978-0-12-815739-8.00006-7

Su, J., Cardie, C., & Nakov, P. (2024). Adapting fake news detection to the era of large language models. Findings of the Association for Computational Linguistics: NAACL 2022. https://doi.org/10.18653/v1/2024.findings-naacl.95

Taha, M. A., Jabar, H. D. A., & Mohammed, W. K. (2024b). Fake news detection model basing on machine learning algorithms. Baghdad Science Journal, 21(8), 2771. https://doi.org/10.21123/bsj.2024.8710

Thakar, H., & Bhatt, B. (2024). Fake news detection: recent trends and challenges. Social Network Analysis and Mining, 14(1). https://doi.org/10.1007/s13278-024-01344-4

Verma, P. K., Agrawal, P., Amorim, I., & Prodan, R. (2021). WELFAKE: Word embedding over Linguistic features for Fake News Detection. IEEE Transactions on Computational Social Systems, 8(4), 881–893. https://doi.org/10.1109/tcss.2021.3068519

Villela, H. F., Corrêa, F., De Araújo Nery Ribeiro, J. S., Rabelo, A., & Carvalho, D. B. F. (2023). Fake news detection: a systematic literature review of machine learning algorithms and datasets. Journal on Interactive Systems, 14(1), 47–58. https://doi.org/10.5753/jis.2023.3020

Wang, W. Y. (2017). “Liar, Liar Pants on Fire”: A New Benchmark Dataset for Fake News Detection. Annual Meeting of the Association for Computational Linguistics. https://doi.org/10.18653/v1/p17-2067

Zahednejad, B., Ke, L., & Li, J. (2020c). A novel Machine Learning-Based approach for security analysis of authentication and key agreement protocols. Security and Communication Networks, 2020, 1–15. https://doi.org/10.1155/2020/8848389