Image Fake News Prediction Based on Random Forest and Gradient-boosting Methods

DOI:

https://doi.org/10.36371/port.2023.4.6

Authors

  • Saadi. M. Saadi Iraqi Commission for Computers and Informatics, Informatics Institute of Postgraduate Studies, Baghdad, Iraq.
  • Waleed A. Mahmoud Al-Jawher College of Engineering, Uruk University, Baghdad, Iraq.

The internet technology of today makes it challenging to spread false information, particularly through photos, including fake news. In this study, fake news is identified and predicted using photos that have been altered or misrepresented. Effective detection systems are crucial because of the proliferation of false information that images might spread due to the use of image modification tools and social media. This paper provides a thorough analysis of fake news based on images. Among the main research areas are machine learning for classification models and image data embedding (feature extraction). Our novel methodology forecasts fake news in the form of altered or misleading photographs by using Random Forest and gradient-boosting algorithms to detect visual alterations such as picture editing and image synthesis. This research leverages massive image datasets from news channels and social media to train and assess predictive algorithms.  Our results demonstrate that our method has strong recall and precision in identifying image-based fake news. We also discuss practical applications and real-time detection, such building tools to combat misinformation on social media and in news organizations. At 0.968 with 0.997, Gradient Boosting performs better than Random Forest. 

Keywords:

: Image Fake News, Random Forest, Feature Extraction, gradient-boosting, Support feature Extraction.

[1] Sharma, S., Sharma, D.K. (2020). Fake News Detection: A long way to go. In: 4th International Conference on Information Systems and Computer Networks (ISCON)2019. pp 816–821, https://doi.org/10.1109/ISCON47742.2019.9036221

[2] Stoll, J. (2020) Reading fake news about the Coronavirus in Norway 2020. https://www.statista.com/statistics/1108710/read ing-fake-news-about-the-coronavirus-in-norway-by-source/ (2020). Accessed on May 2, 2020.

[3] CIGI-Ipsos Global Survey on Internet Security and Trust (2019). https://www.cigionline.org/internet-survey-2019. Accessed on 15 January 2021.

[4] Fazio, L. (Feb 2020). Curbing fake news: Here’s why visuals are the most potent form of misinformation from https://scroll.in/ article/953395/curbing-fake-news-heres-why-visuals-are-themost-potent-form-of-misinformation last accessed on 2021/1/2.

[5] [5] Adobe blog https://blogs.adobe.com/creative/files/2015/12/ Adobe-State-of-Content-Report.pdf.

[6] McCarthy N (2020) Report: Facebook Poses A Major Threat To Public Health. https://www.statista.com/chart/22660/health-mis information-on-facebook/ Accessed on September 3, 2020.

[7] Singh, B., Sharma, D.K. (2021) Image Forgery Over Social Media Platforms - A Deep Learning Approach for Its Detection and Localization. In: 15th International Conference on Computing for Sustainable Global Development—Bharti Vidyapeeth, Delhi.

[8] Johnston P, Elyan E, Jayne C (2019) Video tampering localization using features learned from authentic Content. Neural Comput Appl 32:12243–12257

[9] Vishwakarma DK, Varshney D, Yadav A (2019) Detection and veracity analysis of fake news via scrapping and authenticating the web search. Cognit Syst Res 58:217–229.

[10] Mangal, D., & Sharma, D.K. (2020) Fake News Detection with Integration of Embedded Text Cues and Image Features. In: 8th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO) (2020), pp 68–72

[11] Nakamura, K., Levy, S., & Wang, W.Y. (2019). r/faked it: A new multimodal benchmark dataset for fine-grained fake news detection. arXiv:1911.03854

[12] Meel, P., & Vishwakarma, D.K. (2021). HAN, image captioning, and forensics ensemble multimodal fake news detection. Information Sciences, 567, 23–41. https://doi.org/10.1016/j.ins.2021.03.037.

[13] data set available on the website(News and Commercial Detection (kaggle.com)

[14] Forrest N. Iandola, Song Han, Matthew W. Moskewicz, Khalid Ashraf, William J. Dally, Kurt Keutzer,’ SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size”, Computer Vision and Pattern Recognition,(2026). https://doi.org/10.48550/arXiv.1602.07360

[15] N. N. A. Sjarif, N. F. M. Azmi, S. Chuprat, H. M. Sarkan, Y. Yahya, S. M. Sam, SMS Spam message detection using term frequency-inverse document frequency and random forest algorithm, Proc. Comput. Sci., 161 (2019) 509-515. https://doi.org/10.1016/j.procs.2019.11.150.

[16] Chen, T., Guestrin, C. 2016. XGBoost: A Scalable Tree Boosting System. KDD '16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining August 2016, Pages 785–794 https://doi.org/10.1145/2939672.2939785

[17] M. Sarnovský, V. Maslej-Krešňáková, and N. Hrabovská, "Annotated dataset for the fake news classification in Slovak language," in 2020 18th International Conference on Emerging eLearning Technologies and Applications (ICETA), 2020, pp. 574-579: IEEE. DOI: 10.1109/ICETA51985.2020.9379254.

[18] Al-Mejibli, I. S., Abd, D. H., Alwan, J. K., & Rabash, A. J. (2018). Performance Evaluation of Kernels in Support Vector Machine. 2018 1st Annual International Conference on Information and Sciences (AiCIS). doi:10.1109/aicis.2018.00029

[19] Abd, Dhafar Hamed; Alwan, Jwan K.; Ibrahim, Mohamed; Naeem, Mohammad B. (2017), The utilization of machine learning approaches for medical data classification and personal care system management for sickle cell disease, IEEE 2017 Annual Conference on New Trends in Information & Communications Technology Applications (NTICT), 213–218. doi:10.1109/ntict.2017.7976147.

[20] Ibraheem Al-Jadir, Waleed A Mahmoud “A Grey Wolf Optimizer Feature Selection method and its Effect on the Performance of Document Classification Problem” journal port science research, Vol. 4, Issue 2, PP. 125-131, 2021..

[21] Waleed A Mahmoud, MR Shaker “3D Ear Print Authentication using 3D Radon Transform” proceeding of 2nd International Conference on Information & Communication Technologies, Pages 1052-1056, 2006.

[22] RA Dihin, E AlShemmary, W Al-Jawher “Diabetic Retinopathy Classification Using Swin Transformer with Multi Wavelet” Journal of Kufa for Mathematics and Computer 10 (2), 167-172, 2023.

[23] Rasha. A. Dihin,2Ebtesam N. AlShemmary, 3Waleed Ameen Al Jawher “Implementation Of The Swin Transformer and Its Application In Image Classification” Journal Port Science Research 6 (4), 318-331.

[24] Ali Akram Abdul-Kareem, Waleed Ameen Mahmoud Al-Jawher, “Image Encryption Algorithm Based on Arnold Transform and Chaos Theory in the Multi-wavelet Domain”, International Journal of Computers and Applications, Vol. 45, Issue 4, pp. 306-322, 2023.

[25] Sarah H Awad Waleed A Mahmoud Al-Jawher, “Precise Classification of Brain Magnetic Resonance Imaging (MRIs) using Gray Wolf Optimization (GWO)” HSOA Journal of Brain & Neuroscience Research. Vol 6, issue 1, pp 100021, 2022

[26] .Ali Akram Abdul-Kareem, W. A. Mahmoud Al-Jawher, Zahraa A Hasan, Suha M. Hadi,” Time Domain Speech Scrambler Based on Particle Swarm Optimization” International Journal for Engineering and Information Sciences, Vol. 18, Issue 1, PP. 161-166, 2023.

[27] Zahraa A Hasan, Suha M Hadi, Waleed A Mahmoud, “Speech scrambler with multiwavelet, Arnold Transform and particle swarm optimization” Journal Pollack Periodica, Volume 18, Issue 3, Pages 125-131, 2023.

[28] Ali Akram Abdul-Kareem, Waleed Ameen Mahmoud Al-Jawher, “WAM 3D discrete chaotic map for secure communication applications” International Journal of Innovative Computing, Volume 13, Issue 1-2, Pages 45-54, 2022.

[29] Ali Akram Abdul-Kareem, Waleed Ameen Mahmoud Al-Jawher” Hybrid image encryption algorithm based on compressive sensing, gray wolf optimization, and chaos” Journal of Electronic Imaging, Volume 32, Issue 4, Pages 043038-043038, 2023.

[30] Waleed A. Mahmud Al-Jawhar “Feature Combination & Mapping Using Multiwavelet Transform” Al-Rafidain Univ. College journal, Issue 19, Pages: 13-34, 2006.

[31] Waleed Ameen Mahmoud “A Smart Single Matrix Realization of Fast Walidlet Transform” Journal International Journal of Research and Reviews in Computer Science, Volume 2, Issue, 1, Pages 144-151, 2011.

[32] Hamid M Hasan, Waleed A. Mahmoud Al- Jawher, Majid A Alwan “3-d face recognition using improved 3d mixed transform” Journal International Journal of Biometrics and Bioinformatics (IJBB), Volume 6, Issue 1, Pages 278-290, 2012.

[33] AHM Al-Heladi, W. A. Mahmoud, HA Hali, AF Fadhel “Multispectral Image Fusion using Walidlet Transform” Advances in Modelling and Analysis B, vol 52, issue 1-2, pp. 1-20, 2009.

[34] Walid A Mahmoud, Majed E Alneby, Wael H Zayer “2D-multiwavelet transform 2D-two activation function wavelet network-based face recognition” J. Appl. Sci. Res, vol. 6, issue 8, 1019-1028, 2010.

[35] Waleed A Mahmoud, MR Shaker “3D Ear Print Authentication using 3D Radon Transform” proceeding of 2nd International Conference on Information & Communication Technologies, Pages 1052-1056, 2006.

[36] Waleed A Mahmoud, Afrah Loay Mohammed Rasheed “3D Image Denoising by Using 3D Multiwavelet” AL-Mustansiriya J. Sci, vol 21, issue 7, pp. 108-136, 2010.

[37] Maryam I Mousa Al-Khuzaay, Waleed A Mahmoud Al-Jawher, “New Proposed Mixed Transforms: CAW and FAW and Their Application in Medical Image Classification” International Journal of Innovative Computing, Volume 13, Issue 1-2, Pages 15-21, 2022.

[38] Hamid M Hasan, Waleed Ameen Mahmoud Al-Jawher and Majid A Alwan, “3-d face recognition using improved 3d mixed transform” International Journal of Biometrics and Bioinformatics (IJBB), Volume 6, Issue 1, Pages 278, 2012.

[39] WA Mahmoud, RA Jassim, “ Image Denoising Using Hybrid Transforms” Engineering and Technology Journal, Volume 25, Issue 5, Pages 669-682, 2007.

[40] Abbas Hasan Kattoush, Waleed Ameen Mahmoud Al-Jawher, Sulaiman M Abbas, Ali Tweij Shaheen “A N-Radon Based OFDM Trasceivers Design and Performance Simulation Over Different Channel Models” Journal of Wireless Personal Communications, Volume 58, Pages 695-711, 2011.

[41] Waleed A Mahmoud, Ali A Ali, & Saad N Abdul Majed “Wavelet Based Multi Carrier Code Division Multiple Access” proceeding of 5th International Multi-Conference on Systems, Signals and Devices, pages 1-6, 2008.

[42] W. A. Mahmoud & I.K. Ibraheem “Image Denoising Using Stationary Wavelet Transform” Signals, Inf. Patt. Proc. & Class., vol 46, issue 4, pp. 1-18, 2003.

[43] Saleem MR Taha, Walid A Mahmood “New techniques for Daubechies wavelets and multiwavelets implementation using quantum computing” Journal Facta universitatis-series: Electronics and Energetics, Vol. 26, Issue 2, pp. 145-156, 2013.

[44] Ali Akram Abdul-Kareem, Waleed Ameen Mahmoud Al-Jawher, “Image Encryption Algorithm Based on Arnold Transform and Chaos Theory in the Multi-wavelet Domain”, International Journal of Computers and Applications, Vol. 45, Issue 4, pp. 306-322, 2023.

[45] Waleed Ameen Mahmoud Al-Jawher, “ Computation of Wavelet and Multiwavelet Transforms Using Fast Fourier Transform” Journal Port Science Research, Volume 4, Issue 2, Pages 111-117, 2021.

[46] Mutaz Abdulwahab and Hadeel Al-Taai Waleed Ameen Mahmoud, “New Fast Method for Computing Wavelet Coefficients from 1D up to 3D” 1st Int. Conference on Digital Communication and computer Application, pp. 313-323, 2007.

[47] RA Dihin, E AlShemmary, W Al-Jawher “Implementation Of The Swin Transformer and Its Application In Image Classification” Journal Port Science Research 6 (4), 318-331, 2023.

[48] KN Kadhim, SMR Taha, WA Mahmoud “A new method for filtering and segmentation of the ECG signal” Proceedings of the Annual International Conference of the IEEE Engineering

[49] AH Kattoush, W Al-Jawhar, A Ghodayyah “Multiwavelet computed Radon-based OFDM transceiver designed and simulation under different channel conditions” Journal of Information & Computing Science 5 (2), 133-145.

[50] AA Abdul-Kareem, WAM Al-Jawher “URUK 4D DISCRETE CHAOTIC MAP FOR SECURE COMMUNICATION APPLICATIONS” Journal Port Science Research 5 (3), 131-142.

[51] SM Salih, WA Mahmoud “Modified OFDM model based on sub-sliding window technique” Computer and Information Science 4 (1), 100>

Saadi, S. M. ., & Al-Jawher, W. A. M. . (2023). Image Fake News Prediction Based on Random Forest and Gradient-boosting Methods. Journal Port Science Research, 6(4), 357–364. https://doi.org/10.36371/port.2023.4.6

Downloads

Download data is not yet available.

Most read articles by the same author(s)

1 2 > >>