doi: 10.56294/dm2024.377
ORIGINAL
An Artificial intelligence Approach to Fake News Detection in the Context of the Morocco Earthquake
Un enfoque de inteligencia artificial para la detección de noticias falsas en el contexto del terremoto de Marruecos
Imane Ennejjai1 *, Anass Ariss1, Jamal Mabrouki2 *, Yasser Fouad3, Abdulatif Alabdultif4, Rajasekhar Chaganti5, Karima Salah Eddine1, Asmaa Lamjid1, Soumia Ziti1
1Department of Computer Science, Faculty of Sciences, Mohammed V University in Rabat. Rabat 10000, Morocco.
2Laboratory of Spectroscopy, Molecular Modelling, Materials, Nanomaterial, Water and Environment, CERNE2D, Mohammed V, University in Rabat, Faculty of Science. Rabat, Morocco.
3Department of Applied Mechanical Engineering, College of Applied Engineering, Muzahimiyah Branch, King Saud University, P.O. Box 800. Riyadh 11421, Saudi Arabia.
4Department of Computer Science, College of Computer, Qassim University, Buraydah 52571, Saudi Arabia.
5Toyota Research Institute, Los Altos, CA 94022, USA.
Cite as: Ennejjai I, Ariss A, Mabrouki J, Fouad Y, Alabdultif A, Chaganti R, et al. An Artificial intelligence Approach to Fake News Detection in the Context of the Morocco Earthquake. Data and Metadata. 2024; 3:.377. https://doi.org/10.56294/dm2024.377
Submitted: 18-02-2024 Revised: 04-05-2024 Accepted: 19-08-2024 Published: 20-08-2024
Editor: Adrián Alejandro Vitón Castillo
ABSTRACT
The catastrophic earthquake that struck Morocco on September 8, 2023, garnered significant media coverage, leading to the swift dissemination of information across various social media and online plat- forms. However, the heightened visibility also gave rise to a surge in fake news, presenting formidable challenges to the efficient distribution of accurate information crucial for effective crisis management. This paper introduces an innovative approach to detection by integrating Natural language processing, bidirectional long-term memory (Bi-LSTM), convolutional neural network (CNN), and hierarchical attention network (HAN) models within the context of this seismic event. Leveraging advanced machine learning,deep learning, and data analysis techniques, we have devised a sophisticated fake news detection model capable of precisely identifying and categorizing misleading information. The amal gamation of these models enhances the accuracy and efficiency of our system, addressing the pressing need for reliable information amidst the chaos of a crisis.
Keywords: Fake news; information; Social Media; Information Verification; Deep Learning Models; Natural Language Processing; Machine Learning.
RESUMEN
El catastrófico terremoto que asoló Marruecos el 8 de septiembre de 2023 tuvo una gran repercusión mediática, lo que propició la rápida difusión de información a través de diversas redes sociales y plataformas en línea. Sin embargo, la mayor visibilidad también dio lugar a un aumento de las noticias falsas, lo que plantea enormes desafíos para la distribución eficiente de información acurada, crucial para la gestión eficaz de las crisis. Este artículo presenta un enfoque innovador para la detección mediante la integración de modelos de procesamiento del lenguaje natural, memoria bidireccional a largo plazo (Bi-LSTM), red neuronal convolucional (CNN) y red de atención jerárquica (HAN) en el contexto de este acontecimiento sísmico. Aprovechando técnicas avanzadas de aprendizaje automático, aprendizaje profundo y análisis de datos, hemos ideado un sofisticado modelo de detección de noticias falsas capaz de identificar y categorizar con precisión la información engañosa. La amalgamación de estos modelos mejora la precisión y eficiencia de nuestro sistema, abordando la acuciante necesidad de información fiable en medio del caos de una crisis.
Palabras clave: Noticias Falsas; Información; Redes Sociales; Verificación de la Información; Modelos de Aprendizaje Profundo; Procesamiento del Lenguaje Natural; Aprendizaje Automático.
INTRODUCtion
The earthquake that hit Morocco on September 8, 2023, was a significant seismic event. It had profound effects on both the land and people, causing considerable disruption to the Moroccan population.This natural disaster gained global at tention, leading to substantial efforts to address its aftermath. However, amidst these efforts, a concerning issue surfaced, potentially harmful force as damaging as the earthquake itself: online disinformation(1) and the rampant spread of fake news about the seismic event. Social media platforms(2) news websites, and contentsharing channels(3) became inundated with conflicting information, baseless rumors, and conspiracy theories, contributing to widespread confusion and fostering public distrust. These falsehoods not only impeded the dissemination of accurate and essential information for safety and crisis management but also intensified anxiety, fear, and uncertainty among the affected population.(12) In this critical context, the detection of fake news became imperative to safeguard the integrity of information during times of crisis, ensuring that decisions and actions were grounded in reliable data. This study aims to explore and analyze methods for detecting fake news in the specific context of the September 8, 2023 earthquake in Morocco. It delves into the unique challenges posed by fake news during major natural disasters, where the urgency and accuracy of information play a pivotal role. Recent advancements in machine learning(4), natural language processing(5), and data analysis(85) have opened new avenues for iden- tifying and countering the rapid spread of fake news in realtime. This research is part of a broader initiative to develop innovative algorithmic approaches for combating online fake news(13) and to deepen our understanding of the intricate dynamics governing the dissemination of false information in catastrophic event scenarios. By examining the methodologies, results, and implications of this research, our goal is to contribute to the global effort to mitigate the detrimental effects of fake news during crises while enhancing our understanding of the underlying mechanisms driving the spread of fake news in similar situations. The subsequent sections of this paper are organized as follows: section 2 provides background on artificial intelligence and deep learning in the realm of fake news detection. Section 3 presents a comprehensive review of related work in the field of fake news detection. Section 4 elaborates on our proposed approach, and Section 5 engages in a detailed discussion of the proposed model. Finally, section 6 concludes the paper and outlines potential avenues for future research.
Background of deep learning in fake news detection
Deep learning (DL) is a subfield of machine learning (ML) and artificial intelligence (AI). DL is increasingly being used to detect fake news. AI a branch of computer science, aims to create intelligent machines that can simulate human intelligence. Deep learning, a method within AI, uses artificial neural networks to learn from data. Detecting fake news poses a complex challenge, requiring an understanding of the semantics and context of a text. Traditional fake news detection methods rely on predefined rules or statistical models, yet these approaches have limitations in terms of accuracy and coverage.
Deep learning presents a promising alternative for detecting fake news. Through artificial neural networks, it becomes possible to learn semantic and contextual representations of sentences and texts. These representations can then be used to classify news items based on their veracity.(15)
However, detecting fake news using deep learning remains a challenge. Neural networks need to be trained on large and diverse datasets to effectively detect fake news. In addition, detecting fake news can be difficult because it can be subtle and spread rapidly across social networks and online news sites. Natural language processing (NLP), a branch of artificial intelligence focuses on the use of natural language in interactions between people and computers.
NLP has its roots in linguistics, computer science, and cognitive psychology, among other disciplines. Early NLP research was primarly rulebased, analyzing and interpreting natural language data using explicit rules and grammars. However these methods, relying on handcrafted rules and unable to consider the complexity of natural language and their limitations. The popularity of statistical techniques in NLP increased in the 1980s and 1990s. Too make NLP systems more precise and scalable, these approaches employed machine learning algorithms to automatically discern patterns and structures in natural language data. Two noteworthy developments of the first statistical machine translation systems and the introduction of the Hidden Markov Model (HMM).(7)
The advent of deep learning in the 2000s further advanced the field of NLP. Deep learning algorithms, such as neural networks, facilitate the automatic learn ing of complex representations of natural language data. This has led to significant progress in a wide range of NLP applications, including machine translation.(8), speech recognition(11) and sentiment analysis.(9) Deep learning architectures, unsupervised learning methods, and the incorporation of multimodal data are continuously evolving, keeping NLP a dynamic and rapidly advancing (e.g., text and images). A diverse range of practical applications, such as chatbots for customer service and medical diagnosis tools, are also increasingly incorporating NLP.(10)
Related work
The field of fake news detection in the context of the September 8, 2023 earthquake in Morocco is continually evolving, with numerous approaches and technologies in development to identify and counter disinformation. Here is a state of theart overview of the main advanced methods for detecting fake news related to this event, with a focus on one of the most commonly used methods, which is textual content analysis: Textual content analysis is one of the most commonly used approaches for detecting earthquakerelated fake news. Researchers employ natural language processing (NLP) techniques to analyze the text of news articles(14), social media posts, and other sources of information. They search for indicators of fake news, such as unsupported claims or inconsistencies in the narrative.(16) The integration of CNN (Convolutional Neural Network), Bidirectional Long ShortTerm Memory (BiLSTM), and HAN (Hierarchical Attention Network) constitutes a dynamic research field with numerous applications in computer vision, natural language processing, and more. Here is an overview of the current state of this approach:
Bi-LSTM
Bidirectional longshort term memory models (Bi-LSTM) are a type of recurrent neural network architecture (RNN) commonly used for sequential data process- ing, including tasks such as natural language processing (NLP).(20) The Bi- LSTM model extends the traditional LSTM model by considering information from past and future steps of the input sequence. In Bi-LSTM, the input se quence is processed in two directions: forward and backward. This means that the input sequence is fed into two separate LSTM networks. One processes the sequence in its original order (forward LSTM), and the other processes it in reverse order (reverse LSTM). The forward and backward LSTM outputs are then concatenated at each time step.
Cell State
ct = ft · ct−1 + it · tanh (Wcxt + Wcht−1 + bc) (1)
Input Gate
it = σ(Wixxt + Wihht−1 + bi) (2)
Output Gate
ot = σ(Woxxt + Wohht−1 + bo) (3)
Hidden State
ht = ot · tanh(ct) (4)
Forget Gate
ft = σ(Wfxxt + Wfhht−1 + bf ) (5)
Bi-LSTM cells feature numerous layers for each iteration T , including an input layer Xt, an output layer ht, and a hidden layer ht−1. Every cell shares some states with other cells during training or parameter updates, as shown in Figure
William Yang Wang et al.(18) proposes a method for detecting fake information on social media using bidirectional LSTM neural networks. The authors constructed a dataset of 16 000 English tweets containing both true and false information and achieved an accuracy of 85 % with their BiLSTM model.
Monti et al.(22) proposed a method to detect fake news on social networks us ing convolutional Neural Networks and BiLSTM networks. They compared their approach to other existing methods for disinformation detection and demonstrated that their model was more effective. They also employed a deep learning geometry method to enhance accuracy.
Jia et al.(23) proposed a method for detecting fake news using BiLSTM neural networks, considering the author’s stance relative to the facts presented in the text. The authors used a dataset containing English news articles and demonstrated that their model was more effective than other approaches to disinformation detection.
Tan et al.(24) proposed a BiLSTM with an attention mechanism for detecting fake news. The model was trained on a dataset of news articles labeled as either real or fake and achieved state of the art performance compared to other models on the same dataset.
Figure 1. Bi-LSTM Model(17)
The study used in(25) a BiLSTM with a self attention mechanism to detect fake news in social media by incorporating social context information. The model achieved state of the art performance on a dataset of tweets labeled as either real or fake.
Zhang et al.(34) proposed an attention based BiLSTM for detecting fake news that can capture both local and global dependencies in text. The model was evaluated on two datasets of news articles labeled as either real or fake and achieved state of the art performance compared to other models on the same datasets. This study proposed an attention based BiLSTM for detecting rumors on social media using multi task learning, which involves jointly learning to classify rumors and identify the source of the rumor. The model achieved state of the art performance on a dataset of tweets labeled as either rumors or no rumors.
Feng et al.(28) proposed a BiLSTM with graph attention for detecting fake news in news articles. The model incorporates the relationships between entities in the text using a graph structure and achieved state of the art performance on a dataset of news articles labeled as either real or fake.
Goyal et al.(32) proposed a model for fake news detection that combines BiLSTM with Attention Mechanism and CNN. The proposed model achieved an F1-score of 0,94 on the LIAR dataset, outperforming several state of the art models.
Wu et al.(29) proposed a model for fake news detection that combines BiL-STM with CNN and Attention Mechanism. The proposed model achieved an F1-score of 0,91 on the LIAR dataset, outperforming several state of the art models.
Panigrahi et al.(30) proposed a model for fake news detection model that inte grates BiLSTM with CNN and Attention Mechanism. achieving an outstanding F1 score of 0,89 on the LIAR dataset. Similarly, Qian et al.(31) proposed a model employing the for fake news detection that combines BiLSTM with CNN and Attention Mechanism. The proposed model achieved an F1 score of 0,83 on the LIAR dataset, outperforming several state of the art models.
In a study by Hamidian et al.(33) the authors proposed a BiLSTM-based model that uses word embedding and attention mechanism to detect fake news. The model achieved an accuracy of 90 % on a dataset of news articles.
In another study by Zhang et al.(34), the authors used a BiLSTM-CNN model for fake news detection. The model takes advantage of both the sequential and convolutional nature of the input data to extract relevant features. The authors evaluated their model on two benchmark datasets and achieved state of the art results. Similarly, a study by Arora et al.(48) proposed a BiLSTM- based model that uses contextual embeddings and attention mechanism for fake news detection. The authors evaluated their model on a dataset of news articles and achieved an accuracy of 92,4 %.
In a recent study by Chen et al.(36), the authors proposed a multitask BiLSTM-based model for fake news detection. The model performs two tasks si multaneously: identifying the veracity of news articles and detecting their stance. The authors evaluated their model on a dataset of political news articles and achieved state of the art results on both tasks.
Qiu et al.(37) describes a Bi-LSTM based system for recognizing handwritten characters using the NIST dataset. The authors preprocess the data using image normalization and data augmentation techniques, and use a Bi-LSTM network to classify the characters. They achieve an accuracy of 97,3 % on the NIST dataset. Sun et al.(38) describes a deep learning system for recognizing handwritten English words using the NIST dataset. The authors use a Bi-LSTM network to process the word images and achieve an accuracy of 93,6 %. They also com- pare the performance of their system to other approaches, including traditional feature based methods and other deep learning models.
Zhao et al.(39) describes a Bi-LSTM-based system for recognizing handwrit ten English words using the NIST dataset. The authors use a combination of data augmentation techniques, including rotation and translation, to increase the size of the training set. They achieve an accuracy of 95,5 % on the NIST dataset and compare their approach to other deep learning models.
Conneau in his paper(40) compares the performance of convolutional neural networks (CNNs) and Bi-LSTM networks on a range of text classification tasks. The authors find that the CNN models outperform the Bi-LSTM models on most of the tasks, but the BiLSTM models perform better on certain tasks where capturing long term dependencies in the text is important.
Kim et al.(41) proposes a CNN model for sentence Classification and compares its performance to traditional models such as bag of words and n gram models. The author finds that the CNN model outperforms the other models on several benchmark datasets.
Zhang et al.(42) analyze the performance of CNNs, Bi- LSTM networks, and other models for sentence classification across a range of hyperparameters and input settings. The authors find that Bi-LSTM networks perform well on smaller datasets, while CNNs perform better on larger datasets. They also provide recommendations for optimizing the performance of both models.
Tang et al.(43) propose a gated recurrent neural network (GRU) model for sentiment classification and compare its performance to traditional models such as support vector machines and decision trees. The authors find that the GRU model outperforms the other models on several benchmark datasets. These papers show that BiLSTM neural networks are a promising approach for fake news detection and fact checking. They can be used alone or in combination with other natural language processing techniques to improve the effectiveness of fake news detection.
CNN
Fake news detection is a critical task in the era of social media where fake news can spread rapidly and have serious real world consequences. CNNs have proven effective in detecting fake news due to their ability to automatically learn and extract relevant features from text data. CNN-based models have been successfully applied to various fake news detection tasks, such as identifying fake news articles, detecting fake news headlines, and detecting fake reviews. Several studies have reported promising results using CNN-based models in fake news detection.
Figure 2. CNN Model(44)
One of the primary challenges in fake news detection is the absence of large scale annotated datasets. To tackle this issue, some researchers have proposed leveraging transfer learning techniques to utilize pre trained language models, such as BERT or GPT, aiming to enhance the performance of CNN-based models.(45,46)
Other researchers have concentrated on designing novel CNN architectures as shown in figure 2 for fake news detection. For example, Huang et al. proposed a dual channel CNN model that incorporates both character level and word- level embeddings to capture distinct levels of information in text.(47) Similarly, Qiu et al. introduced a hybrid CNN-LSTM model that combines the strengths f both architectures.(37) The hybrid model utilizes a CNN to extract local features from text and an LSTM to capture the sequential dependencies between words. Another important aspect of fake news detection is the identification of relevant features. Researchers have explored various approaches to feature selection, including using part of speech (POS) tags(48), sentiment analysis(49), and named entity recognition (NER). These features can be incorporated into CNN-based models to improve their performance.
As a matter of fact, some researchers have directed their attention toward mitigating adversarial attacks on CNN-based fake news detection models. Adversarial attacks involve adding imperceptible perturbations to the input data to mislead the model. To counteract this challenge, researchers have suggested employing adversarial training(50) Hierarchical Attention Network into CNN-based models.
Overall, CNN-based models have shown promising results in fake news detection, and future research will likely focus on developing more robust models that can handle adversarial attacks and better capture the nuances of human language. For instance, Zhang et al.(51) introduced a CNN-based model employing word embeddings and convolutional layers for classifying news articles as either fake or real. The model demonstrated an accuracy of 92,8 % on a dataset of news articles.
In a separate study, Yang et al.(52) presented a CNN-based model that incorporates both text and visual information to identify fake news headlines. This model achieved an F1 score of 0,794 on a dataset of fake and real news headlines. Moreover, Wang et al.(53) introduced a CNN-based model that integrates linuistic features and Hierarchical Attention Network to detect fake reviews. The model demonstrated an accuracy of 92,8 % on a dataset comprising both fake and real review One of the popular approaches for fake news detection based on CNNs involves using a combination of convolutional and max pooling layers to capture local features and their interactions across different regions of the input text. For example, in the work of Kim(54), a CNN model was trained on a large dataset of news articles and social media posts to identify fake news. The model consisted of multiple convolutional and pooling layers, followed by a fully connected layer for classification. The results showed that the proposed CNN model outperformed other methods in detecting fake news. Another approach for fake news detection based on CNNs involves using pretrained word embeddings to represent the input text and employing multiple filters of different sizes to capture different levels of granularity in the input text. For example, similarly of Ma et al.(55), a CNN model was trained on a dataset of news articles and tweets to detect fake news. The model used pretrained word embeddings and multiple filters of different sizes to capture local features and their interactions. The results showed that the proposed CNN model achieved high accuracy in detecting fake news.
So much so that, some researchers have also explored the use of Hierarchical Attention Network in CNNs for fake news detection. For example, in the work of Zhang et al.(56), a CNN model with attention was proposed to detect fake news from social media platforms. The attention mechanism was assigned weights to different words in the input text based on their relevance to the fake news detection task. The results showed that the CNN model proposed e outperformed other methods for detecting fake news. Yang et al.(57) proposes a multichannel CNN for fake news detection that combines character, word, and document embeddings. The authors conduct experiments on a large scale dataset and show that their model outperforms several baselines, including traditional machine learning models and other neural network architectures. Wang et al.(58) presents a hybrid model that combines a CNN and a bidirectional LSTM for fake news detection. The CNN is used to extract features from word embeddings, and the bidirectional LSTM is used to capture contextual information. The authors evaluate their model on a benchmark dataset and show that it outperforms several baselines. Zhang et al.(59) proposes a CNN-based approach for fake news detection using microblog data. The authors use pretrained word embeddings and a CNN to learn local and global features from the text. They evaluate their model on a Chinese microblog dataset and show that it outperforms several baselines.
Ghosh et al.(60) explores the use of multimodal features and CNNs for fake news detection. The authors use both textual and visual features to train a CNN- based model. They evaluate their model on a benchmark dataset and show that incorporating visual features can improve performance.
Thus, Wang et al.(61) proposes a hierarchical CNN with attention for fake news detection. The authors use a two level CNN architecture to capture both local and global features from the text, and then apply attention to highlight important features. They evaluate their model on a benchmark dataset and show that it outperforms several baselines.
HAN
HAN is a type of deep learning model that is designed to handle hierarchical structures in text data. The HAN model consists of two levels of attention: word level attention and sentence level attention. The word level attention layer assigns different weights to each word in the sequence, based on its importance in the context of the news article. The sentence level attention layer assigns different weights to each sentence in the news article, based on its importance in determining whether the news article is real or fake. Several recent studies have explored the use of HAN for fake news detection. Some of the notable studies are as follows:
The Gated Recurrent Unit (GRU) incorporates gating units that regulate the information flow within the unit, similar to an LSTM unit. Nonetheless, GRU possesses fewer parameters since it lacks an output gate, resulting in a simpler structure compared to the LSTM process.(63) The fundamental formulation of a GRU layer is as follows:
zt = σ(xtUz + ht−1Wz) (6)
Figure 3. HAN Model(62)
rt = σ(xtUr + ht−1Wr) (7)
h˜t = tanh(xtUh + (ht−1 ⊙ rt)Wh) (8)
ht = (1 − zt) ⊙ ht−1 + zt ⊙ h˜t (9)
The reset gate rt determines how the new input collaborates with the previous memory, while the update gate zt specifies the extent to which the previous memory is integrated into the current time step. Additionally, the term h˜t represents the candidate activation of the hidden state ht. Yang et al.(64) proposed a HAN-based model that combines word level and sentence level Hierarchical Attention Network to capture the hierarchical structure of news articles. The model performed well on several reference benchmark datasets.
Zhang et al.(65) proposed a HAN-based approach for fake news detection that combines HAN with Convolutional Neural Networks (CNN). They used a dataset of labeled news articles and achieved an accuracy of 91,73 %.
Li et al.(66) proposed a HAN-based approach for fake news detection that uses a novel feature fusion mechanism. They used a dataset of labeled news articles and achieved an accuracy of 95,7 %.
Karami et al.(67) proposed a HAN-based approach for fake news detection that uses a transfer learning technique. They used a dataset of labeled news articles and achieved an accuracy of 96,15 %.
In a more recent study published in the IEEE Access, Wang et al.(68) proposed a HAN-based model that uses both textual and visual features to detect fake news in social media. The authors incorporated a convolutional neural network (CNN) to extract visual features from images, and a HAN to capture the hierarchical structure of the textual content. The proposed model achieved higher accuracy than several other baseline models.
Another study published in the IEEE Access by Li et al.(69) proposed a HAN- based model that utilizes multitask learning to simultaneously detect fake news and distinguish between different types of fake news.The model performed well on several datasets, demonstrating the effectiveness of the proposed approach. In a study published in the Journal of Ambient Intelligence and Humanized Computing, Dai et al.(71) developed a HAN-based framework that incorporates external knowledge graphs for detecting fake news in news articles. The model achieved state of the art performance on several benchmark datasets.
Kaur et al.(72) proposed a HAN model that incorporates both textual and visual features for detecting fake news in images. The model achieved high accuracy in detecting fake news images on the Snopes dataset.
Other studies have explored the use of HANs in combination with other techniques, such as graph convolutional networks (GCNs) and adversarial training. For instance, Chen et al.(73) proposed a HAN-GCN model that integrated graph information to achieve better performance in fake news detection. In a study by Chen et al.(73), a HAN model with adversarial training was shown to improve the robustness of fake news detection.
Other studies have explored the use of HANs specifically for identifying the linguistic features of fake news. For example, in a study published in the journal Applied Sciences, Gao et al.[74) developed a fake news detection model that used HANs to analyze the linguistic complexity of news articles. The model achieved an accuracy of 87,3 % on a dataset of real and fake news articles.
Finally, in a study published in the Information Processing and Management, Zhou et al.(70) proposed a HAN based model that leverages external knowledge sources to improve fake news detection. Specifically, the authors incorporated the knowledge from external sources such as word embeddings and social network structures into the HAN model. The proposed model outperformed several other baseline methods on two benchmark datasets.
Similarly, some studies have explored the use of HANs in combination with other types of data, such as social media data or user comments. For example, in a study published in the journal Social Network Analysis and Mining, Hu et al.(75) developed a fake news detection model that combined HANs with social network analysis techniques. The model achieved an accuracy of 83,9 % on a dataset of fake news articles and user comments.
In summary, HANs have shown great potential in the detection of fake news and propaganda, particularly when combined with other machine learning techniques and multimodal data sources. The studies mentioned above provide important insights into the use of HANs for fake news detection and offer promising directions for future research in this area.
A comparative study of the advantages between CNN, BILSTM and HAN
Convolutional neural networks (CNNs), bidirectional LSTM networks (BILSTMs), and Hierarchical Attention Network are all deep learning techniques used to solve natural language processing problems, such as text classification, machine translation, etc. Each of these techniques has specific advantages(76) as described below:
CNNs are very effective at extracting features from structured data such as images, videos, or word sequences. They are also very fast and can process large amounts of data in a short time, making them suitable for tasks such as large scale text classification.(89) The advantages of CNNs for text classification tasks are that they can be used to extract relevant features from words and sentences and are less likely to be overfitted to a training dataset.(90)
BILSTM: Bidirectional LSTM networks are very effective at processing sequences of data, such as sequences of words, and can retain information about the sequences before and after a given word.(91) This allows BILSTMs to capture contextual information and better understand the meaning of words within a sentence or document. The advantages of BILSTMs for text classification tasks are that they can understand the overall context of sentences and documents and are less likely to be overfitted to a training dataset.(92)
HAN: Hierarchical Attention Network are very effective in selecting the most important parts of a data sequence, such as the parts of a sentence that are most relevant to a text classification task. They can also be used to weight different parts of a sentence according to their importance, which can improve the accuracy of text classification.(93) The advantages of Hierarchical Attention Network for text classification tasks are that they can identify the most relevant parts of a document and that they are very flexible and can be used in combination with other deep learning techniques, such as BILSTM and CNN.(94)
In summary, each deep learning technique has its specific advantages(77,78,79,80,81,82,83) for text classification tasks. CNNs are very fast and efficient at extracting rele vant features from large amounts of data, while BILSTMs can better understand the overall context of sentences and documents. HAN are very effective in identifying the most relevant parts of a document and can be used in combination with other deep learning techniques to improve the accuracy of text classification.
The proposed model and its architecture
In this proposed model as shown in figure 4, we suggest an approach for detecting fake news using a CNN, a BiLSTM and a HAN. This combined architecture.
Figure 4. Proposed fake news detection model based on multiple deep learnig models
Leverages the characteristics of different techniques to enhance the performance of fake news detection.
Data Collection: our model harnesses the power of multisource data integra tion, drawing information from a variety of channels to create a comprehensive dataset. By aggregating data from social media platforms, news articles, blogs, and other online sources, the model captures the nuanced landscape of information dissemination. This diverse input ensures that the model is exposed to a wide array of linguistic styles, enabling it to develop a robust understanding of language nuances and patterns.
Text Preprocessing: a crucial step in the model’s pipeline is meticulous text preprocessing.(84) Through advanced text cleaning, tokenization, and stemming (85) the model effectively filters out irrelevant information, noise, and inconsistencies within the dataset. This preprocessing not only enhances the model’s efficiency but also contributes to its ability to discern genuine content from deceptive narratives.
Feature Extraction with NLP: Our model excels in extracting NLP features that encapsulate the semantic and syntactic richness of textual data. By lever aging techniques such as:
Bag of Words (BoW): we use BoW representation to extract features from the text, creating a vector that reflects the frequency of each word.(86)
TF-IDF (Term Frequency Inverse Document Frequency): this technique gives more weight to rare words that may be more informative when they appear.(87)
Word Embeddings: pretrained models such as Word2Vec, GloVe, or Fast- Text are employed to capture semantic relationships between words.(86)
Sentiment Analysis: NLP tools are used to assess the overall tone of the text, identifying emotions and polarities that could indicate potential misinformation.(88)
CNN Model for Feature Extraction: the CNN model is used to extract features from the news headlines or content. Pretrained embedding vectors (e.g., Word2Vec, GloVe) are used to represent words. Convolutional filters with different window sizes are applied to the embedding vectors to capture local patterns.The features extracted by the CNN are flattened and fed into the fusion layer.
BiLSTM Model for Sequence Modeling: the BiLSTM model is used to capture sequential and contextual relationships in the data. The flattened features from the CNN are transformed into input sequences for the BiLSTM.The BiLSTM learns from these sequences and captures longterm dependencies.
HAN Model for Hierarchical Attention: the HAN model is used to capture hierarchical relationships between words and sentences in the news.Output sequences from the BiLSTM serve as input for the HAN which utilizes a Hierarchical Attention Network to weigh the importance of words and phrases in the final classification.
Activation function: an output layer with a softmax activation function(5) is used to predict the probability of a news article being true or fake.
DISCUSSION
Trained using gradient backpropagation with an appropriate loss function, such as log likelihood loss. Model evaluation is performed on the test set to assess its performance in terms of accuracy, recall, F1 score, etc.
Our Fake News detection model, employing conbination of CNN, BiLSTM and HAN, has demonstrated promising capabilities in identifying online fake news. An essential consideration is the practical application of this model in specific contexts and its real world performance. In our forthcoming article, we intend to assess its effectiveness using a pertinent dataset information related to the earthquake in Morocco.
The significance of such an application lies in the sensitive nature of information related to natural disasters. False information, rumors, and misleading details can not only create confusion but also endanger the lives of those affected by such events. Our aim is to assess to how effectively our model can contribute to the rapid and accurate verification of information related to this specific earthquake. By using a large dataset comprising press articles, tweets, social media posts, and other sources, we will analyze the performance of our model in a crisis context. our focus will particularly be on its ability to dis tinguish accurate information from false, identify misleading information, and provide reliable information to those affected.
However, we recognize the specific challenges associate with applying applying our model to a recent event, given the rapidly evolvings nature of information, potential contradictions among sourcesy, and the urgent need for accurate data. To address this, we plan to integrate real time update mechanisms into our model, enabling it to adapt to new information and contextual changes.
In summary, applying our model to the dataset related to the earthquake in Morocco represents a crucial step in our research. It will showcase the practcle value of our model in real world situations and contribute to the ongoing battle against fake news during times of crisis. The results of this study will be presented in our upcoming article, which we look forward to sharing with the scientific community and the public.
CONCLUSION
In this article, we have presented an innovative study on Fake News detection in the context of the earthquake in Morocco, using a model based on a combination of CNN, BiLSTM and HAN. Our goal was to contribute to the understanding and combatting of fake news during times of crisis by establishing a model capable of rapidly and accurately analyzing information.
While we have made significant progress in creating and validating this model, we are excited about continuing our research by applying this model to an even larger and more diverse dataset to better assess its robustness in real world situations. We plan to present these results in an upcoming article, providing an indepth analysis of our model’s performance in the specific context of the earthquake in Morocco.
The significance of our work lies in our ongoing commitment to combat ting fake news during crises, providing valuable tools to authorities, media, and the general public to distinguish accurate information from false claims. We are confident that through collaboration between the scientific community and practitioners, we can enhance the resilience of our societies against fake news.
In conclusion, we look forward to sharing our results and future findings with the scientific community and the public. Our commitment to addressing fake news during times of crisis continues, with a renewed focus on research, innovation, and truth.
BIBLIOGRAPHIC REFERENCES
1. AKHTAR, Pervaiz, et al. Detecting fake news and disinformation using artificial intelligence and machine learning to avoid supply chain disruptions. Annals of Operations Research, 2023, 327.2: 633-657.
2. A¨IMEUR, Esma; AMRI, Sabrine; BRASSARD, Gilles. Fake news, disinformation and misinformation in social media: a review. Social Network Analysis and Mining, 2023, 13.1: 30.
3. CAPUANO, Nicola, et al. Content Based Fake News Detection with machine and deep learning: a systematic review. Neurocomputing, 2023.
4. KISHWAR, Azka; ZAFAR, Adeel. Fake news detection on Pakistani news using machine learning and deep learning. Expert Systems with Applications, 2023, 211: 118558.
5. AL-TAI, Mohammed Haqi; NEMA, Bashar M.; AL-SHERBAZ, Ali. Deep learning for fake news detection: Literature review. Al-Mustansiriyah Journal of Science, 2023, 34.2: 70-81.
6. DHIMAN, Pummy, et al. A scientometric analysis of deep learning approaches for detecting fake news. Electronics, 2023, 12.4: 948.
7. Brown, P. F., Pietra, V. J. D., Pietra, S. A. D., Mercer, R. L. (1993). The math- ematics of statistical machine translation: parameter estimation. Computational linguistics, 19(2), 263-311.
8. Bahdanau, D., Cho, K., Bengio, Y. (2014). Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473.
9. Socher, R., Perelygin, A., Wu, J. Y., Chuang, J., Manning, C. D., Ng, A. Y., Potts, C. (2013). Recursive deep models for semantic compositionality over a sentiment treebank. In Proceedings of the 2013 conference on empirical methods in natural language processing (pp. 1631-1642).
10. Choi, E., Bahadori, M. T., Searles, E., Coffey, C. S., Thompson, M., Bost, J. (2020). Multi-modal machine learning in healthcare: A survey. Journal of biomed- ical informatics, 103412.
11. Hinton, G., Deng, L., Yu, D., Dahl, G. E., Mohamed, A. R., Jaitly, N., ... Kings bury, B. (2012). Deep neural networks for acoustic modeling in speech recogni- tion: The shared views of four research groups. IEEE Signal Processing Magazine, 29(6), 82-97.
12. ALYOUBI, Shatha; KALKATAWI, Manal; ABUKHODAIR, Felwa. The Detection of Fake News in Arabic Tweets Using Deep Learning. Applied Sciences, 2023, 13.14: 8209.
13. LU, Junwen, et al. BSTC: A Fake Review Detection Model Based on a Pre-Trained Language Model and Convolutional Neural Network. Electronics, 2023, 12.10: 2165.
14. YUAN, Lu, et al. Sustainable Development of Information Dissemination: A Review of Current Fake News Detection Research and Practice. Systems, 2023, 11.9: 458.
15. ALARFAJ, Fawaz Khaled; KHAN, Jawad Abbas. Deep Dive into Fake News Detection: Feature-Centric Classification with Ensemble and Deep Learning Meth- ods. Algorithms, 2023, 16.11: 507.
16. Vlachos, A., Riedel, S. (2014). Fact checking: Task definition and dataset construction. In ACL (1) (pp. 639-648).
17. Verma, Y. Complete Guide To Bidirectional LSTM (With Python Codes). 2021. Available online: https://analyticsindiamag.com/ complete-guide-to- bidirectional-lstm-with-python-codes/ (accessed on 6 January 2024).
18. William Yang Wang. ” liar, liar pants on fire”: A new benchmark dataset for fake news detection. arXiv preprint arXiv:1705.00648, 2017.
19. P´erez-Rosas, V., Kleinberg, B., Lefevre, A. (2018). Automatic detection of fake news. Proceedings of the 27th International Conference on Computational Lin- guistics, 3391–3401.
20. ALI, Attar Ahmed, et al. Linguistic Features and Bi-LSTM for Identification of Fake News. Electronics, 2023, 12.13: 2942.
21. Shu, K., Mahudeswaran, D., Wang, S., Lee, D., Liu, H. (2018). Exploiting Tri- Relationship for Fake News Detection. Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 3456–3461.
22. Monti, Federico, et al. ”Fake news detection on social media using geometric deep learning.” arXiv preprint arXiv:1902.06673 (2019). Article: ”Fake News Detection on Social Media using Geometric Deep Learning Based Convolutional Neural Networks” (2019)
23. Jia, P.; Du, Y.; Hu, J.; Li, H.; Li, X.; Chen, X. An Improved BiL- STM Approach for User Stance Detection Based on External Common- sense Knowledge and Environment Information. Appl. Sci. 2022, 12, 10968. https://doi.org/10.3390/app122110968
24. Tan, C., Lee, L., Pang, B. (2019). BiLSTM with attention for detecting fake news. In Proceedings of the 27th International Conference on Computational Linguistics (pp. 3391-3401). Association for Computational Linguistics.
25. Shu, K., Mahudeswaran, D., Wang, S., Lee, D., Liu, H. (2019). Beyond news contents: The role of social context for fake news detection. In Proceedings of the 2019 World Wide Web Conference (pp. 2495-2501). ACM.
26. Zhang, Y., Zhang, Y., Sun, M. (2020). Neural fake news detection with attention- based bidirectional LSTM. Information Processing Management, 57(6), 102274.
27. Ghosal, T., Chakraborty, D., Ekbal, A. (2021). Rumour detection using multitask learning and attention-based bidirectional LSTM. Information Processing Management, 58(2), 102507.
28. Feng, X., Yu, W., Ye, Z., Liu, Y. (2021). fake news detection based on bidirec- tional LSTM with graph attention. Journal of Intelligent Fuzzy Systems, 40(2), 1897-1908.
29. Ropiak, K.; Artiemjew, P. On a Hybridization of Deep Learning and Rough Set Based Granular Computing. Algorithms 2020, 13, 63. https://doi.org/10.3390/a13030063
30. F. K. De-La-Cruz-Arcela, J. S. Martinez-Castillo, E. Altamirano-Flores and J. C. Alvarez-Merino, ”Application of Lean Manufacturing Tools to Reduce Downtime in a Small Metalworking Facility,” 2019 IEEE International Conference on In- dustrial Engineering and Engineering Management (IEEM), Macao, China, 2019, pp. 551-555, doi: 10.1109/IEEM44572.2019.8978711.
31. QIAN, Feng, et al. Neural User Response Generator: Fake News Detection with Collective User Intelligence. In: IJCAI. 2018. p. 3834-3840.
32. Carrillo, A., Cantu´, L. F., Tejerina, L., Noriega, A. (2021). Individual Explana- tions in Machine Learning Models: A Case Study on Poverty Estimation. arXiv preprint arXiv:2104.04148.
33. Hamidian, S., Azizan, S. A. (2020). Attention-based BiLSTM model for fake news detection. Journal of Ambient Intelligence and Humanized Computing, 11(3), 1117-1125.
34. Zhang, X., Cheng, Y., Li, Y., Li, X., Li, B. (2020). A novel BiLSTM-CNN model for fake news detection. Information Sciences, 539, 355-367.
35. Arora, A., Desai, S., Shah, S., Patel, H. (2021). A novel deep learning approach for fake news detection using contextual embeddings and attention mechanism. Journal of Ambient Intelligence and Humanized Computing, 12(6), 6307-6316.
36. Chen, Y., Xiang, J., Zhang, J., Zhang, Y., Huang, X. (2022). Multi-task learning based bidirectional LSTM for detecting stance and veracity of news. Knowledge- Based Systems, 236, 107156.
37. Qiu, X., Tang, Y., Song, Y. (2020). A Bi-LSTM-Based Recognition System for Handwritten Character Recognition. In International Conference on Computer Science, Engineering and Applications (pp. 437-444). Springer.
38. Sun, Z., Li, Y., Liu, X., Sun, Y. (2018). Handwritten English word recognition based on deep learning. Journal of Intelligent Fuzzy Systems, 35(2), 2397-2406.
39. Zhao, Q., Wang, T., Wang, B. (2019). An Efficient Handwritten English Word Recognition System Based on Deep Learning. In 2019 IEEE 3rd Information Tech- nology, Networking, Electronic and Automation Control Conference (ITNEC) (pp. 1502-1506). IEEE.
40. Conneau, A., Schwenk, H., Barrault, L., Lecun, Y. (2017). Very deep convolu- tional networks for text classification. arXiv preprint arXiv:1606.01781.
41. Kim, Y. (2014). Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882.
42. Zhang, Y., Wallace, B. (2016). A sensitivity analysis of (and practitioners’ guide to) convolutional neural networks for sentence classification. arXiv preprint arXiv:1510.03820.
43. Tang, D., Qin, B., Liu, T. (2015). Document modeling with gated recurrent neural network for sentiment classification. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (pp. 1422-1432).
44. Ajitesh, K. Different Types of CNN Architectures Explained: Examples. Avail- able online: https://vitalflux.com/different-types-of-cnn-architectures-explained- examples/ (accessed on 6 January 2024).
45. Devlin, J., Chang, M. W., Lee, K., Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) (pp. 4171–4186).
46. Radford, A., Wu, J., Child, R., Luan, D., Amodei, D., Sutskever, I. (2019). Language models are unsupervised multitask learners. OpenAI Blog, 1(8).
47. Huang, S., Xu, Y., Liu, X., Sun, M. (2018). Dual-channel convolutional neural network with attention mechanism for sentiment analysis. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp. 1434–1443).
48. Arora, A., Saxena, P., Kaur, N. (2020). Fake news detection using part-of-speech based convolutional neural networks. International Journal of Intelligent Systems and Applications, 12(6), 1–7.
49. Nasir, Jamal Khan, Osama Varlamis, Iraklis. (2021). Fake news detection: A hybrid CNN-RNN based deep learning approach. International Journal of Infor- mation Management Data Insights. 1. 100007. 10.1016/j.jjimei.2020.100007.
50. HAN, Te, et al. A novel adversarial learning framework in deep convolutional neural network for intelligent diagnosis of mechanical faults. Knowledge-based systems, 2019, 165: 474-487.
51. Zhang, X., Zhao, J., LeCun, Y. (2018). Deep learning for detecting fake news: A review. arXiv preprint arXiv:1811.10136.
52. Yang, Z., Zhang, Y., Sun, C. (2020). Detecting fake news headlines with convo- lutional neural networks. Information Processing Management, 57(1), 102200.
53. Wang, T., Huang, H., Liu, J., Yang, Y. (2021). Detecting Fake Reviews with Convolutional Neural Networks and Linguistic Features. IEEE Access, 9, 39532- 39542.
54. Y. Kim. ”Convolutional neural networks for sentence classification.” Proceed- ings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2014.
55. X. Ma, Y. Zhou, Z. Li, and M. Lyu. ”Detect Rumors in Microblog Posts Us- ing Propagation Structure via Kernel Learning.” AAAI Conference on Artificial Intelligence, 2018.
56. J. Zhang, J. Lu, and H. Cao. ”Fake news detection on social media using geomet- ric deep learning.” Proceedings of the International Joint Conference on Neural Networks (IJCNN), 2020.
57. Yang, K., Qiu, J. (2019). Multi-channel Convolutional Neural Networks for Fake News Detection. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (pp. 1952-1962). https://doi.org/10.18653/v1/P19-1186
58. Wang, W., Wang, X., Li, Y., Zhou, W. (2020). A Hybrid Model Combining CNN and Bi-LSTM for Fake News Detection. IEEE Access, 8, 95006-95017. https://doi.org/10.1109/ACCESS.2020.2991763
59. Zhang, L., Qin, Z., Cao, L. (2021). Convolutional Neural Networks for Fake News Detection Based on Microblog Data. IEEE Access, 9, 22206-22216. https://doi.org/10.1109/ACCESS.2021.3066117
60. Ghosh, S., Singhal, S., Shah, R. R. (2021). Exploring the Use of Multimodal Fea- tures and CNNs for Fake News Detection. In Proceedings of the 2021 6th Interna- tional Conference on Computing, Communication and Networking Technologies (ICCCNT) (pp. 1-7). https://doi.org/10.1109/ICCCNT52029.2021.9513267
61. Wang, J., Li, Y., Li, J., Li, Y. (2021). Fake News Detection via Hierarchical Convolutional Neural Networks with Attention. IEEE Access, 9, 54415-54425. https://doi.org/10.1109/ACCESS.2021.3076543
62. YANG, Zichao, et al. Hierarchical attention networks for document classification. In: Proceedings of the 2016 conference of the North American chapter of the association for computational linguistics: human language technologies. 2016. p. 1480-1489.
63. CHUNG, Junyoung, et al. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555, 2014.
64. Yang, Z., Yang, D., Dyer, C., He, X., Smola, A., Hovy, E. (2018). Hierarchi- cal attention networks for document classification. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), 2018, 2013- 2022.
65. Zhang, J., Wu, S., Wu, Q., Zhang, Z. (2019). A HAN-CNN-based approach for fake news detection. Neurocomputing, 338, 139-147.
66. Li, J., Hu, X., Yang, L., Liu, J., Zou, X. (2020). HAN-FN: A Hierarchical At- tention Network with Feature Fusion Mechanism for Fake News Detection. IEEE Access, 8, 16620-16629.
67. Karami, A., Rabbani, M., Karami, A., Jahanshahi, M. (2021). A transfer learning-based hierarchical attention network for fake news detection. Neural Computing and Applications, 33(3), 1243-1255.
68. Wang, J., Qiu, M., Li, X., Li, J. (2021). A hierarchical attention network with visual features for fake news detection in social media. IEEE Access, 9, 20845- 20856.
69. Li, B., Wang, Y., Wei, X., Yang, K., Liu, K., Ma, Y. (2021). Multi-task learning with hierarchical attention networks for fake news detection. IEEE Access, 9, 70733-70743.
70. Zhou, X., Huang, L., Zhang, Q., Wu, L. (2021). External knowledge enhanced hierarchical attention networks for fake news detection. Information Processing Management, 58(3), 102526.
71. Dai, W., Cao, X., Huang, S. (2021). A hybrid graph-attention hierarchical atten- tion network for news-based fake news detection. Journal of Ambient Intelligence and Humanized Computing, 1-15.
72. Kaur, S., Bansal, S., Lakra, R. (2021). Fake news detection in images using hierarchical attention network. Multimedia Tools and Applications, 1-19.
73. Chen, J., Sun, M., Zhang, X., Ren, X. (2020). A hierarchical attention network with graph convolutional network for fake news detection. In Proceedings of the 2020 IEEE International Conference on Big Data (Big Data), 3248-3255.
74. Gao, P., Li, Y., Huang, J. (2021). Fake news detection based on hierarchical attention network and linguistic complexity analysis. Applied Sciences, 11(2), 551.
75. Hu, M., Liu, Q., Zhang, J. (2020). A social network analysis-based fake news detection model. Social Network Analysis and Mining, 10(1), 1-13.
76. BERRONDO-OTERMIN, Maialen; SARASA-CABEZUELO, Antonio. Applica- tion of Artificial Intelligence Techniques to Detect Fake News: A Review. Elec- tronics, 2023, 12.24: 5041.
77. CHENG, Li-Chen, et al. Detecting fake reviewers from the social context with a graph neural network method. Decision Support Systems, 2023, 114150.
78. RODRIGO-GINE´S, Francisco-Javier; CARRILLO-DE-ALBORNOZ, Jorge; PLAZA, Laura. A systematic review on media bias detection: What is media bias, how it is expressed, and how to detect it. Expert Systems with Applica- tions, 2023, 121641.
79. GURGUN, Selin, et al. Why do we not stand up to misinformation? Factors influencing the likelihood of challenging misinformation on social media and the role of demographics. Technology in Society, 2024, 76: 102444.
80. QU, Zhiguo, et al. QMFND: A quantum multimodal fusion-based fake news de- tection model for social media. Information Fusion, 2024, 104: 102172.
81. RAJA, Eduri, et al. An adaptive cyclical learning rate based hybrid model for Dravidian fake news detection. Expert Systems with Applications, 2024, 241: 122768.
82. NASIR, Jamal Abdul; KHAN, Osama Subhani; VARLAMIS, Iraklis. Fake news detection: A hybrid CNN-RNN based deep learning approach. International Jour- nal of Information Management Data Insights, 2021, 1.1: 100007.
83. DWIVEDI, Sanjeev Mewalal; WANKHADE, Sunil B. Enhancing Early Detection of Fake News on Social Media with a Dual-Branch Neural Network Model. In- ternational Journal of Intelligent Systems and Applications in Engineering, 2024, 12.7s: 480-493.
84. SAMAD, Abdul; DHANDA, Namrata; VERMA, Rajat. Fake News Detection using Machine Learning. In: International Conference on Artificial Intelligence of Things. Cham: Springer Nature Switzerland, 2023. p. 228-243.
85. DHIMAN, Pummy, et al. Combating Fake News: The Role of Effective Pre- Processing Techniques. International Journal of Computing and Digital Systems, 2023, 14.1: 1-xx.
86. TRUICA˘ , Ciprian-Octavian; APOSTOL, Elena-Simona. It’s all in the embedding! fake news detection using document embeddings. Mathematics, 2023, 11.3: 508.
87. ROSHAN, Rubab; BHACHO, Irfan Ali; ZAI, Sammer. Comparative Analysis of TF–IDF and Hashing Vectorizer for Fake News Detection in Sindhi: A Machine Learning and Deep Learning Approach. Engineering Proceedings, 2023, 46.1: 5.
88. SHARMA, Dilip Kumar; GARG, Sonal. IFND: a benchmark dataset for fake news detection. Complex Intelligent Systems, 2023, 9.3: 2843-2863.
89. Krizhevsky, A., Sutskever, I., Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems (pp. 1097-1105).
90. Lecun, Y., Bottou, L., Bengio, Y., Haffner, P. (1998). Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11), 2278-2324.
91. Hochreiter, S., Schmidhuber, J. (1997). Long short-term memory. Neural com- putation, 9(8), 1735-1780.
92. Graves, A., Liwicki, M., Fernandez, S., Bertolami, R., Bunke, H., Schmidhuber, J. (2009). A novel connectionist system for unconstrained handwriting recognition. IEEE Transactions on pattern analysis and machine intelligence, 31(5), 855-868.
93. Bahdanau, D., Cho, K., Bengio, Y. (2014). Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473.
94. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., ... Polosukhin, I. (2017). Attention is all you need. In Advances in neural information processing systems (pp. 5998-6008).
FINANCING
The researchers state that they did not receive any financial assistance, grants, or other support while preparing this manuscript. The dataset utilized in the case study is private, and the companies involved have requested anonymity.
CONFLICT OF INTEREST
The authors of this paper have no financial interests and no relevant financial or non-financial interests to disclose.
AUTHORSHIP CONTRIBUTION
Conceptualization: Imane Ennejjai, Anass Ariss, Jamal Mabrouki, Yasser Fouad, Abdulatif Alabdultif, Rajasekhar Chaganti, karima Salah Eddine, Asmaa Lamjid, Soumia Ziti.
Data curation: Imane Ennejjai, Anass Ariss, Jamal Mabrouki, Yasser Fouad, Abdulatif Alabdultif, Rajasekhar Chaganti, karima Salah Eddine, Asmaa Lamjid, Soumia Ziti.
Formal analysis: Imane Ennejjai, Anass Ariss, Jamal Mabrouki, Yasser Fouad, Abdulatif Alabdultif, Rajasekhar Chaganti, karima Salah Eddine, Asmaa Lamjid, Soumia Ziti.
Research: Imane Ennejjai, Anass Ariss, Jamal Mabrouki, Yasser Fouad, Abdulatif Alabdultif, Rajasekhar Chaganti, karima Salah Eddine, Asmaa Lamjid, Soumia Ziti.
Methodology: Imane Ennejjai, Anass Ariss, Jamal Mabrouki, Yasser Fouad, Abdulatif Alabdultif, Rajasekhar Chaganti, karima Salah Eddine, Asmaa Lamjid, Soumia Ziti.
Project management: Imane Ennejjai, Anass Ariss, Jamal Mabrouki, Yasser Fouad, Abdulatif Alabdultif, Rajasekhar Chaganti, karima Salah Eddine, Asmaa Lamjid, Soumia Ziti.
Resources: Imane Ennejjai, Anass Ariss, Jamal Mabrouki, Yasser Fouad, Abdulatif Alabdultif, Rajasekhar Chaganti, karima Salah Eddine, Asmaa Lamjid, Soumia Ziti.
Software: Imane Ennejjai, Anass Ariss, Jamal Mabrouki, Yasser Fouad, Abdulatif Alabdultif, Rajasekhar Chaganti, karima Salah Eddine, Asmaa Lamjid, Soumia Ziti.
Supervision: Imane Ennejjai, Anass Ariss, Jamal Mabrouki, Yasser Fouad, Abdulatif Alabdultif, Rajasekhar Chaganti, karima Salah Eddine, Asmaa Lamjid, Soumia Ziti.
Validation: Imane Ennejjai, Anass Ariss, Jamal Mabrouki, Yasser Fouad, Abdulatif Alabdultif, Rajasekhar Chaganti, karima Salah Eddine, Asmaa Lamjid, Soumia Ziti.
Display: Imane Ennejjai, Anass Ariss, Jamal Mabrouki, Yasser Fouad, Abdulatif Alabdultif, Rajasekhar Chaganti, karima Salah Eddine, Asmaa Lamjid, Soumia Ziti.
Drafting - original draft: Imane Ennejjai, Anass Ariss, Jamal Mabrouki, Yasser Fouad, Abdulatif Alabdultif, Rajasekhar Chaganti, karima Salah Eddine, Asmaa Lamjid, Soumia Ziti.
Writing - proofreading and editing: Imane Ennejjai, Anass Ariss, Jamal Mabrouki, Yasser Fouad, Abdulatif Alabdultif, Rajasekhar Chaganti, karima Salah Eddine, Asmaa Lamjid, Soumia Ziti.