Spam Email Categorization with NLP and using Federated Deep Learning
conference contribution
posted on 2024-11-03, 14:59authored byIkram Ul Haq, Paul Black, Iqbal GondalIqbal Gondal, Joarder Kamruzzaman, Paul Watters, A Kayes
Emails are the most popular and efficient communication method that
makes them vulnerable to misuse. Federated learning (FL) provides a decentral-
ized machine learning (ML) model, where a central server coordinates clients that
collaboratively train a shared ML model. This paper proposes Federated Phishing
Filtering (FPF) technique based on federated learning, natural language process-
ing, and deep learning. FL for intelligent algorithms fuses trained models of ML
algorithms from multiple sites for collective learning. This approach improves ML
performance by utilizing large collective training data sets across the corporate
client base, resulting in higher phishing email detection accuracy. FPF techniques
preserve email privacy using local feature extraction on client email servers. Thus,
the contents of emails do not need to be transmitted across the network or stored on
third-party servers. We have applied FL and Natural Language Processing (NLP)
for email phishing detection. This technique provides four training modes that per-
form FL without sharing email content. Our research categorizes emails as benign,
spam, and phishing. Empirical evaluations with publicly available datasets show
that accuracy is improved by the use of our Federated Deep Learning model
History
Start page
15
End page
27
Total pages
13
Outlet
Lecture Notes in Computer Science book series (LNAI,volume 13726)