TY - JOUR
T1 - Machine-Learning-Based Spam Mail Detector
AU - Charanarur, Panem
AU - Jain, Harsh
AU - Rao, G. Srinivasa
AU - Samanta, Debabrata
AU - Sengar, Sandeep Singh
AU - Hewage, Chaminda Thushara
N1 - Publisher Copyright:
© 2023, Crown.
PY - 2023/11/8
Y1 - 2023/11/8
N2 - The proliferation of spam emails, a predominant form of online harassment, has elevated the significance of email in daily life. As a consequence, a substantial portion of individuals remain vulnerable to fraudulent activities. Despite Gmail’s “spam mail filtration system,” its effectiveness is not absolute. It occasionally misclassifies legitimate messages, leading to their confinement in the spam folder, or overlooks potentially harmful spam emails. This results in the occurrence of false positives. This research scrutinizes the historical data, cookies, caches, Session Restores, flash artifacts, and super cookies of Internet Explorer, Firefox, and Chrome on the Windows 10 platform. Data was collected through Google, Firefox, and Internet Explorer, operating within the Windows 10 environment. It has been observed that browsers store user behavior data on the host computer’s hard drive. The implications of this study hold substantial value for computer forensics researchers, law enforcement professionals, and digital forensics experts. The study leverages Python, alongside pertinent libraries such as pandas, Numpy, Matplotlib, scikit-learn, and flask, to facilitate its investigation. The experiment result and analysis show KN and NB algorithms have the best accuracy and precision score compared to other Algorithms.
AB - The proliferation of spam emails, a predominant form of online harassment, has elevated the significance of email in daily life. As a consequence, a substantial portion of individuals remain vulnerable to fraudulent activities. Despite Gmail’s “spam mail filtration system,” its effectiveness is not absolute. It occasionally misclassifies legitimate messages, leading to their confinement in the spam folder, or overlooks potentially harmful spam emails. This results in the occurrence of false positives. This research scrutinizes the historical data, cookies, caches, Session Restores, flash artifacts, and super cookies of Internet Explorer, Firefox, and Chrome on the Windows 10 platform. Data was collected through Google, Firefox, and Internet Explorer, operating within the Windows 10 environment. It has been observed that browsers store user behavior data on the host computer’s hard drive. The implications of this study hold substantial value for computer forensics researchers, law enforcement professionals, and digital forensics experts. The study leverages Python, alongside pertinent libraries such as pandas, Numpy, Matplotlib, scikit-learn, and flask, to facilitate its investigation. The experiment result and analysis show KN and NB algorithms have the best accuracy and precision score compared to other Algorithms.
KW - Cookies
KW - E-mail
KW - Inbox
KW - Spam mail
KW - Trap
KW - Trash folder
UR - http://www.scopus.com/inward/record.url?scp=85182416144&partnerID=8YFLogxK
U2 - 10.1007/s42979-023-02330-x
DO - 10.1007/s42979-023-02330-x
M3 - Article
AN - SCOPUS:85182416144
SN - 2662-995X
VL - 4
JO - SN Computer Science
JF - SN Computer Science
IS - 6
M1 - 858
ER -