Institutional Open Archives

Analysis of TF-IDF text mining based machine learning methods for textual spam


Show simple item record

dc.contributor.author Kılıç, İrfan
dc.contributor.author Kaşoğlu, Aytaç
dc.contributor.author Yaman, Orhan
dc.contributor.editor Demir, İdris
dc.contributor.editor İş, Hafzullah
dc.date.accessioned 2024-08-16T12:25:22Z
dc.date.available 2024-08-16T12:25:22Z
dc.date.issued 2024-05-04
dc.identifier.citation Kılıç, İ., Kaşoğlu, A. ve Yaman, O. (2024). Analysis of TF-IDF text mining based machine learning methods for textual spam. Demir, İ. ve İş, H. (Ed.). III. International Informatics Congress 2024 Proceedings Book, 02-04 May 2024. (ss.117-124). Batman: Batman Üniversitesi Yayınevi. tr_TR
dc.identifier.uri http://hdl.handle.net/11508/21057
dc.description.abstract The most popular messaging applications that form the basis of Internet technology are E-mail and SMS applications. The biggest problem with these applications is spam. When comparing audio, video and text spam, text spam is known to be common. It is very important to detect textual spam with high accuracy and to know which category the spam content belongs to. This study focuses on the development of TF-IDF (Term Frequency-Inverse Document Frequency) based machine learning methods in the field of textual spam filtering. The study discussed the prominence of TF-IDF as a technique that plays an important role in the field of text mining. This technique uses the ratio between the frequency of the term in the document and its frequency in all documents to determine the importance of a term in a document. Within the scope of the study, data sets consisting of SMS and E-mail texts were collected and the effect of using Support Vector Machines (SVM), Decision Trees, Random Forests, Naïve Bayes, k-NN and TF-IDF together on spam filtering performance was examined. It has been demonstrated that the developed TF-IDF-based machine learning methods have the potential to obtain more effective results in spam filtering systems. tr_TR
dc.language.iso İngilizce tr_TR
dc.rights info:eu-repo/semantics/openAccess tr_TR
dc.subject Fırat Üniversitesi Kütüphanesi::DOĞA BİLİMLERİ VE MATEMATİK tr_TR
dc.subject.ddc Spam filtering tr_TR
dc.subject.ddc Text mining tr_TR
dc.subject.ddc Machine learning tr_TR
dc.subject.ddc TF-IDF tr_TR
dc.subject.ddc Support vector machine tr_TR
dc.title Analysis of TF-IDF text mining based machine learning methods for textual spam tr_TR
dc.type Bildiri - Yayımlanmış tr_TR
dc.contributor.YOKID 223516 tr_TR
dc.contributor.YOKID 223516 tr_TR
dc.relation.publishinghaddress Batman tr_TR
dc.relation.publishinghouse Batman Üniversitesi Yayınevi tr_TR
dc.identifier.pages 117;124
dc.identifier.bookname III. International Informatics Congress 2024 Proceedings Book, 02-04 May 2024 tr_TR
dc.published.type Uluslararası tr_TR


Files in this item

This item appears in the following Collection(s)

Show simple item record

University of Fırat
23119
Elazığ-Merkez
TURKEY