Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Currently best spam filter algorithm

What is the currently best method to detect spam ? especially on mobile text message. are there any resource or comparison analysis ?

like image 493
ahmy Avatar asked Dec 22 '10 16:12

ahmy


People also ask

What is the best algorithm for spam filtering?

Several machine learning algorithms have been used in spam e-mail filtering, but Naıve Bayes algorithm is particularly popular in commercial and open-source spam filters [2]. This is because of its simplicity, which make them easy to implement and just need short training time or fast evaluation to filter email spam.

Which is the number one tool for filtering spam?

EDITOR'S CHOICE. Trustifi Inbound Shield is our top pick for an anti-spam software tool because it slots into your Google Workspaces and Microsoft 365 accounts to offer protection for your email system that expands beyond spam filtering.

How AI is used in spam filters?

AI spam filters scan each incoming message and label any objectionable content. Its intelligent learning capabilities label warning signs of malware. If a message containing this malicious software is found in your inbox, it's immediately flagged and you're alerted not to touch it.


2 Answers

It's good to look into supervised learning techniques. There've been a number of studies where the Multinomial Naive Bayes Classifier has been used for spam email filtering with a lot of success. If it worked for spam email filtering, then it should work with SMS filtering. What you need is a huge dataset of example spam SMS texts and train the classifier with it.

Also, it may be helpful to look into the Support Vector Machine, which; although less widely used in spam filtering; is a much more powerful technique.

Also, just training the algorithms on raw text may not quite be the best way forward. There was a study by Mehran Sahami from 1998 that found that they achieved superior performance when they took other heuristics into consideration (e.g. was the email sent to a mailing list? was the email sent from a domain name that ended in either ".edu",".com",".org"? did the email contain multiple punctuation marks ("!!!")?, and so forth).

But start off with the Multinomial Naive Bayes Classifier. It's very simple to implement, and it's very easy to use, and from personal experience: it has a very short training time, as well.

like image 131
arkate Avatar answered Sep 30 '22 13:09

arkate


As I understand it most modern spam filtering is a combination of an implementation of Bayes' theorem and some heuristics, e.g. sender blacklists, standards compliance, sending patterns.

The easiest place to implement this in the mobile phone network would probably be at the SMS message centre, since the volume is higher, which makes a lot of the heuristics easier to implement.

like image 29
Flexo Avatar answered Sep 30 '22 13:09

Flexo