Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Is there a free anti spam database?

Wordpress has a spam filtering plugin called Akismet that seems to be able to classify any block of text as spam or not. The only caveat being that you need to go through their interface and their database/algorithm is not open sourced or readily available otherwies.

There are also commercial providers that provide a web accessible API for you to classify the emails, comments or any other text being submitted by users in your web application.

Is there any sort of open source or freely accessible database that can classify a block of text as spam/non-spam?

Edit: Here's a clearer explanation of what I want

Basically I was hoping that there was an extensive database out there with the probabilities of certain phrases being spam. Since (I'm assuming) spammers spam all email addresses equally, by pre-populating my Bayesian spam filter with this database, I could create an application that starts off by capturing most spam without any user training.

like image 786
Praveen Angyan Avatar asked Oct 15 '22 14:10

Praveen Angyan


1 Answers

Probably not exactly what you're looking for, but the MoinMoin Wiki maintainers keep a central list of Wiki spam regular expressions here: http://master.moinmo.in/BadContent

like image 176
RichieHindle Avatar answered Oct 21 '22 09:10

RichieHindle