K'ay. I got an idea that just might work.
So... What is the thing that all these forum "spammers" do? They register, activate and then start spamming. Right?
There's one interesting thing in here. There's seems to be an interesting pattern in the posts they make.
Sounds like, for all these users, they do not make any useful post (we can think of 3 posts, just to be sure) before they start spamming.
We could make a message checker system that would work with the same idea as thunderbird's message evaluator (it uses it to evaluate if an e-mail is junk or not).
Code: Select all
For each post from a user with less than 4 posts post:
var words = break the post into individual words.
var urls = also we look for all urls
var urlImages = also we look for all urls that are images.
Foreach (words as word)
Evaluate the probability that that word is related to spam.
Compare it to the previous one
Compare it to the next one
Associate a number to it. (That number is the probability that that word is related to spam).
Then do the same about the urls.
Then do the same about the image urls.
The objective is to see if it contains words like buy or it's a bit.ly (or something like that)
Then gather the final result. It is a number between 0% and 100% (we can use other scales, if it makes sense)
Using a threshold, we can then separate what to do with that post.
This is kinda resource heavy but it seems to be quite effective. At least with thunderbird, it works like a charm to me!
We can do this in a more accurate and strict way in the form to contact the admins.
We create a form that, while the user is not in a "accepted" state (has, at least, 3 accepted posts, or so) his request to the admins is filtered. Only certain kind of stuff is allowed.
Yeah... bots can go through these protections but they can only pass with content they don't want to insert. All these bots seem to be only interested into placing ads of some sort. Ads have patters. We may gain something in here! This will not prevent spam but it will prevent the kind of spam all these bots want to place in the BB's.
What do you think?