Anyone want to write an "intelligent" mail filter?

new topic     » topic index » view thread      » older message » newer message

Every day I get more annoying SPAM e-mails. Currently it's running about 10 
spams to every valid e-mail.

I'm tired of wading thru them, and I'd rather not download them at all. 
My e-mail client can filter the messages by sender or subject, but most 
spams now are written to get around those filters. 

One thing I notice is that nearly 100% of the spams either contain the 
word "lagos" or long strings of "dictionary" words to confuse the filters:

"indecisive constitute dakar summitry ajax beaver descendent withal 
circumlocution asocial voluble inquire convolution replete hitler 
commendation segregate cognition abstract eject disgustful"

But very few or none of the more common shorter words that would likely 
appear in a valid e-mail: "a, and, or, if, you, we, I, to, for, the, this, 
that....."

We should be able to come up with a routine which would analyze a given 
text string and rank it according to its likelyhood of being a 'meaningful' 
message. Then use that routine in an e-mail client to rank messages and 
only download from the server those which appear to be 'real'. 

Ideas?

Irv



-- 
Windows 98 is *NOT* a virus - viruses are small and efficient.

new topic     » topic index » view thread      » older message » newer message

Search



Quick Links

User menu

Not signed in.

Misc Menu