Anyone want to write an "intelligent" mail filter?
- Posted by Irv Mullins <irvm at ellijay.com> Nov 04, 2003
- 757 views
Every day I get more annoying SPAM e-mails. Currently it's running about 10 spams to every valid e-mail. I'm tired of wading thru them, and I'd rather not download them at all. My e-mail client can filter the messages by sender or subject, but most spams now are written to get around those filters. One thing I notice is that nearly 100% of the spams either contain the word "lagos" or long strings of "dictionary" words to confuse the filters: "indecisive constitute dakar summitry ajax beaver descendent withal circumlocution asocial voluble inquire convolution replete hitler commendation segregate cognition abstract eject disgustful" But very few or none of the more common shorter words that would likely appear in a valid e-mail: "a, and, or, if, you, we, I, to, for, the, this, that....." We should be able to come up with a routine which would analyze a given text string and rank it according to its likelyhood of being a 'meaningful' message. Then use that routine in an e-mail client to rank messages and only download from the server those which appear to be 'real'. Ideas? Irv -- Windows 98 is *NOT* a virus - viruses are small and efficient.