Someone is using the alt.binaries.anime group as a private dumping ground by encrypting both Subject and Author. I would like to be able to filter these out, but it's not obvious how to do it. For example:
Title:
02b936d4d4bcc20983bb4d05 [*/4] - "02b936d4d4bcc20983bb4d05.vol00-08"
Author:
6aaatttl8l2vZvs2 <0d01ikmo9ri3vecb@71lulg7wtj9pwZvc.fh>
This is one of those things when a human can immediately recognize the pattern (there are no recognizable English words), but it is hard to come up with a simple rule to identify these posts algorithmically. I noticed that the ratio of letters to numbers in the strings are low compared to non-encrypted posts. Therefore, the most immediate strategy that comes to mind is to count the ratio of numbers to alphabetic letters and kill the post if this is higher than a set threshold (although you may think of something better).
Thanks for any help you can give.
Feature Request: Filter for Encrypted Subject and Author pos
-
- Posts: 35
- Joined: Mon Jun 13, 2005 11:02 pm