To Catch a Thief
by
Pasta
Controlling The Thief
I believe it was the beginning of August I posted an Htaccess file for site rippers (a Cool DFN link of the week). It was a learning thread for me. There were concerns of do I really want to block browsers and spiders to my site. I did, but I also took what people mentioned into consideration. I did my own investigating. I visited search engines to find what their bot names were for starters. Took the working htaccess saved it as htaccess.old. Made it easier to do live edits on my htaccess file when I wasn't home. Basically, you will just add rewrite conditions blocking a particular user agent in your htaccess, simple enough. It's not! First and foremost it takes some up-front research. Research your current monthly stats and the previous month and look for a trend in occurrences. Show of hands, do you want something called Siphon visiting your site? Search on Siphon and read about it you would block it also, wouldn't you? The reason for me going on a little mission to gather information, I didn't want these proggies in my stats, period. Here is and example of the rewrite condition lines you will be adding to your htaccess file if you want to do some research:
RewriteCond %{HTTP_USER_
AGENT} ^.*Siphon.*$ [OR]
Pasta's Cool DFN Link Of The Week
How You Doin?
Working Site Ripper Htaccess
Johann von Goethe



