<?xml version="1.0" encoding="utf-8" ?><rss version="2.0" xml:base="https://www.webmaster-forums.net/crss/node/1033060" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title></title>
    <link>https://www.webmaster-forums.net/crss/node/1033060</link>
    <description></description>
    <language>en</language>
          <item>
    <title></title>
    <link>https://www.webmaster-forums.net/server-management/help-incredible-bandwidth-spike#comment-1193499</link>
    <description> &lt;p&gt;Sometimes it might need a bit of fine tuning (may catch good bot) so just keep an eye on your htaccess file (if you use that method) for a SE cycle.&lt;br /&gt;
Another way, which requires work is to use the getout file above but instead of automatically writing to htaccess file get it to just send you an email with the details so you can check them out, and add manually if bad&lt;/p&gt;
 </description>
     <pubDate>Thu, 16 Feb 2006 09:15:26 +0000</pubDate>
 <dc:creator>Busy</dc:creator>
 <guid isPermaLink="false">comment 1193499 at https://www.webmaster-forums.net</guid>
  </item>
  <item>
    <title></title>
    <link>https://www.webmaster-forums.net/server-management/help-incredible-bandwidth-spike#comment-1193476</link>
    <description> &lt;p&gt;Thanks a ton for all your help!    I&#039;ll try to get one of those options implemented.&lt;/p&gt;
 </description>
     <pubDate>Wed, 15 Feb 2006 21:44:35 +0000</pubDate>
 <dc:creator>fifeclub</dc:creator>
 <guid isPermaLink="false">comment 1193476 at https://www.webmaster-forums.net</guid>
  </item>
  <item>
    <title></title>
    <link>https://www.webmaster-forums.net/server-management/help-incredible-bandwidth-spike#comment-1193475</link>
    <description> &lt;p&gt;Forgot to mention these bot traps were orginally created for email harvertors then advanced to bots that download your entire site, it wont stop leeching, or hotlinking (if forums link to one image can have similar effect as bad bot&lt;/p&gt;
 </description>
     <pubDate>Wed, 15 Feb 2006 21:08:18 +0000</pubDate>
 <dc:creator>Busy</dc:creator>
 <guid isPermaLink="false">comment 1193475 at https://www.webmaster-forums.net</guid>
  </item>
  <item>
    <title></title>
    <link>https://www.webmaster-forums.net/server-management/help-incredible-bandwidth-spike#comment-1193474</link>
    <description> &lt;p&gt;The bot traps do help, say a run away bot sucks everything off your site, you wont know about it until the next day, with a bot script the bot is banned straight away (depends how it set up) and if sent away (apposed to given 403 pages) it will only get a few pages.&lt;/p&gt;
&lt;p&gt;I didn&#039;t give any links or script as there are so any ways of doing it.&lt;br /&gt;
The very basic just blocks if bot views that one page, ideally called forbotsonly, or keepout, or banlist... but people are nosey and you will catch some so the trick is to put the file in the robots.txt only, this way only bots (or nosey people viewing your robots file) will ever find it. Isn&#039;t foolproof but does catch a lot.&lt;br /&gt;
There are some advance scripts out there that only ban for 24-48 hours, banned for viewing pages to quick (tabbed browsing can trigger this).&lt;/p&gt;
&lt;p&gt;here are a couple of links to help you decide which way you want to go.&lt;br /&gt;
&lt;a href=&quot;http://www.neilgunton.com/spambot_trap/&quot; class=&quot;bb-url&quot;&gt;http://www.neilgunton.com/spambot_trap/&lt;/a&gt;&lt;br /&gt;
&lt;a href=&quot;http://www.kloth.net/internet/bottrap.php&quot; class=&quot;bb-url&quot;&gt;http://www.kloth.net/internet/bottrap.php&lt;/a&gt; &lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;http://www.g-clef.net/drupal/?q=node/3&quot; class=&quot;bb-url&quot;&gt;http://www.g-clef.net/drupal/?q=node/3&lt;/a&gt;   &amp;lt;-- what a bot trap is&lt;br /&gt;
&lt;a href=&quot;http://www.jkcc.com/e-mail.html&quot; class=&quot;bb-url&quot;&gt;http://www.jkcc.com/e-mail.html&lt;/a&gt;   &amp;lt;-- what a loop page or punishment page could look like&lt;/p&gt;
&lt;p&gt;The second link is probably the best one, but if you just want some code &lt;div class=&quot;codeblock&quot;&gt;&lt;code&gt;&lt;span style=&quot;color: #000000&quot;&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;&amp;lt;?php&lt;br /&gt;$filename &lt;/span&gt;&lt;span style=&quot;color: #007700&quot;&gt;= &lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$_SERVER&lt;/span&gt;&lt;span style=&quot;color: #007700&quot;&gt;[\&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt;&quot;DOCUMENT_ROOT\&quot;] . \&quot;/.htaccess\&quot;;&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$content&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt; = \&quot;SetEnvIf Remote_Addr ^\&quot; .&lt;br /&gt;str_replace(\&quot;.\&quot;,\&quot;\.\&quot;,&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$_SERVER&lt;/span&gt;&lt;span style=&quot;color: #007700&quot;&gt;[&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt;\&quot;REMOTE_ADDR\&quot;]).\&quot;$ getout\r\n\&quot;;&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$handle&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt; = fopen(&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$filename&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt;, &#039;r&#039;);&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$content&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt; .= fread(&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$handle&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt;,filesize(&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$filename&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt;));&lt;br /&gt;fclose(&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$handle&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt;);&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$handle&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt; = fopen(&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$filename&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt;, &#039;w+&#039;);&lt;br /&gt;fwrite(&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$handle&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt;, &lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$content&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt;,strlen(&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$content&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt;));&lt;br /&gt;fclose(&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$handle&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt;);&lt;br /&gt;// change youremail@yourdomain and &amp;lt;a href=&quot;&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;mailto&lt;/span&gt;&lt;span style=&quot;color: #007700&quot;&gt;:&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;trap&lt;/span&gt;&lt;span style=&quot;color: #007700&quot;&gt;@&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;yourdomain&lt;/span&gt;&lt;span style=&quot;color: #007700&quot;&gt;.&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;com&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt;&quot; class=&quot;&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;bb&lt;/span&gt;&lt;span style=&quot;color: #007700&quot;&gt;-&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;email&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt;&quot;&amp;gt;trap@yourdomain.com&amp;lt;/a&amp;gt; to your real&lt;br /&gt;// address and real domain name, leave &#039;trap@&#039; so you know it&#039;s from the spider trap&lt;br /&gt;mail(\&quot;youremail@yourdomain\&quot;,&lt;br /&gt;\&quot;Spider Alert!\&quot;,&lt;br /&gt;\&quot;The following ip just got banned because it accessed the spider trap.\r\n\r\n\&quot; .&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$_SERVER&lt;/span&gt;&lt;span style=&quot;color: #007700&quot;&gt;[&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt;\&quot;REMOTE_ADDR\&quot;] . \&quot;\r\n\&quot; . &lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$_SERVER&lt;/span&gt;&lt;span style=&quot;color: #007700&quot;&gt;[&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt;\&quot;HTTP_USER_AGENT\&quot;] . \&quot;\r\n\&quot; .&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$_SERVER&lt;/span&gt;&lt;span style=&quot;color: #007700&quot;&gt;[&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt;\&quot;HTTP_REFERER\&quot;] ,\&quot;FROM: &amp;lt;a href=&quot;&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;mailto&lt;/span&gt;&lt;span style=&quot;color: #007700&quot;&gt;:&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;trap&lt;/span&gt;&lt;span style=&quot;color: #007700&quot;&gt;@&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;yourdomain&lt;/span&gt;&lt;span style=&quot;color: #007700&quot;&gt;.&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;com&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt;&quot; class=&quot;&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;bb&lt;/span&gt;&lt;span style=&quot;color: #007700&quot;&gt;-&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;email&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt;&quot;&amp;gt;trap@yourdomain.com&amp;lt;/a&amp;gt;\&quot;);&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$page&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt; = &#039;&#039;;&lt;br /&gt;// note: some site downloaders will also trigger the script&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$page&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt; .= \&quot;&amp;lt;h1&amp;gt;You have been permantly blocked from the site&amp;lt;/h1&amp;gt;\&quot;;&lt;br /&gt;&lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$page&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt; .= &#039;&amp;lt;p&amp;gt;We don\&#039;t allow site downloads or email spiders &#039; .&lt;br /&gt;&#039;of any kind, sorry. If you feel this is a mistake, &#039; .&lt;br /&gt;&#039; please send us an email with your IP address and we\&#039;ll &#039; .&lt;br /&gt;&#039;remove your IP address from the blocked list.&amp;lt;/p&amp;gt;&#039;;&lt;br /&gt;/*&lt;br /&gt;add email constructor here if desired (post 4 in thread)&lt;br /&gt;*/&lt;br /&gt;echo &lt;/span&gt;&lt;span style=&quot;color: #0000BB&quot;&gt;$page&lt;/span&gt;&lt;span style=&quot;color: #DD0000&quot;&gt;;&lt;br /&gt; &lt;br /&gt;?&amp;gt;&lt;/span&gt;&lt;/span&gt;&lt;/code&gt;&lt;/div&gt; page is getout.php&lt;/p&gt;
&lt;p&gt;or do a search for things like bot trap, auto bot trap, bad bots ...&lt;/p&gt;
 </description>
     <pubDate>Wed, 15 Feb 2006 21:04:48 +0000</pubDate>
 <dc:creator>Busy</dc:creator>
 <guid isPermaLink="false">comment 1193474 at https://www.webmaster-forums.net</guid>
  </item>
  <item>
    <title></title>
    <link>https://www.webmaster-forums.net/server-management/help-incredible-bandwidth-spike#comment-1193461</link>
    <description> &lt;p&gt;I saw a link on somebody&#039;s website a few weeks ago and curiously followed it to &lt;a href=&quot;http://www.spampoison.com/&quot; class=&quot;bb-url&quot;&gt;http://www.spampoison.com/&lt;/a&gt; .  It sounds like the trap you were mentioning, but does it really work (or at least help)?  Of course punishing bad bots won&#039;t actually protect my website so I&#039;m also intrigued by that hidden-link-auto-ban thing you were talking about.  Creating something like that myself is beyond my abilities, but what would something like that be called if I were trying to look for that on hotscripts or someplace?&lt;/p&gt;
&lt;p&gt;Thanks.&lt;/p&gt;
 </description>
     <pubDate>Wed, 15 Feb 2006 14:08:36 +0000</pubDate>
 <dc:creator>fifeclub</dc:creator>
 <guid isPermaLink="false">comment 1193461 at https://www.webmaster-forums.net</guid>
  </item>
  <item>
    <title></title>
    <link>https://www.webmaster-forums.net/server-management/help-incredible-bandwidth-spike#comment-1193443</link>
    <description> &lt;p&gt;Robots.txt is only for the main search engines as a suggestion, a robots.txt file should never be used solely to discourage bots from folders, files or images. Bad bots hardly ever read the file, some do view it so as to not look bad but usually dont take notice of it.&lt;/p&gt;
&lt;p&gt;The site gives the UA but did you check it (and IP) in your logs? if so is it correct.&lt;/p&gt;
&lt;p&gt;I know a few people who believe the bot problem is way out of control, every tom, fred and henery are trying to make the next google. They block all bots except the main ones and use a bot trap.&lt;br /&gt;
The way it works is a hidden link on all pages, the page adds their IP to the .htaccess page and displays a ugly message for them and can sometimes send them into a wild loop so they have to suck up thousands of useless pages going nowhere fast. - This can suck up a few resources thou so beware.&lt;/p&gt;
 </description>
     <pubDate>Wed, 15 Feb 2006 09:08:37 +0000</pubDate>
 <dc:creator>Busy</dc:creator>
 <guid isPermaLink="false">comment 1193443 at https://www.webmaster-forums.net</guid>
  </item>
  <item>
    <title></title>
    <link>https://www.webmaster-forums.net/server-management/help-incredible-bandwidth-spike#comment-1193420</link>
    <description> &lt;p&gt;My attack came specifically from 64.127.124.157 bit their website states that their IP address ranges are: 64.127.124.* and 65.19.150.129 - 65.19.150.255.&lt;/p&gt;
 </description>
     <pubDate>Wed, 15 Feb 2006 03:08:45 +0000</pubDate>
 <dc:creator>fifeclub</dc:creator>
 <guid isPermaLink="false">comment 1193420 at https://www.webmaster-forums.net</guid>
  </item>
  <item>
    <title></title>
    <link>https://www.webmaster-forums.net/server-management/help-incredible-bandwidth-spike#comment-1193403</link>
    <description> &lt;p&gt;Can you share those IPs?&lt;/p&gt;
&lt;p&gt;I, for one, would be grateful.&lt;/p&gt;
&lt;p&gt;It would be nice to never have that happen to me.&lt;/p&gt;
&lt;p&gt;Thanks!&lt;/p&gt;
 </description>
     <pubDate>Tue, 14 Feb 2006 23:29:08 +0000</pubDate>
 <dc:creator>Michael Allison</dc:creator>
 <guid isPermaLink="false">comment 1193403 at https://www.webmaster-forums.net</guid>
  </item>
  <item>
    <title></title>
    <link>https://www.webmaster-forums.net/server-management/help-incredible-bandwidth-spike#comment-1193402</link>
    <description> &lt;p&gt;I found an IP address in my stats that gobbled up over &lt;strong&gt;56,000&lt;/strong&gt; page views in just one night.  Other than my message board, my entire site only has a few dozen pages total.  I searched for that IP address and connected it to some evil search spider named &quot;&lt;a href=&quot;http://www.omni-explorer.com/&quot; class=&quot;bb-url&quot;&gt;omni-explorer&lt;/a&gt;&quot;.  Curse them!  &lt;/p&gt;
&lt;p&gt;Two bits of information that enrage me even more than the fact that they&#039;ve forced me to shut down most of my website in order to stay afloat until the end of the month... 1) My properly written robots.txt clearly states that no robots are allowed to index my forums (which should have stopped the bot after a few dozen page views) and 2) that despite this over-indulgent attack, their bot is not listed as a spider that visited my website (so their trying to avoid detection (like a bull in a china shop)).  &lt;/p&gt;
&lt;p&gt;Sorry, but I am so F****** p***** off!  &lt;img src=&quot;https://www.webmaster-forums.net/misc/smileys/at-wits-end.png&quot; title=&quot;Mad&quot; alt=&quot;Mad&quot; class=&quot;smiley-content&quot; /&gt;  I&#039;ve now banned their IP&#039;s from my domain but it&#039;s too little too late.&lt;/p&gt;
 </description>
     <pubDate>Tue, 14 Feb 2006 23:15:11 +0000</pubDate>
 <dc:creator>fifeclub</dc:creator>
 <guid isPermaLink="false">comment 1193402 at https://www.webmaster-forums.net</guid>
  </item>
  <item>
    <title></title>
    <link>https://www.webmaster-forums.net/server-management/help-incredible-bandwidth-spike#comment-1193283</link>
    <description> &lt;p&gt;Use your raw logs, web stats (of any brand) are useless for things like this.&lt;br /&gt;
Your log files will be big so start with the error logs, if it&#039;s bots, the stupid ones will be easy to spot (they make try find root files in folders), them your log files strip out all your domain name (first using www. then without it) then search for http and/or www and you&#039;ll probably find it is bots and forums hotlinking to files.&lt;/p&gt;
&lt;p&gt;Block all repeat offenders via htaccess&lt;/p&gt;
&lt;p&gt;It&#039;s a horrible time consuming job but needs to be done&lt;br /&gt;
good luck&lt;/p&gt;
 </description>
     <pubDate>Mon, 13 Feb 2006 20:21:51 +0000</pubDate>
 <dc:creator>Busy</dc:creator>
 <guid isPermaLink="false">comment 1193283 at https://www.webmaster-forums.net</guid>
  </item>
  </channel>
</rss>
