Blocking Geo2Web
Written by Chad on May 6th, 2009So, like so many other GIS related blogs.. I am being scraped by geo2web.com. So I have blocked them from even accessing my site and I debating removing feedburner as the RSS source and will go back to running the feeds from here. So, if you are an RSS subscriber, I would suggest updating your feed address just in case.
I would suggest everyone that gets scraped by geo2web do something similar to send them a message.
Here is what I am using in my .htaccess file:
# Blocking geo2web theft
order allow,deny
allow from all
Setenvif Remote-Addr “^67\.225\.1(1[4-9]¦2[0-5])\.” getout
Deny from getout
deny from gudzondns.net
6
PM
Thanks for the tip – I’ve added it to my htaccess file.
Got a good laugh when I went to geo2web.com, and saw your post above at the top
.
[Translate]
6
PM
Went to check out the geo2web site that you refer to and this entry was first on the page. I don’t know if that means this doesn’t work or not.
[Translate]
7
AM
Thanks for letting us know, I’ve updtaed to your new RSS feed. Its a pity that sites like Geo2Web exploit information sources.
[Translate]
7
AM
This post appears on geo2web, probably from Planet Geospatial
[Translate]
7
AM
HE should not be getting anything from planetgs anymore, he has the feed addresses now. I have not yet killed the feedburner feed, so he is probably scraping that feed.
Hopefully, once I kill feedburner.. no more posts will show up there.
[Translate]
7
AM
You should put something about Geo2Web being a scrapper and leave that in feedburner so it continually shows up at that site and visitors to that site will know what kind of blog it really is.
[Translate]
7
AM
James said something along the same line as well last night. Probably worth doing anyway.
I’ll run both for now till I finally decide what to do.
[Translate]
18
PM
Hello, we just got our list of top mapping blogs scraped by geo2web and mapperz also…
We may follow your advice, thanks anyway
[Translate]