GoFuckYourself.com - Adult Webmaster Forum

GoFuckYourself.com - Adult Webmaster Forum (https://gfy.com/index.php)
-   Fucking Around & Business Discussion (https://gfy.com/forumdisplay.php?f=26)
-   -   Warning with this technique (Google) (https://gfy.com/showthread.php?t=1052351)

roganoli 01-04-2012 03:24 PM

Warning with this technique (Google)
 
I'm still an inexperienced webmaster...

I was having good placements on Google for my new site... He was even making good sales!

Suddenly something happened. My website disappeared from search! Today I found out what caused it... My site was totally copied by the proxy service Google! See the link below:

https://www.google.com/#q=%22appspot...1280& bih=663

I believe this was done intentionally by another webmaster and I believe I have solved this problem using my htaccess!

But be warned for other amateurs like me! :thumbsup

Klen 01-04-2012 03:29 PM

Had same happening to me,totally copied site to three different domains.Which is why i have now in my settings protection against all kind of downloaders.

roganoli 01-04-2012 03:31 PM

I put it on my htaccess:

RewriteCond %{HTTP_USER_AGENT} AppEngine [NC]
RewriteRule .* - [F]

Let me know if there is better solution...

baryl 01-04-2012 03:32 PM

Wait..what? Can someone explain what this is?

Klen 01-04-2012 03:34 PM

Search for article like this:
http://www.htaccess-guide.com/blocki...-and-bad-bots/
Had one with huge list of those downloaders,but dont know what was exact url.

roganoli 01-04-2012 03:35 PM

Quote:

Originally Posted by baryl (Post 18671087)
Wait..what? Can someone explain what this is?

Sorry if my English is not helpful...

baryl 01-04-2012 03:39 PM

It's fine. Thanks for pointing that out and sorry about your site.

RayBonga 01-04-2012 04:13 PM

Quote:

Originally Posted by roganoli (Post 18671085)
I put it on my htaccess:

RewriteCond %{HTTP_USER_AGENT} AppEngine [NC]
RewriteRule .* - [F]

Let me know if there is better solution...

Thanks for the warning :thumbsup What does this code do?

brassmonkey 01-04-2012 04:37 PM

damn sorry to hear that

redwhiteandblue 01-04-2012 04:44 PM

Quote:

Originally Posted by RayBonga (Post 18671154)
Thanks for the warning :thumbsup What does this code do?

It blocks access to any client which presents a user agent with the string "AppEngine" in it. But there are a whole load of others you need to block too, and that won't even stop all scrapers. I'd love to know the UA updowner.com is using and block that fucker.

medicatedghost 01-04-2012 05:47 PM

check out google.com/safebrowsing/diagnostic?site=updowner.com/
interesting analysis

jakeuk 01-04-2012 06:01 PM

The mod rewrite you suggested offer some protection. When I had similar problem a few years ago, here are other things I did in addition to the mod_rewrite:

1. Get strong links to your site so that your site is at least stronger than the copycat.
2. Rewrite some of the contents of your site so that its not a duplicate any more but a different version of the one that was copied.
3. Update you site regularly so that the copycat cannot keep up.
4. use a more aggressive exclusion robots.txt file, allowing ONLY the big 3 crawlers.

rowan 01-04-2012 06:31 PM

Quote:

Originally Posted by redwhiteandblue (Post 18671228)
I'd love to know the UA updowner.com is using and block that fucker.

Any half decent scraper probably uses a common UA, and possibly a random selection of common UAs. I've seen loads of multiple pages from the same IP use a different UA each time.


All times are GMT -7. The time now is 09:55 AM.

Powered by vBulletin® Version 3.8.8
Copyright ©2000 - 2025, vBulletin Solutions, Inc.
©2000-, AI Media Network Inc123