FutureQuest, Inc. FutureQuest, Inc. FutureQuest, Inc.

FutureQuest, Inc.
Go Back   FutureQuest Community > General Site Owner Support (All may read/respond) > Open Discussions
User Name
Password  Lost PW

Reply
 
Thread Tools Search this Thread Display Modes
Old 05-13-2007, 05:50 AM   Postid: 158066
phppete
Registered User
 
phppete's Avatar

Forum Notability:
238 pts: Ambassador of Goodwill
[Post Feedback]
 
Join Date: May 2003
Posts: 1,489
Yahoo spider crashing mysql on non FQ site

We seem to be experiencing Yahoos spider triggering a mysql crash on a non FQ server. It is always one particular site and always the user agent (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp)

We don't have the luxury of separate mysql servers or SRC on this server so I was thinking of detecting Yahoo! in the user agent and adding usleep() for 1/2 second. Would 1/2 second be too long or too little to slow Yahoos spider? Would the spider just give up and not index the site?

Or would it be better to do as instructed here :

http://help.yahoo.com/help/us/ysearc.../slurp-03.html

Ideally we don't want to mess up SE rankings.
phppete is offline   Reply With Quote
Old 05-13-2007, 11:14 AM   Postid: 158071
kennylucius
Registered User

Forum Notability:
237 pts: Ambassador of Goodwill
[Post Feedback]
 
Join Date: Jun 2005
Posts: 140
Re: Yahoo spider crashing mysql on non FQ site

It never occurred to me that you could slow a spider that way. I assumed from G's spastic crawling that they sent for pages on an inhuman schedule regardless of the response they receive (at least in the short term).

Yahoo has always been well-behaved, and is supposed to honor the "crawl-delay" directive. Have they been ignoring that?
__________________
Kenny Lucius
kennylucius is offline   Reply With Quote
Old 05-14-2007, 12:36 PM   Postid: 158076
SneakyDave
Fond of TAZ
 
SneakyDave's Avatar

Forum Notability:
93 pts: Helpful Contributor
[Post Feedback]
 
Join Date: Feb 1999
Posts: 918
Re: Yahoo spider crashing mysql on non FQ site

Yahoo's slurp has been going crazy on my sites lately too. 50 or 60 instances of it crawling around at any given time. I just wish there was a way to limit the number of visits from it, rather than block it completely.
SneakyDave is offline   Reply With Quote
Old 05-14-2007, 12:41 PM   Postid: 158077
phppete
Registered User
 
phppete's Avatar

Forum Notability:
238 pts: Ambassador of Goodwill
[Post Feedback]
 
Join Date: May 2003
Posts: 1,489
Re: Yahoo spider crashing mysql on non FQ site

Quote:
Originally Posted by SneakyDave View Post
Yahoo's slurp has been going crazy on my sites lately too. 50 or 60 instances of it crawling around at any given time. I just wish there was a way to limit the number of visits from it, rather than block it completely.
There is http://help.yahoo.com/help/us/ysearc.../slurp-03.html

My original question though is which is better, slow it down using usleep() or doing as instructed in the info. FQ already have SRC (Spider Rate Control) but the site in my original post is not on FQ.

If FQ didn't have SRC we would probably see lots more downtime, crashes, high server loads.
phppete is offline   Reply With Quote
Reply


Currently Active Users Viewing This Thread: 1 (0 members and 1 visitors)
 
Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Forum Jump


All times are GMT -4. The time now is 08:31 AM.


Running on vBulletin®
Copyright © 2000 - 2013, Jelsoft Enterprises Ltd.
Hosted & Administrated by FutureQuest, Inc.
Images & content copyright © 1998-2013 FutureQuest, Inc.
FutureQuest, Inc.