FutureQuest, Inc. FutureQuest, Inc. FutureQuest, Inc.

FutureQuest, Inc.
Go Back   FutureQuest Community > General Site Owner Support (All may read/respond) > General Coding/Development
User Name
Password  Lost PW

Reply
 
Thread Tools Search this Thread Display Modes
Old 08-04-2003, 11:06 AM   Postid: 93031
krisleech
Registered User

Forum Notability:
10 pts: User-friendly
[Post Feedback]
 
Join Date: Mar 2002
Location: Nottingham, UK.
Posts: 323
robots.txt not working.

Can anyone tell me why my robots.txt does not work. http://www.phizzie.com/robots.txt

In robots.txt is:

PHP Code:
User-agent: *

Disallowcopyright.htm
Disallow
privacy.htm
Disallow
small-business-links.htm 
But http://www.phizzie.com/small-business-links.htm has a Pagerank. When the spiders should be visiting it.

Any suggestions as to why. Im sure the syntax in the robots.txt is correct.
__________________
Website Solutions, Advice, Articles and News @ www.phizzie.com. Based in UK.
krisleech is offline   Reply With Quote
Old 08-04-2003, 07:16 PM   Postid: 93079
MichaelC
Registered User

Forum Notability:
0 pts:
[Post Feedback]
 
Join Date: Mar 2002
Location: Seattle
Posts: 387
First off, many bots will ignore your robots.txt file (for example, every known address-harvesting bot). Others will read it and explicitly go where you told them not to go.

To make doubly sure the honorable bots aren't logging your pages to search engines, you might consider using the robots meta tag on those pages. The robotstxt.org site has lots of useful information about bots.

MC
MichaelC is offline   Reply With Quote
Old 08-05-2003, 04:02 AM   Postid: 93093
krisleech
Registered User

Forum Notability:
10 pts: User-friendly
[Post Feedback]
 
Join Date: Mar 2002
Location: Nottingham, UK.
Posts: 323
robots.txt and google

Thank michael, will take a look and try that out.
I would have thought googles robots would obey the robots.txt, as i am mainly concerned about google. I know googles bots have been on at least one of the pages in the robots.txt.


cheers kris.
__________________
Website Solutions, Advice, Articles and News @ www.phizzie.com. Based in UK.
krisleech is offline   Reply With Quote
Old 08-05-2003, 10:39 AM   Postid: 93112
frankc
Site Owner
 
frankc's Avatar

Forum Notability:
181 pts: Ambassador of Goodwill
[Post Feedback]
 
Join Date: Mar 1999
Location: northeastern Illinois
Posts: 1,003
Re: robots.txt and google

Quote:
Originally posted by krisleech:
....I know googles bots have been on at least one of the pages in the robots.txt.
Well, I'd bet many bad 'bots spoof their name, so it may not have been Google.
__________________
Frank
:: There is more to life than increasing its speed. (Gandhi) ::
:: Be careful of your thoughts, they may become words at any moment. (Iara Gassen) ::
:: "Perfectionism is self abuse of the highest order." (Anne Wilson Schaef) ::
:: "Life is change; how it differs from the rocks." (Jefferson Airplane) ::
:: "Everyone thinks of changing the world, but no one thinks of changing himself." (Leo Tolstoy, 1828-1910) ::
frankc is offline   Reply With Quote
Old 08-05-2003, 11:40 AM   Postid: 93114
krisleech
Registered User

Forum Notability:
10 pts: User-friendly
[Post Feedback]
 
Join Date: Mar 2002
Location: Nottingham, UK.
Posts: 323
Google bot for sure.

I know the real google bot has been as the pages actually have a pagerank of zero. If google bot had not been the page rank would be "google has not ranked this page", or something along those lines.
__________________
Website Solutions, Advice, Articles and News @ www.phizzie.com. Based in UK.
krisleech is offline   Reply With Quote
Old 08-05-2003, 12:22 PM   Postid: 93117
Jarrod
Site Owner
 
Jarrod's Avatar

Forum Notability:
334 pts: An Honor To Be Around
[Post Feedback]
 
Join Date: Jan 2003
Location: London, England
Posts: 347
Quote:
I know the real google bot has been as the pages actually have a pagerank of zero. If google bot had not been the page rank would be "google has not ranked this page", or something along those lines.
I'm not sure about this. Using Google Toolbar V2 I've just had a look at the page ranking of the CNC pages for my web site. They also shows up as a page rank of zero, yet google (or any bot for that matter) has never been near the pages as they are all protected by a .htaccess file.
Jarrod is offline   Reply With Quote
Old 08-05-2003, 04:09 PM   Postid: 93128
krisleech
Registered User

Forum Notability:
10 pts: User-friendly
[Post Feedback]
 
Join Date: Mar 2002
Location: Nottingham, UK.
Posts: 323
Yer, ur right jarrod ive just checked the CNC and it gives 0 out of 10.
I guess the only certain way to know if google bot is going to those pages is to serach google for a phrase or word that is on one of those page that is not in any other webpage on the web. And if the webpages comes up as a result then google bot has been on the webpage, it must have to put in googles database.

worth a try anyway,

Kris.
__________________
Website Solutions, Advice, Articles and News @ www.phizzie.com. Based in UK.
krisleech is offline   Reply With Quote
Reply


Currently Active Users Viewing This Thread: 1 (0 members and 1 visitors)
 
Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Forum Jump


All times are GMT -4. The time now is 12:34 PM.


Running on vBulletin®
Copyright © 2000 - 2013, Jelsoft Enterprises Ltd.
Hosted & Administrated by FutureQuest, Inc.
Images & content copyright © 1998-2013 FutureQuest, Inc.
FutureQuest, Inc.