Website Babble Webmaster Forums  
  #1 (permalink)  
Old 06-04-2008, 08:07 PM
nikkii30's Avatar
Regular Babbler
 
Join Date: Oct 2007
Location: Los Angeles
Posts: 37
Question Do I need robots.txt file.

I read somewhere that one of the most fundamental steps when optimizing a website is writing a robots.txt file. Do I really need to write this in my meta tags???? and if how should i write it?
__________________
www.beautyallnatural.com
Nature Knows Best
Reply With Quote
  #2 (permalink)  
Old 06-04-2008, 09:35 PM
lisa's Avatar
Administrator
 
Join Date: Apr 2007
Location: Dallas, TX
Posts: 3,434
Default

You don't have to have one but it's good for telling the engines to ignore certain directories. For example, a cgi-bin is usually a folder that contains scripts so there is no "indexable" (I don't think that's a word content.) So the engine bots would ignore that directory. You would include a line in the robots.txt file that tells the engines to ignore that folder.

I've never used one because there are some downsides. This article is good
http://www.robotstxt.org/robotstxt.html
__________________
Don't put the cart before the horse.
Plan your website, then create it.
~~~~~~~~~~~~~~~~~~~~~~~~~

Recent Blog Posts...

Make Money as a YouTube Partner
Don't Discount Amazon's Affiliate Program
Finally! AdSense Ads for Feedburner!
Reply With Quote
  #3 (permalink)  
Old 06-12-2008, 05:15 PM
Junior Babbler
 
Join Date: May 2008
Posts: 8
Default

A robots.txt file can be very useful in some cases. For example, you might have a website with a user control panel in the admin folder. You should disallow indexing of that area. Again, you might also have a public file in that area that you may want indexed, you can then just allow that one page (don't know why you would, but this is just an example).

On many of the servers I code on, I might have php test code I am working on in a folder called testcode. I just disallow indexing of that folder in robots.txt.

I have known of programmers who keep .pdf files, that are created on the fly in a contact form to be sent as an attachment, in a public temp folder that is never cleaned (shoddy programming, but I guess we don't all think alike). Well, .pdf files can easily be indexed and their contents made aware. Google, if you noticed, has a "read as html" link feature for .pdf documents. You wouldn't want that to happen, especially on potential customer correspondence. So just disallow that temp folder.

(You also need to be careful as to what is placed on your server. Not all spiders/crawlers respect the robots.txt file and they index everything.)
Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On



All times are GMT -6. The time now is 12:46 PM.



WB Sponsors

Profit Lance Review

affiliate network

Work At Home Jobs

Expert PPC Coaching



 Subscribe to the Website Babble Feeds

2 Create a Website Homepage | 2 Create a Website Blog


Powered by vBulletin® Version 3.7.2
Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
Search Engine Optimization by vBSEO 3.1.0