Chilkat
HOME
Android™
ASP
Visual Basic
VB.NET
C#
iOS (IPhone)
Objective-C
C++
C
MFC
Delphi
FoxPro
Java
Perl
PHP Extension
PHP ActiveX
Python
PowerShell
Ruby
SQL Server
VBScript
Fetch robots.txt for a Site
The Chilkat Spider library is robots.txt compliant. It automatically fetches a site's robots.txt file and adheres to it. It will not download pages denied by robots.txt. Pages excluded by robots.txt will not appear in the Spider's "unspidered" list. This example shows how to explicitly download and review the robots.txt for a given site. // The Chilkat Spider component/library is free. Chilkat.Spider spider = new Chilkat.Spider(); spider.Initialize("www.chilkatsoft.com"); string robotsText; robotsText = spider.FetchRobotsText(); textBox1.Text += robotsText + "\r\n"; textBox1.Refresh(); |
© 2000-2012 Chilkat Software, Inc. All Rights Reserved.