VB.NET Examples

ChilkatHOMEASPVisual BasicVB.NETC#Visual C++CMFCDelphiFoxProJavaPerlPHPPythonRubySQL ServerVBScript

VB.NET Examples

Bounced Mail
Character Encoding
Digital Certificates
Digital Signatures
Email
FTP
HTML to XML
HTTP
IMAP
Encryption
MHT / HTML Email
PFX
RSA Encryption
S/MIME
Socket
Spider
Tar Archive
Upload
XML
XMP
Zip Compression
Misc

More Examples...
Email Object
POP3
SMTP
RSS
Atom
Self-Extractor

Byte Array
VB.NET FTPS
System.IO

Unreleased...
Service
PPMD
Deflate
Bzip2
LZW
Bz2
DH Key Exchange
DSA
Icon

 

 

 

 

 

 

Avoiding Outbound Links Matching Patterns

The spider accumulates outbound links when crawling. Your program may specify any number of "avoid patterns" to prevent any link matching at least one of the wildcarded patterns from being added.

Download Chilkat .NET for 2.0 Framework

Download Chilkat .NET for 1.0 / 1.1 Framework

'  The Chilkat Spider component/library is free.
Dim spider As New Chilkat.Spider()

'  First, we'll get the outbound links for a page in the
'  Google directory.  Then we'll add some avoid patterns
'  and then re-fetch, to see it work...

spider.Initialize("directory.google.com")
spider.AddUnspidered("http://directory.google.com/Top/Recreation/Food/Cheese/")

Dim success As Boolean
success = spider.CrawlNext()

'  Display the outbound links
Dim i As Long
Dim url As String
For i = 0 To spider.NumOutboundLinks - 1
    TextBox1.Text = TextBox1.Text & spider.GetOutboundLink(i) & vbCrLf
Next

'  The output:
'  http://www.cheese.com/
'  http://www.cheesediaries.com/
'  http://www.WisDairy.com/
'  http://www.newenglandcheese.com
'  http://www.ilovecheese.com
'  http://www.cheesefromspain.com
'  http://www.realcaliforniacheese.com/
'  http://www.frencheese.co.uk/
'  http://www.cheesesociety.org/
'  http://www.specialcheese.com/queso.htm
'  http://www.franceway.com/cheese/intro.htm
'  http://www.foodsubs.com/Chesfirm.html
'  http://www.cheeseboard.co.uk/
'  http://www.thecheeseweb.com/
'  http://www.vtcheese.com/
'  http://www.coldbacon.com/cheese.html
'  http://www.norwegiancheeses.co.uk/
'  http://www.reluctantgourmet.com/cheese.htm
'  http://www.lancewood.co.za/
'  http://www.switzerlandcheese.ca
'  http://www.frenchcheese.dk/
'  http://www.dolcevita.com/cuisine/cheese/cheese.htm
'  http://cheeseisland.net/
'  http://www.cheestrings.ca/
'  http://www.dreamcheese.co.uk
'  http://hgic.clemson.edu/factsheets/HGIC3506.htm
'  http://www.epicurious.com/cooking/how_to/food_dictionary/entry?id=1815
'  http://www.mousetrapcheese.co.uk
'  http://taquitos.net/yum/gc.shtml
'  http://www.greek-recipe.com/static/greek-cheese
'  http://www.park.org/Netherlands/pavilions/food_and_markets/cheese/introduction.html
'  http://www.dairyfarmers.org/engl/recipes/4_1.asp
'  http://www.prairieridgecheese.com/wischeesguid.html
'  http://dmoz.org/cgi-bin/add.cgi?where=Recreation/Food/Cheese
'  http://dmoz.org/about.html
'  http://dmoz.org/cgi-bin/apply.cgi?where=Recreation/Food/Cheese


'  Do it again, but this time with avoid patterns.
spider.Initialize("directory.google.com")
spider.AddUnspidered("http://directory.google.com/Top/Recreation/Food/Cheese/")

'  Add some avoid patterns:
spider.AddAvoidOutboundLinkPattern("*dmoz.org*")
spider.AddAvoidOutboundLinkPattern("*?id=*")
spider.AddAvoidOutboundLinkPattern("*.co.uk*")
success = spider.CrawlNext()

TextBox1.Text = TextBox1.Text & "-----------------------" & vbCrLf

'  Display the outbound links
For i = 0 To spider.NumOutboundLinks - 1
    TextBox1.Text = TextBox1.Text & spider.GetOutboundLink(i) & vbCrLf
Next

'  Output:
'  http://www.cheese.com/
'  http://www.cheesediaries.com/
'  http://www.WisDairy.com/
'  http://www.newenglandcheese.com
'  http://www.ilovecheese.com
'  http://www.cheesefromspain.com
'  http://www.realcaliforniacheese.com/
'  http://www.cheesesociety.org/
'  http://www.specialcheese.com/queso.htm
'  http://www.franceway.com/cheese/intro.htm
'  http://www.foodsubs.com/Chesfirm.html
'  http://www.thecheeseweb.com/
'  http://www.vtcheese.com/
'  http://www.coldbacon.com/cheese.html
'  http://www.reluctantgourmet.com/cheese.htm
'  http://www.lancewood.co.za/
'  http://www.switzerlandcheese.ca
'  http://www.frenchcheese.dk/
'  http://www.dolcevita.com/cuisine/cheese/cheese.htm
'  http://cheeseisland.net/
'  http://www.cheestrings.ca/
'  http://hgic.clemson.edu/factsheets/HGIC3506.htm
'  http://taquitos.net/yum/gc.shtml
'  http://www.greek-recipe.com/static/greek-cheese
'  http://www.park.org/Netherlands/pavilions/food_and_markets/cheese/introduction.html
'  http://www.dairyfarmers.org/engl/recipes/4_1.asp
'  http://www.prairieridgecheese.com/wischeesguid.html

 

Need a specific example? Send a request to support@chilkatsoft.com

© 2000-2007 Chilkat Software, Inc. All Rights Reserved.

Mail Component · XML Parser