Chilkat HOME ASP Visual Basic VB.NET C# Visual C++ C MFC Delphi FoxPro Java Perl PHP Python Ruby SQL Server VBScript
|
Extract all HTML Objects from a Web PageDemonstrates how to download a Web page (at a URL) and extract all HTML objects. Eg. images, links, CSS files, JavaScript files, etc. require 'chilkat' mht = Chilkat::CkMht.new() success = mht.UnlockComponent("30-day trial") if (success != true) print "Mht component unlock failed" + "\n" exit end # Download a URL into an in-memory MHT web archive contained # in a string variable: mhtDoc = mht.getMHT("http://www.gopackaging.com/") # On failure, the mhtDoc will be a zero-length string. # Check the LastErrorText property for error information. # Now extract the HTML and embedded objects: unpackDir = "c:/temp/" htmlFilename = "gopackaging.html" partsSubdir = "objects" # Extract to c:/temp/gopackaging.html. # images and other embedded objects are placed in # c:/temp/objects. Directories are automatically # created if they don't already exist. success = mht.UnpackMHTString(mhtDoc,unpackDir,htmlFilename,partsSubdir) if (success != true) print mht.lastErrorText() + "\n" else print "Unpacked!" + "\n" end |
Need a specific example? Send a request to support@chilkatsoft.com
© 2000-2008 Chilkat Software, Inc. All Rights Reserved.