Chilkat HOME ASP Visual Basic VB.NET C# Visual C++ C MFC Delphi FoxPro Java Perl PHP Python Ruby SQL Server VBScript
|
Extract all HTML Objects from a Web PageDemonstrates how to download a Web page (at a URL) and extract all HTML objects. Eg. images, links, CSS files, JavaScript files, etc. use chilkat; $mht = new chilkat::CkMht(); $success = $mht->UnlockComponent("30-day trial"); if ($success != 1) { print "Mht component unlock failed" . "\n"; exit; } # Download a URL into an in-memory MHT web archive contained # in a string variable: $mhtDoc = $mht->getMHT("http://www.gopackaging.com/"); # On failure, the mhtDoc will be a zero-length string. # Check the LastErrorText property for error information. # Now extract the HTML and embedded objects: $unpackDir = "c:/temp/"; $htmlFilename = "gopackaging.html"; $partsSubdir = "objects"; # Extract to c:/temp/gopackaging.html. # images and other embedded objects are placed in # c:/temp/objects. Directories are automatically # created if they don't already exist. $success = $mht->UnpackMHTString($mhtDoc,$unpackDir,$htmlFilename,$partsSubdir); if ($success != 1) { print $mht->lastErrorText() . "\n"; } else { print "Unpacked!" . "\n"; } |
Need a specific example? Send a request to support@chilkatsoft.com
© 2000-2007 Chilkat Software, Inc. All Rights Reserved.