Chilkat
HOME
Android™
ASP
Visual Basic
VB.NET
C#
iOS (IPhone)
Objective-C
C++
C
MFC
Delphi
FoxPro
Java
Perl
PHP Extension
PHP ActiveX
Python
PowerShell
Ruby
SQL Server
VBScript
Extract all HTML Objects from a Web PageDemonstrates how to download a Web page (at a URL) and extract all HTML objects. Eg. images, links, CSS files, JavaScript files, etc. Downloads: MS Windows Visual C/C++ Libraries Linux/CentOS C/C++ Libraries MAC OS X C/C++ Libraries Solaris C/C++ Libraries C++ Builder Libraries #include <C_CkMht.h> void ChilkatSample(void) { HCkMht mht; BOOL success; const char * mhtDoc; const char * unpackDir; const char * htmlFilename; const char * partsSubdir; mht = CkMht_Create(); success = CkMht_UnlockComponent(mht,"Anything for 30-day trial"); if (success != TRUE) { printf("%s\n",CkMht_lastErrorText(mht)); return; } // Download a URL into an in-memory MHT web archive contained // in a string variable: mhtDoc = CkMht_getMHT(mht,"http://www.gopackaging.com/"); if (mhtDoc == 0 ) { printf("%s\n",CkMht_lastErrorText(mht)); return; } // Now extract the HTML and embedded objects: unpackDir = "/Users/chilkat/temp/"; htmlFilename = "gopackaging.html"; partsSubdir = "objects"; // Extract to /Users/chilkat/temp/gopackaging.html. // images and other embedded objects are placed in // /Users/chilkat/temp/objects. Directories are automatically // created if they don't already exist. success = CkMht_UnpackMHTString(mht,mhtDoc,unpackDir,htmlFilename,partsSubdir); if (success != TRUE) { printf("%s\n",CkMht_lastErrorText(mht)); } else { printf("Unpacked!\n"); } CkMht_Dispose(mht); } |
Need a specific example? Send a request to support@chilkatsoft.com
© 2000-2010 Chilkat Software, Inc. All Rights Reserved.