Python Examples

ChilkatHOMEASPVisual BasicVB.NETC#Visual C++CMFCDelphiFoxProJavaPerlPHPPythonRubySQL ServerVBScript

Python Examples

Quick Start
Unicode
Byte Array
Bz2
Certificates
CSV
Email
Encryption
FTP
HTML-to-XML
HTTP
IMAP
MHT
MIME
POP3
RSA
S/MIME
Signatures
Socket / SSL
SFTP
SMTP
Spider
SSH Key
SSH
SSH Tunnel
Tar
HTTP Upload
XML
XMP
Zip

More Examples...
String
Email Object
FileAccess
RSS
Atom
Self-Extractor
Service
PPMD
Deflate
DH Key Exchange
DSA

Unreleased...
Bzip2
LZW
Icon

 

 

 

 

 

 

 

Extract all HTML Objects from a Web Page

Demonstrates how to download a Web page (at a URL) and extract all HTML objects. Eg. images, links, CSS files, JavaScript files, etc.

Download Chilkat Python Library

import sys
import chilkat

mht = chilkat.CkMht()

success = mht.UnlockComponent("30-day trial")
if (success != True):
    print "Mht component unlock failed"
    sys.exit()

# Download a URL into an in-memory MHT web archive contained
# in a string variable:

mhtDoc = mht.getMHT("http://www.gopackaging.com/")
# On failure, the mhtDoc will be a zero-length string.
# Check the LastErrorText property for error information.

# Now extract the HTML and embedded objects:
unpackDir = "c:/temp/"
htmlFilename = "gopackaging.html"
partsSubdir = "objects"
# Extract to c:/temp/gopackaging.html.
# images and other embedded objects are placed in
# c:/temp/objects.  Directories are automatically
# created if they don't already exist.
success = mht.UnpackMHTString(mhtDoc,unpackDir,htmlFilename,partsSubdir)
if (success != True):
    print mht.lastErrorText()
else:
    print "Unpacked!"


 

Need a specific example? Send a request to support@chilkatsoft.com

© 2000-2008 Chilkat Software, Inc. All Rights Reserved.