Chilkat
HOME
Android™
ASP
Visual Basic
VB.NET
C#
iOS (IPhone)
Objective-C
C++
C
MFC
Delphi
FoxPro
Java
Perl
PHP Extension
PHP ActiveX
Python
PowerShell
Ruby
SQL Server
VBScript
Dropping HTML Tag Types
Perl script showing how to drop specific HTML tag types during an HTML to XML conversion.
# file: DropTagType.pl use chilkat; # Demonstrates how specific HTML tags can be selected to be dropped # during the HTML to XML conversion process. $htmlConv = new chilkat::CkHtmlToXml(); $success = $htmlConv->UnlockComponent("anything for 30-day trial"); if (! $success) { print "component is locked!\n"; exit; } $html = "<html><body><span>This <b>is</b> a <i>test</i><hr></span></body></html>"; # First, call UndropTextFormattingTags to prevent the text formatting tags # from being dropped by default. $htmlConv->UndropTextFormattingTags(); # We'll want to drop <hr>, <i>, and <span> tags: $htmlConv->DropTagType("hr"); $htmlConv->DropTagType("i"); $htmlConv->DropTagType("span"); # To convert, set the HTML and get the XML: $htmlConv->put_Html($html); $xml = $htmlConv->xml(); print $xml . "\n"; # The output is this: # # <?xml version="1.0" encoding="utf-8" ?> # # <root> # <html> # <body> # <text>This </text> # <b> # <text>is</text> # </b> # <text>a test</text> # </body> # </html> # </root> |
© 2000-2010 Chilkat Software, Inc. All Rights Reserved.