Chilkat
HOME
Android™
ASP
Visual Basic
VB.NET
C#
iOS (IPhone)
Objective-C
C++
C
MFC
Delphi
FoxPro
Java
Perl
PHP Extension
PHP ActiveX
Python
PowerShell
Ruby
SQL Server
VBScript
Text Formatting Tags
Perl script showing how HTML text formatting tags are dropped by default during the HTML to XML conversion process. Text formatting tags may be kept by calling UndropTextFormattingTags.
# file: TextFormattingTags.pl use chilkat; # Demonstrates how HTML text formatting tags are (by default) dropped # during the HTML to XML conversion process. To keep text formatting # tags, call UndropTextFormattingTags $htmlConv = new chilkat::CkHtmlToXml(); $success = $htmlConv->UnlockComponent("anything for 30-day trial"); if (! $success) { print "component is locked!\n"; exit; } $html = "<html><body>This <b>is</b> a <i>test</i></body></html>"; # To convert, set the HTML and get the XML: $htmlConv->put_Html($html); $xml = $htmlConv->xml(); print $xml . "\n"; # The output is this: # # <?xml version="1.0" encoding="utf-8" ?> # # <root> # <html> # <body> # <text>This is a test</text> # </body> # </html> # </root> # # # What happened to the <b> and <i> tags??? # By default, text formatting tags are dropped. # If we call UndropTextFormattingTags, the tags will remain: $htmlConv->UndropTextFormattingTags(); $xml = $htmlConv->xml(); print $xml . "\n"; # We now get this: # # <?xml version="1.0" encoding="utf-8" ?> # # <root> # <html> # <body> # <text>This </text> # <b> # <text>is</text> # </b> # <text>a </text> # <i> # <text>test</text> # </i> # </body> # </html> # </root> |
© 2000-2010 Chilkat Software, Inc. All Rights Reserved.