Sample code for 30+ languages & platforms
Ruby

Scrub Xml

See more XML Examples

Demonstrates the Scrub method for some common XML cleaning needs.

Note: This example requires Chilkat v9.5.0.77 or greater.

Chilkat Ruby Downloads

Ruby
require 'chilkat'

success = false

# The Scrub method has options for trimming whitespace from the ends of attributes and content,
# trimming inside whitespace, and for case conversion for tags and attribute names.

# Trimming inside whitespace is defined as replacing CR, LF, and tab chars with SPACE chars, and 
# removing extra SPACE chars so that at most one SPACE separates non-SPACE chars.

# For example, load the following XML:

s = "<xyz><td class=\" colHoldingPercent col7\" style=\"border-bottom-color: rgb(221, 221, 221);               border-image-outset: 0;  \"> abc\r\n123    456</td></xyz>"

xml = Chilkat::CkXml.new()
success = xml.LoadXml(s)

print xml.getXml() + "\n";

# The XML before scrubbing is:

# <xyz>
#     <td class=" colHoldingPercent col7" style="border-bottom-color: rgb(221, 221, 221);               border-image-outset: 0;  ">abc
# 123    456</td>
# </xyz>

xml.Scrub("AttrTrimEnds,AttrTrimInside,ContentTrimEnds,ContentTrimInside")
print xml.getXml() + "\n";

# The XML after scrubbing is:

# <?xml version="1.0" encoding="utf-8"?>
# <xyz>
#     <td class="colHoldingPercent col7" style="border-bottom-color: rgb(221, 221, 221); border-image-outset: 0;">abc 123 456</td>
# </xyz>