Chilkat Examples

ChilkatHOMEAndroid™AutoItCC#C++Chilkat2-PythonCkPythonClassic ASPDataFlexDelphi DLLGoJavaNode.jsObjective-CPHP ExtensionPerlPowerBuilderPowerShellPureBasicRubySQL ServerSwiftTclUnicode CUnicode C++VB.NETVBScriptVisual Basic 6.0Visual FoxProXojo Plugin

Chilkat2-Python Examples
Web API Categories

ASN.1
AWS KMS
AWS Misc
Amazon EC2
Amazon Glacier
Amazon S3
Amazon S3 (new)
Amazon SES
Amazon SNS
Amazon SQS
Apple Keychain
Async
Azure Cloud Storage
Azure Key Vault
Azure Service Bus
Azure Table Service
Base64
Box
CAdES
CSR
CSV
Cert Store
Certificates
Cloud Signature CSC
Code Signing
Compression
DKIM / DomainKey
DNS
DSA
Diffie-Hellman
Digital Signatures
Dropbox
Dynamics CRM
EBICS
ECC
Ed25519
Email Object
Encryption
FTP
FileAccess
Firebase
GMail REST API
GMail SMTP/IMAP/POP
Geolocation
Google APIs
Google Calendar
Google Cloud SQL
Google Cloud Storage
Google Drive
Google Photos
Google Sheets
Google Tasks
Gzip
HTML-to-XML/Text
HTTP
HTTP Misc
IMAP
JSON
JSON Web Encryption (JWE)
JSON Web Signatures (JWS)
JSON Web Token (JWT)
Java KeyStore (JKS)
MHT / HTML Email
MIME
Microsoft Graph
Misc
NTLM
OAuth1
OAuth2
OIDC
Office365
OneDrive
OpenSSL
Outlook
Outlook Calendar
Outlook Contact
PDF Signatures
PEM
PFX/P12
PKCS11
POP3
PRNG
REST
REST Misc
RSA
Regular Expressions
SCP
SCard
SFTP
SMTP
SSH
SSH Key
SSH Tunnel
ScMinidriver
Secrets
SharePoint
SharePoint Online
Signing in the Cloud
Socket/SSL/TLS
Spider
Stream
Tar Archive
ULID/UUID
Upload
WebSocket
X
XAdES
XML
XML Digital Signatures
XMP
Zip
curl
uncategorized

 

 

 

(Chilkat2-Python) Convert utf-8 Text File to Windows-1252

Demonstrates how to convert a text file using the utf-8 byte representation to windows-1252.

Note: This example requires Chilkat v11.0.0 or greater.

Chilkat2 Python Downloads

install with pip

pip3 install chilkat2

or download... Python Module for Windows, Linux, Alpine Linux, MacOS

import sys
import chilkat2

success = False

# Converts a file containing the following to windows-1252:

# <greetings>
#     <message>Hello, world!</message>
#     <message>¡Hola, mundo!</message>
#     <message>Bonjour, le monde!</message>
#     <message>Hallo, Welt!</message>
#     <message>Olá, mundo!</message>
#     <message>Привет, мир!</message>
#     <message>你好,世界!</message>
#     <message>こんにちは、世界!</message>
#     <message>안녕하세요, 세계!</message>
#     <message>😊🌍</message>
# </greetings>

# --------------------------------------------------------------------------------------------------------------------------
# Note:
# Windows-1252 is an 8-bit single-byte encoding. It can only encode:
# 
#     The basic ASCII set (0x00–0x7F).
#     Latin-1 Supplement (0xA0–0xFF), plus some extra printable characters (like curly quotes, €, etc.).
#     In total: 256 possible code points, covering most Western European languages but nothing outside of Latin script.

# --------------------------------------------------------------------------------------------------------------------------
# Characters in your XML that are representable
# 
#     Hello, world! ✅ (ASCII only)
#     ¡Hola, mundo! ✅ (inverted exclamation mark U+00A1 is in Windows-1252)
#     Bonjour, le monde! ✅
#     Hallo, Welt! ✅
#     Olá, mundo! ✅ (U+00E1 á and U+00F3 ó are in Windows-1252)

# --------------------------------------------------------------------------------------------------------------------------
# Characters that break conversion
# 
#     Russian / Cyrillic: Привет, мир!
#     → These are Cyrillic characters (U+041F … U+0440). Not representable in Windows-1252. Conversion would require replacement (e.g. with ? or XML character references).
#     Chinese: 你好,世界!
#     → CJK ideographs (U+4F60, U+597D, etc.). Not in Windows-1252.
#     Japanese: こんにちは、世界!
#     → Hiragana + CJK. Not in Windows-1252.
#     Korean: 안녕하세요, 세계!
#     → Hangul syllables. Not in Windows-1252.
#     Emoji: 😊🌍
#     → Unicode Supplementary Multilingual Plane (U+1F60A, U+1F30D). Windows-1252 cannot encode any emoji.

bd = chilkat2.BinData()

# Load the utf-8 bytes.
success = bd.LoadFile("qa_data/xml/utf8test.xml")
if (success == False):
    print(bd.LastErrorText)
    sys.exit()

# If allOrNone = True, then the conversion fails and the contents of the BinData
# are left unchanged if any char is unconvertable.

# If allOrNone = False, then non-convertable chars are discarded.
allOrNone = False
fromCharset = "utf-8"
toCharset = "windows-1252"
success = bd.CharsetConvert(fromCharset,toCharset,allOrNone)

# The return value will be False if any utf-8 chars were discarded because of non-convertability.
if (success == False):
    print("Some utf-8 chars could not be converted to windows-1252")
else:
    print("All utf-8 chars were converted to windows-1252")

success = bd.WriteFile("c:/temp/qa_output/out.xml")

# The output file contains the following, where all non-convertable chars were discarded

# <greetings>
#     <message>Hello, world!</message>
#     <message>¡Hola, mundo!</message>
#     <message>Bonjour, le monde!</message>
#     <message>Hallo, Welt!</message>
#     <message>Olá, mundo!</message>
#     <message>, !</message>
#     <message></message>
#     <message></message>
#     <message>, !</message>
#     <message></message>
# </greetings>

 

© 2000-2025 Chilkat Software, Inc. All Rights Reserved.