diff options
Diffstat (limited to 'documentation/cxx/tree/manual/index.xhtml')
-rw-r--r-- | documentation/cxx/tree/manual/index.xhtml | 18 |
1 files changed, 16 insertions, 2 deletions
diff --git a/documentation/cxx/tree/manual/index.xhtml b/documentation/cxx/tree/manual/index.xhtml index d468fe3..91c6154 100644 --- a/documentation/cxx/tree/manual/index.xhtml +++ b/documentation/cxx/tree/manual/index.xhtml @@ -226,7 +226,7 @@ <th>2.1</th><td><a href="#2.1">Preliminary Information</a> <table class="toc"> <tr><th>2.1.1</th><td><a href="#2.1.1">Identifiers</a></td></tr> - <tr><th>2.1.2</th><td><a href="#2.1.2">Character Type</a></td></tr> + <tr><th>2.1.2</th><td><a href="#2.1.2">Character Type and Encoding</a></td></tr> <tr><th>2.1.3</th><td><a href="#2.1.3">XML Schema Namespace</a></td></tr> <tr><th>2.1.4</th><td><a href="#2.1.4">Anonymous Types</a></td></tr> </table> @@ -567,7 +567,7 @@ CONVENTION section in the <a href="http://www.codesynthesis.com/projects/xsd/documentation/xsd.xhtml">XSD Compiler Command Line Manual</a>.</p> - <h3><a name="2.1.2">2.1.2 Character Type</a></h3> + <h3><a name="2.1.2">2.1.2 Character Type and Encoding</a></h3> <p>The code that implements the mapping, depending on the <code>--char-type</code> option, is generated using either @@ -577,6 +577,20 @@ your schemas, for example <code>std::basic_string<C></code>. </p> + <p>Another aspect of the mapping that depends on the character type + is character encoding. For the <code>char</code> character type + the default encoding is UTF-8. Other supported encodings are + ISO-8859-1, Xerces-C++ Local Code Page (LPC), as well as + custom encodings and can be selected with the + <code>--char-encoding</code> command line option.</p> + + <p>For the <code>wchar_t</code> character type the encoding is + automatically selected between UTF-16 and UTF-32/UCS-4 depending + on the size of the <code>wchar_t</code> type. On some platforms + (for example, Windows with Visual C++ and AIX with IBM XL C++) + <code>wchar_t</code> is 2 bytes long. For these platforms the + encoding is UTF-16. On other platforms <code>wchar_t</code> is 4 bytes + long and UTF-32/UCS-4 is used.</p> <h3><a name="2.1.3">2.1.3 XML Schema Namespace</a></h3> |