Remove files that aren't used, including Unihan.zip
diff --git a/ucd/DerivedProperties.html b/ucd/DerivedProperties.html
deleted file mode 100644
index 491680d..0000000
--- a/ucd/DerivedProperties.html
+++ /dev/null
@@ -1,19 +0,0 @@
-<!doctype HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
-<html>

-

-<head>

-<meta http-equiv="Content-Type" content="text/html; charset=utf-8">

-<meta name="GENERATOR" content="Microsoft FrontPage 4.0">

-<meta name="ProgId" content="FrontPage.Editor.Document">

-<title>Unicode Character Database Documentation</title>

-</head>

-

-<body>

-

-<p>Starting with Version 4.0.0, most of the documentation for the Unicode 

-Character Database, including material formerly in this file, has been 

-consolidated into UCD.html.</p>

-

-</body>

-

-</html>

diff --git a/ucd/NamesList.html b/ucd/NamesList.html
deleted file mode 100644
index 688a6d0..0000000
--- a/ucd/NamesList.html
+++ /dev/null
@@ -1,355 +0,0 @@
-<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"

-

-       "http://www.w3.org/TR/REC-html40/loose.dtd"> 

-

-<html>

-

-<head>

-<meta http-equiv="Content-Type" content="text/html; charset=utf-8">

-<meta http-equiv="Content-Language" content="en-us">

-<meta name="GENERATOR" content="Microsoft FrontPage 6.0">

-<meta name="ProgId" content="FrontPage.Editor.Document">

-<meta name="keywords" content="unicode, normalization, composition, decomposition">

-<meta name="description" content="Specifies the Unicode Normalization Formats">

-<title>UCD: Unicode NamesList File Format</title>

-<link rel="stylesheet" type="text/css" href="http://www.unicode.org/unicode/reports/reports.css">

-<style type="text/css">

-

-<!--

-

-.foo         {  }

--->

-

-</style>

-</head>

-

-<body bgcolor="#ffffff">

-

-<table class="header">

-  <tr>

-    <td class="icon"><a href="http://www.unicode.org"><img border="0" src="http://www.unicode.org/webscripts/logo60s2.gif" align="middle" alt="[Unicode]" width="34" height="33"></a>&nbsp;&nbsp;<a class="bar" href="http://www.unicode.org/ucd/">Unicode    

-      Character Database</a></td>   

-  </tr>

-  <tr>

-    <td class="gray">&nbsp;</td>

-  </tr>

-</table>

-<div class="body">

-  <h1>Unicode NamesList File Format</h1>   

-  <table class="wide">

-    <tbody>

-      <tr>

-        <td valign="top" width="144">Revision</td>

-        <td valign="top">4.1.0</td>

-      </tr>

-      <tr>

-        <td valign="top" width="144">Authors</td>

-        <td valign="top">Asmus Freytag</td>

-      </tr>

-      <tr>

-        <td valign="top" width="144">Date</td>

-        <td valign="top">2005-3-10</td>

-      </tr>

-      <tr>

-        <td valign="top" width="144">This Version</td>

-        <td valign="top">

-		<a href="http://www.unicode.org/Public/4.1.0/ucd/NamesList.html">http://www.unicode.org/Public/4.1.0/ucd/NamesList.html</a></td>

-      </tr>

-      <tr>

-        <td valign="top" width="144">Previous Version</td>

-        <td valign="top"><a href="http://www.unicode.org/Public/4.0-Update/NamesList-4.0.0.html">http://www.unicode.org/Public/4.0-Update/NamesList-4.0.0.html</a></td>

-      </tr>

-      <tr>

-        <td valign="top" width="144">Latest Version</td>

-        <td valign="top"><a href="http://www.unicode.org/Public/UNIDATA/NamesList.html">http://www.unicode.org/Public/UNIDATA/NamesList.html</a></td>

-      </tr>

-    </tbody>

-  </table>

-  <h3><br>

-  <i>Summary</i></h3>

-  <blockquote>

-    <p>This file describes the format and contents of NamesList.txt</p>

-  </blockquote>

-  <h3><i>Status</i></h3>

-  <blockquote>

-    <p><i>The file and the files described herein are part of the <a href="http://www.unicode.org/ucd/">Unicode 

-    Character Database</a> (UCD) and are governed by the <a href="#Terms of Use">UCD 

-    Terms of Use</a> stated at the end.</i></p>

-  </blockquote>

-  <hr width="50%">

-  <h2>1.0 Introduction</h2>

-  <p>The Unicode name list file NamesList.txt (also NamesList.lst) is a plain  

-  text file used to drive the layout of the character code charts in the Unicode  

-  Standard. The information in this file is a combination of several fields from  

-  the UnicodeData.txt and Blocks.txt files, together with additional annotations  

-  for many characters.&nbsp;</p>

-  <p> This document describes the syntax rules for the file  

-  format, but also gives brief information on how each construct is rendered  

-  when laid out for the book. Some of the syntax elements were used in  

-  preparation of the drafts of the book and may not be present in the final,  

-  released form of the NamesList.txt file.</p> 

-  <p>The same input file can be used to do the draft preparation for ISO/IEC  

-  10646 (referred below as ISO-style). This necessitates the presence of some  

-  information in the name list file that is not needed (and in fact removed  

-  during parsing) for the Unicode code charts.</p>

-  <p>With access to the layout program (<a href="http://www.unicode.org/unibook/">unibook.exe</a>) it is a simple matter of   

-  creating name lists for the purpose of formatting working drafts containing   

-  proposed characters.</p>  

-  <p>The content of the NamesList.txt file is optimized for code chart creation. 

-  Some information that can be inferred by the reader from context has been 

-  suppressed to make the code charts more readable.&nbsp;</p> 

-  <h3>1.1 NamesList File Overview</h3>

-  <p>The *.lst files are plain text files which in their most simple form look 

-  like this</p>

-  <p>@@&lt;tab&gt;0020&lt;tab&gt;BASIC LATIN&lt;tab&gt;007F<br>

-  ; this is a file comment (ignored)<br>

-  0020&lt;tab&gt;SPACE<br>

-  0021&lt;tab&gt;EXCLAMATION MARK<br>

-  0022&lt;tab&gt;QUOTATION MARK<br>

-  . . .<br>

-  007F&lt;tab&gt;DELETE</p>

-  <p>The semicolon (as first character), @ and &lt;tab&gt; characters are used 

-  by the file syntax and must be provided as shown. Hexadecimal digits must be 

-  in UPPER CASE. A double @@ introduces a block header, with the title, and 

-  start and ending code of the block provided as shown.</p>

-  <p>For an ISO-style, minimal name list, only the NAME_LINE and BLOCKHEADER and 

-  their constituent syntax elements are needed.</p>

-  <p>The full syntax with all the options is provided in the following sections.</p>

-  <h3>1.2 NamesList File Structure</h3>

-  <p>This section gives defines the overall file structure</p>

-  <pre><strong>NAMELIST:     TITLE_PAGE* BLOCK* 

-</strong>

-<strong>TITLE_PAGE:   TITLE 

-		| TITLE_PAGE SUBTITLE 

-		| TITLE_PAGE SUBHEADER 

-		| TITLE_PAGE IGNORED_LINE 

-		| TITLE_PAGE EMPTY_LINE

-		| TITLE_PAGE COMMENT_LINE

-		| TITLE_PAGE NOTICE_LINE

-		| TITLE_PAGE PAGEBREAK 

-</strong>

-<strong>BLOCK:	      BLOCKHEADER 

-		| BLOCK CHAR_ENTRY 

-		| BLOCK SUBHEADER 

-		| BLOCK NOTICE_LINE 

-		| BLOCK EMPTY_LINE 

-		| BLOCK IGNORED_LINE 

-		| BLOCK PAGEBREAK

-

-CHAR_ENTRY:   NAME_LINE | RESERVED_LINE

-		| CHAR_ENTRY ALIAS_LINE

-		| CHAR_ENTRY COMMENT_LINE

-		| CHAR_ENTRY CROSS_REF

-		| CHAR_ENTRY DECOMPOSITION

-		| CHAR_ENTRY COMPAT_MAPPING

-		| CHAR_ENTRY IGNORED_LINE

-		| CHAR_ENTRY EMPTY_LINE

-		| CHAR_ENTRY NOTICE

-</strong></pre>

-  <p>In other words:<br>   

-  <br>

-  Neither TITLE nor&nbsp; SUBTITLE may occur after the first BLOCKHEADER.</p>   

-  <p>Only TITLE, SUBTITLE, SUBHEADER, PAGEBREAK, COMMENT_LINE,&nbsp; and    

-  IGNORED_LINE may occur before the first BLOCKHEADER.</p>   

-  <p>Directly following either a NAME_LINE or a RESERVED_LINE an uninterrupted  

-  sequence of the following lines may occur (in any order and repeated as often  

-  as needed): ALIAS_LINE, CROSS_REF, DECOMPOSITION, COMPAT_MAPPING, NOTICE_LINE,  

-  EMPTY_LINE and IGNORED_LINE.</p> 

-  <p>Except for EMPTY_LINE, NOTICE_LINE and IGNORED_LINE, none of these lines may  

-  occur in any other place.</p> 

-  <p><b>Note</b>: A NOTICE_LINE displays differently depending on whether it follows a   

-  header or title or is part of a CHAR_ENTRY.</p>  

-  <h3>1.3 NamesList File Elements</h3>

-  <p>This section provides the details of the syntax for the individual 

-  elements.</p>

-  <pre style="font-size:8pt"><strong>ELEMENT		SYNTAX</strong>	// How rendered

-  

-<strong>NAME_LINE:	CHAR &lt;tab&gt; NAME LF

-</strong>			// the CHAR and the corresponding image are echoed, 

-			// followed by the name as given in LINE

-

-<strong>		CHAR TAB &quot;&lt;&quot; LCNAME &quot;&gt;&quot; LF

-</strong>			// control and non-characters use this form of									

-			// lower case, bracketed pseudo character name</pre>

-  <pre style="font-size:8pt"><strong>		CHAR TAB NAME COMMENT LF

-</strong>			// Names may have a comment, which is stripped off

-			// unless the file is parsed for an ISO style list

-										

-<strong>RESERVED_LINE:	CHAR TAB &lt;reserved&gt;		

-</strong>			// the CHAR is echoed followed by an icon for the

-			// reserved character and a fixed string e.g. &lt;reserved&gt;

-	

-<strong>COMMMENT_LINE:	&lt;tab&gt; &quot;*&quot; SP EXPAND_LINE

-</strong>			// * is replaced by BULLET, output line as comment

-		<strong>&lt;tab&gt; EXPAND_LINE</strong>	

-			// output line as comment

-

-<strong>ALIAS_LINE:	&lt;tab&gt; &quot;=&quot; SP LINE	

-</strong>			// replace = by itself, output line as alias

-

-<strong>CROSS_REF:	&lt;tab&gt; &quot;X&quot; SP EXPAND_LINE	

-</strong>			// X is replaced by a right arrow

-		<strong>&lt;tab&gt; &quot;X&quot; SP &quot;(&quot; LCNAME SP &quot;-&quot; SP CHAR &quot;)&quot;	

-</strong>			// X is replaced by a right arrow

-			// the &quot;(&quot;, &quot;-&quot;, &quot;)&quot; are removed, the

-			// order of CHAR and STRING is reversed

-			// i.e. both inputs result in the same output

-

-<b>FILE_COMMENT:</b>	<b>&quot;;&quot;</b> <b>LINE</b><strong>

-IGNORED_LINE:	&lt;tab&gt; &quot;;&quot; EXPAND_LINE

-EMPTY_LINE:	LF			

-</strong>			// empty and ignored lines as well as 

-			// file comments are ignored

-

-<strong>DECOMPOSITION:	&lt;tab&gt; &quot;:&quot; EXPAND_LINE	

-</strong>			// replace ':' by EQUIV, expand line into 

-			// decomposition 

-

-<strong>COMPAT_MAPPING:	&lt;tab&gt; &quot;#&quot; SP EXPAND_LINE	

-</strong>			// replace '#' by APPROX, output line as mapping 

-

-<strong>NOTICE_LINE:	&quot;@+&quot; &lt;tab&gt; LINE		

-</strong>			// skip '@+', output text as notice

-<strong>		&quot;@+&quot; &lt;tab&gt; * SP LINE	

-</strong>			// skip '@+', output text as notice

-			// &quot;*&quot; expands to a bullet character

-			// Notices following a character code apply to the

-			// character and are indented. Notices not following

-			// a character code apply to the page/block/column 

-			// and are italicized, but not indented

-

-<strong>SUBTITLE:	&quot;@@@+&quot; &lt;tab&gt; LINE	

-</strong>			// skip &quot;@@@+&quot;, output text as subtitle

-

-<strong>SUBHEADER:	&quot;@&quot; &lt;tab&gt; LINE	

-</strong>			// skip '@', output line as text as column header

-

-<strong>BLOCKHEADER:	&quot;@@&quot; &lt;tab&gt; BLOCKSTART &lt;tab&gt; BLOCKNAME &lt;tab&gt; BLOCKEND

-</strong>			// skip &quot;@@&quot;, cause a page break and optional

-			// blank page, then output one or more charts

-			// followed by the list of character names. 

-			// use BLOCKSTART and BLOCKEND to define the 

-			// characters belonging to a block

-			// use blockname in page and table headers</pre>

-  <pre style="font-size:8pt"><b>BLOCKNAME:	LABEL

-		LABEL SP &quot;(&quot; LABEL &quot;)&quot;</b>			

-			// if an alternate label is present it replaces 

-			// the blockname when an ISO-style namelist is

-			// laid out; it is ignored in the Unicode charts

-

-<strong>BLOCKSTART:	CHAR</strong>	// first character position in block

-<strong>BLOCKEND:		CHAR</strong>	// last character position in block

-<strong>PAGE_BREAK:	&quot;@@&quot;</strong>	// insert a column break

-

-<strong>TITLE:		&quot;@@@&quot; &lt;tab&gt; LINE</strong>	

-			// skip &quot;@@@&quot;, output line as text

-			// Title is used in page headers

-

-<strong>EXPAND_LINE:	{CHAR | STRING}+ LF	</strong>

-			// all instances of CHAR *) are replaced by 

-			// CHAR NBSP x NBSP where x is the single Unicode

-			// character corresponding to char

-			// If character is combining, it is replaced with

-			// CHAR NBSP &lt;circ&gt; x NBSP where &lt;circ&gt; is the 

-			// dotted circle</pre>

-  <p><strong>Notes:</strong></p>

-  <ul>

-    <li>Blocks must be aligned on 16-code point boundary and contain an integer  

-      multiple of 16 code point columns. The exception to that rule is for blocks of  

-      ideographs etc. for which no names are listed in the file. Such blocks  

-      must end on the actual last character.</li> 

-    <li>Blocks must be non-overlapping and in ascending order.&nbsp; Namelines    

-      must be in ascending order and follow the block header for the block to    

-      which they belong.</li>   

-    <li>Reserved entries are optional, and will normally be supplied automatically. They  

-      are required whenever followed by ALIAS_LINE, COMMENT_LINE, NOTICE_LINE or CROSS_REF</li> 

-    <li>The French version of the nameslist uses French rules, which allow 

-      apostrophe and accented letters in character names.</li>

-  </ul>

-  <h3><strong>1.4 NamesList File Primitives</strong></h3>

-  <p>The following are the primitives and terminals for the NamesList syntax.</p>

-  <pre style="font-size:8pt"><strong>LINE:		STRING LF

-COMMENT:		&quot;(&quot; LABEL &quot;)&quot;

-		&quot;(&quot; LABEl &quot;)&quot; SP &quot;*&quot;

-</strong>		&quot;*&quot;<strong> </strong>

-<strong>BLOCKNAME:</strong>	&lt;sequence of Latin-1 characters, except &quot;(&quot; and &quot;)&quot;&gt; 

-<strong>NAME</strong>:	  	&lt;sequence of uppercase ASCII letters, digits and hyphen&gt; 

-<b>LCNAME:		</b>&lt;sequence of lowercase ASCII letters, digits and hyphen&gt;

-<strong>STRING</strong>:	  	&lt;sequence of Latin-1 characters, except space and controls&gt; 

-<strong>LABEL</strong>:	  	&lt;sequence of Latin-1 characters, except space, controls, &quot;(&quot; or &quot;)&quot;&gt; 

-<strong>CHAR</strong>:		<strong>X X X X</strong>

-		<strong>| X X X X X</strong>

-		<strong>| X X X X X X</strong>

-<strong>X:	  	&quot;0&quot;|&quot;1&quot;|&quot;2&quot;|&quot;3&quot;|&quot;4&quot;|&quot;5&quot;|&quot;6&quot;|&quot;7&quot;|&quot;8&quot;|&quot;9&quot;|&quot;A&quot;|&quot;B&quot;|&quot;C&quot;|&quot;D&quot;|&quot;E&quot;|&quot;F&quot; 

-&lt;tab&gt;:</strong>	  	&lt;sequence of one or more ASCII tab characters 0x09&gt;	

-<strong>SP</strong>:	  	&lt;ASCII 0x20&gt;

-<strong>LF</strong>:	  	&lt;any sequence of ASCII 0x0A and 0x0D&gt;

-</pre>

-  <p><strong>Notes:</strong>

-  <ul>

-    <li>Special lookahead logic prevents a mention of a 4 digit number for a standard, such  

-      as ISO 9999 from being misinterpreted as ISO CHAR. The hyphen in a character  

-      range CHAR-CHAR is replaced by an EN DASH on output.</li>

-    <li>The final LF in the file must be present</li>

-    <li>A CHAR inside ' or &quot; is expanded, but only its glyph image is    

-      printed,&nbsp; the code value is not echoed.</li>   

-    <li>Single and double straight quotes in an EXPAND_LINE are replaced by curly quotes using   

-      English rules. Smart apostrophes are supported, but nested quotes are not.</li>  

-    <li>The NamesList.txt file is encoded in Latin-1. While the code chart  

-      formatter can accept files in either Latin-1 and little-endian UTF-16,  

-      prefixed with a BOM, the character repertoire for running text (anything  

-      other than CHAR) is effectively restricted to Latin-1 characters.</li> 

-  </ul>

-  <h2>Modifications</h2>

-  <p><b>Version 4.0.0<br>

-  </b>&nbsp;&nbsp;&nbsp; Fix syntax to better reflect restrictions on characters 

-  in character and block names.<br>

-  &nbsp;&nbsp;&nbsp; Better document treatment of comments in block names, plus 

-  French name rules.</p>

-  <p><b>Version 3.2.0<br> 

-  </b>&nbsp;&nbsp;&nbsp; Fixed several broken links, added a left margin,  

-  changed version numbering.</p>

-  <p><b>Version 3.1.0 (2)<br>

-  </b>&nbsp;&nbsp;&nbsp; Use of 4-6 digit hex notation is now supported.</p> 

-  <hr width="50%">

-  <h2>UCD <a name="Terms of Use">Terms of Use</a></h2>

-  <h3><i>Disclaimer</i></h3>

-  <blockquote>

-    <p><i>The Unicode Character Database is provided as is by Unicode, Inc. No 

-    claims are made as to fitness for any particular purpose. No warranties of 

-    any kind are expressed or implied. The recipient agrees to determine 

-    applicability of information provided. If this file has been purchased on 

-    magnetic or optical media from Unicode, Inc., the sole remedy for any claim 

-    will be exchange of defective media within 90 days of receipt.</i></p>

-    <p><i>This disclaimer is applicable for all other data files accompanying 

-    the Unicode Character Database, some of which have been compiled by the 

-    Unicode Consortium, and some of which have been supplied by other sources.</i></p>

-  </blockquote>

-  <h3><i>Limitations on Rights to Redistribute This Data</i></h3>

-  <blockquote>

-    <p><i>Recipient is granted the right to make copies in any form for internal 

-    distribution and to freely use the information supplied in the creation of 

-    products supporting the Unicode<sup>TM</sup> Standard. The files in the 

-    Unicode Character Database can be redistributed to third parties or other 

-    organizations (whether for profit or not) as long as this notice and the 

-    disclaimer notice are retained. Information can be extracted from these 

-    files and used in documentation or programs, as long as there is an 

-    accompanying notice indicating the source.</i></p>

-  </blockquote>

-  <hr width="50%">

-  <div align="center">

-    <center>

-    <table cellspacing="0" cellpadding="0" border="0">

-      <tr>

-        <td><a href="http://www.unicode.org/unicode/copyright.html"><img src="http://www.unicode.org/img/hb_notice.gif" border="0" alt="Access to Copyright and terms of use" width="216" height="50"></a></td>

-      </tr>

-    </table>

-        <script language="Javascript" type="text/javascript" 

-        src="http://www.unicode.org/webscripts/lastModified.js"></script>

-    </center>

-  </div>

-</div>

-

-</body>

-

-</html>

diff --git a/ucd/PropList.html b/ucd/PropList.html
deleted file mode 100644
index 491680d..0000000
--- a/ucd/PropList.html
+++ /dev/null
@@ -1,19 +0,0 @@
-<!doctype HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
-<html>

-

-<head>

-<meta http-equiv="Content-Type" content="text/html; charset=utf-8">

-<meta name="GENERATOR" content="Microsoft FrontPage 4.0">

-<meta name="ProgId" content="FrontPage.Editor.Document">

-<title>Unicode Character Database Documentation</title>

-</head>

-

-<body>

-

-<p>Starting with Version 4.0.0, most of the documentation for the Unicode 

-Character Database, including material formerly in this file, has been 

-consolidated into UCD.html.</p>

-

-</body>

-

-</html>

diff --git a/ucd/StandardizedVariants.html b/ucd/StandardizedVariants.html
deleted file mode 100644
index 3b7c311..0000000
--- a/ucd/StandardizedVariants.html
+++ /dev/null
@@ -1,556 +0,0 @@
-<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"

-

-       "http://www.w3.org/TR/REC-html40/loose.dtd"> 

-

-<html>

-

-<head>

-<meta http-equiv="Content-Type" content="text/html; charset=utf-8">

-<meta http-equiv="Content-Language" content="en-us">

-<meta name="GENERATOR" content="Microsoft FrontPage 6.0">

-<meta name="ProgId" content="FrontPage.Editor.Document">

-<meta name="keywords" content="unicode, variant glyphs">

-<meta name="description" content="Describes and displays standardized variant glyphs">

-<title>Standardized Variants</title>

-<link rel="stylesheet" type="text/css" href="http://www.unicode.org/reports/reports.css">

-</head>

-

-<body bgcolor="#ffffff">

-

-<table class="header">

-  <tr>

-    <td class="icon"><a href="http://www.unicode.org"><img align="middle" alt="[Unicode]" border="0" src="http://www.unicode.org/webscripts/logo60s2.gif" width="34" height="33"></a>&nbsp;&nbsp;<a class="bar" href="http://www.unicode.org/ucd/">Unicode 

-      Character Database</a></td>

-  </tr>

-  <tr>

-    <td class="gray">&nbsp;</td>

-  </tr>

-</table>

-<blockquote>

-  <h1>Standardized Variants</h1>

-  <table class="wide">

-    <tbody>

-      <tr>

-        <td valign="top" width="144">Revision</td>

-        <td valign="top">4.1.0</td>

-      </tr>

-      <tr>

-        <td valign="top" width="144">Authors</td>

-        <td valign="top">Members of the Editorial Committee</td>

-      </tr>

-      <tr>

-        <td valign="top" width="144">Date</td>

-        <td valign="top">2005-03-30</td>

-      </tr>

-      <tr>

-        <td valign="top" width="144">This Version</td>

-        <td valign="top">

-		<a href="http://www.unicode.org/Public/4.1.0/ucd/StandardizedVariants.html">http://www.unicode.org/Public/4.1.0/ucd/StandardizedVariants.html</a></td>

-      </tr>

-      <tr>

-        <td valign="top" width="144">Previous Version</td>

-        <td valign="top">

-		<a href="http://www.unicode.org/Public/4.0-Update/StandardizedVariants-4.0.0.html">

-		http://www.unicode.org/Public/4.0-Update/StandardizedVariants-4.0.0.html</a></td>

-      </tr>

-      <tr>

-        <td valign="top" width="144">Latest Version</td>

-        <td valign="top"><a href="http://www.unicode.org/Public/UNIDATA/StandardizedVariants.html">http://www.unicode.org/Public/UNIDATA/StandardizedVariants.html</a></td>

-      </tr>

-    </tbody>

-  </table>

-  <h3><br>

-  <i>Summary</i></h3>

-  <blockquote>

-    <p>This file provides a visual display of the standard variant sequences 

-    derived from StandardizedVariants.txt.</p>

-  </blockquote>

-  <h3><i>Status</i></h3>

-  <blockquote>

-    <p><i>This file and the files described herein are part of the Unicode 

-	Character Database and are governed by the terms of use at

-	<a href="http://www.unicode.org/terms_of_use.html">

-	http://www.unicode.org/terms_of_use.html</a>.</i> </p>

-  </blockquote>

-  <hr width="50%">

-  <h2>Introduction</h2>

-  <p>The tables here <i>exhaustively</i> lists the valid, registered 

-  combinations of base character plus variation indicator. All combinations not 

-  listed in StandardizedVariants.txt are unspecified and are reserved for future 

-  standardization; no conformant process may interpret them as standardized 

-  variants. Variation selectors and their use are described in The Unicode 

-  Standard.</p>

-  <p>These mathematical variants are all produced with the addition of Variation 

-  Selector 1 (VS1 or U+FE00) to mathematical operator base characters. There is 

-  no variation according to context. The Mongolian variants use the Mongolian 

-  Variant Selectors, and may vary according to context. That is, if a contextual 

-  shape is not listed below, then the variation sequence has an unmodified 

-  appearance. At this time no Han variants exist.</p>

-  <blockquote>

-    <p><a name="fonts"><b>Note: </b></a>The glyphs used to show the variations 

-    are often derived from different physical fonts than the representative 

-    glyphs in the standard. They may therefore exhibit minor differences in 

-    size, proportion, or weight <i>unrelated</i> to the intentional difference 

-    in feature that is the defining element of the variation. Such minor 

-    differences should be ignored. Likewise, in some cases the existing 

-    representative fonts may not yet contain newly encoded characters and hence 

-    some representative glyphs shown in these tables may have a slightly 

-    different style than others.</p>

-  </blockquote>

-  <p><table><tr><th>Rep Glyph</th><th>Character Sequence</th><th>Context</th><th width='10%'>Alt Glyph</th><th>Description of variant appearance</th></tr><tr><td><img alt='U+2229' src='http://www.unicode.org/cgi-bin/refglyph?24-2229'></td>

-<td>2229 FE00</td>

-<td></td>

-<td><img alt='U+2229+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-2229-FE00'></td>

-<td>INTERSECTION  with serifs</td>

-</tr><tr><td><img alt='U+222A' src='http://www.unicode.org/cgi-bin/refglyph?24-222A'></td>

-<td>222A FE00</td>

-<td></td>

-<td><img alt='U+222A+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-222A-FE00'></td>

-<td>UNION  with serifs</td>

-</tr><tr><td><img alt='U+2268' src='http://www.unicode.org/cgi-bin/refglyph?24-2268'></td>

-<td>2268 FE00</td>

-<td></td>

-<td><img alt='U+2268+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-2268-FE00'></td>

-<td>LESS-THAN BUT NOT EQUAL TO  with vertical stroke</td>

-</tr><tr><td><img alt='U+2269' src='http://www.unicode.org/cgi-bin/refglyph?24-2269'></td>

-<td>2269 FE00</td>

-<td></td>

-<td><img alt='U+2269+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-2269-FE00'></td>

-<td>GREATER-THAN BUT NOT EQUAL TO  with vertical stroke</td>

-</tr><tr><td><img alt='U+2272' src='http://www.unicode.org/cgi-bin/refglyph?24-2272'></td>

-<td>2272 FE00</td>

-<td></td>

-<td><img alt='U+2272+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-2272-FE00'></td>

-<td>LESS-THAN OR EQUIVALENT TO  following the slant of the lower leg</td>

-</tr><tr><td><img alt='U+2273' src='http://www.unicode.org/cgi-bin/refglyph?24-2273'></td>

-<td>2273 FE00</td>

-<td></td>

-<td><img alt='U+2273+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-2273-FE00'></td>

-<td>GREATER-THAN OR EQUIVALENT TO  following the slant of the lower leg</td>

-</tr><tr><td><img alt='U+228A' src='http://www.unicode.org/cgi-bin/refglyph?24-228A'></td>

-<td>228A FE00</td>

-<td></td>

-<td><img alt='U+228A+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-228A-FE00'></td>

-<td>SUBSET OF WITH NOT EQUAL TO  with stroke through bottom members</td>

-</tr><tr><td><img alt='U+228B' src='http://www.unicode.org/cgi-bin/refglyph?24-228B'></td>

-<td>228B FE00</td>

-<td></td>

-<td><img alt='U+228B+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-228B-FE00'></td>

-<td>SUPERSET OF WITH NOT EQUAL TO  with stroke through bottom members</td>

-</tr><tr><td><img alt='U+2293' src='http://www.unicode.org/cgi-bin/refglyph?24-2293'></td>

-<td>2293 FE00</td>

-<td></td>

-<td><img alt='U+2293+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-2293-FE00'></td>

-<td>SQUARE CAP  with serifs</td>

-</tr><tr><td><img alt='U+2294' src='http://www.unicode.org/cgi-bin/refglyph?24-2294'></td>

-<td>2294 FE00</td>

-<td></td>

-<td><img alt='U+2294+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-2294-FE00'></td>

-<td>SQUARE CUP  with serifs</td>

-</tr><tr><td><img alt='U+2295' src='http://www.unicode.org/cgi-bin/refglyph?24-2295'></td>

-<td>2295 FE00</td>

-<td></td>

-<td><img alt='U+2295+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-2295-FE00'></td>

-<td>CIRCLED PLUS  with white rim</td>

-</tr><tr><td><img alt='U+2297' src='http://www.unicode.org/cgi-bin/refglyph?24-2297'></td>

-<td>2297 FE00</td>

-<td></td>

-<td><img alt='U+2297+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-2297-FE00'></td>

-<td>CIRCLED TIMES  with white rim</td>

-</tr><tr><td><img alt='U+229C' src='http://www.unicode.org/cgi-bin/refglyph?24-229C'></td>

-<td>229C FE00</td>

-<td></td>

-<td><img alt='U+229C+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-229C-FE00'></td>

-<td>CIRCLED EQUALS  with equal sign touching the circle</td>

-</tr><tr><td><img alt='U+22DA' src='http://www.unicode.org/cgi-bin/refglyph?24-22DA'></td>

-<td>22DA FE00</td>

-<td></td>

-<td><img alt='U+22DA+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-22DA-FE00'></td>

-<td>LESS-THAN EQUAL TO OR GREATER-THAN  with slanted equal</td>

-</tr><tr><td><img alt='U+22DB' src='http://www.unicode.org/cgi-bin/refglyph?24-22DB'></td>

-<td>22DB FE00</td>

-<td></td>

-<td><img alt='U+22DB+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-22DB-FE00'></td>

-<td>GREATER-THAN EQUAL TO OR LESS-THAN  with slanted equal</td>

-</tr><tr><td><img alt='U+2A3C' src='http://www.unicode.org/cgi-bin/refglyph?24-2A3C'></td>

-<td>2A3C FE00</td>

-<td></td>

-<td><img alt='U+2A3C+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-2A3C-FE00'></td>

-<td>INTERIOR PRODUCT  tall variant with narrow foot</td>

-</tr><tr><td><img alt='U+2A3D' src='http://www.unicode.org/cgi-bin/refglyph?24-2A3D'></td>

-<td>2A3D FE00</td>

-<td></td>

-<td><img alt='U+2A3D+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-2A3D-FE00'></td>

-<td>RIGHTHAND INTERIOR PRODUCT  tall variant with narrow foot</td>

-</tr><tr><td><img alt='U+2A9D' src='http://www.unicode.org/cgi-bin/refglyph?24-2A9D'></td>

-<td>2A9D FE00</td>

-<td></td>

-<td><img alt='U+2A9D+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-2A9D-FE00'></td>

-<td>SIMILAR OR LESS-THAN  with similar following the slant of the upper leg</td>

-</tr><tr><td><img alt='U+2A9E' src='http://www.unicode.org/cgi-bin/refglyph?24-2A9E'></td>

-<td>2A9E FE00</td>

-<td></td>

-<td><img alt='U+2A9E+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-2A9E-FE00'></td>

-<td>SIMILAR OR GREATER-THAN  with similar following the slant of the upper leg</td>

-</tr><tr><td><img alt='U+2AAC' src='http://www.unicode.org/cgi-bin/refglyph?24-2AAC'></td>

-<td>2AAC FE00</td>

-<td></td>

-<td><img alt='U+2AAC+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-2AAC-FE00'></td>

-<td>SMALLER THAN OR EQUAL TO  with slanted equal</td>

-</tr><tr><td><img alt='U+2AAD' src='http://www.unicode.org/cgi-bin/refglyph?24-2AAD'></td>

-<td>2AAD FE00</td>

-<td></td>

-<td><img alt='U+2AAD+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-2AAD-FE00'></td>

-<td>LARGER THAN OR EQUAL TO  with slanted equal</td>

-</tr><tr><td><img alt='U+2ACB' src='http://www.unicode.org/cgi-bin/refglyph?24-2ACB'></td>

-<td>2ACB FE00</td>

-<td></td>

-<td><img alt='U+2ACB+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-2ACB-FE00'></td>

-<td>SUBSET OF ABOVE NOT EQUAL TO  with stroke through bottom members</td>

-</tr><tr><td><img alt='U+2ACC' src='http://www.unicode.org/cgi-bin/refglyph?24-2ACC'></td>

-<td>2ACC FE00</td>

-<td></td>

-<td><img alt='U+2ACC+U+FE00/' src='http://www.unicode.org/cgi-bin/varglyph?24-2ACC-FE00'></td>

-<td>SUPERSET OF ABOVE NOT EQUAL TO  with stroke through bottom members</td>

-</tr><tr><td><img alt='U+1820' src='http://www.unicode.org/cgi-bin/refglyph?24-1820'></td>

-<td>1820 180B</td>

-<td>isolate<br>medial<br>final</td>

-<td><img alt='U+1820+U+180B/isolate' src='http://www.unicode.org/cgi-bin/varglyph?24-1820-180B-isol'> <img alt='U+1820+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1820-180B-medi'> <img alt='U+1820+U+180B/final' src='http://www.unicode.org/cgi-bin/varglyph?24-1820-180B-fina'></td>

-<td>MONGOLIAN LETTER A  second form</td>

-</tr><tr><td><img alt='U+1820' src='http://www.unicode.org/cgi-bin/refglyph?24-1820'></td>

-<td>1820 180C</td>

-<td>medial</td>

-<td><img alt='U+1820+U+180C/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1820-180C-medi'></td>

-<td>MONGOLIAN LETTER A  third form</td>

-</tr><tr><td><img alt='U+1821' src='http://www.unicode.org/cgi-bin/refglyph?24-1821'></td>

-<td>1821 180B</td>

-<td>initial<br>final</td>

-<td><img alt='U+1821+U+180B/initial' src='http://www.unicode.org/cgi-bin/varglyph?24-1821-180B-init'> <img alt='U+1821+U+180B/final' src='http://www.unicode.org/cgi-bin/varglyph?24-1821-180B-fina'></td>

-<td>MONGOLIAN LETTER E  second form</td>

-</tr><tr><td><img alt='U+1822' src='http://www.unicode.org/cgi-bin/refglyph?24-1822'></td>

-<td>1822 180B</td>

-<td>medial</td>

-<td><img alt='U+1822+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1822-180B-medi'></td>

-<td>MONGOLIAN LETTER I  second form</td>

-</tr><tr><td><img alt='U+1823' src='http://www.unicode.org/cgi-bin/refglyph?24-1823'></td>

-<td>1823 180B</td>

-<td>medial<br>final</td>

-<td><img alt='U+1823+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1823-180B-medi'> <img alt='U+1823+U+180B/final' src='http://www.unicode.org/cgi-bin/varglyph?24-1823-180B-fina'></td>

-<td>MONGOLIAN LETTER O  second form</td>

-</tr><tr><td><img alt='U+1824' src='http://www.unicode.org/cgi-bin/refglyph?24-1824'></td>

-<td>1824 180B</td>

-<td>medial</td>

-<td><img alt='U+1824+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1824-180B-medi'></td>

-<td>MONGOLIAN LETTER U  second form</td>

-</tr><tr><td><img alt='U+1825' src='http://www.unicode.org/cgi-bin/refglyph?24-1825'></td>

-<td>1825 180B</td>

-<td>medial<br>final</td>

-<td><img alt='U+1825+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1825-180B-medi'> <img alt='U+1825+U+180B/final' src='http://www.unicode.org/cgi-bin/varglyph?24-1825-180B-fina'></td>

-<td>MONGOLIAN LETTER OE  second form</td>

-</tr><tr><td><img alt='U+1825' src='http://www.unicode.org/cgi-bin/refglyph?24-1825'></td>

-<td>1825 180C</td>

-<td>medial</td>

-<td><img alt='U+1825+U+180C/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1825-180C-medi'></td>

-<td>MONGOLIAN LETTER OE  third form</td>

-</tr><tr><td><img alt='U+1826' src='http://www.unicode.org/cgi-bin/refglyph?24-1826'></td>

-<td>1826 180B</td>

-<td>isolate<br>medial<br>final</td>

-<td><img alt='U+1826+U+180B/isolate' src='http://www.unicode.org/cgi-bin/varglyph?24-1826-180B-isol'> <img alt='U+1826+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1826-180B-medi'> <img alt='U+1826+U+180B/final' src='http://www.unicode.org/cgi-bin/varglyph?24-1826-180B-fina'></td>

-<td>MONGOLIAN LETTER UE  second form</td>

-</tr><tr><td><img alt='U+1826' src='http://www.unicode.org/cgi-bin/refglyph?24-1826'></td>

-<td>1826 180C</td>

-<td>medial</td>

-<td><img alt='U+1826+U+180C/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1826-180C-medi'></td>

-<td>MONGOLIAN LETTER UE  third form</td>

-</tr><tr><td><img alt='U+1828' src='http://www.unicode.org/cgi-bin/refglyph?24-1828'></td>

-<td>1828 180B</td>

-<td>initial<br>medial</td>

-<td><img alt='U+1828+U+180B/initial' src='http://www.unicode.org/cgi-bin/varglyph?24-1828-180B-init'> <img alt='U+1828+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1828-180B-medi'></td>

-<td>MONGOLIAN LETTER NA  second form</td>

-</tr><tr><td><img alt='U+1828' src='http://www.unicode.org/cgi-bin/refglyph?24-1828'></td>

-<td>1828 180C</td>

-<td>medial</td>

-<td><img alt='U+1828+U+180C/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1828-180C-medi'></td>

-<td>MONGOLIAN LETTER NA  third form</td>

-</tr><tr><td><img alt='U+1828' src='http://www.unicode.org/cgi-bin/refglyph?24-1828'></td>

-<td>1828 180D</td>

-<td>medial</td>

-<td><img alt='U+1828+U+180D/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1828-180D-medi'></td>

-<td>MONGOLIAN LETTER NA  separate form</td>

-</tr><tr><td><img alt='U+182A' src='http://www.unicode.org/cgi-bin/refglyph?24-182A'></td>

-<td>182A 180B</td>

-<td>final</td>

-<td><img alt='U+182A+U+180B/final' src='http://www.unicode.org/cgi-bin/varglyph?24-182A-180B-fina'></td>

-<td>MONGOLIAN LETTER BA  alternative form</td>

-</tr><tr><td><img alt='U+182C' src='http://www.unicode.org/cgi-bin/refglyph?24-182C'></td>

-<td>182C 180B</td>

-<td>initial<br>medial</td>

-<td><img alt='U+182C+U+180B/initial' src='http://www.unicode.org/cgi-bin/varglyph?24-182C-180B-init'> <img alt='U+182C+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-182C-180B-medi'></td>

-<td>MONGOLIAN LETTER QA  second form</td>

-</tr><tr><td><img alt='U+182C' src='http://www.unicode.org/cgi-bin/refglyph?24-182C'></td>

-<td>182C 180B</td>

-<td>isolate</td>

-<td><img alt='U+182C+U+180B/isolate' src='http://www.unicode.org/cgi-bin/varglyph?24-182C-180B-isolfem'></td>

-<td>MONGOLIAN LETTER QA  feminine second form</td>

-</tr><tr><td><img alt='U+182C' src='http://www.unicode.org/cgi-bin/refglyph?24-182C'></td>

-<td>182C 180C</td>

-<td>medial</td>

-<td><img alt='U+182C+U+180C/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-182C-180C-medi'></td>

-<td>MONGOLIAN LETTER QA  third form</td>

-</tr><tr><td><img alt='U+182C' src='http://www.unicode.org/cgi-bin/refglyph?24-182C'></td>

-<td>182C 180D</td>

-<td>medial</td>

-<td><img alt='U+182C+U+180D/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-182C-180D-medi'></td>

-<td>MONGOLIAN LETTER QA  fourth form</td>

-</tr><tr><td><img alt='U+182D' src='http://www.unicode.org/cgi-bin/refglyph?24-182D'></td>

-<td>182D 180B</td>

-<td>initial<br>medial</td>

-<td><img alt='U+182D+U+180B/initial' src='http://www.unicode.org/cgi-bin/varglyph?24-182D-180B-init'> <img alt='U+182D+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-182D-180B-medi'></td>

-<td>MONGOLIAN LETTER GA  second form</td>

-</tr><tr><td><img alt='U+182D' src='http://www.unicode.org/cgi-bin/refglyph?24-182D'></td>

-<td>182D 180B</td>

-<td>final</td>

-<td><img alt='U+182D+U+180B/final' src='http://www.unicode.org/cgi-bin/varglyph?24-182D-180B-finafem'></td>

-<td>MONGOLIAN LETTER GA  feminine form</td>

-</tr><tr><td><img alt='U+182D' src='http://www.unicode.org/cgi-bin/refglyph?24-182D'></td>

-<td>182D 180C</td>

-<td>medial</td>

-<td><img alt='U+182D+U+180C/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-182D-180C-medi'></td>

-<td>MONGOLIAN LETTER GA  third form</td>

-</tr><tr><td><img alt='U+182D' src='http://www.unicode.org/cgi-bin/refglyph?24-182D'></td>

-<td>182D 180D</td>

-<td>medial</td>

-<td><img alt='U+182D+U+180D/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-182D-180D-medifem'></td>

-<td>MONGOLIAN LETTER GA  feminine form</td>

-</tr><tr><td><img alt='U+1830' src='http://www.unicode.org/cgi-bin/refglyph?24-1830'></td>

-<td>1830 180B</td>

-<td>final</td>

-<td><img alt='U+1830+U+180B/final' src='http://www.unicode.org/cgi-bin/varglyph?24-1830-180B-fina'></td>

-<td>MONGOLIAN LETTER SA  second form</td>

-</tr><tr><td><img alt='U+1830' src='http://www.unicode.org/cgi-bin/refglyph?24-1830'></td>

-<td>1830 180C</td>

-<td>final</td>

-<td><img alt='U+1830+U+180C/final' src='http://www.unicode.org/cgi-bin/varglyph?24-1830-180C-fina'></td>

-<td>MONGOLIAN LETTER SA  third form</td>

-</tr><tr><td><img alt='U+1832' src='http://www.unicode.org/cgi-bin/refglyph?24-1832'></td>

-<td>1832 180B</td>

-<td>medial</td>

-<td><img alt='U+1832+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1832-180B-medi'></td>

-<td>MONGOLIAN LETTER TA  second form</td>

-</tr><tr><td><img alt='U+1833' src='http://www.unicode.org/cgi-bin/refglyph?24-1833'></td>

-<td>1833 180B</td>

-<td>initial<br>medial<br>final</td>

-<td><img alt='U+1833+U+180B/initial' src='http://www.unicode.org/cgi-bin/varglyph?24-1833-180B-init'> <img alt='U+1833+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1833-180B-medi'> <img alt='U+1833+U+180B/final' src='http://www.unicode.org/cgi-bin/varglyph?24-1833-180B-fina'></td>

-<td>MONGOLIAN LETTER DA  second form</td>

-</tr><tr><td><img alt='U+1835' src='http://www.unicode.org/cgi-bin/refglyph?24-1835'></td>

-<td>1835 180B</td>

-<td>medial</td>

-<td><img alt='U+1835+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1835-180B-medi'></td>

-<td>MONGOLIAN LETTER JA  second form</td>

-</tr><tr><td><img alt='U+1836' src='http://www.unicode.org/cgi-bin/refglyph?24-1836'></td>

-<td>1836 180B</td>

-<td>initial<br>medial</td>

-<td><img alt='U+1836+U+180B/initial' src='http://www.unicode.org/cgi-bin/varglyph?24-1836-180B-init'> <img alt='U+1836+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1836-180B-medi'></td>

-<td>MONGOLIAN LETTER YA  second form</td>

-</tr><tr><td><img alt='U+1836' src='http://www.unicode.org/cgi-bin/refglyph?24-1836'></td>

-<td>1836 180C</td>

-<td>medial</td>

-<td><img alt='U+1836+U+180C/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1836-180C-medi'></td>

-<td>MONGOLIAN LETTER YA  third form</td>

-</tr><tr><td><img alt='U+1838' src='http://www.unicode.org/cgi-bin/refglyph?24-1838'></td>

-<td>1838 180B</td>

-<td>final</td>

-<td><img alt='U+1838+U+180B/final' src='http://www.unicode.org/cgi-bin/varglyph?24-1838-180B-fina'></td>

-<td>MONGOLIAN LETTER WA  second form</td>

-</tr><tr><td><img alt='U+1844' src='http://www.unicode.org/cgi-bin/refglyph?24-1844'></td>

-<td>1844 180B</td>

-<td>medial</td>

-<td><img alt='U+1844+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1844-180B-medi'></td>

-<td>MONGOLIAN LETTER TODO E  second form</td>

-</tr><tr><td><img alt='U+1845' src='http://www.unicode.org/cgi-bin/refglyph?24-1845'></td>

-<td>1845 180B</td>

-<td>medial</td>

-<td><img alt='U+1845+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1845-180B-medi'></td>

-<td>MONGOLIAN LETTER TODO I  second form</td>

-</tr><tr><td><img alt='U+1846' src='http://www.unicode.org/cgi-bin/refglyph?24-1846'></td>

-<td>1846 180B</td>

-<td>medial</td>

-<td><img alt='U+1846+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1846-180B-medi'></td>

-<td>MONGOLIAN LETTER TODO O  second form</td>

-</tr><tr><td><img alt='U+1847' src='http://www.unicode.org/cgi-bin/refglyph?24-1847'></td>

-<td>1847 180B</td>

-<td>isolate<br>medial<br>final</td>

-<td><img alt='U+1847+U+180B/isolate' src='http://www.unicode.org/cgi-bin/varglyph?24-1847-180B-isol'> <img alt='U+1847+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1847-180B-medi'> <img alt='U+1847+U+180B/final' src='http://www.unicode.org/cgi-bin/varglyph?24-1847-180B-fina'></td>

-<td>MONGOLIAN LETTER TODO U  second form</td>

-</tr><tr><td><img alt='U+1847' src='http://www.unicode.org/cgi-bin/refglyph?24-1847'></td>

-<td>1847 180C</td>

-<td>medial</td>

-<td><img alt='U+1847+U+180C/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1847-180C-medi'></td>

-<td>MONGOLIAN LETTER TODO U  third form</td>

-</tr><tr><td><img alt='U+1848' src='http://www.unicode.org/cgi-bin/refglyph?24-1848'></td>

-<td>1848 180B</td>

-<td>medial</td>

-<td><img alt='U+1848+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1848-180B-medi'></td>

-<td>MONGOLIAN LETTER TODO OE  second form</td>

-</tr><tr><td><img alt='U+1849' src='http://www.unicode.org/cgi-bin/refglyph?24-1849'></td>

-<td>1849 180B</td>

-<td>isolate<br>medial</td>

-<td><img alt='U+1849+U+180B/isolate' src='http://www.unicode.org/cgi-bin/varglyph?24-1849-180B-isol'> <img alt='U+1849+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1849-180B-medi'></td>

-<td>MONGOLIAN LETTER TODO UE  second form</td>

-</tr><tr><td><img alt='U+184D' src='http://www.unicode.org/cgi-bin/refglyph?24-184D'></td>

-<td>184D 180B</td>

-<td>initial<br>medial</td>

-<td><img alt='U+184D+U+180B/initial' src='http://www.unicode.org/cgi-bin/varglyph?24-184D-180B-initfem'> <img alt='U+184D+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-184D-180B-medifem'></td>

-<td>MONGOLIAN LETTER TODO QA  feminine form</td>

-</tr><tr><td><img alt='U+184E' src='http://www.unicode.org/cgi-bin/refglyph?24-184E'></td>

-<td>184E 180B</td>

-<td>medial</td>

-<td><img alt='U+184E+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-184E-180B-medi'></td>

-<td>MONGOLIAN LETTER TODO GA  second form</td>

-</tr><tr><td><img alt='U+185D' src='http://www.unicode.org/cgi-bin/refglyph?24-185D'></td>

-<td>185D 180B</td>

-<td>medial<br>final</td>

-<td><img alt='U+185D+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-185D-180B-medi'> <img alt='U+185D+U+180B/final' src='http://www.unicode.org/cgi-bin/varglyph?24-185D-180B-fina'></td>

-<td>MONGOLIAN LETTER SIBE E  second form</td>

-</tr><tr><td><img alt='U+185E' src='http://www.unicode.org/cgi-bin/refglyph?24-185E'></td>

-<td>185E 180B</td>

-<td>medial<br>final</td>

-<td><img alt='U+185E+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-185E-180B-medi'> <img alt='U+185E+U+180B/final' src='http://www.unicode.org/cgi-bin/varglyph?24-185E-180B-fina'></td>

-<td>MONGOLIAN LETTER SIBE I  second form</td>

-</tr><tr><td><img alt='U+185E' src='http://www.unicode.org/cgi-bin/refglyph?24-185E'></td>

-<td>185E 180C</td>

-<td>medial<br>final</td>

-<td><img alt='U+185E+U+180C/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-185E-180C-medi'> <img alt='U+185E+U+180C/final' src='http://www.unicode.org/cgi-bin/varglyph?24-185E-180C-fina'></td>

-<td>MONGOLIAN LETTER SIBE I  third form</td>

-</tr><tr><td><img alt='U+1860' src='http://www.unicode.org/cgi-bin/refglyph?24-1860'></td>

-<td>1860 180B</td>

-<td>medial<br>final</td>

-<td><img alt='U+1860+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1860-180B-medi'> <img alt='U+1860+U+180B/final' src='http://www.unicode.org/cgi-bin/varglyph?24-1860-180B-fina'></td>

-<td>MONGOLIAN LETTER SIBE UE  second form</td>

-</tr><tr><td><img alt='U+1863' src='http://www.unicode.org/cgi-bin/refglyph?24-1863'></td>

-<td>1863 180B</td>

-<td>medial</td>

-<td><img alt='U+1863+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1863-180B-medi'></td>

-<td>MONGOLIAN LETTER SIBE KA  second form</td>

-</tr><tr><td><img alt='U+1868' src='http://www.unicode.org/cgi-bin/refglyph?24-1868'></td>

-<td>1868 180B</td>

-<td>initial<br>medial</td>

-<td><img alt='U+1868+U+180B/initial' src='http://www.unicode.org/cgi-bin/varglyph?24-1868-180B-init'> <img alt='U+1868+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1868-180B-medi'></td>

-<td>MONGOLIAN LETTER SIBE TA  second form</td>

-</tr><tr><td><img alt='U+1868' src='http://www.unicode.org/cgi-bin/refglyph?24-1868'></td>

-<td>1868 180C</td>

-<td>medial</td>

-<td><img alt='U+1868+U+180C/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1868-180C-medi'></td>

-<td>MONGOLIAN LETTER SIBE TA  third form</td>

-</tr><tr><td><img alt='U+1869' src='http://www.unicode.org/cgi-bin/refglyph?24-1869'></td>

-<td>1869 180B</td>

-<td>initial<br>medial</td>

-<td><img alt='U+1869+U+180B/initial' src='http://www.unicode.org/cgi-bin/varglyph?24-1869-180B-init'> <img alt='U+1869+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1869-180B-medi'></td>

-<td>MONGOLIAN LETTER SIBE DA  second form</td>

-</tr><tr><td><img alt='U+186F' src='http://www.unicode.org/cgi-bin/refglyph?24-186F'></td>

-<td>186F 180B</td>

-<td>initial<br>medial</td>

-<td><img alt='U+186F+U+180B/initial' src='http://www.unicode.org/cgi-bin/varglyph?24-186F-180B-init'> <img alt='U+186F+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-186F-180B-medi'></td>

-<td>MONGOLIAN LETTER SIBE ZA  second form</td>

-</tr><tr><td><img alt='U+1873' src='http://www.unicode.org/cgi-bin/refglyph?24-1873'></td>

-<td>1873 180B</td>

-<td>medial<br>final</td>

-<td><img alt='U+1873+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1873-180B-medi'> <img alt='U+1873+U+180B/final' src='http://www.unicode.org/cgi-bin/varglyph?24-1873-180B-fina'></td>

-<td>MONGOLIAN LETTER MANCHU I  second form</td>

-</tr><tr><td><img alt='U+1873' src='http://www.unicode.org/cgi-bin/refglyph?24-1873'></td>

-<td>1873 180C</td>

-<td>medial<br>final</td>

-<td><img alt='U+1873+U+180C/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1873-180C-medi'> <img alt='U+1873+U+180C/final' src='http://www.unicode.org/cgi-bin/varglyph?24-1873-180C-fina'></td>

-<td>MONGOLIAN LETTER MANCHU I  third form</td>

-</tr><tr><td><img alt='U+1873' src='http://www.unicode.org/cgi-bin/refglyph?24-1873'></td>

-<td>1873 180D</td>

-<td>medial</td>

-<td><img alt='U+1873+U+180D/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1873-180D-medi'></td>

-<td>MONGOLIAN LETTER MANCHU I  fourth form</td>

-</tr><tr><td><img alt='U+1874' src='http://www.unicode.org/cgi-bin/refglyph?24-1874'></td>

-<td>1874 180B</td>

-<td>medial</td>

-<td><img alt='U+1874+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1874-180B-medi'></td>

-<td>MONGOLIAN LETTER MANCHU KA  second form</td>

-</tr><tr><td><img alt='U+1874' src='http://www.unicode.org/cgi-bin/refglyph?24-1874'></td>

-<td>1874 180B</td>

-<td>final</td>

-<td><img alt='U+1874+U+180B/final' src='http://www.unicode.org/cgi-bin/varglyph?24-1874-180B-finafem'></td>

-<td>MONGOLIAN LETTER MANCHU KA  feminine first final form</td>

-</tr><tr><td><img alt='U+1874' src='http://www.unicode.org/cgi-bin/refglyph?24-1874'></td>

-<td>1874 180C</td>

-<td>medial</td>

-<td><img alt='U+1874+U+180C/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1874-180C-medifem'></td>

-<td>MONGOLIAN LETTER MANCHU KA  feminine first medial form</td>

-</tr><tr><td><img alt='U+1874' src='http://www.unicode.org/cgi-bin/refglyph?24-1874'></td>

-<td>1874 180C</td>

-<td>final</td>

-<td><img alt='U+1874+U+180C/final' src='http://www.unicode.org/cgi-bin/varglyph?24-1874-180C-finafem'></td>

-<td>MONGOLIAN LETTER MANCHU KA  feminine second final form</td>

-</tr><tr><td><img alt='U+1874' src='http://www.unicode.org/cgi-bin/refglyph?24-1874'></td>

-<td>1874 180D</td>

-<td>medial</td>

-<td><img alt='U+1874+U+180D/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1874-180D-medifem'></td>

-<td>MONGOLIAN LETTER MANCHU KA  feminine second medial form</td>

-</tr><tr><td><img alt='U+1876' src='http://www.unicode.org/cgi-bin/refglyph?24-1876'></td>

-<td>1876 180B</td>

-<td>initial<br>medial</td>

-<td><img alt='U+1876+U+180B/initial' src='http://www.unicode.org/cgi-bin/varglyph?24-1876-180B-init'> <img alt='U+1876+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-1876-180B-medi'></td>

-<td>MONGOLIAN LETTER MANCHU FA  second form</td>

-</tr><tr><td><img alt='U+1880' src='http://www.unicode.org/cgi-bin/refglyph?24-1880'></td>

-<td>1880 180B</td>

-<td></td>

-<td><img alt='U+1880+U+180B/' src='http://www.unicode.org/cgi-bin/varglyph?24-1880-180B'></td>

-<td>MONGOLIAN LETTER ALI GALI ANUSVARA ONE  second form</td>

-</tr><tr><td><img alt='U+1881' src='http://www.unicode.org/cgi-bin/refglyph?24-1881'></td>

-<td>1881 180B</td>

-<td></td>

-<td><img alt='U+1881+U+180B/' src='http://www.unicode.org/cgi-bin/varglyph?24-1881-180B'></td>

-<td>MONGOLIAN LETTER ALI GALI VISARGA ONE  second form</td>

-</tr><tr><td><img alt='U+1887' src='http://www.unicode.org/cgi-bin/refglyph?24-1887'></td>

-<td>1887 180B</td>

-<td>isolate<br>final</td>

-<td><img alt='U+1887+U+180B/isolate' src='http://www.unicode.org/cgi-bin/varglyph?24-1887-180B-isol'> <img alt='U+1887+U+180B/final' src='http://www.unicode.org/cgi-bin/varglyph?24-1887-180B-fina'></td>

-<td>MONGOLIAN LETTER ALI GALI A  second form</td>

-</tr><tr><td><img alt='U+1887' src='http://www.unicode.org/cgi-bin/refglyph?24-1887'></td>

-<td>1887 180C</td>

-<td>final</td>

-<td><img alt='U+1887+U+180C/final' src='http://www.unicode.org/cgi-bin/varglyph?24-1887-180C-fina'></td>

-<td>MONGOLIAN LETTER ALI GALI A  third form</td>

-</tr><tr><td><img alt='U+1887' src='http://www.unicode.org/cgi-bin/refglyph?24-1887'></td>

-<td>1887 180D</td>

-<td>final</td>

-<td><img alt='U+1887+U+180D/final' src='http://www.unicode.org/cgi-bin/varglyph?24-1887-180D-fina'></td>

-<td>MONGOLIAN LETTER ALI GALI A  fourth form</td>

-</tr><tr><td><img alt='U+1888' src='http://www.unicode.org/cgi-bin/refglyph?24-1888'></td>

-<td>1888 180B</td>

-<td>final</td>

-<td><img alt='U+1888+U+180B/final' src='http://www.unicode.org/cgi-bin/varglyph?24-1888-180B-fina'></td>

-<td>MONGOLIAN LETTER ALI GALI I  second form</td>

-</tr><tr><td><img alt='U+188A' src='http://www.unicode.org/cgi-bin/refglyph?24-188A'></td>

-<td>188A 180B</td>

-<td>initial<br>medial</td>

-<td><img alt='U+188A+U+180B/initial' src='http://www.unicode.org/cgi-bin/varglyph?24-188A-180B-init'> <img alt='U+188A+U+180B/medial' src='http://www.unicode.org/cgi-bin/varglyph?24-188A-180B-medi'></td>

-<td>MONGOLIAN LETTER ALI GALI NGA  second form</td>

-</tr></table></p>

-  <hr width="50%">

-  <h2>UCD <a name="Terms of Use">Terms of Use</a></h2>

-  <p>For terms of use, see <i>

-	<a href="http://www.unicode.org/terms_of_use.html">

-	http://www.unicode.org/terms_of_use.html</a>.</i> </p>

-  <hr width="50%">

-  <div align="center">

-    <center>

-    <table cellspacing="0" cellpadding="0" border="0">

-      <tr>

-        <td><a href="http://www.unicode.org/unicode/copyright.html"><img src="http://www.unicode.org/img/hb_notice.gif" border="0" alt="Access to Copyright and terms of use" width="216" height="50"></a></td>

-      </tr>

-    </table>

-    <script language="Javascript" type="text/javascript" src="http://www.unicode.org/webscripts/lastModified.js"></script>

-    </center>

-  </div>

-</blockquote>

-

-</body>

-

-</html>

diff --git a/ucd/UCD.html b/ucd/UCD.html
deleted file mode 100644
index 971927e..0000000
--- a/ucd/UCD.html
+++ /dev/null
@@ -1,2537 +0,0 @@
-<!doctype HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" "http://www.w3.org/TR/REC-html40/loose.dtd">

-<html>

-

-<head>

-<meta http-equiv="Content-Type" content="text/html; charset=utf-8">

-<meta http-equiv="Content-Language" content="en-us">

-<meta name="GENERATOR" content="Microsoft FrontPage 6.0">

-<meta name="ProgId" content="FrontPage.Editor.Document">

-<title>Unicode Character Database</title>

-<link rel="stylesheet" type="text/css" href="http://www.unicode.org/reports/reports.css">

-<style type="text/css">

-<!--

-th           { background-color: #CCFFCC }

--->

-</style>

-</head>

-

-<body bgcolor="#ffffff">

-

-<table class="header" width="100%">

-  <tr>

-    <td class="icon"><a href="http://www.unicode.org">

-    <img align="middle" alt="[Unicode]" border="0" src="http://www.unicode.org/webscripts/logo60s2.gif" width="34" height="33"></a>&nbsp;&nbsp;<a class="bar" href="http://www.unicode.org/ucd/">Unicode 

-    Character Database</a></td>

-  </tr>

-  <tr>

-    <td class="gray">&nbsp;</td>

-  </tr>

-</table>

-<div class="body">

-  <h1>UNICODE CHARACTER DATABASE</h1>

-  <table class="wide" border="1">

-    <tr>

-      <td valign="TOP" width="144">Revision</td>

-      <td valign="TOP"><span>4.1.0</span></td>

-    </tr>

-    <tr>

-      <td valign="TOP" width="144">Authors</td>

-      <td valign="TOP">Mark Davis and Ken Whistler</td>

-    </tr>

-    <tr>

-      <td valign="TOP" width="144">Date</td>

-      <td valign="TOP"><span>2005-03-</span>30</td>

-    </tr>

-    <tr>

-      <td valign="TOP" width="144">This Version</td>

-      <td valign="TOP"><span><a href="http://www.unicode.org/Public/4.1.0/ucd/UCD.html">

-      http://www.unicode.org/Public/4.1.0/ucd/UCD.html</a></span></td>

-    </tr>

-    <tr>

-      <td valign="TOP" width="144">Previous Version</td>

-      <td valign="TOP"><span><a href="http://www.unicode.org/Public/4.0-Update1/UCD-4.0.1.html">

-      http://www.unicode.org/Public/4.0-Update1/UCD-4.0.1.html</a></span></td>

-    </tr>

-    <tr>

-      <td valign="TOP" width="144">Latest Version</td>

-      <td valign="TOP"><a href="http://www.unicode.org/Public/UNIDATA/UCD.html">

-      http://www.unicode.org/Public/UNIDATA/UCD.html</a></td>

-    </tr>

-  </table>

-  <h3><br>

-  S<i>ummary</i></h3>

-  <blockquote>

-    <p><i>This document describes the format and content of the Unicode Character Database (UCD)</i></p>

-  </blockquote>

-  <h3><i>Status</i></h3>

-  <blockquote>

-    <p><i>This file and the files described herein are part of the Unicode Character Database and 

-    are governed by the terms of use at <a href="http://www.unicode.org/terms_of_use.html">

-    http://www.unicode.org/terms_of_use.html</a>.</i></p>

-    <p><i>The <a href="#References">References</a> provide related information that is useful in 

-    understanding this document.</i></p>

-    <p><i><b>Warning: </b>the information in this file does not completely describe the use and 

-    interpretation of Unicode character properties and behavior. It must be used in conjunction with 

-    the data in the other files in the Unicode Character Database, and relies on the notation and 

-    definitions supplied in <a href="http://www.unicode.org/standard/standard.html">The Unicode 

-    Standard</a>. All chapter references are to Version 4.0.0 of the standard unless otherwise 

-    indicated.</i></p>

-  </blockquote>

-  <h2>Contents</h2>

-  <ul>

-    <li><a href="#Introduction">Introduction</a></li>

-    <li><a href="#Conformance">Conformance</a></li>

-    <li><a href="#UCD_File_Format">UCD File Format</a></li>

-    <li><a href="#UCD_Files">UCD Files</a></li>

-    <li><a href="#Properties">Properties</a></li>

-    <li><a href="#Property_and_Property_Value_Matching">Property and Property Value Matching</a></li>

-    <li><a href="#Property_Values">Property Values</a>

-    <ul>

-      <li><a href="#General_Category_Values">General Category Values</a></li>

-      <li><a href="#Bidi_Class_Values">Bidi Class Values</a></li>

-      <li><a href="#Character_Decomposition_Mappings">Character Decomposition Mapping</a></li>

-      <li><a href="#Canonical_Combining_Class_Values">Canonical Combining Classes</a></li>

-      <li><a href="#Decompositions_and_Normalization">Decompositions and Normalization</a></li>

-      <li><a href="#Case_Mappings">Case Mappings</a></li>

-    </ul>

-    </li>

-    <li><a href="#Unihan_Tags">Unihan Tags</a></li>

-    <li><a href="#Other_UCD_Files">Other UCD Files</a></li>

-    <li><a href="#Derived_Extracted_Properties">Derived Extracted Properties</a></li>

-    <li><a href="#Property_Invariants">Property Invariants</a></li>

-    <li><a href="#References">References</a></li>

-    <li><a href="#Modification_History">Modification History</a></li>

-    <li><a href="#UCD_Terms">UCD Terms of Use</a></li>

-  </ul>

-  <h2><a name="Introduction">Introduction</a></h2>

-  <p>The Unicode Character Database (UCD) is a set of files that define the Unicode character 

-  properties and internal mappings. This document describes the properties and files that are part 

-  of The Unicode Standard, Version <span>4.1.0 [<a href="#U4.1.0">U4.1.0</a>]</span>. For a 

-  description of the changes in this version, see <a href="#Modification_History">Modification 

-  History</a>.</p>

-  <p><span>The file structure for the UCD has changed in version 4.1.0. From this point on, the 

-  successive versions of the UCD are complete versions, so that so that users of the standard do not 

-  need to assemble the correct version of each file from different update directories for previous 

-  versions in order to have a complete set of files for a version. Each version is in a directory of 

-  the following form:</span></p>

-  <p><span><a href="http://www.unicode.org/Public/4.1.0/ucd/">

-  http://www.unicode.org/Public/4.1.0/ucd/</a></span></p>

-  <p><span>Within this directory the structure is the same as in previous versions, with two 

-  changes:</span></p>

-  <ul>

-    <li><span>The file names are unversioned in the final release (although<br>

-    they may be versioned during beta review of the UCD data). This allows people using the files to 

-    not worry about removing the release versions from the individual files, and allows the html 

-    files in the release to link to specific files.</span></li>

-    <li><span>An auxiliary directory has been added. In 4.1.0 it contains properties associated with 

-    UAX #29: Text Boundaries [<a href="#Breaks">Breaks</a>].</span></li>

-  </ul>

-  <h2><a name="Conformance">Conformance</a></h2>

-  <p>For information on the meaning and application of the terms <i>normative, informative, </i>and<i> 

-  provisional</i>, see Section 3.5, &quot;Properties&quot; in the Unicode Standard, Version 4.0.</p>

-  <h2><a name="UCD_File_Format">UCD File Format</a></h2>

-  <p>Files in the UCD use the following format, unless otherwise specified.</p>

-  <ul>

-    <li>Each line of data consists of fields separated by semicolons. The fields are numbered 

-    starting with zero. Code points are expressed as hexadecimal numbers with four to six digits. 

-    They are written without &quot;U+&quot;. Within a sequence of code points, spaces are used for separation. 

-    Leading and trailing spaces within a field are not significant.</li>

-  </ul>

-  <ul>

-    <li>The first field (0) of each line in the Unicode Character Database files represents a code 

-    point or range. The remaining fields (1..n) are properties associated with that code point.</li>

-  </ul>

-  <ul>

-    <li>A range of code points is specified by the form &quot;X..Y&quot;. Each code point from X to Y has the 

-    associated property value. For example (from <a href="Blocks.txt">Blocks.txt</a>):

-    <blockquote>

-      <pre>0000..007F; Basic Latin

-0080..00FF; Latin-1 Supplement</pre>

-    </blockquote>

-    </li>

-    <li>Property values may be omitted if they have a &quot;default&quot; value. For string properties, the 

-    default value is the character itself. For others, the default value is listed in a comment. For 

-    example (from <a href="Scripts.txt">Scripts.txt</a>):

-    <blockquote>

-      <pre>#  All code points not explicitly listed for Script

-#  have the value Common (Zyyy).</pre>

-    </blockquote>

-    </li>

-    <li>Where a file contains values for multiple properties, the second field will contain the name 

-    of the property and the third field will contain the property value. For example (from

-    <a href="DerivedNormalizationProps.txt">DerivedNormalizationProps.txt</a>):

-    <blockquote>

-      <pre>03D2  ; FC_NFKC; 03C5           # L&amp;  GREEK UPSILON WITH HOOK SYMBOL

-03D3  ; FC_NFKC; 03CD           # L&amp;  GREEK UPSILON WITH ACUTE AND HOOK SYMBOL

-</pre>

-    </blockquote>

-    </li>

-    <li>For binary properties, the second field given is the name of the applicable property, with 

-    the implied value of the property being &quot;True&quot;. Only the ranges of characters with the binary 

-    property value of True are listed. For example (from <a href="PropList.txt">PropList.txt</a>):

-    <blockquote>

-      <pre>1680       ; White_Space # Zs      OGHAM SPACE MARK

-180E       ; White_Space # Zs      MONGOLIAN VOWEL SEPARATOR

-2000..200A ; White_Space # Zs [11] EN QUAD..HAIR SPACE</pre>

-    </blockquote>

-    </li>

-    <li>For backwards compatibility, in the file <a href="UnicodeData.txt">UnicodeData.txt</a> a 

-    range is specified not by the form &quot;X..Y&quot;, but by their start and end characters. In such cases, 

-    the names of characters in the range are algorithmically derivable. Surrogate code points and 

-    private use characters have no names. See [<a href="#U4.0">U4.0</a>] for more information.</li>

-    <li>Hash marks (&quot;#&quot;) are used to indicate comments: all characters from the hash mark to the end 

-    of the line are comments, and disregarded when parsing data. In many files, the comments on data 

-    lines use a common format.

-    <blockquote>

-      <pre>00BC..00BE ; numeric # No [3] VULGAR FRACTION ONE QUARTER..VULGAR FRACTION THREE QUARTERS</pre>

-    </blockquote>

-    </li>

-    <li>The first part of the comment is generally the UCD general category. The symbol &quot;L&amp;&quot; 

-    indicates characters of type Lu, Ll, or Lt. This is the same as the LC property in 

-    PropertyValueAliases. The code point ranges are calculated so that they all have the same 

-    General Category (or LC). While this results in more ranges than are strictly necessary, it 

-    makes the contents of the ranges clearer. The second part of the comment (in square brackets), 

-    indicates the number of items in a range, if there is one. The third part is the name of the 

-    character in field zero: if it is a range, then the character names for the ends of the range 

-    are separated by &quot;..&quot;.

-    <ul>

-      <li>However, the comments are purely informational, and may change format or be omitted in the 

-      future. They should not be parsed for content.</li>

-    </ul>

-    </li>

-    <li>In the QuickCheck property table, NF* refers to one of NFD, NFC, NFKC, or NFKD.</li>

-    <li>The Unihan data format differs from the standard format, and is described in 

-	<a href="Unihan.html">Unihan.html</a>. That file also describes which properties are informative, which are normative, and 

-    which are provisional.</li>

-    <li>In some cases, segments of a data file are distinguished by a line starting with an &quot;@&quot; sign.</li>

-    <li>The files use UTF-8, with the exception of NamesList.txt, which is 

-	encoded in Latin-1. Unless otherwise noted, non-ASCII characters only 

-    appear in comments.</li>

-  </ul>

-  <h2><a name="UCD_Files">UCD Files</a></h2>

-  <p>The following table describes the format and meaning of each property data file in the UCD. (An 

-  index by property name, rather than file, is found at <a href="#Properties">Properties</a>.) The 

-  first column lists the files and the properties for which they contain data. The second column 

-  indicates the type of the property: String, Numeric, Enumeration (non-binary), Binary, Catalog, or 

-  Miscellaneous. Catalog properties have enumerated values which are expected 

-  to be regularly extended with successive versions of the Unicode Standard. This distinguishes them 

-  from Enumeration properties, whose enumerated values constitute a logical partition space, for 

-  which new values will generally not be added in successive versions of the standard. An example of 

-  a Catalog property is the Block property. Miscellaneous properties do not fit into the other 

-  property categories, and currently include character names, comments about characters, or the Unicode_Radical_Stroke property (a combination of numeric values). The third column indicates the 

-  status (<b>N</b>ormative vs. <b>I</b>nformative), and the fourth column provides a description of 

-  the data.</p>

-  <p>The files with a small number of properties are listed first, followed by the files with a 

-  large number of properties: <a href="#DerivedCoreProperties.txt">DerivedCoreProperties.txt</a>,

-  <a href="#DerivedNormalizationProps.txt">DerivedNormalizationProps.txt</a>,

-  <a href="#Proplist.txt">Proplist.txt</a>, and <a href="#UnicodeData.txt">UnicodeData.txt</a>. For 

-  UnicodeData, the field numbers are supplied in the description. In a number of cases, fields in a 

-  data file only contribute to a UCD property; for example, the name field in

-  <a href="#UnicodeData.txt">UnicodeData.txt</a> does not provide all the values for the Name 

-  property; <a href="#Jamo.txt">Jamo.txt</a> must be used as well.</p>

-  <p>None of these properties should be used without consulting the relevant discussions in the 

-  Unicode Standard.</p>

-  <p>Where a data file does not explicitly list property values for all code points, the code points 

-  are given default property values. These default property values are documented in the data files, 

-  with the exception of <a href="#UnicodeData.txt">UnicodeData.txt</a>. For that case the default 

-  property values are listed below in parentheses after the property name, with (=) indicating the 

-  code point itself.&nbsp; The default property values are also documented in any corresponding 

-  extracted data file.</p>

-  <table>

-    <tr>

-      <th valign="top" align="LEFT" colspan="4"><a name="ArabicShaping.txt">ArabicShaping.txt</a></th>

-    </tr>

-    <tr>

-      <td><a name="Joining_Type">Joining_Type</a><br>

-      <a name="Joining_Group">Joining_Group</a></td>

-      <td>E</td>

-      <td align="center">N</td>

-      <td>Basic Arabic and Syriac character shaping properties, such as initial, medial and final 

-      shapes. See Section 8.2<br>

-      </td>

-    </tr>

-    <tr>

-      <th valign="top" align="LEFT" colspan="4"><a name="BidiMirroring.txt">BidiMirroring.txt</a>&nbsp;</th>

-    </tr>

-    <tr>

-      <td><a name="Bidi_Mirroring_Glyph">Bidi_Mirroring_Glyph</a></td>

-      <td>S</td>

-      <td align="center">I</td>

-      <td>Properties for substituting characters in an implementation of bidirectional mirroring. 

-      See <span>UAX #9: The Bidirectional Algorithm [<a href="#BIDI">BIDI</a>]</span>. Do not 

-      confuse this with the Bidi_Mirrored property.</td>

-    </tr>

-    <tr>

-      <th valign="top" align="LEFT" colspan="4"><a name="Blocks.txt">Blocks.txt</a>&nbsp;</th>

-    </tr>

-    <tr>

-      <td><a name="Block">Block</a></td>

-      <td>C</td>

-      <td align="center">N</td>

-      <td>List of block names, which are arbitrary names for ranges of code points. See Chapter 16.</td>

-    </tr>

-    <tr>

-      <th valign="top" align="LEFT" colspan="4"><a name="CompositionExclusions.txt">

-      CompositionExclusions.txt</a>&nbsp;</th>

-    </tr>

-    <tr>

-      <td><a name="Composition_Exclusion">Composition Exclusion</a></td>

-      <td>B</td>

-      <td align="center">N</td>

-      <td>Properties for normalization. See <span>UAX #15: Unicode Normalization Forms [<a href="#Norm">Norm</a>]</span>. 

-      Unlike other files, CompositionExclusions simply lists the relevant code points.</td>

-    </tr>

-    <tr>

-      <th valign="top" align="LEFT" colspan="4"><a name="CaseFolding.txt">CaseFolding.txt</a>&nbsp;</th>

-    </tr>

-    <tr>

-      <td><a name="Simple_Case_Folding">Simple_Case_Folding</a><br>

-      <a name="Case_Folding">Case_Folding</a></td>

-      <td>S</td>

-      <td align="center">N</td>

-      <td>Mapping from characters to their case-folded forms. This is an informative file containing 

-      normative derived properties.

-      <p><i>Derived from UnicodeData and SpecialCasing.</i></td>

-    </tr>

-    <tr>

-      <th valign="top" align="LEFT" colspan="4"><a name="DerivedAge.txt">DerivedAge.txt</a>&nbsp;</th>

-    </tr>

-    <tr>

-      <td><a name="Age">Age</a></td>

-      <td>C</td>

-      <td align="center">N/I</td>

-      <td>This file shows when various code points were designated/assigned in successive versions 

-      of the Unicode standard.</td>

-    </tr>

-    <tr>

-      <th valign="top" align="LEFT" colspan="4"><a name="EastAsianWidth.txt">EastAsianWidth.txt</a>&nbsp;</th>

-    </tr>

-    <tr>

-      <td><a name="East_Asian_Width">East_Asian_Width</a></td>

-      <td>E</td>

-      <td align="center">I</td>

-      <td>Properties for determining the choice of wide vs. narrow glyphs in East Asian contexts. 

-      Property values are described in <span>UAX #11: East Asian Width [<a href="#Width">Width</a>]</span>.</td>

-    </tr>

-    <tr>

-      <th valign="top" align="LEFT" colspan="4">

-      <p align="LEFT"><a name="HangulSyllableType.txt">HangulSyllableType.txt</a></th>

-    </tr>

-    <tr>

-      <td valign="top"><a name="Hangul_Syllable_Type">Hangul_Syllable_Type</a><br>

-&nbsp;</td>

-      <td valign="top" align="center">E</td>

-      <td valign="top" align="center">N</td>

-      <td valign="top">The values L, V, T, LV, and LVT used in Chapter 3.</td>

-    </tr>

-    <tr>

-      <th valign="top" align="LEFT" colspan="4">

-      <p align="LEFT"><a name="Jamo.txt">Jamo.txt</a></th>

-    </tr>

-    <tr>

-      <td valign="top"><i>used in Name</i><br>

-&nbsp;</td>

-      <td valign="top" align="center">S</td>

-      <td valign="top" align="center">N</td>

-      <td valign="top">The Hangul Syllable names are derived from the Jamo Short Names, as described 

-      in Chapter 3.</td>

-    </tr>

-    <tr>

-      <th valign="top" align="LEFT" colspan="4"><a name="LineBreak.txt">LineBreak.txt</a>&nbsp;</th>

-    </tr>

-    <tr>

-      <td><a name="Line_Break">Line_Break</a></td>

-      <td>E</td>

-      <td align="center">N/I</td>

-      <td>Properties for line breaking. For more information, see <span>UAX #14: Line Breaking 

-      Properties [<a href="#Line">Line</a>].</span></td>

-    </tr>

-    <tr>

-      <th valign="top" align="LEFT" colspan="4">

-      <p align="LEFT"><a name="NormalizationCorrections.txt">NormalizationCorrections.txt</a>&nbsp;</th>

-    </tr>

-    <tr>

-      <td valign="top"><i>used in Decomposition Mappings</i></td>

-      <td valign="top" align="center">S</td>

-      <td valign="top" align="center">N</td>

-      <td valign="top">NormalizationCorrections lists code point differences for <i>

-      <a href="http://www.unicode.org/versions/corrigendum3.html">Normalization Corrigenda</a>. </i>

-      For more information, see <span>UAX #15: Unicode Normalization Forms [<a href="#Norm">Norm</a>]</span>.</td>

-    </tr>

-    <tr>

-      <th valign="top" align="LEFT" colspan="4"><a name="PropertyAliases.txt">PropertyAliases.txt</a></th>

-    </tr>

-    <tr>

-      <td><i>n/a</i></td>

-      <td>S</td>

-      <td align="center">N/I</td>

-      <td>Property names and abbreviations. These names can be used for XML formats of UCD data, for 

-      regular-expression property tests, and other programmatic textual descriptions of Unicode 

-      data.</td>

-    </tr>

-    <tr>

-      <th valign="top" align="LEFT" colspan="4">PropertyValueAliases.txt</th>

-    </tr>

-    <tr>

-      <td><i>n/a</i></td>

-      <td>S</td>

-      <td align="center">N/I</td>

-      <td>Property value names and abbreviations. These names can be used for XML formats of UCD 

-      data, for regular-expression property tests, and other programmatic textual descriptions of 

-      Unicode data.</td>

-    </tr>

-    <tr>

-      <th valign="top" align="LEFT" colspan="4"><a name="Scripts.txt">Scripts.txt</a>&nbsp;</th>

-    </tr>

-    <tr>

-      <td><a name="Script">Script</a></td>

-      <td>C</td>

-      <td align="center">I</td>

-      <td>Default script values for use in regular expressions. For more information, see <span>UAX 

-      #24: Script Names [<a href="#Scripts">Script</a>]</span>.</td>

-    </tr>

-    <tr>

-      <th valign="top" align="LEFT" colspan="4">SpecialCasing.txt</th>

-    </tr>

-    <tr>

-      <td><a name="Uppercase_Mapping">Uppercase_Mapping<br>

-      </a><a name="Lowercase_Mapping">Lowercase_Mapping</a><br>

-      <a name="Titlecase_Mapping">Titlecase_Mapping</a><br>

-      <a name="Special_Case_Condition">Special_Case_Condition</a></td>

-      <td>S</td>

-      <td align="center">I</td>

-      <td>Data for producing (in combination with Unicode Data) the full case mappings.</td>

-    </tr>

-    <tr>

-      <th valign="top" align="LEFT" colspan="4"><a name="Unihan.txt">Unihan.txt</a>&nbsp;(for more 

-      information, see <span><a href="Unihan.html">Unihan.html</a></span>)</th>

-    </tr>

-    <tr>

-      <td><a name="Numeric_Type_Han">Numeric_Type</a><br>

-      <a name="Numeric_Value_Han">Numeric_Value</a></td>

-      <td>E</td>

-      <td align="center">I</td>

-      <td>The characters tagged with <a href="Unihan.html#kPrimaryNumeric">kPrimaryNumeric</a>,

-      <a href="Unihan.html#kAccountingNumeric">kAccountingNumeric</a>, and

-      <a href="Unihan.html#kOtherNumeric">kOtherNumeric</a> are given the Numeric_Type <i>numeric</i>, 

-      and the values indicated.

-      <p>Most characters have these properties based on values from the UnicodeData.txt data file. 

-      See <a href="#Numeric_Type">Numeric_Type</a>.</td>

-    </tr>

-    <tr>

-      <td><a name="Unicode_Radical_Stroke">Unicode_Radical_Stroke</a>

-      <p>&nbsp;</td>

-      <td>S</td>

-      <td align="center">I</td>

-      <td>The Unicode radical stroke count, based on the tag <a href="Unihan.html#kRSUnicode">

-      kRSUnicode</a>.</td>

-    </tr>

-    <tr>

-      <th valign="top" align="LEFT" colspan="4"><a name="DerivedCoreProperties.txt">

-      DerivedCoreProperties.txt</a>&nbsp;</th>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Alphabetic">Alphabetic</a></td>

-      <td valign="top">B</td>

-      <td valign="top">I</td>

-      <td valign="top">Characters with the Alphabetic property. For more information, see

-      <a href="http://www.unicode.org/uni2book/ch04.pdf">Chapter 4, Character Properties</a>.

-      <p><i>Generated from: <a href="#Other_Alphabetic">Other_Alphabetic</a> + Lu + Ll + Lt + Lm + 

-      Lo + Nl</i></td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Default_Ignorable_Code_Point">

-      Default_Ignorable_Code_Point</a></td>

-      <td valign="top">B</td>

-      <td valign="top">N</td>

-      <td valign="top">For programmatic determination of default-ignorable code points. New 

-      characters that should be ignored in processing (unless explicitly supported) will be assigned 

-      in these ranges, permitting programs to correctly handle the default behavior of such 

-      characters when not otherwise supported. For more information, see <span>UAX #29: Text 

-      Boundaries [<a href="#Breaks">Breaks</a>]</span>.

-      <p><i>Generated from <a href="#Other_Default_Ignorable_Code_Point">

-      Other_Default_Ignorable_Code_Point</a> + Cf + Cc + Cs + Noncharacters - White_Space - 

-      Annotation_characters</i></td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Lowercase">Lowercase</a></td>

-      <td valign="top">B</td>

-      <td valign="top">I</td>

-      <td valign="top">Characters with the Lowercase property. For more information, see

-      <a href="http://www.unicode.org/uni2book/ch04.pdf">Chapter 4, Character Properties</a>.

-      <p><i>Generated from: <a href="#Other_Lowercase">Other_Lowercase</a> + Ll</i></td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Grapheme_Base">Grapheme_Base</a></td>

-      <td valign="top">B</td>

-      <td valign="top">I</td>

-      <td valign="top">For programmatic determination of grapheme cluster boundaries. For more 

-      information, see <span>UAX #29: Text Boundaries [<a href="#Breaks">Breaks</a>]</span>.

-      <p><i>Generated from: [0..10FFFF] - Cc - Cf - Cs - Co - Cn - Zl - Zp -

-      <a href="#Grapheme_Extend">Grapheme_Extend</a></i></td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Grapheme_Extend">Grapheme_Extend</a></td>

-      <td valign="top">B</td>

-      <td valign="top">I</td>

-      <td valign="top">For programmatic determination of grapheme cluster boundaries. For more 

-      information, see <span>UAX #29: Text Boundaries [<a href="#Breaks">Breaks</a>]</span>.

-      <p><i>Generated from: <a href="#Other_Grapheme_Extend">Other_Grapheme_Extend</a> + Me + Mn</i></p>

-      <p><b>Note: </b>depending on an application&#39;s interpretation of Co (private use), they may be 

-      either in Grapheme_Base, or in Grapheme_Extend, or in neither.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="ID_Start">ID_Start</a></td>

-      <td valign="top">B</td>

-      <td valign="top">I</td>

-      <td valign="top" rowspan="2"><span>Used to determine programming identifiers, as as described 

-      in UAX #31: Identifier and Pattern Syntax [<a href="#Pattern">Pattern</a>]</span></td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="ID_Continue">ID_Continue</a></td>

-      <td valign="top">B</td>

-      <td valign="top">I</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Math">Math</a></td>

-      <td valign="top">B</td>

-      <td valign="top">I</td>

-      <td valign="top">Characters with the Math property. For more information, see

-      <a href="http://www.unicode.org/uni2book/ch04.pdf">Chapter 4, Character Properties</a>.

-      <p><i>Generated from: Sm + <a href="#Other_Math">Other_Math</a></i></td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Uppercase">Uppercase</a></td>

-      <td valign="top">B</td>

-      <td valign="top">I</td>

-      <td valign="top">Characters with the Uppercase property. For more information, see

-      <a href="http://www.unicode.org/uni2book/ch04.pdf">Chapter 4, Character Properties</a>.

-      <p><i>Generated from: Lu + <a href="#Other_Lowercase">Other_Uppercase</a></i></td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="XID_Start">XID_Start</a></td>

-      <td valign="top">B</td>

-      <td valign="top">I</td>

-      <td valign="top" rowspan="2"><span>Used to determine programming identifiers, as as described 

-      in UAX #31: Identifier and Pattern Syntax [<a href="#Pattern">Pattern</a>]</span></td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="XID_Continue">XID_Continue</a></td>

-      <td valign="top">B</td>

-      <td valign="top">I</td>

-    </tr>

-    <tr>

-      <th valign="top" align="LEFT" colspan="4"><a name="DerivedNormalizationProps.txt">

-      DerivedNormalizationProps.txt</a>&nbsp;</th>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Full_Composition_Exclusion">Full_Composition_Exclusion</a></td>

-      <td valign="top">B</td>

-      <td valign="top">N</td>

-      <td valign="top">Characters that are excluded from composition: those explicitly in 

-      CompositionExclusions.txt, plus:<br>

-      <i>(3) Singleton Decompositions</i><br>

-      <i>(4) Non-Starter Decompositions</i></td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Expands_On_NFC">Expands_On_NFC</a><br>

-      <a name="Expands_On_NFD">Expands_On_NFD</a><br>

-      <a name="Expands_On_NFKC">Expands_On_NFKC</a><br>

-      <a name="Expands_On_NFKD">Expands_On_NFKD</a></td>

-      <td valign="top">B</td>

-      <td valign="top">N</td>

-      <td valign="top">Characters that expand to more than one character in the specified 

-      normalization form.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="FC_NFKC_Closure">FC_NFKC_Closure</a></td>

-      <td valign="top">S</td>

-      <td valign="top">N</td>

-      <td valign="top">Characters that require extra mappings for closure under Case Folding plus 

-      Normalization Form KC. Characters marked with this property have a third field with the 

-      mapping in it. Generated with the following, where Fold is the default fold operation (not 

-      Turkic):

-      <pre>b = NFKC(Fold(a));

-c = NFKC(Fold(b));

-if (c != b) add mapping from a to c</pre>

-      </td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="NFD_Quick_Check">NFD_Quick_Check</a><br>

-      <a name="NFKD_Quick_Check">NFKD_Quick_Check</a><br>

-      <a name="NFC_Quick_Check">NFC_Quick_Check</a><br>

-      <a name="NFKC_Quick_Check">NFKC_Quick_Check</a></td>

-      <td valign="top">E</td>

-      <td valign="top">N</td>

-      <td valign="top">For property values, see <a href="#Decompositions_and_Normalization">

-      Decompositions and Normalization</a>.</td>

-    </tr>

-    <tr>

-      <th valign="top" align="LEFT" colspan="4"><a name="Proplist.txt">Proplist.txt</a>&nbsp;</th>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="ASCII_Hex_Digit">ASCII_Hex_Digit</a></td>

-      <td valign="top">B</td>

-      <td valign="top">N</td>

-      <td valign="top">ASCII characters commonly used for the representation of hexadecimal numbers.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Bidi_Control">Bidi_Control</a></td>

-      <td valign="top" align="center">B</td>

-      <td valign="top">N</td>

-      <td valign="top">Those format control characters which have specific functions in the 

-      Bidirectional Algorithm.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Dash">Dash</a></td>

-      <td valign="top" align="center">B</td>

-      <td valign="top">I</td>

-      <td valign="top">Those punctuation characters explicitly called out as dashes in the Unicode 

-      Standard, plus compatibility equivalents to those. Most of these have the Pd General Category, 

-      but some have the Sm General Category because of their use in mathematics.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Deprecated">Deprecated</a></td>

-      <td valign="top">B</td>

-      <td valign="top">N</td>

-      <td valign="top">For a machine-readable list of deprecated characters. No characters will ever 

-      be removed from the standard, but the usage of deprecated characters is strongly discouraged.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Diacritic">Diacritic</a></td>

-      <td valign="top" align="center">B</td>

-      <td valign="top">I</td>

-      <td valign="top">Characters that linguistically modify the meaning of another character to 

-      which they apply. Some diacritics are not combining characters, and some combining characters 

-      are not diacritics.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Extender">Extender</a></td>

-      <td valign="top">B</td>

-      <td valign="top">I</td>

-      <td valign="top">Characters whose principal function is to extend the value or shape of a 

-      preceding alphabetic character. Typical of these are length and iteration marks.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Grapheme_Link">Grapheme_Link</a></td>

-      <td valign="top" align="center">B</td>

-      <td valign="top">N</td>

-      <td valign="top">Used in determining default grapheme cluster boundaries. For more 

-      information, see <span>UAX #29: Text Boundaries [<a href="#Breaks">Breaks</a>]</span>.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Hex_Digit">Hex_Digit</a></td>

-      <td valign="top">B</td>

-      <td valign="top">I</td>

-      <td valign="top">Characters commonly used for the representation of hexadecimal numbers, plus 

-      their compatibility equivalents.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Hyphen">Hyphen</a> (<a href="#Stabilized">Stabilized</a> 

-      as of 3.2)</td>

-      <td valign="top">B</td>

-      <td valign="top">I</td>

-      <td valign="top">Those dashes used to mark connections between pieces of words, plus the 

-      Katakana middle dot. The Katakana middle dot functions like a hyphen, but is shaped like a dot 

-      rather than a dash.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Ideographic">Ideographic</a></td>

-      <td valign="top">B</td>

-      <td valign="top">I</td>

-      <td valign="top">Characters considered to be CJKV (Chinese, Japanese, Korean, and Vietnamese) 

-      ideographs.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="IDS_Binary_Operator">IDS_Binary_Operator</a></td>

-      <td valign="top">B</td>

-      <td valign="top">N</td>

-      <td valign="top">Used in Ideographic Description Sequences.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="IDS_Trinary_Operator">IDS_Trinary_Operator</a></td>

-      <td valign="top">B</td>

-      <td valign="top">N</td>

-      <td valign="top">Used in Ideographic Description Sequences.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Join_Control">Join_Control</a></td>

-      <td valign="top">B</td>

-      <td valign="top">N</td>

-      <td valign="top">Those format control characters which have specific functions for control of 

-      cursive joining and ligation.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Logical_Order_Exception">Logical_Order_Exception</a></td>

-      <td valign="top">B</td>

-      <td valign="top">N</td>

-      <td valign="top">There are a small number of characters that do not use logical order. These 

-      characters require special handling in most processing.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Noncharacter_Code_Point">Noncharacter_Code_Point</a></td>

-      <td valign="top">B</td>

-      <td valign="top">N</td>

-      <td valign="top">Code points that are permanently reserved for internal 

-		use.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Other_Alphabetic">Other_Alphabetic</a></td>

-      <td valign="top" align="center">B</td>

-      <td valign="top">I</td>

-      <td valign="top">Used in deriving the Alphabetic property.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Other_Default_Ignorable_Code_Point">

-      Other_Default_Ignorable_Code_Point</a></td>

-      <td valign="top">B</td>

-      <td valign="top">N</td>

-      <td valign="top">Used in deriving the Default_Ignorable_Code_Point property.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Other_Grapheme_Extend">Other_Grapheme_Extend</a></td>

-      <td valign="top" align="center">B</td>

-      <td valign="top">N</td>

-      <td valign="top">Used in deriving&nbsp; the Grapheme_Extend property.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><span><a name="Other_ID_Continue">Other_ID_Continue</a></span></td>

-      <td valign="top"><span>B</span></td>

-      <td valign="top"><span>N</span></td>

-      <td valign="top"><span>Used for backwards compatibility of <a href="#ID_Continue">ID_Continue</a></span></td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Other_ID_Start">Other_ID_Start</a></td>

-      <td valign="top">B</td>

-      <td valign="top">N</td>

-      <td valign="top">Used for backwards compatibility of <a href="#ID_Start">ID_Start</a></td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Other_Lowercase">Other_Lowercase</a></td>

-      <td valign="top">B</td>

-      <td valign="top">I</td>

-      <td valign="top">Used in deriving the Lowercase property.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Other_Math">Other_Math</a></td>

-      <td valign="top">B</td>

-      <td valign="top">I</td>

-      <td valign="top">Used in deriving&nbsp; the Math property.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Other_Uppercase">Other_Uppercase</a></td>

-      <td valign="top">B</td>

-      <td valign="top">I</td>

-      <td valign="top">Used in deriving the Uppercase property.</td>

-    </tr>

-    <tr>

-      <td><span><a name="Pattern_Syntax">Pattern_Syntax</a></span></td>

-      <td valign="top"><span>B</span></td>

-      <td valign="top"><span>N</span></td>

-      <td valign="top" rowspan="2"><span>Used for pattern syntax as described in UAX #31: Identifier 

-      and Pattern Syntax [<a href="#Pattern">Pattern</a>].</span></td>

-    </tr>

-    <tr>

-      <td><span><a name="Pattern_White_Space">Pattern_White_Space</a></span></td>

-      <td valign="top"><span>B</span></td>

-      <td valign="top"><span>N</span></td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Quotation_Mark">Quotation_Mark</a></td>

-      <td valign="top">B</td>

-      <td valign="top">I</td>

-      <td valign="top">Those punctuation characters that function as quotation marks.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Radical">Radical</a></td>

-      <td valign="top">B</td>

-      <td valign="top">N</td>

-      <td valign="top">Used in Ideographic Description Sequences.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Soft_Dotted">Soft_Dotted</a></td>

-      <td valign="top" align="center">B</td>

-      <td valign="top">N</td>

-      <td valign="top">Characters with a &quot;soft dot&quot;, like <i>i</i> or <i>j.</i> An accent placed on 

-      these characters causes the dot to disappear. An explicit <i>dot above</i> can be added where 

-      required, such as in Lithuanian.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="STerm">STerm</a></td>

-      <td valign="top">B</td>

-      <td valign="top">I</td>

-      <td valign="top">Sentence Terminal. Used in <span>UAX #29: Text Boundaries [<a href="#Breaks">Breaks</a>].</span></td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Terminal_Punctuation">Terminal_Punctuation</a></td>

-      <td valign="top" align="center">B</td>

-      <td valign="top">I</td>

-      <td valign="top">Those punctuation characters that generally mark the end of textual units.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Unified_Ideograph">Unified_Ideograph</a></td>

-      <td valign="top">B</td>

-      <td valign="top">N</td>

-      <td valign="top">Used in Ideographic Description Sequences.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="Variation_Selector">Variation_Selector</a></td>

-      <td valign="top">B</td>

-      <td valign="top">N</td>

-      <td valign="top">Indicates all those characters that qualify as Variation Selectors. For 

-      details on the behavior of these characters, see <a href="StandardizedVariants.html">

-      StandardizedVariants.html</a> and

-      <a href="http://www.unicode.org/versions/Unicode4.0.0/ch15.pdf#G19053">15.6 Variation 

-      Selectors</a></td>

-    </tr>

-    <tr>

-      <td valign="top" align="left"><a name="White_Space">White_Space</a></td>

-      <td valign="top">B</td>

-      <td valign="top">N</td>

-      <td valign="top">Those separator characters and control characters which should be treated by 

-      programming languages as &quot;white space&quot; for the purpose of parsing elements.

-      <p><b>Note:</b> ZERO WIDTH SPACE and ZERO WIDTH NO-BREAK SPACE are not included, since their 

-      functions are restricted to line-break control. Their names are unfortunately misleading in 

-      this respect.</p>

-      <p><b>Note: </b>There are other senses of &quot;whitespace&quot; that encompass a different set of 

-      characters.</td>

-    </tr>

-    <tr>

-      <th valign="top" align="LEFT" colspan="4">

-      <p align="LEFT"><a name="UnicodeData.txt">UnicodeData.txt</a>&nbsp;</th>

-    </tr>

-    <tr>

-      <td valign="top"><a name="Name">Name</a>* (&lt;reserved&gt;)</td>

-      <td valign="top" align="center">M</td>

-      <td valign="top" align="center">N</td>

-      <td valign="top">(1) These names match exactly the names published in the code charts of the 

-      Unicode Standard. The Hangul Syllable names are omitted from this file; see Jamo.txt.</td>

-    </tr>

-    <tr>

-      <td valign="top"><a name="General_Category">General_Category</a> (Cn)</td>

-      <td valign="top" align="center">E</td>

-      <td valign="top" align="center">N</td>

-      <td valign="top">(2) This is a useful breakdown into various character types which can be used 

-      as a default categorization in implementations. For the property values, see

-      <a href="#General_Category_Values">General Category Values</a>.</td>

-    </tr>

-    <tr>

-      <td valign="top"><a name="Canonical_Combining_Class">Canonical_Combining_Class</a> (0)</td>

-      <td valign="top" align="center">N</td>

-      <td valign="top" align="center">N</td>

-      <td valign="top">(3) The classes used for the Canonical Ordering Algorithm in the Unicode 

-      Standard. For the property value names associated with different numeric values, see 

-      DerivedCombiningClass.txt and <a href="#Canonical_Combining_Class_Values">Canonical Combining 

-      Class Values</a>.</td>

-    </tr>

-    <tr>

-      <td valign="top"><a name="Bidi_Class">Bidi_Class</a> (L, AL, R)</td>

-      <td valign="top" align="center">E</td>

-      <td valign="top" align="center">N</td>

-      <td valign="top">(4) These are the categories required by the Bidirectional Behavior Algorithm 

-      in the Unicode Standard. For the property values, see <a href="#Bidi_Class_Values">Bidi Class 

-      Values</a>. For more information, see <span>UAX #9: The Bidirectional Algorithm [<a href="#BIDI">BIDI</a>].</span><p>

-      The default property values depend on the code point<span>, and are given in

-      <a href="extracted/DerivedBidiClass.txt">extracted/DerivedBidiClass.txt</a></span></td>

-    </tr>

-    <tr>

-      <td valign="top"><a name="Decomposition_Type">Decomposition_Type</a> (None)<br>

-      <a name="Decomposition_Mapping">Decomposition_Mapping</a> (=)</td>

-      <td valign="top" align="center">E<br>

-      S</td>

-      <td valign="top" align="center">N</td>

-      <td valign="top">(5) This field contains both values, with the type in angle brackets. The 

-      decomposition mappings match exactly the decomposition mappings published with the character 

-      names in the Unicode Standard. For more information, see

-      <a href="#Character_Decomposition_Mappings">Character Decomposition Mappings</a>.</td>

-    </tr>

-    <tr>

-      <td valign="top" rowspan="3"><a name="Numeric_Type">Numeric_Type</a> (None)<br>

-      <a name="Numeric_Value">Numeric_Value</a> (Not a Number)</td>

-      <td valign="top" align="center">E<br>

-      N</td>

-      <td valign="top" align="center">N</td>

-      <td valign="top">(6) If the character has the <i>decimal digit</i> property, as specified in 

-      Chapter 4 of the Unicode Standard, then the value of that digit is represented with an integer 

-      value in fields 6, 7, and 8.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="center">E<br>

-      N</td>

-      <td valign="top" align="center">N</td>

-      <td valign="top">(7) If the character has the <i>digit</i> property, but is not a decimal 

-      digit, then the value of that digit is represented with an integer value in fields 7 and 8. 

-      This covers digits that need special handling, such as the compatibility superscript digits.</td>

-    </tr>

-    <tr>

-      <td valign="top" align="center">E<br>

-      N</td>

-      <td valign="top" align="center">N</td>

-      <td valign="top">(8) If the character has the <i>numeric</i> property, as specified in Chapter 

-      4 of the Unicode Standard, the value of that character is represented with an positive or 

-      negative integer or rational number in this field. This includes fractions as, e.g., &quot;1/5&quot; for 

-      U+2155 VULGAR FRACTION ONE FIFTH.

-      <p>Some characters have these properties based on values from the Unihan data file. See

-      <a href="#Numeric_Type_Han">Numeric_Type, Han</a>.</td>

-    </tr>

-    <tr>

-      <td valign="top"><a name="Bidi_Mirrored">Bidi_Mirrored</a> (N)</td>

-      <td valign="top" align="center">B</td>

-      <td valign="top" align="center">N</td>

-      <td valign="top">(9) If the character has been identified as a &quot;mirrored&quot; character in 

-      bidirectional text, this field has the value &quot;Y&quot;; otherwise &quot;N&quot;. The list of mirrored 

-      characters is also printed in Chapter 4 of the Unicode Standard. <i>Do not confuse this with 

-      the Bidi_Mirroring_Glyph property.</i></td>

-    </tr>

-    <tr>

-      <td valign="top"><a name="Unicode_1_Name">Unicode_1_Name</a> (&lt;none&gt;)</td>

-      <td valign="top" align="center">M</td>

-      <td valign="top" align="center">I</td>

-      <td valign="top">(10) This is the old name as published in Unicode 1.0. This name is only 

-      provided when it is significantly different from the current name for the character. The value 

-      of field 10 for control characters does not always match the Unicode 1.0 names. Instead, field 

-      10 contains ISO 6429 names for control functions, for printing in the code charts.</td>

-    </tr>

-    <tr>

-      <td valign="top"><a name="ISO_Comment">ISO_Comment</a> (&lt;none&gt;)</td>

-      <td valign="top" align="center">M</td>

-      <td valign="top" align="center">I</td>

-      <td valign="top">(11) This is the ISO 10646 comment field. It appears in parentheses in the 

-      10646 names list, or contains an asterisk to mark an Annex P note.</td>

-    </tr>

-    <tr>

-      <td valign="top"><a name="Simple_Uppercase_Mapping">Simple_Uppercase_Mapping</a> (=)</td>

-      <td valign="top" align="center">S</td>

-      <td valign="top" align="center">N</td>

-      <td valign="top">(12) Simple uppercase mapping (single character result). If a character is 

-      part of an alphabet with case distinctions, and has a simple upper case equivalent, then the 

-      upper case equivalent is in this field. See the explanation below on case distinctions. The 

-      simple mappings have a single character result, where the full mappings may have 

-      multi-character results. For more information, see <a href="#Case_Mappings">Case Mappings</a>.

-      <p><i><b>Note: </b>The simple uppercase may be omitted in the data file if the uppercase is 

-      the same as the code point itself</i>.</td>

-    </tr>

-    <tr>

-      <td valign="top"><a name="Simple_Lowercase_Mapping">Simple_Lowercase_Mapping</a> (=)</td>

-      <td valign="top" align="center">S</td>

-      <td valign="top" align="center">N</td>

-      <td valign="top">(13) Simple lowercase mapping (single character result). Similar to Uppercase 

-      mapping.

-      <p><i><b>Note: </b>The simple lowercase may be omitted in the data file if the lowercase is 

-      the same as the code point itself</i>.</td>

-    </tr>

-    <tr>

-      <td valign="top"><a name="Simple_Titlecase_Mapping">Simple_Titlecase_Mapping</a> (=)</td>

-      <td valign="top" align="center">S</td>

-      <td valign="top" align="center">N</td>

-      <td valign="top">(14) Similar to Uppercase mapping (single character result).

-      <p><i><b>Note: </b>The simple titlecase may be omitted in the data file if the titlecase is 

-      the same as the uppercase.</i></td>

-    </tr>

-  </table>

-  <p><b>Note: </b></p>

-  <blockquote>

-    <p><a name="Stabilized"><b>Stabilized</b></a> properties are no longer actively maintained, nor 

-    are they extended as new characters are added.</p>

-  </blockquote>

-  <h2><a name="Properties">Properties</a></h2>

-  <p>The following table lists the properties in the UCD. They are roughly organized into groups 

-  based on the usage of the property (this grouping is purely for convenience, and has no other 

-  implications). The link on each property leads to description in the file index. The contributory 

-  properties (those of the form Other_XXX) are sets of exceptions used to generate properties in

-  <a href="DerivedCoreProperties.txt">DerivedCoreProperties.txt</a>. They are not intended for 

-  general use, such as in APIs that return property values.</p>

-  <table border="1">

-    <tr>

-      <th width="33%">General</th>

-      <th width="33%">Decomposition and Normalization</th>

-      <th width="33%">CJK</th>

-    </tr>

-    <tr>

-      <td><a href="#Name">Name</a></td>

-      <td><a href="#Canonical_Combining_Class">Canonical_Combining_Class</a></td>

-      <td><a href="#Ideographic">Ideographic</a></td>

-    </tr>

-    <tr>

-      <td><a href="#Block">Block</a></td>

-      <td><a href="#Decomposition_Mapping">Decomposition_Mapping</a></td>

-      <td><a href="#Unified_Ideograph">Unified_Ideograph</a></td>

-    </tr>

-    <tr>

-      <td><a href="#Age">Age</a></td>

-      <td><a href="#Composition_Exclusion">Composition_Exclusion</a></td>

-      <td><a href="#Radical">Radical</a></td>

-    </tr>

-    <tr>

-      <td><a href="#General_Category">General_Category</a></td>

-      <td><a href="#Full_Composition_Exclusion">Full_Composition_Exclusion</a></td>

-      <td><a href="#IDS_Binary_Operator">IDS_Binary_Operator</a></td>

-    </tr>

-    <tr>

-      <td><a href="#Script">Script</a></td>

-      <td><a href="#Decomposition_Type">Decomposition_Type</a></td>

-      <td><a href="#IDS_Trinary_Operator">IDS_Trinary_Operator</a></td>

-    </tr>

-    <tr>

-      <td><a href="#White_Space">White_Space</a></td>

-      <td><a href="#FC_NFKC_Closure">FC_NFKC_Closure</a></td>

-      <td><a href="#Unicode_Radical_Stroke">Unicode_Radical_Stroke</a></td>

-    </tr>

-    <tr>

-      <td><a href="#Alphabetic">Alphabetic</a></td>

-      <td><a href="#NFC_Quick_Check">NFC_Quick_Check</a></td>

-      <th>Misc</th>

-    </tr>

-    <tr>

-      <td><a href="#Hangul_Syllable_Type">Hangul_Syllable_Type</a></td>

-      <td><a href="#NFKC_Quick_Check">NFKC_Quick_Check</a></td>

-      <td><a href="#Math">Math</a></td>

-    </tr>

-    <tr>

-      <td><a href="#Noncharacter_Code_Point">Noncharacter_Code_Point</a></td>

-      <td><a href="#NFD_Quick_Check">NFD_Quick_Check</a></td>

-      <td><a href="#Quotation_Mark">Quotation_Mark</a></td>

-    </tr>

-    <tr>

-      <td><a href="#Default_Ignorable_Code_Point">Default_Ignorable_Code_Point</a></td>

-      <td><a href="#NFKD_Quick_Check">NFKD_Quick_Check</a></td>

-      <td><a href="#Dash">Dash</a></td>

-    </tr>

-    <tr>

-      <td><a href="#Deprecated">Deprecated</a></td>

-      <td><a href="#Expands_On_NFC">Expands_On_NFC</a></td>

-      <td><a href="#Hyphen">Hyphen</a></td>

-    </tr>

-    <tr>

-      <td><a href="#Logical_Order_Exception">Logical_Order_Exception</a></td>

-      <td><a href="#Expands_On_NFD">Expands_On_NFD</a></td>

-      <td><a href="#STerm">STerm</a></td>

-    </tr>

-    <tr>

-      <td><a href="#Variation_Selector">Variation_Selector</a></td>

-      <td><a href="#Expands_On_NFKC">Expands_On_NFKC</a></td>

-      <td><a href="#Terminal_Punctuation">Terminal_Punctuation</a></td>

-    </tr>

-    <tr>

-      <th>Case</th>

-      <td><a href="#Expands_On_NFKD">Expands_On_NFKD</a></td>

-      <td><a href="#Diacritic">Diacritic</a></td>

-    </tr>

-    <tr>

-      <td><a href="#Uppercase">Uppercase</a></td>

-      <th>Shaping and Rendering</th>

-      <td><a href="#Extender">Extender</a></td>

-    </tr>

-    <tr>

-      <td><a href="#Lowercase">Lowercase</a></td>

-      <td><a href="#Join_Control">Join_Control</a></td>

-      <td><a href="#Grapheme_Base">Grapheme_Base</a></td>

-    </tr>

-    <tr>

-      <td><a href="#Lowercase_Mapping">Lowercase_Mapping</a></td>

-      <td><a href="#Joining_Group">Joining_Group</a></td>

-      <td><a href="#Grapheme_Extend">Grapheme_Extend</a></td>

-    </tr>

-    <tr>

-      <td><a href="#Titlecase_Mapping">Titlecase_Mapping</a></td>

-      <td><a href="#Joining_Type">Joining_Type</a></td>

-      <td><a href="#Grapheme_Link">Grapheme_Link</a></td>

-    </tr>

-    <tr>

-      <td><a href="#Uppercase_Mapping">Uppercase_Mapping</a></td>

-      <td><a href="#Line_Break">Line_Break</a></td>

-      <td><a href="#Unicode_1_Name">Unicode_1_Name</a></td>

-    </tr>

-    <tr>

-      <td><a href="#Case_Folding">Case_Folding</a></td>

-      <td><span><a href="#Grapheme_Cluster_Break">Grapheme_Cluster_Break</a></span></td>

-      <td><a href="#ISO_Comment">ISO_Comment</a></td>

-    </tr>

-    <tr>

-      <td><a href="#Simple_Lowercase_Mapping">Simple_Lowercase_Mapping</a></td>

-      <td><span><a href="#Sentence_Break">Sentence_Break</a></span></td>

-      <td>&nbsp;</td>

-    </tr>

-    <tr>

-      <td><a href="#Simple_Titlecase_Mapping">Simple_Titlecase_Mapping</a></td>

-      <td><span><a href="#Word_Break">Word_Break</a></span></td>

-      <td>&nbsp;</td>

-    </tr>

-    <tr>

-      <td><a href="#Simple_Uppercase_Mapping">Simple_Uppercase_Mapping</a></td>

-      <td><a href="#East_Asian_Width">East_Asian_Width</a></td>

-      <td>&nbsp;</td>

-    </tr>

-    <tr>

-      <td><a href="#Simple_Case_Folding">Simple_Case_Folding</a></td>

-      <th>Bidi</th>

-      <td>&nbsp;</td>

-    </tr>

-    <tr>

-      <td><a href="#Special_Case_Condition">Special_Case_Condition</a></td>

-      <td><a href="#Bidi_Control">Bidi_Control</a></td>

-      <th><i>Contributory Properties</i></th>

-    </tr>

-    <tr>

-      <td><a href="#Soft_Dotted">Soft_Dotted</a></td>

-      <td><a href="#Bidi_Mirrored">Bidi_Mirrored</a></td>

-      <td><a href="#Other_Alphabetic">Other_Alphabetic</a></td>

-    </tr>

-    <tr>

-      <th>Identifiers</th>

-      <td><a href="#Bidi_Class">Bidi_Class</a></td>

-      <td><a href="#Other_Default_Ignorable_Code_Point">Other_Default_Ignorable_Code_Point</a></td>

-    </tr>

-    <tr>

-      <td><a href="#ID_Continue">ID_Continue</a></td>

-      <td><a href="#Bidi_Mirroring_Glyph">Bidi_Mirroring_Glyph</a></td>

-      <td><a href="#Other_Grapheme_Extend">Other_Grapheme_Extend</a></td>

-    </tr>

-    <tr>

-      <td><a href="#ID_Start">ID_Start</a></td>

-      <th>Numeric</th>

-      <td><a href="#Other_ID_Continue">Other_ID_Start</a></td>

-    </tr>

-    <tr>

-      <td><a href="#XID_Continue">XID_Continue</a></td>

-      <td><a href="#Numeric_Value">Numeric_Value</a></td>

-      <td><span><a href="#Other_ID_Continue">Other_ID_Continue</a></span></td>

-    </tr>

-    <tr>

-      <td><a href="#XID_Start">XID_Start</a></td>

-      <td><a href="#Numeric_Type">Numeric_Type</a></td>

-      <td><a href="#Other_Lowercase">Other_Lowercase</a></td>

-    </tr>

-    <tr>

-      <td><span><a href="#Pattern_Syntax">Pattern_Syntax</a></span></td>

-      <td><a href="#Hex_Digit">Hex_Digit</a></td>

-      <td><a href="#Other_Math">Other_Math</a></td>

-    </tr>

-    <tr>

-      <td><span><a href="#Pattern_White_Space">Pattern_White_Space</a></span></td>

-      <td><a href="#ASCII_Hex_Digit">ASCII_Hex_Digit</a></td>

-      <td><a href="#Other_Uppercase">Other_Uppercase</a></td>

-    </tr>

-  </table>

-  <p>&nbsp;</p>

-  <h2><a name="Property_and_Property_Value_Matching">Property and Property Value Matching</a></h2>

-  <p>Properties and property values may have multiple aliases, such as abbreviated names and longer, 

-  more descriptive names. For example, one can write either Line_Break or LB for the Line Break 

-  property, and either OP or Open_Punctuation for one of its values. When matching property names 

-  and values, it is strongly recommended that all aliases in the UCD be recognized, and that loose 

-  matching should be applied to all property names and property values according to the following:</p>

-  <p><b>Numeric Properties</b></p>

-  <p>For all numeric properties, and properties such as Unicode_Radical_Stroke that are combinations 

-  of numeric values, use the following loose matching rule:</p>

-  <p><i>LM1. Apply numeric equivalences</i></p>

-  <ul>

-    <li>&quot;01.00&quot; is equivalent to &quot;1&quot;.</li>

-    <li>&quot;1.666667&quot; in the UCD is a repeating fraction, and equivalent to 10/6.</li>

-  </ul>

-  <p><b>Character Names</b></p>

-  <p><i>LM2. Ignore case, whitespace, underscore (&#39;_&#39;), and all medial hyphens except the hyphen in 

-  U+1180.</i></p>

-  <ul>

-    <li>&quot;zero-width space&quot; is equivalent to &quot;zero width space&quot; or &quot;zerowidthspace&quot;</li>

-    <li>&quot;character -a&quot; is not equivalent to &quot;character a&quot;</li>

-  </ul>

-  <p><b>Others</b></p>

-  <p>For all property names, property value names, and for property values for Enumerated, Binary, 

-  or Catalog properties, use the following loose matching rule:</p>

-  <p><i>LM3. Ignore case, whitespace, underscore (&#39;_&#39;), and hyphens.</i></p>

-  <ul>

-    <li>&quot;linebreak&quot; is equivalent to &quot;Line_Break&quot; or &quot;Line-break&quot;</li>

-  </ul>

-  <p>Otherwise loose matching should not be done for the property values of String properties, as 

-  case distinctions or other distinctions in those values may be significant.</p>

-  <h2><a name="Property_Values">Property Values</a></h2>

-  <p>The following gives a summary of property values for certain properties. Other property values 

-  are documented in other locations; for example, the line breaking property values are documented 

-  in <span>UAX #14: Line Breaking Properties [<a href="#Line">Line</a>]</span>.</p>

-  <h3><a name="General_Category_Values">General Category Values</a></h3>

-  <p>The values in this field are abbreviations for the following values. For more information, see 

-  the Unicode Standard.</p>

-  <blockquote>

-    <p><b>Note:</b> The Unicode Standard does not assign information to control characters (except 

-    for certain cases). Implementations will generally also assign categories to certain control 

-    characters, notably CR and LF, according to platform conventions. See Section 5.8 &quot;Newline 

-    Guidelines&quot; for more information.</p>

-  </blockquote>

-  <table>

-    <tr>

-      <th>

-      <p align="LEFT">Abbr.</th>

-      <th>

-      <p align="LEFT">Description</th>

-    </tr>

-    <tr>

-      <td align="CENTER">Lu</td>

-      <td>Letter, Uppercase</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Ll</td>

-      <td>Letter, Lowercase</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Lt</td>

-      <td>Letter, Titlecase</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Lm</td>

-      <td>Letter, Modifier</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Lo</td>

-      <td>Letter, Other</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Mn</td>

-      <td>Mark, Nonspacing</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Mc</td>

-      <td>Mark, Spacing Combining</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Me</td>

-      <td>Mark, Enclosing</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Nd</td>

-      <td>Number, Decimal Digit</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Nl</td>

-      <td>Number, Letter</td>

-    </tr>

-    <tr>

-      <td align="CENTER">No</td>

-      <td>Number, Other</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Pc</td>

-      <td>Punctuation, Connector</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Pd</td>

-      <td>Punctuation, Dash</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Ps</td>

-      <td>Punctuation, Open</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Pe</td>

-      <td>Punctuation, Close</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Pi</td>

-      <td>Punctuation, Initial quote (may behave like Ps or Pe depending on usage)</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Pf</td>

-      <td>Punctuation, Final quote (may behave like Ps or Pe depending on usage)</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Po</td>

-      <td>Punctuation, Other</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Sm</td>

-      <td>Symbol, Math</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Sc</td>

-      <td>Symbol, Currency</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Sk</td>

-      <td>Symbol, Modifier</td>

-    </tr>

-    <tr>

-      <td align="CENTER">So</td>

-      <td>Symbol, Other</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Zs</td>

-      <td>Separator, Space</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Zl</td>

-      <td>Separator, Line</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Zp</td>

-      <td>Separator, Paragraph</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Cc</td>

-      <td>Other, Control</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Cf</td>

-      <td>Other, Format</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Cs</td>

-      <td>Other, Surrogate</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Co</td>

-      <td>Other, Private Use</td>

-    </tr>

-    <tr>

-      <td align="CENTER">Cn</td>

-      <td>Other, Not Assigned (no characters in the file have this property)</td>

-    </tr>

-  </table>

-  <blockquote>

-    <p><b>Note:</b> The term &quot;L&amp;&quot; is used to stand for Uppercase, Lowercase or Titlecase letters 

-    (Lu, Ll, or Lt) in comments. The LC value in <a href="PropertyValueAliases.txt">

-    PropertyValueAliases.txt</a> also stands for Uppercase, Lowercase or Titlecase letters.</p>

-  </blockquote>

-  <h3><a name="Bidi_Class_Values">Bidi Class Values</a></h3>

-  <p>Please refer to <span>UAX #9: The Bidirectional Algorithm [<a href="#BIDI">BIDI</a>] </span>for 

-  an explanation of the algorithm for Bidirectional Behavior and an explanation of the significance 

-  of these categories.</p>

-  <table>

-    <tr>

-      <th valign="TOP" align="LEFT">

-      <p align="LEFT">Type</th>

-      <th valign="TOP" align="LEFT">

-      <p align="LEFT">Description</th>

-    </tr>

-    <tr>

-      <td valign="TOP">L</td>

-      <td valign="TOP">Left-to-Right</td>

-    </tr>

-    <tr>

-      <td valign="TOP">LRE</td>

-      <td valign="TOP">Left-to-Right Embedding</td>

-    </tr>

-    <tr>

-      <td valign="TOP">LRO</td>

-      <td valign="TOP">Left-to-Right Override</td>

-    </tr>

-    <tr>

-      <td valign="TOP">R</td>

-      <td valign="TOP">Right-to-Left</td>

-    </tr>

-    <tr>

-      <td valign="TOP">AL</td>

-      <td valign="TOP">Right-to-Left Arabic</td>

-    </tr>

-    <tr>

-      <td valign="TOP">RLE</td>

-      <td valign="TOP">Right-to-Left Embedding</td>

-    </tr>

-    <tr>

-      <td valign="TOP">RLO</td>

-      <td valign="TOP">Right-to-Left Override</td>

-    </tr>

-    <tr>

-      <td valign="TOP">PDF</td>

-      <td valign="TOP">Pop Directional Format</td>

-    </tr>

-    <tr>

-      <td valign="TOP">EN</td>

-      <td valign="TOP">European Number</td>

-    </tr>

-    <tr>

-      <td valign="TOP">ES</td>

-      <td valign="TOP">European Number Separator</td>

-    </tr>

-    <tr>

-      <td valign="TOP">ET</td>

-      <td valign="TOP">European Number Terminator</td>

-    </tr>

-    <tr>

-      <td valign="TOP">AN</td>

-      <td valign="TOP">Arabic Number</td>

-    </tr>

-    <tr>

-      <td valign="TOP">CS</td>

-      <td valign="TOP">Common Number Separator</td>

-    </tr>

-    <tr>

-      <td valign="TOP">NSM</td>

-      <td valign="TOP">Non-Spacing Mark</td>

-    </tr>

-    <tr>

-      <td valign="TOP">BN</td>

-      <td valign="TOP">Boundary Neutral</td>

-    </tr>

-    <tr>

-      <td valign="TOP">B</td>

-      <td valign="TOP">Paragraph Separator</td>

-    </tr>

-    <tr>

-      <td valign="TOP">S</td>

-      <td valign="TOP">Segment Separator</td>

-    </tr>

-    <tr>

-      <td valign="TOP">WS</td>

-      <td valign="TOP">Whitespace</td>

-    </tr>

-    <tr>

-      <td valign="TOP">ON</td>

-      <td valign="TOP">Other Neutrals</td>

-    </tr>

-  </table>

-  <p>&nbsp;</p>

-  <h3><a name="Character_Decomposition_Mappings">Character Decomposition Mapping</a></h3>

-  <p>The tags supplied with certain decomposition mappings generally indicate formatting 

-  information. Where no such tag is given, the mapping is canonical. Conversely, the presence of a 

-  formatting tag also indicates that the mapping is a compatibility mapping and not a canonical 

-  mapping. In the absence of other formatting information in a compatibility mapping, the tag is 

-  used to distinguish it from canonical mappings.</p>

-  <p>In some instances a canonical mapping or a compatibility mapping may consist of a single 

-  character. For a canonical mapping, this indicates that the character is a canonical equivalent of 

-  another single character. For a compatibility mapping, this indicates that the character is a 

-  compatibility equivalent of another single character. The compatibility formatting tags used are:</p>

-  <table>

-    <tr>

-      <th>Tag</th>

-      <th>

-      <p align="LEFT">Description</th>

-    </tr>

-    <tr>

-      <td align="CENTER">&lt;font&gt;&nbsp;&nbsp;</td>

-      <td>A font variant (e.g. a blackletter form).</td>

-    </tr>

-    <tr>

-      <td align="CENTER">&lt;noBreak&gt;&nbsp;&nbsp;</td>

-      <td>A no-break version of a space or hyphen.</td>

-    </tr>

-    <tr>

-      <td align="CENTER">&lt;initial&gt;&nbsp;&nbsp;</td>

-      <td>An initial presentation form (Arabic).</td>

-    </tr>

-    <tr>

-      <td align="CENTER">&lt;medial&gt;&nbsp;&nbsp;</td>

-      <td>A medial presentation form (Arabic).</td>

-    </tr>

-    <tr>

-      <td align="CENTER">&lt;final&gt;&nbsp;&nbsp;</td>

-      <td>A final presentation form (Arabic).</td>

-    </tr>

-    <tr>

-      <td align="CENTER">&lt;isolated&gt;&nbsp;&nbsp;</td>

-      <td>An isolated presentation form (Arabic).</td>

-    </tr>

-    <tr>

-      <td align="CENTER">&lt;circle&gt;&nbsp;&nbsp;</td>

-      <td>An encircled form.</td>

-    </tr>

-    <tr>

-      <td align="CENTER">&lt;super&gt;&nbsp;&nbsp;</td>

-      <td>A superscript form.</td>

-    </tr>

-    <tr>

-      <td align="CENTER">&lt;sub&gt;&nbsp;&nbsp;</td>

-      <td>A subscript form.</td>

-    </tr>

-    <tr>

-      <td align="CENTER">&lt;vertical&gt;&nbsp;&nbsp;</td>

-      <td>A vertical layout presentation form.</td>

-    </tr>

-    <tr>

-      <td align="CENTER">&lt;wide&gt;&nbsp;&nbsp;</td>

-      <td>A wide (or zenkaku) compatibility character.</td>

-    </tr>

-    <tr>

-      <td align="CENTER">&lt;narrow&gt;&nbsp;&nbsp;</td>

-      <td>A narrow (or hankaku) compatibility character.</td>

-    </tr>

-    <tr>

-      <td align="CENTER">&lt;small&gt;&nbsp;&nbsp;</td>

-      <td>A small variant form (CNS compatibility).</td>

-    </tr>

-    <tr>

-      <td align="CENTER">&lt;square&gt;&nbsp;&nbsp;</td>

-      <td>A CJK squared font variant.</td>

-    </tr>

-    <tr>

-      <td align="CENTER">&lt;fraction&gt;&nbsp;&nbsp;</td>

-      <td>A vulgar fraction form.</td>

-    </tr>

-    <tr>

-      <td align="CENTER">&lt;compat&gt;&nbsp;&nbsp;</td>

-      <td>Otherwise unspecified compatibility character.</td>

-    </tr>

-  </table>

-  <p><b>Reminder: </b>There is a difference between decomposition and decomposition mapping. The 

-  decomposition mappings are defined in the UnicodeData, while the decomposition (also termed &quot;full 

-  decomposition&quot;) is defined in Chapter 3 to use those mappings <i>recursively.</i></p>

-  <ul>

-    <li>The canonical decomposition is formed by recursively applying the canonical mappings, then 

-    applying the canonical reordering algorithm.</li>

-    <li>The compatibility decomposition is formed by recursively applying the canonical <em>and</em> 

-    compatibility mappings, then applying the canonical reordering algorithm.</li>

-  </ul>

-  <h3><a name="Canonical_Combining_Class_Values">Canonical Combining Class Values</a></h3>

-  <table>

-    <tr>

-      <th>

-      <p align="LEFT">Value</th>

-      <th>

-      <p align="LEFT">Description</th>

-    </tr>

-    <tr>

-      <td align="RIGHT">0:</td>

-      <td>Spacing, split, enclosing, reordrant, and Tibetan subjoined</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">1:</td>

-      <td>Overlays and interior</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">7:</td>

-      <td>Nuktas</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">8:</td>

-      <td>Hiragana/Katakana voicing marks</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">9:</td>

-      <td>Viramas</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">10:</td>

-      <td>Start of fixed position classes</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">199:</td>

-      <td>End of fixed position classes</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">200:</td>

-      <td>Below left attached</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">202:</td>

-      <td>Below attached</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">204:</td>

-      <td>Below right attached</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">208:</td>

-      <td>Left attached (reordrant around single base character)</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">210:</td>

-      <td>Right attached</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">212:</td>

-      <td>Above left attached</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">214:</td>

-      <td>Above attached</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">216:</td>

-      <td>Above right attached</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">218:</td>

-      <td>Below left</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">220:</td>

-      <td>Below</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">222:</td>

-      <td>Below right</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">224:</td>

-      <td>Left (reordrant around single base character)</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">226:</td>

-      <td>Right</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">228:</td>

-      <td>Above left</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">230:</td>

-      <td>Above</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">232:</td>

-      <td>Above right</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">233:</td>

-      <td>Double below</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">234:</td>

-      <td>Double above</td>

-    </tr>

-    <tr>

-      <td align="RIGHT">240:</td>

-      <td>Below (iota subscript)</td>

-    </tr>

-  </table>

-  <blockquote>

-    <p><strong>Note: </strong>some of the combining classes in this list do not currently have 

-    members but are specified here for completeness.</p>

-  </blockquote>

-  <h3><a name="Decompositions_and_Normalization">Decompositions and Normalization</a></h3>

-  <p>Decomposition is specified in Chapter 3. <span>UAX #15: Unicode Normalization Forms [<a href="#Norm">Norm</a>]

-  </span>specifies the interaction between decomposition and normalization. That report specifies 

-  how the decompositions defined in <a href="UnicodeData.txt">UnicodeData.txt</a> are used to derive 

-  normalized forms of Unicode text.</p>

-  <p>Note that as of the 2.1.9 update of the Unicode Character Database, the decompositions in the

-  <a href="UnicodeData.txt">UnicodeData.txt</a> file can be used to <i>recursively</i> derive the 

-  full decomposition in canonical order, without the need to separately apply canonical reordering. 

-  However, canonical reordering of combining character sequences <b><i>must</i></b> still be applied 

-  in decomposition when normalizing source text which contains any combining marks.</p>

-  <p>The QuickCheck property values are as follows:</p>

-  <div style="spacing:20">

-    <table>

-      <tr>

-        <th>Value</th>

-        <th>Property</th>

-        <th>Description</th>

-      </tr>

-      <tr>

-        <td>No</td>

-        <td>NF*_QC</td>

-        <td>Characters that cannot ever occur in the respective normalization form. See

-        <a href="#Decompositions_and_Normalization">Decompositions and Normalization</a>.</td>

-      </tr>

-      <tr>

-        <td>Maybe</td>

-        <td>NFC_QC, NFKC_QC</td>

-        <td>Characters that may occur in in the respective normalization, depending on the context. 

-        See <a href="#Decompositions_and_Normalization">Decompositions and Normalization</a>.</td>

-      </tr>

-      <tr>

-        <td>Yes</td>

-        <td>n/a</td>

-        <td>All other characters. This is the default value, and is not explicitly listed in the 

-        file.</td>

-      </tr>

-    </table>

-  </div>

-  <p><br>

-  For more information, see Annex&nbsp;8 in <span>UAX #15: Unicode Normalization Forms [<a href="#Norm">Norm</a>].</span></p>

-  <h3><a name="Case_Mappings">Case Mappings</a></h3>

-  <p>There are a number of complications to case mappings that occur once the repertoire of 

-  characters is expanded beyond ASCII. For more information, see Chapter 3 in Unicode 4.0.</p>

-  <p>For compatibility with existing parsers, <a href="UnicodeData.txt">UnicodeData.txt</a> only 

-  contains case mappings for characters where they are one-to-one mappings; it also omits 

-  information about context-sensitive case mappings. Information about these special cases can be 

-  found in a separate data file, <a href="SpecialCasing.txt">SpecialCasing.txt</a>.</p>

-  <h2><a name="Unihan_Tags">Unihan Tags</a></h2>

-  <p>The <a href="#Unihan.txt">Unihan.txt</a> file is described in <a href="Unihan.html">Unihan.html</a>.</p>

-  <h2><a name="Other_UCD_Files">Other UCD Files</a></h2>

-  <p>The following files in the Unicode Character Database are not used directly for Unicode 

-  properties. &nbsp;For more information about these files, see the referenced technical report(s), 

-  files, or section of Unicode Standard.</p>

-  <table>

-    <tr>

-      <th>&quot;.txt&quot; File</th>

-      <th>Description</th>

-      <th align="center">N/I</th>

-      <th>Summary</th>

-    </tr>

-    <tr>

-      <td>Index</td>

-      <td>Chapter 16</td>

-      <td align="center">I</td>

-      <td>Index to Unicode characters, as printed in the Unicode Standard.</td>

-    </tr>

-    <tr>

-      <td>NamesList</td>

-      <td>Chapter 16</td>

-      <td align="center">I</td>

-      <td>This file duplicates some of the material in the UnicodeData file, and adds annotations 

-      used in the character charts.</td>

-    </tr>

-    <tr>

-      <td>NormalizationTest</td>

-      <td>UAX #15</td>

-      <td align="center">N</td>

-      <td>Test file for conformance to Unicode Normalization Forms.<p>See <span>UAX #15: Unicode 

-      Normalization Forms [<a href="#Norm">Norm</a>]</span></td>

-    </tr>

-    <tr>

-      <td>StandardizedVariants</td>

-      <td>Chapter 15</td>

-      <td align="center">N</td>

-      <td>Lists all the standardized variant sequences that have been defined, plus a description of 

-      the desired appearance. <a href="StandardizedVariants.html">StandardizedVariants.html </a>

-      contains this information, plus a sample glyph showing the desired features.</td>

-    </tr>

-  </table>

-  <h2><br>

-  <a name="Derived_Extracted_Properties">Derived Extracted Properties</a></h2>

-  <p>The following files contain other properties of the UCD that are simply separated out, and 

-  listed in range format. These files are provided purely as a reformatting of existing data, with a 

-  certain exceptions listed below. They are all contained in a subdirectory called <i>extracted.</i></p>

-  <table>

-    <tr>

-      <th>Files</th>

-      <th valign="top">N/I</th>

-      <th>Definition and Generation</th>

-    </tr>

-    <tr>

-      <td valign="top">DerivedBidiClass*</td>

-      <td align="center" valign="top">N</td>

-      <td>From UnicodeData.txt, field 4</td>

-    </tr>

-    <tr>

-      <td valign="top">DerivedBinaryProperties*</td>

-      <td align="center" valign="top">N</td>

-      <td>From UnicodeData.txt, field 9. See <a href="#Bidi_Note">Bidi Note</a>.</td>

-    </tr>

-    <tr>

-      <td valign="top">DerivedCombiningClass*</td>

-      <td align="center" valign="top">N</td>

-      <td>From UnicodeData.txt, field 3</td>

-    </tr>

-    <tr>

-      <td valign="top">DerivedDecompositionType*</td>

-      <td align="center" valign="top">*</td>

-      <td>From the &lt;tag&gt; in UnicodeData.txt, field 5. For characters with canonical decomposition 

-      mappings (no tag), the value &quot;canonical&quot; is used.

-      <p>* The value &quot;canonical&quot; is normative; the others are informative.</td>

-    </tr>

-    <tr>

-      <td valign="top">DerivedEastAsianWidth*</td>

-      <td align="center" valign="top">I</td>

-      <td>From EastAsianWidth.txt, field 1</td>

-    </tr>

-    <tr>

-      <td valign="top">DerivedGeneralCategory*</td>

-      <td align="center" valign="top">N</td>

-      <td>From UnicodeData.txt, field 2</td>

-    </tr>

-    <tr>

-      <td valign="top">DerivedJoiningGroup*</td>

-      <td align="center" valign="top">N</td>

-      <td>From ArabicShaping.txt, field 2</td>

-    </tr>

-    <tr>

-      <td valign="top">DerivedJoiningType*</td>

-      <td align="center" valign="top">N</td>

-      <td>From ArabicShaping.txt, field 1</td>

-    </tr>

-    <tr>

-      <td valign="top">DerivedLineBreak*</td>

-      <td align="center" valign="top">*</td>

-      <td>From LineBreak.txt, field 1.

-      <p>* Some values are normative; some are informative. For more information, see <span>UAX #14: 

-      Line Breaking Properties [<a href="#Line">Line</a>]</span>.</td>

-    </tr>

-    <tr>

-      <td valign="top">DerivedNumericType*</td>

-      <td align="center" valign="top">N</td>

-      <td>The property value is based on the contents of UnicodeData.txt, fields 6 through&nbsp;8:<br>

-&nbsp;

-      <div align="center">

-        <center>

-        <table>

-          <tr>

-            <th width="50%">property value</th>

-            <th width="50%">non-empty fields</th>

-          </tr>

-          <tr>

-            <td width="50%">decimal</td>

-            <td width="50%">6, 7, &amp; 8</td>

-          </tr>

-          <tr>

-            <td width="50%">digit</td>

-            <td width="50%">7 &amp; 8</td>

-          </tr>

-          <tr>

-            <td width="50%">numeric</td>

-            <td width="50%">8</td>

-          </tr>

-        </table>

-        </center>

-      </div>

-      </td>

-    </tr>

-    <tr>

-      <td valign="top">DerivedNumericValues*</td>

-      <td align="center" valign="top">N</td>

-      <td><i><b>Non-binary Property</b></i>

-      <p>From UnicodeData.txt, field 8</td>

-    </tr>

-  </table>

-  <blockquote>

-    <p><b><a name="Bidi_Note">Bidi Note</a>:</b> The BidiMirrored property and the BidiMirroring 

-    property are different. The former is a normative property that indicates whether characters are 

-    mirrored in a right-to-left context in the Unicode Bidirectional Algorithm. The latter is an 

-    informative mapping of BidiMirrored characters, where possible, to characters that normally have 

-    the corresponding mirrored glyph.</p>

-  </blockquote>

-  <h2><span><a name="Auxiliary_Property_Files">Auxiliary Property Files</a></span></h2>

-  <p><span>The files in this directory contain auxiliary properties. They consist of the following:</span></p>

-  <table>

-    <tr>

-      <th><span>Property</span></th>

-      <th>&nbsp;</th>

-      <th align="center"><span>N/I</span></th>

-      <th>&nbsp;</th>

-    </tr>

-    <tr>

-      <td><span><a name="Grapheme_Cluster_Break">Grapheme_Cluster_Break</a></span></td>

-      <td><span>E</span></td>

-      <td align="center"><span>I</span></td>

-      <td><span>GraphemeBreakProperty.txt</span><p><span>See UAX #29: Text Boundaries [<a href="#Breaks">Breaks</a>]

-      </span></td>

-    </tr>

-    <tr>

-      <td><span><a name="Sentence_Break">Sentence_Break</a></span></td>

-      <td><span>E</span></td>

-      <td align="center"><span>I</span></td>

-      <td><span>SentenceBreakProperty.txt</span><p><span>See UAX #29: Text Boundaries [<a href="#Breaks">Breaks</a>]</span></td>

-    </tr>

-    <tr>

-      <td><span><a name="Word_Break">Word_Break</a></span></td>

-      <td><span>E</span></td>

-      <td align="center"><span>I</span></td>

-      <td><span>WordBreakProperty.txt</span><p><span>See UAX #29: Text Boundaries [<a href="#Breaks">Breaks</a>]</span></td>

-    </tr>

-  </table>

-  <h2><a name="Property_Invariants">Property Invariants</a></h2>

-  <p>Values in the UCD are subject to correction as errors are found; however, some characteristics 

-  of the properties and files are considered invariants. Applications may wish to take these 

-  invariants into account when choosing how to implement character properties. The most important 

-  invariants are described in <a href="http://www.unicode.org/policies/policies.html">Unicode 

-  Policies</a>. The following lists some additional invariants and more detail on some of the 

-  invariants in Unicode Policies.</p>

-  <h4>UnicodeData Fields</h4>

-  <ul>

-    <li>The number of fields in UnicodeData.txt is fixed.

-    <ul>

-      <li>Any additional information about character properties to be added in the future will 

-      appear in separate data files, rather than being added as an additional field or by 

-      subdivision or reinterpretation of existing fields.</li>

-    </ul>

-    </li>

-    <li>The order of the fields is also fixed.</li>

-  </ul>

-  <h4>Combining Classes</h4>

-  <ul>

-    <li>Combining classes are limited to the values 0 to 255.

-    <ul>

-      <li>In practice, there are far fewer than 256 values used; Unicode 3.0 used 53 values, and 

-      Unicode 4.0 used 54 values total. (For details, see DerivedCombiningClasses.txt in the UCD.) 

-      Implementations may take advantage of this fact for compression, since only the ordering of 

-      the non-zero values matters for the Canonical Ordering Algorithm. In principle, it would be 

-      possible for up to 256 values to be used in the future; however, new combining classes are 

-      added very seldom. There are implementation advantages in restricting the number of classes to 

-      128—for example, the ability to used signed bytes without widening to ints in Java. </li>

-    </ul>

-    </li>

-    <li>All characters other than those of General Category M* have the combining class 0.

-    <ul>

-      <li>Currently, all characters other than those of General Category Mn have the value 0. 

-      However, some characters of General Category Me or Mc may be given non-zero values in the 

-      future.</li>

-      <li>The precise values above the value 0 are not invariant--only the relative ordering of 

-      values is considered fixed. For example, it is not guaranteed in future versions that the 

-      class of U+05B4 will be precisely 14.</li>

-    </ul>

-    </li>

-  </ul>

-  <h4>Decimal Digits</h4>

-  <ul>

-    <li>In Unicode 4.0 and thereafter, the General_Category value <i>Decimal_Number</i> (Nd), and 

-    the Numeric_Type value <i>Decimal</i> (de) are defined to be co-extensive, that is, the set of 

-    character having <i>Nd</i> will always be the same as the set of characters having <i>de</i>.</li>

-  </ul>

-  <h2><a name="References">References</a></h2>

-  <table class="noborder" style="border-collapse: collapse" cellpadding="4" cellspacing="0">

-    <tr>

-      <td valign="top" width="1" class="noborder"><span>[<a name="BIDI">BIDI</a>]</span></td>

-      <td valign="top" class="noborder"><span>UAX #9: The Bidirectional Algorithm<br>

-      Latest version:<br>

-      <a href="http://www.unicode.org/reports/tr9/">http://www.unicode.org/reports/tr9/</a><br>

-      4.1.0 version:<br>

-      <a href="http://www.unicode.org/reports/tr9/tr9-15.html">

-      http://www.unicode.org/reports/tr9/tr9-15.html</a> </span></td>

-    </tr>

-    <tr>

-      <td valign="top" width="1" class="noborder"><span>[<a name="Breaks">Breaks</a>]</span></td>

-      <td valign="top" class="noborder"><span><a href="http://www.unicode.org/reports/tr29/">UAX 

-      #29: Text Boundaries</a><br>

-      Latest Version:<br>

-      <a href="http://www.unicode.org/reports/tr29/">http://www.unicode.org/reports/tr29/</a><br>

-      4.1.0 version:<br>

-      <a href="http://www.unicode.org/reports/tr29/tr29-9.html">http://www.unicode.org/reports/tr29/tr29-9.html</a> </span></td>

-    </tr>

-    <tr>

-      <td valign="top" width="1" class="noborder">[<a name="FAQ">FAQ</a>]</td>

-      <td valign="top" class="noborder">Unicode Frequently Asked Questions<br>

-      <a href="http://www.unicode.org/faq/">http://www.unicode.org/faq/<br>

-      </a><i>For answers to common questions on technical issues.</i></td>

-    </tr>

-    <tr>

-      <td valign="top" width="1" class="noborder">[<a name="Glossary">Glossary</a>]</td>

-      <td valign="top" class="noborder">Unicode Glossary<a href="http://www.unicode.org/glossary/"><br>

-      http://www.unicode.org/glossary/<br>

-      </a><i>For explanations of terminology used in this and other documents.</i></td>

-    </tr>

-    <tr>

-      <td valign="top" width="1" class="noborder"><span>[<a name="Line">Line</a>]</span></td>

-      <td valign="top" class="noborder"><span>UAX #14: Line Breaking Properties<br>

-      Latest Version:<br>

-      <a href="http://www.unicode.org/reports/tr14/">http://www.unicode.org/reports/tr14/</a><br>

-      4.1.0 version:<br>

-      <a href="http://www.unicode.org/reports/tr14/tr14-17.html">

-      http://www.unicode.org/reports/tr14/tr14-17.html</a> </span></td>

-    </tr>

-    <tr>

-      <td valign="top" width="1" class="noborder"><span>[<a name="Norm">Norm</a>]</span></td>

-      <td valign="top" class="noborder"><span>UAX #15: Unicode Normalization Forms<br>

-      Latest Version:<br>

-      <a href="http://www.unicode.org/reports/tr15/">http://www.unicode.org/reports/tr15/</a><br>

-      4.1.0 version:<br>

-      <a href="http://www.unicode.org/reports/tr15/tr15-25.html">

-      http://www.unicode.org/reports/tr15/tr15-25.html</a> </span></td>

-    </tr>

-    <tr>

-      <td valign="top" width="1" class="noborder"><span>[<a name="Pattern">Pattern</a>]</span></td>

-      <td valign="top" class="noborder"><span>UAX #31: Identifier and Pattern Syntax<br>

-      Latest Version:<br>

-      <a href="http://www.unicode.org/reports/tr31/">http://www.unicode.org/reports/tr31/</a><br>

-      4.1.0 version:<br>

-      <a href="http://www.unicode.org/reports/tr31/tr31-5.html">

-      http://www.unicode.org/reports/tr31/tr31-5.html</a> </span></td>

-    </tr>

-    <tr>

-      <td valign="top" width="1" class="noborder">[<a name="Reports">Reports</a>]</td>

-      <td valign="top" class="noborder">Unicode Technical Reports<br>

-      <a href="http://www.unicode.org/reports/">http://www.unicode.org/reports/<br>

-      </a><i>For information on the status and development process for technical reports, and for a 

-      list of technical reports.</i></td>

-    </tr>

-    <tr>

-      <td valign="top" width="1" class="noborder">[<a name="Scripts">Scripts</a>]</td>

-      <td valign="top" class="noborder">UAX #24 Script Names<br>

-      <a href="http://www.unicode.org/reports/tr24/">http://www.unicode.org/reports/tr24/</a><br>

-      4.1.0 version:<br>

-      <a href="http://www.unicode.org/reports/tr24/tr24-7.html">

-      http://www.unicode.org/reports/tr24/tr24-7.htm</a> </td>

-    </tr>

-    <tr>

-      <td valign="top" width="1" class="noborder">[<a name="U4.0">U4.0</a>]</td>

-      <td valign="top" class="noborder">The Unicode Standard Version 4.0<br>

-      <a href="http://www.unicode.org/versions/Unicode4.0.0/">

-      http://www.unicode.org/versions/Unicode4.0.0/</a></td>

-    </tr>

-    <tr>

-      <td valign="top" width="1" class="noborder"><span>[<a name="U4.1.0">U4.1.0</a>]</span></td>

-      <td valign="top" class="noborder"><span>The Unicode Standard Version 4.1.0<br>

-      <a href="http://www.unicode.org/versions/Unicode4.1.0/">

-      http://www.unicode.org/versions/Unicode4.1.0/</a></span></td>

-    </tr>

-    <tr>

-      <td valign="top" width="1" class="noborder">[<a name="Versions">Versions</a>]</td>

-      <td valign="top" class="noborder">Versions of the Unicode Standard<br>

-      <a href="http://www.unicode.org/versions/">http://www.unicode.org/versions/<br>

-      </a><i>For details on the precise contents of each version of the Unicode Standard, and how to 

-      cite them.</i></td>

-    </tr>

-    <tr>

-      <td valign="top" width="1" class="noborder"><span>[<a name="Width">Width</a>]</span></td>

-      <td valign="top" class="noborder"><span>UAX #11: East Asian Width<br>

-      Latest Version:<br>

-      <a href="http://www.unicode.org/reports/tr11/">http://www.unicode.org/reports/tr11/</a><br>

-      4.1.0 version:<br>

-      <a href="http://www.unicode.org/reports/tr11/tr11-14.html">http://www.unicode.org/reports/tr11/tr11-14.html</a></span></td>

-    </tr>

-  </table>

-  <h2><br>

-  <a name="Modification_History">Modification History</a></h2>

-  <p>This section provides a summary of the changes between update versions of the Unicode Standard. 

-  The modifications prior to Unicode 4.0 only listed changes in UnicodeData.txt. From 4.0 onward, 

-  the consolidated modifications include the changes in other files.</p>

-  <h3><a href="http://www.unicode.org/versions/enumeratedversions.html#Unicode_4_1_0">Unicode 4.1.0</a></h3>

-  <p><b>This document:</b></p>

-  <ul>

-    <li><span>Added description of new directory and release structure, including the Auxiliary 

-    files.</span></li>

-    <li><span>Removed exception for field numbering in LineBreak and EastAsianWidth.</span></li>

-    <li><span>Added new properties, and changed some of the documentation of the identifier 

-    properties.</span></li>

-    <li><span>Removed the material that is now to be in Unihan.html</span></li>

-    <li><span>Removed the listing of default BIDI properties, referring now to&nbsp;

-    <a href="extracted/DerivedBidiClass.txt">extracted/DerivedBidiClass.txt</a></span></li>

-    <li>Replaced direct links to UAXes with links to references section<span>.</span></li>

-  </ul>

-  <p><b>Common file changes:</b></p>

-

-<p>

-All remaining files not corrected for Unicode 4.0.1 have

-had their headers updated to explicitly point to

-<a href="http://www.unicode.org/terms_of_use.html">Terms of Use</a>. The headers have also been

-synchronized somewhat to share a more common format for

-file version, date, and pointers to documentation.

-The major exception is UnicodeData.txt, which for legacy

-reasons, has no header.

-</p><p>

-<b>Changes in specific files:</b>

-</p><p>

-In some of the following, reference is made to a Public

-Review Issue (PRI). See

-<a href="http://www.unicode.org/review/resolved-pri.html">http://www.unicode.org/review/resolved-pri.html</a> for more information about those cases.

-</p><p>

-Appropriate data files were updated to include the 1273

-new characters added in Unicode 4.1.</p>

-	<p>

-The description of the Unihan properties was separated out from UCD.html, and 

-extensively revised, and now appears in Unihan.html.</p>

-	<p>

-<span>An auxiliary directory has been added. In 4.1.0 it contains properties associated with 

-    UAX #29: Text Boundaries [<a href="#Breaks">Breaks</a>].</span></p>

-

-<ul><li><b>UnicodeData.txt</b>

-<ul><li>

-  The Bidi_Class of U+202F was changed from bc=WS to bc=CS.

-    See PRI #45.

-</li><li>  

-  The Bidi_Class of U+FF0F was changed from bc=ES to bc=CS.

-    See PRI #44.

-</li><li>  

-  The Bidi_Class of U+2212 MINUS SIGN and 9 other characters

-    similar to either a minus sign or a plus sign were changed

-    to bc=ES. See PRI #57.

-</li><li>      

-  U+30FB KATAKANA MIDDLE DOT and U+FF65 HALFWIDTH KATAKANA MIDDLE DOT

-    were changed from gc=Pc to gc=Po. See PRI #55.

-</li><li>      

-  Case mappings were added for Georgian capitals (Asomtavruli)

-    to map them to the newly added Nuskhuri alphabet.

-</li><li>      

-  U+A015 YI SYLLABLE WU was changed from gc=Lo to gc=Lm.

-</li><li>      

-  9 Ethiopic digits were changed from gc=Nd to gc=No.

-</li><li>    

-  The Numeric_Type of U+1034A GOTHIC LETTER NINE HUNDRED was

-    changed from nt=None to nt=Nu, and it was given a Numeric_Value

-    of 900.

-</li><li>  

-  Uppercase and titlecase mappings were added for U+019A LATIN

-    SMALL LETTER L WITH BAR and U+0294 LATIN LETTER GLOTTAL STOP

-    to map them to newly added capital letters.

-</li></ul>

-<li><b>Unihan.txt</b>

-<ul><li>

-  Extensive additions and corrections were made for this data file.

-    See Unihan.html for the modification history.

-</li></ul></li>

-<li><b>ArabicShaping.txt</b>

-<ul><li>

-  The Joining_Group of U+06C2 ARABIC LETTER HEH GOAL WITH HAMZA ABOVE

-    was changed to jg=Heh_Goal.

-</li></ul>

-<li><b>BidiMirroring.txt</b>

-<ul><li>

-  The Bidi_Mirroring_Glyph value for U+2A2D was corrected.

-</li></ul>

-<li><b>Blocks.txt</b>

-<ul><li>

-  Added 20 new block definitions.

-</li></ul></li>

-<li><b>LineBreak.txt</b>

-<ul><li>

-  The Line_Break property of all conjoining jamos was updated from

-    lb=ID to make use of Hangul-specific Line_Break property values,

-    aligned with the Hangul_Syllable_Type property.

-</li><li>    

-  Many other corrections were made to the Line_Break property of

-    characters, particularly for punctuation marks specific to

-    Runic, Mongolian, Tibetan and various Indic scripts. For details 

-    on these changes, see UAX #14.

-</li></ul></li>

-<li><b>PropertyAliases.txt</b>

-<ul><li>

-  Properties and aliases were added for UAX #29, Text Boundaries:

-    Grapheme_Cluster_Break, Word_Break, and Sentence_Break.

-</li><li>        

-  Properties and aliases were added for: Other_ID_Continue,

-    Pattern_White_Space, and Pattern_Syntax.

-</li><li>        

-  An alias was added for White_Space: "space", for compatibility

-    with POSIX.

-</li></ul></li>

-<li><b>PropertyValueAliases.txt</b>

-<ul><li>

-  Property value aliases were added for all new properties, and

-    for new values added to existing catalog properties (blocks

-    and scripts).

-</li><li>        

-  Property value aliases were added for compatibility with POSIX:

-    "cntrl", "digit", and "punct"

-</li></ul></li>

-<li><b>PropList.txt</b>

-<ul><li>

-  3 new properties were added: Other_ID_Continue, Pattern_White_Space,

-    and Pattern_Syntax.

-</li><li>      

-  U+30A0 KATAKANA-HIRAGANA DOUBLE HYPHEN was given the Dash property.

-</li><li>      

-  U+A015 YI SYLLABLE WU was given the Extender property.

-</li><li>      

-  Golden number runes (U+16EE..U+16F0), Roman numerals (U+2160..U+2183),

-    and U+1034A GOTHIC LETTER NINE HUNDRED were removed from Other_Alphabetic.

-</li><li>        

-  Circled Latin letters (U+24B6..U+24E9) were added to Other_Alphabetic.

-    These changes to Other_Alphabetic were to better align Alphabetic

-    and casing properties. The derived property Alphabetic is now a

-    superset of the derived properties Lowercase and Uppercase,

-    for compatibility with POSIX-style character classes.

-</li><li>        

-  3 musical symbol combining flags (U+1D170..U+1D172) were added

-    to Other_Grapheme_Extend to fix an inconsistency in the data.

-</li><li>        

-  U+200B ZERO WIDTH SPACE was removed from Other_Default_Ignorable_Code_Point.

-</li></ul></li>

-<li><b>Scripts.txt</b>

-<ul><li>

-  8 new Script values were added: Buginese, Coptic, New_Tai_Lue,

-    Glagolitic, Tifinagh, Syloti_Nagri, Old_Persian, and Kharoshthi.

-</li><li>        

-  The Script value Katakana_Or_Hiragana (Hrkt) was removed.

-</li><li>      

-  The Script for the 14 Coptic letters in the Greek and Coptic block

-    were updated to sc=Copt.

-</li><li>       

-  10 characters (punctuation and extenders) shared by Katakana and

-    Hiragana were changed from sc=Hrkt to sc=Zyyy.

-</li></ul></li>

-<li><b>SpecialCasing.txt</b>

-<ul><li>

-  The case mapping contexts defined in this file were updated.

-</li><li>    

-  A number of clarifying changes were made to comments in the header

-    of this data file.

-</li></ul>

-</ul>

-

-  <h3><a href="http://www.unicode.org/versions/enumeratedversions.html#Unicode_4_0_1">Unicode 4.0.1</a></h3>

-  <p><b>This document:</b></p>

-  <ul>

-    <li>Added two new properties</li>

-    <li>Added the property types Catalog and Miscellaneous</li>

-    <li>Described loose matching of property names and values</li>

-    <li>Added to file format</li>

-  </ul>

-  <p><b>Common file changes:</b></p>

-  <p>Some property values have different casing (upper vs. lower) for consistency between the data 

-  files and the PropertyValueAlias file. There are some additional changes in comments:</p>

-  <ul>

-    <li>Nearly all files changed headers to explicitly point to <i>

-    <a href="http://www.unicode.org/terms_of_use.html">Terms of Use</a></i></li>

-    <li>Names for code points without names now have a more uniform style, such as <i>

-    &lt;reserved-1234&gt;</i></li>

-    <li>Where characters with a default value are not listed, that information is indicated in the 

-    total code point counts</li>

-    <li>The full property name and property value name (for enumerated properties) is usually 

-    supplied in a comment</li>

-  </ul>

-  <p><b>Changes in specific files:</b></p>

-  <p>In some of the following, reference is made to a Public Review Issue (PRI). See

-  <a href="http://www.unicode.org/review/resolved-pri.html">

-  http://www.unicode.org/review/resolved-pri.html</a> for more information about those cases.</p>

-  <ul>

-    <li><b>UnicodeData.txt</b><br>

-    <ul>

-      <li>Changed general category of Zero Width Space (U+200B) from Zs to Cf. For background 

-      information, see PRI #21.</li>

-      <li>Bidi Conformance was made much clearer and more rigorous, also resulting in a number of 

-      property changes:<br>

-      <ul>

-        <li>Several Bidi fixes impact number and date formatting with the following characters: +, 

-        -, /</li>

-        <li>Braille symbols were changed to being strong Left-to-right, to reflect usage.</li>

-        <li>A review of BN and Default Ignorable code points resulted in a number of changes: for 

-        details, see PRI #28.</li>

-        <li>Some other bidi tweaks were made for consistency.</li>

-      </ul>

-      </li>

-      <li>While the properties of the Join_Controls have not changed, their role in combining 

-      characters sequences has. For more information, see

-      <a href="http://www.unicode.org/versions/Unicode4.0.1/">

-      http://www.unicode.org/versions/Unicode4.0.1/</a>.</li>

-      <li>Removed an extraneous space at the end of the name field for two characters.</li>

-    </ul>

-    </li>

-    <li><b>Unihan.txt</b>

-    <ul>

-      <li>A major update of the Unihan data file, to bring it up-to-date for Unicode 4.0. (It was 

-      not released in Version 4.0.0, because of the time required to complete and check corrections 

-      to the data file.) This update rolls in fixes for nearly all known errors in the prior version 

-      of the file and adds a very large amount of other informative data. For details, see the 

-      header of that file.</li>

-      <li>Added three new tags: kHanyuPinlu,&nbsp; kGSR, and kIRG_USource.</li>

-      <li>Completed data for kCihaiT, kCowles, kGradeLevel, and kLau</li>

-      <li>The kMandarin field has been corrected and its order restored to a&nbsp;&quot;frequency&quot; order</li>

-    </ul>

-    </li>

-    <li><b>ArabicShaping.txt</b>

-    <ul>

-      <li>Moved one entry into code point order.</li>

-    </ul>

-    </li>

-    <li><b>Blocks.txt</b>

-    <ul>

-      <li>Corrected name of the Cyrillic Supplement block.</li>

-    </ul>

-    </li>

-    <li><b>DerivedCoreProperties.txt</b>

-    <ul>

-      <li>ZWNJ/ZWJ (U+200C..U+200D) now have the <a href="#Grapheme_Extend">Grapheme_Extend</a> 

-      property.</li>

-    </ul>

-    </li>

-    <li><b>DerivedNormalizationProps.txt</b>

-    <ul>

-      <li>While not actually changing the particular values associated with the Quick Check 

-      properties for characters, a revision was made in how the Quick Check properties are expressed 

-      in the file, to bring it more into line with the model for other properties. This resulted in 

-      a significant change in the format of the data file and the explicit separation of Yes, No, 

-      and Maybe values. In addition, the actual aliases for the property changed in the data file.</li>

-    </ul>

-    </li>

-    <li><b>Index.txt</b>

-    <ul>

-      <li>Updated to correspond to the character index published as part of the

-      <a href="http://www.unicode.org/versions/Unicode4.0.0/">Unicode Standard, Version 4.0</a>.</li>

-    </ul>

-    </li>

-    <li><b>LineBreak.txt</b>

-    <ul>

-      <li>Many changes for consistency and to better match best practice in existing line break 

-      implementations; for details, see <a href="http://www.unicode.org/reports/tr14/">UAX #14: Line 

-      Breaking Properties</a></li>

-    </ul>

-    </li>

-    <li><b>PropertyAliases.txt</b>

-    <ul>

-      <li>Addition of some property categories, with the order of property aliases adjusted for 

-      clarity. </li>

-      <li>Addition of alias entries for the new <a href="#STerm">STerm</a> and

-      <a href="#Variation_Selector">Variation_Selector</a> properties.</li>

-    </ul>

-    </li>

-    <li><b>PropertyValueAliases.txt</b>

-    <ul>

-      <li>Addition of specific values and aliases for age. </li>

-      <li>Addition of second alias for the Cyrillic Supplement block. </li>

-      <li>Addition of second alias for the Inseparable value of the Line Break property. </li>

-      <li>Revision of the all the Normalization Quick Check properties, to replace the 

-      pseudo-property &quot;qc&quot; with actual specific properties with explicit enumerated value aliases.

-      </li>

-      <li>Addition of Katakana_Or_Hiragana script alias.</li>

-      <li>Fixed None (so it is used uniformly in first aliases instead of being the only n/a)</li>

-    </ul>

-    </li>

-    <li><b>PropList.txt</b>

-    <ul>

-      <li>Major revision of the <a href="#Other_Math">Other_Math</a> property to align the derived

-      <a href="#Math">Math</a> property with the explanation given in UTR #25. </li>

-      <li>Extension of the list of characters with the <a href="#Soft_Dotted">Soft_Dotted</a> 

-      property. </li>

-      <li>Significant update of the list of characters with the Terminal_Punctuation property. </li>

-      <li>Addition of a new <a href="#STerm">STerm</a> property, to simplify the description used in 

-      UAX #29. </li>

-      <li>Addition of the <a href="#Variation_Selector">Variation_Selector</a> property. </li>

-      <li>Reassignment of the list of characters with the

-      <a href="#Other_Default_Ignorable_Code_Point">Other_Default_Ignorable_Code_Point</a> property, 

-      to enable simpler derivation. </li>

-      <li>Addition of ZWNJ/ZWJ (200C..200D) to <a href="#Other_Grapheme_Extend">

-      Other_Grapheme_Extend</a>.</li>

-    </ul>

-    </li>

-    <li><b>Scripts.txt</b>

-    <ul>

-      <li>Significant revision of script assignments, to assign specific script values to many 

-      characters that previously had the Common script value. </li>

-      <li>Addition of the Katakana_Or_Hiragana script value, with list of characters for it.</li>

-      <li>The Common values are now listed, for comparison.</li>

-    </ul>

-    </li>

-    <li><b>SpecialCasing.txt</b>

-    <ul>

-      <li>Correction of typo in comments.</li>

-    </ul>

-    </li>

-  </ul>

-  <h3><a href="http://www.unicode.org/versions/enumeratedversions.html#Unicode_4_0_0">Unicode 4.0</a></h3>

-  <ul>

-    <li><b>UnicodeData.txt</b>

-    <ul>

-      <li>Decimal Digits

-      <ul>

-        <li>Numeric_Type=decimal digit now aligned with General_Category=Nd</li>

-      </ul>

-      </li>

-      <li>Modifier letters*

-      <ul>

-        <li>The general category of 02B9..02BA, 02C6..02CF changed to general category Lm.</li>

-      </ul>

-      </li>

-    </ul>

-    </li>

-    <li><b>Other Files</b>

-    <ul>

-      <li>New Properties and Values

-      <ul>

-        <li>Hangul_Syllable_Type, Unicode_Radical_Stroke</li>

-        <li>CJK numeric values added.</li>

-        <li>PropertyValueAliases adds block names</li>

-        <li>UCD fallback props more precisely defined, for code points not explicitly in data files</li>

-        <li>Added script value for Braille</li>

-        <li>New line breaking properties: NL, WJ</li>

-      </ul>

-      </li>

-      <li>Khmer

-      <ul>

-        <li>Two Khmer characters are deprecated; four others strongly discouraged.</li>

-      </ul>

-      </li>

-      <li>Special Casing

-      <ul>

-        <li>Fixed for Turkish, Lithuanian</li>

-      </ul>

-      </li>

-      <li>Default Ignorables

-      <ul>

-        <li>Hangul Filler characters</li>

-        <li>Soft-Hyphen, CGJ, ZWS</li>

-        <li>Arabic End of Ayah and Syriac Abbreviation Mark no longer DI (their shaping classes are 

-        also fixed.)</li>

-      </ul>

-      </li>

-      <li>Grapheme_Extend

-      <ul>

-        <li>Removes halfwidth katakana marks, most Mc (except as needed for canonical equivalence)</li>

-      </ul>

-      </li>

-      <li><a href="#Stabilized">Stabilized</a> Properties

-      <ul>

-        <li>The <a href="#Hyphen">Hyphen</a> property is now stabilized.</li>

-      </ul>

-      </li>

-    </ul>

-    </li>

-  </ul>

-  <h3><a href="http://www.unicode.org/versions/enumeratedversions.html#Unicode_3_2_0">Unicode 3.2</a></h3>

-  <p>Modifications made for Version 3.2.0 of UnicodeData.txt include:</p>

-  <blockquote>

-    <ul>

-      <li>Addition of 1016 new entries, to cover new characters encoded in Unicode 3.2.</li>

-      <li>Updated ISO 6429 names for control functions to match the currently published version of 

-      that standard.</li>

-      <li>Changed general category for Mongolian free variation selectors (U+180B..U+180D) from Cf 

-      to Mn.</li>

-      <li>Changed general category for U+0B83 TAMIL SIGN VISARGA (aytham) from Mc to Lo.</li>

-      <li>Changed general category for U+06DD ARABIC END OF AYAH from Me to Cf.</li>

-      <li>Changed general category for U+17D7 KHMER SIGN LEK TOO from Po to Lm.</li>

-      <li>Changed general category for U+17DC KHMER SIGN AVAKRAHASANYA from Po to Lo.</li>

-      <li>Changed canonical decomposition for U+F951 from 96FB to 964B (see <i>

-      <a href="http://www.unicode.org/versions/corrigendum3.html">Corrigendum #3: U+F951 

-      Normalization</a></i>).</li>

-    </ul>

-  </blockquote>

-  <h3><a href="http://www.unicode.org/versions/enumeratedversions.html#Unicode_3_1_1">Unicode 3.1.1</a></h3>

-  <p>Modifications made for Version 3.1.1 of UnicodeData.txt include:</p>

-  <ul>

-    <li>Modification of ISO 10646 annotation regarding Greek tonos, affecting entries for U+0301 and 

-    U+030D.</li>

-  </ul>

-  <h3><a href="http://www.unicode.org/versions/enumeratedversions.html#Unicode_3_1_0">Unicode 3.1</a></h3>

-  <p>Modifications made for Version 3.1.0 of UnicodeData.txt include:</p>

-  <ul>

-    <li>Addition of 2237 new entries, to cover new characters and new ranges of unified Han 

-    characters encoded in Unicode 3.1.</li>

-    <li>Changed General Category value of 16EE..16F0 (Runic golden numbers) from No to Nl.</li>

-  </ul>

-  <h3><a href="http://www.unicode.org/versions/enumeratedversions.html#Unicode_3_0_1">Unicode 3.0.1</a></h3>

-  <p>Modifications made for Version 3.0.1 of UnicodeData.txt include:</p>

-  <ul>

-    <li>Added 5- and 6-digit representation of code points past U+FFFF.</li>

-    <li>Added Private Use range definitions for Planes 15 and 16.</li>

-    <li>Minor additions for the 10646 comment field.</li>

-  </ul>

-  <h3><a href="http://www.unicode.org/versions/enumeratedversions.html#Unicode_3_0_0">Unicode 3.0.0</a></h3>

-  <p>Modifications made for Version 3.0.0 of UnicodeData.txt include many new characters and a 

-  number of property changes. These are summarized in Appendix D of <em>The Unicode Standard, 

-  Version 3.0.</em></p>

-  <h3><a href="http://www.unicode.org/versions/enumeratedversions.html#Unicode_2_1_9">Unicode 2.1.9</a></h3>

-  <p>Modifications made for Version 2.1.9 of UnicodeData.txt include:</p>

-  <ul>

-    <li>Corrected combining class for U+05AE HEBREW ACCENT ZINOR.</li>

-    <li>Corrected combining class for U+20E1 COMBINING LEFT RIGHT ARROW ABOVE</li>

-    <li>Corrected combining class for U+0F35 and U+0F37 to 220.</li>

-    <li>Corrected combining class for U+0F71 to 129.</li>

-    <li>Added a decomposition for U+0F0C TIBETAN MARK DELIMITER TSHEG BSTAR.</li>

-    <li>Added&nbsp; decompositions for several Greek symbol letters: U+03D0..U+03D2, U+03D5, U+03D6, 

-    U+03F0..U+03F2.</li>

-    <li>Removed&nbsp; decompositions from the conjoining jamo block: U+1100..U+11F8.</li>

-    <li>Changes to decomposition mappings for some Tibetan vowels for consistency in normalization. 

-    (U+0F71, U+0F73, U+0F77, U+0F79, U+0F81)</li>

-    <li>Updated the decomposition mappings for several Vietnamese characters with two diacritics 

-    (U+1EAC, U+1EAD, U+1EB6, U+1EB7, U+1EC6, U+1EC7, U+1ED8, U+1ED9), so that the recursive 

-    decomposition can be generated directly in canonically reordered form (not a normative change).</li>

-    <li>Updated the decomposition mappings for several Arabic compatibility characters involving 

-    shadda (U+FC5E..U+FC62, U+FCF2..U+FCF4), and two Latin characters (U+1E1C, U+1E1D), so that the 

-    decompositions are generated directly in canonically reordered form (not a normative change).</li>

-    <li>Changed BIDI category for: U+00A0 NO-BREAK SPACE, U+2007 FIGURE SPACE, U+2028 LINE 

-    SEPARATOR.</li>

-    <li>Changed BIDI category for extenders of General Category Lm: U+3005, U+3021..U+3035, U+FF9E, 

-    U+FF9F.</li>

-    <li>Changed General Category and BIDI category for the Greek numeral signs: U+0374, U+0375.</li>

-    <li>Corrected General Category for U+FFE8 HALFWIDTH FORMS LIGHT VERTICAL.</li>

-    <li>Added Unicode 1.0 names for many Tibetan characters (informative).</li>

-  </ul>

-  <h3><a href="http://www.unicode.org/versions/enumeratedversions.html#Unicode_2_1_8">Unicode 2.1.8</a></h3>

-  <p>Modifications made for Version 2.1.8 of UnicodeData.txt include:</p>

-  <ul>

-    <li>Added combining class 240 for U+0345 COMBINING GREEK YPOGEGRAMMENI so that decompositions 

-    involving iota subscript are derivable directly in canonically reordered form; this also has a 

-    bearing on simplification of casing of polytonic Greek.</li>

-    <li>Changes in decompositions related to Greek tonos. These result from the clarification that 

-    monotonic Greek &quot;tonos&quot; should be equated with U+0301 COMBINING ACUTE, rather than with U+030D 

-    COMBINING VERTICAL LINE ABOVE. (All Greek characters in the Greek block involving &quot;tonos&quot;; some 

-    Greek characters in the polytonic Greek in the 1FXX block.)</li>

-    <li>Changed decompositions involving dialytika tonos. (U+0390, U+03B0)</li>

-    <li>Changed ternary decompositions to binary. (U+0CCB, U+FB2C, U+FB2D) These changes simplify 

-    normalization.</li>

-    <li>Removed canonical decomposition for Latin Candrabindu. (U+0310)</li>

-    <li>Corrected error in canonical decomposition for U+1FF4.</li>

-    <li>Added compatibility decompositions to clarify collation tables. (U+2100, U+2101, U+2105, 

-    U+2106, U+1E9A)</li>

-    <li>A series of general category changes to assist the convergence of the Unicode definition of 

-    identifier with ISO TR 10176:

-    <ul>

-      <li>So &gt; Lo: U+0950, U+0AD0, U+0F00, U+0F88..U+0F8B</li>

-      <li>Po &gt; Lo: U+0E2F, U+0EAF, U+3006</li>

-      <li>Lm &gt; Sk: U+309B, U+309C</li>

-      <li>Po &gt; Pc: U+30FB, U+FF65</li>

-      <li>Ps/Pe &gt; Mn: U+0F3E, U+0F3F</li>

-    </ul>

-    </li>

-    <li>A series of bidi property changes for consistency.

-    <ul>

-      <li>L &gt; ET: U+09F2, U+09F3</li>

-      <li>ON &gt; L: U+3007</li>

-      <li>L &gt; ON: U+0F3A..U+0F3D, U+037E, U+0387</li>

-    </ul>

-    </li>

-    <li>Add case mapping: U+01A6 &lt;-&gt; U+0280</li>

-    <li>Updated symmetric swapping value for guillemets: U+00AB, U+00BB, U+2039, U+203A.</li>

-    <li>Changes to combining class values. Most Indic fixed position class non-spacing marks were 

-    changed to combining class 0. This fixes some inconsistencies in how canonical reordering would 

-    apply to Indic scripts, including Tibetan. Indic interacting top/bottom fixed position classes 

-    were merged into single (non-zero) classes as part of this change. Tibetan subjoined consonants 

-    are changed from combining class 6 to combining class 0. Thai pinthu (U+0E3A) moved to combining 

-    class 9. Moved two Devanagari stress marks into generic above and below combining classes 

-    (U+0951, U+0952).</li>

-    <li>Corrected placement of semicolon near symmetric swapping field. (U+FA0E, etc., scattered 

-    positions to U+FA29)</li>

-  </ul>

-  <h3>Version 2.1.7</h3>

-  <p><i>This version was for internal change tracking only, and never publicly released.</i></p>

-  <h3>Version 2.1.6</h3>

-  <p><i>This version was for internal change tracking only, and never publicly released.</i></p>

-  <h3><a href="http://www.unicode.org/versions/enumeratedversions.html#Unicode_2_1_5">Unicode 2.1.5</a></h3>

-  <p>Modifications made for Version 2.1.5 of UnicodeData.txt include:</p>

-  <ul>

-    <li>Changed decomposition for U+FF9E and U+FF9F so that correct collation weighting will 

-    automatically result from the canonical equivalences.</li>

-    <li>Removed canonical decompositions for U+04D4, U+04D5, U+04D8, U+04D9, U+04E0, U+04E1, U+04E8, 

-    U+04E9 (the implication being that no canonical equivalence is claimed between these 8 

-    characters and similar Latin letters), and updated 4 canonical decompositions for U+04DB, 

-    U+04DC, U+04EA, U+04EB to reflect the implied difference in the base character.</li>

-    <li>Added Pi, and Pf categories and assigned the relevant quotation marks to those categories, 

-    based on the Unicode Technical Corrigendum on Quotation Characters.</li>

-    <li>Updating of many bidi properties, following the advice of the ad hoc committee on bidi, and 

-    to make the bidi properties of compatibility characters more consistent.</li>

-    <li>Changed category of several Tibetan characters: U+0F3E, U+0F3F, U+0F88..U+0F8B to make them 

-    non-combining, reflecting the combined opinion of Tibetan experts.</li>

-    <li>Added case mapping for U+03F2.</li>

-    <li>Corrected case mapping for U+0275.</li>

-    <li>Added titlecase mappings for U+03D0, U+03D1, U+03D5, U+03D6, U+03F0.. U+03F2.</li>

-    <li>Corrected compatibility label for U+2121.</li>

-    <li>Add specific entries for all the CJK compatibility ideographs, U+F900..U+FA2D, so the 

-    canonical decomposition for each (the URO character it is equivalent to) can be carried in the 

-    database.</li>

-  </ul>

-  <h3>Version 2.1.4</h3>

-  <p><i>This version was for internal change tracking only, and never publicly released.</i></p>

-  <h3>Version 2.1.3</h3>

-  <p><i>This version was for internal change tracking only, and never publicly released.</i></p>

-  <h3><a href="http://www.unicode.org/versions/enumeratedversions.html#Unicode_2_1_2">Unicode 2.1.2</a></h3>

-  <p>Modifications made in updating UnicodeData.txt to Version 2.1.2 for the Unicode Standard, 

-  Version 2.1 (from Version 2.0) include:</p>

-  <ul>

-    <li>Added two characters (U+20AC and U+FFFC).</li>

-    <li>Amended bidi properties for U+0026, U+002E, U+0040, U+2007.</li>

-    <li>Corrected case mappings for U+018E, U+019F, U+01DD, U+0258, U+0275, U+03C2, U+1E9B.</li>

-    <li>Changed combining order class for U+0F71.</li>

-    <li>Corrected canonical decompositions for U+0F73, U+1FBE.</li>

-    <li>Changed decomposition for U+FB1F from compatibility to canonical.</li>

-    <li>Added compatibility decompositions for U+FBE8, U+FBE9, U+FBF9..U+FBFB.</li>

-    <li>Corrected compatibility decompositions for U+2469, U+246A, U+3358.</li>

-  </ul>

-  <h3>Version 2.1.1</h3>

-  <p><i>This version was for internal change tracking only, and never publicly released.</i></p>

-  <h3><a href="http://www.unicode.org/versions/enumeratedversions.html#Unicode_2_0_0">Unicode 2.0.0</a></h3>

-  <p>The modifications made in updating UnicodeData.txt for the Unicode Standard, Version 2.0 

-  include:</p>

-  <ul>

-    <li>Fixed decompositions with TONOS to use correct NSM: 030D.</li>

-    <li>Removed old Hangul Syllables; mapping to new characters are in a separate table.</li>

-    <li>Marked compatibility decompositions with additional tags.</li>

-    <li>Changed old tag names for clarity.</li>

-    <li>Revision of decompositions to use first-level decomposition, instead of maximal 

-    decomposition.</li>

-    <li>Correction of all known errors in decompositions from earlier versions.</li>

-    <li>Added control code names (as old Unicode names).</li>

-    <li>Added Hangul Jamo decompositions.</li>

-    <li>Added Number category to match properties list in book.</li>

-    <li>Fixed categories of Koranic Arabic marks.</li>

-    <li>Fixed categories of precomposed characters to match decomposition where possible.</li>

-    <li>Added Hebrew cantillation marks and the Tibetan script.</li>

-    <li>Added place holders for ranges such as CJK Ideographic Area and the Private Use Area.</li>

-    <li>Added categories Me, Sk, Pc, Nl, Cs, Cf, and rectified a number of mistakes in the database.</li>

-  </ul>

-  <h2><i><a name="UCD_Terms">UCD Terms of Use</a></i></h2>

-  <p>For terms of use, see <i>

-	<a href="http://www.unicode.org/terms_of_use.html">http://www.unicode.org/terms_of_use.html</a>.</i></p>

-  <hr width="50%">

-  <div align="center">

-    <center>

-    <table cellspacing="0" cellpadding="0" border="0">

-      <tr>

-        <td><a href="http://www.unicode.org/copyright.html">

-        <img src="http://www.unicode.org/img/hb_notice.gif" border="0" alt="Access to Copyright and terms of use" width="216" height="50"></a></td>

-      </tr>

-    </table>

-    <script language="Javascript" type="text/javascript" src="http://www.unicode.org/webscripts/lastModified.js">

-                </script>

-    </center>

-  </div>

-</div>

-

-</body>

-

-</html>

diff --git a/ucd/UnicodeCharacterDatabase.html b/ucd/UnicodeCharacterDatabase.html
deleted file mode 100644
index 491680d..0000000
--- a/ucd/UnicodeCharacterDatabase.html
+++ /dev/null
@@ -1,19 +0,0 @@
-<!doctype HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
-<html>

-

-<head>

-<meta http-equiv="Content-Type" content="text/html; charset=utf-8">

-<meta name="GENERATOR" content="Microsoft FrontPage 4.0">

-<meta name="ProgId" content="FrontPage.Editor.Document">

-<title>Unicode Character Database Documentation</title>

-</head>

-

-<body>

-

-<p>Starting with Version 4.0.0, most of the documentation for the Unicode 

-Character Database, including material formerly in this file, has been 

-consolidated into UCD.html.</p>

-

-</body>

-

-</html>

diff --git a/ucd/Unihan.html b/ucd/Unihan.html
deleted file mode 100644
index 4a49cc1..0000000
--- a/ucd/Unihan.html
+++ /dev/null
@@ -1,3368 +0,0 @@
-<!doctype HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" "http://www.w3.org/TR/REC-html40/loose.dtd">
-<html>
-
-<head>
-<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
-<meta http-equiv="Content-Language" content="en-us">
-<meta name="GENERATOR" content="Microsoft FrontPage 5.0">
-<meta name="ProgId" content="FrontPage.Editor.Document">
-<title>Unicode Han Database</title>
-<link rel="stylesheet" type="text/css" href="http://www.unicode.org/reports/reports.css">
-<style type="text/css">
-<!--
-th           { background-color: #CCFFCC }
--->
-</style>
-</head>
-
-<body bgcolor="#ffffff">
-
-<table class="header" width="100%">
-  <tr>
-    <td class="icon"><a href="http://www.unicode.org">
-    <img align="middle" alt="[Unicode]" border="0" src="http://www.unicode.org/webscripts/logo60s2.gif" width="34" height="33"></a>&nbsp;&nbsp;<a class="bar" href="http://www.unicode.org/ucd/">Unicode Character Database</a></td>
-  </tr>
-  <tr>
-    <td class="gray">&nbsp;</td>
-  </tr>
-</table>
-<div class="body">
-  <h1>UNICODE HAN DATABASE</h1>
-  <table class="wide" border="1">
-    <tr>
-      <td valign="TOP" width="144">Revision</td>
-      <td valign="TOP">4.1.0</td>
-    </tr>
-    <tr>
-      <td valign="TOP" width="144">Authors</td>
-      <td valign="TOP">John Jenkins, Richard Cook</td>
-    </tr>
-    <tr>
-      <td valign="TOP" width="144">Date</td>
-      <td valign="TOP">2005-03-30</td>
-    </tr>
-    <tr>
-      <td valign="TOP" width="144">This Version</td>
-      <td valign="TOP"><a href="http://www.unicode.org/Public/4.1.0/ucd/Unihan.html">
-      http://www.unicode.org/Public/4.1.0/ucd/Unihan.html</a></td>
-    </tr>
-    <tr>
-      <td valign="TOP" width="144">Previous Version</td>
-      <td valign="TOP">(was part of
-      <a href="http://www.unicode.org/Public/4.0-Update1/UCD-4.0.1.html">
-      http://www.unicode.org/Public/4.0-Update1/UCD-4.0.1.html</a>)</td>
-    </tr>
-    <tr>
-      <td valign="TOP" width="144">Latest Version</td>
-      <td valign="TOP"><a href="http://www.unicode.org/Public/UNIDATA/Unihan.html">
-      http://www.unicode.org/Public/UNIDATA/Unihan.html</a></td>
-    </tr>
-  </table>
-  <h3><i>Summary</i></h3>
-  <blockquote>
-    <p><i>This document describes the format and content of the file Unihan.txt in the <a href="http://www.unicode.org/ucd/">Unicode Character Database (UCD)</a></i>.</p>
-  </blockquote>
-  <h3><i>Status</i></h3>
-  <blockquote>
-    <p><i>This file and the files described herein are part of the Unicode Character Database and 
-    are governed by the terms of use at <a href="http://www.unicode.org/terms_of_use.html">
-    http://www.unicode.org/terms_of_use.html</a>.</i></p>
-    <p><i>The <a href="#References">References</a> provide related information that is useful in 
-    understanding this document.</i></p>
-    <p><i><b>Warning: </b>the information in this file does not completely describe the use and 
-    interpretation of Unicode character properties and behavior. It must be used in conjunction with 
-    the data in the other files in the Unicode Character Database, and relies on the notation and 
-    definitions supplied in <a href="http://www.unicode.org/standard/standard.html">The Unicode 
-    Standard</a>. All chapter references are to Version 4.1.0 of the standard unless otherwise 
-    indicated.</i></p>
-  </blockquote>
-
- <h3><i>Contents</i></h3>
-    <ul>
-        <li><a href="#File_Structure">File Structure</a></li>
-        <li><a href="#Unihan_Tags">Unihan Properties</a></li>
-        <li><a href="#Unihan_ABC">Alphabetical listing of Unihan Properties</a></li>
-        <li><a href="#Unihan_Cats">Unihan Properties by Category</a></li>
-        <li><a href="#Unihan_Status">Unihan Properties by Status</a></li>
-        <li><a href="#Unihan_Detail">Unihan Properties in Detail</a></li>
-        <li><a href="#References">References</a></li>
-        <li><a href="#Modification_History">Modification History</a></li>
-        <li><a href="#UCD_Terms">UCD Terms of Use</a></li>
-    </ul>
-
-  <hr>
-  <h2><a name="File_Structure">File Structure</a></h2>
-  <p>Each line (record) of the file Unihan.txt consists of three tab-separated fields. 
-    <ul>
-        <li>Field 1 contains the Unicode code point in the form U+[X]XXXX (that is, there are either four or five hex digits following the U+ prefix).</li> 
-        <li>Field 2 contains a tag name indicating the type or source of information in the third field.</li>
-        <li>Field 3 contains the line&#39;s value (in UTF-8).</li>
-    </ul>
-
-  <p>Ranges of Han code points valid for Field 1 of Unihan.txt are listed in the following table:</p>
-  <table cellpadding="4">
-    <tr>
-      <th>Code point range </th>
-      <th>Block name </th>
-      <th>Release</th>
-    </tr>
-    <tr><td>U+3400..U+4DB5</td>  <td>CJK Unified Ideographs Extension A</td><td>3.0</td></tr>
-    <tr><td>U+4E00..U+9FA5</td>  <td>CJK Unified Ideographs</td>            <td>1.1</td></tr>
-    <tr BGCOLOR="red"><td>U+9FA6..U+9FBB</td>  <td>CJK Unified Ideographs </td>           <td>4.1</td></tr>
-    <tr><td>U+F900..U+FA2D</td>  <td>CJK Compatibility Ideographs</td>      <td>1.1</td></tr>
-    <tr><td>U+FA30..U+FA6A</td>  <td>CJK Compatibility Ideographs</td>      <td>3.2</td></tr>
-    <tr BGCOLOR="red"><td>U+FA70..U+FAD9</td>  <td>CJK Compatibility Ideographs</td>      <td>4.1</td></tr>
-    <tr><td>U+20000..U+2A6D6</td><td>CJK Unified Ideographs Extension B</td><td>3.1</td></tr>
-    <tr><td>U+2F800..U+2FA1D</td><td>CJK Compatibility Supplement</td>      <td>3.1</td></tr>
-  </table>
-
-  <p>Note that CJK characters in the following ranges <b>do not</b> have mapping data in Field 1 of Unihan.txt:</p>
-  <table BGCOLOR="#CCCCCC" cellpadding="1">
-    <tr>
-      <th>Code point range </th>
-      <th>Block name </th>
-      <th>Release</th>
-    </tr>
-    <tr><td>U+2E80..U+2E99</td>  <td>CJK RADICALS SUPPLEMENT</td>                  <td>3.0</td></tr>
-    <tr><td>U+2E9B..U+2EF3</td>  <td>CJK RADICALS SUPPLEMENT</td>                  <td>3.0</td></tr>
-    <tr><td>U+2F00..U+2FD5</td>  <td>KANGXI RADICALS </td>                         <td>3.0</td></tr>
-    <tr><td>U+3000..U+303F</td>  <td>CJK SYMBOLS AND PUNCTUATION</td>              <td>various</td></tr>
-    <tr><td>U+3200..U+3243</td>  <td>ENCLOSED CJK LETTERS AND MONTHS</td>          <td>various</td></tr>
-    <tr><td>U+3250..U+32FE</td>  <td>ENCLOSED CJK LETTERS AND MONTHS</td>          <td>various</td></tr>
-    <tr><td>U+3300..U+33FF</td>  <td>CJK COMPATIBILITY</td>                        <td>various</td></tr>
-  </table>
-
-  <p></p>
-
-  <h2><a name="Unihan_Tags">Unihan Properties</a></h2>
-  <p>Below are lists of the properties (data tags) in Unihan.txt. Information on each of these properties is given in the table below. Only a few Unihan properties correspond to Unicode normative or informative properties: the rest are provisional. For more information on the meanings of the "Normative", "Informative" and "Provisional" Status flags, see definitions D9, D9a, and D9b in Chapter 3 Properties of Unicode 4.0 [<a href="#U4.0">U4.0</a>]. For information on properties and on the general structure of the Unicode Character Database, see <a href="UCD.html">UCD.html</a>.</p> 
-
-
-
-
-
-
-  <hr>
-  <h3><a name="Unihan_ABC">Alphabetical Listing of Unihan Properties</a></h3>
-    <blockquote>
-1:&nbsp;<a href="#kAccountingNumeric">kAccountingNumeric</a>, 2:&nbsp;<a href="#kBigFive">kBigFive</a>, 3:&nbsp;<a href="#kCCCII">kCCCII</a>, 4:&nbsp;<a href="#kCNS1986">kCNS1986</a>, 5:&nbsp;<a href="#kCNS1992">kCNS1992</a>, 6:&nbsp;<a href="#kCangjie">kCangjie</a>, 7:&nbsp;<a href="#kCantonese">kCantonese</a>, 8:&nbsp;<a href="#kCihaiT">kCihaiT</a>, 9:&nbsp;<a href="#kCompatibilityVariant">kCompatibilityVariant</a>, 10:&nbsp;<a href="#kCowles">kCowles</a>, 11:&nbsp;<a href="#kDaeJaweon">kDaeJaweon</a>, 12:&nbsp;<a href="#kDefinition">kDefinition</a>, 13:&nbsp;<a href="#kEACC">kEACC</a>, 14:&nbsp;<a href="#kFenn">kFenn</a>, 15:&nbsp;<a href="#kFennIndex">kFennIndex</a>, 16:&nbsp;<a href="#kFrequency">kFrequency</a>, 17:&nbsp;<a href="#kGB0">kGB0</a>, 18:&nbsp;<a href="#kGB1">kGB1</a>, 19:&nbsp;<a href="#kGB3">kGB3</a>, 20:&nbsp;<a href="#kGB5">kGB5</a>, 21:&nbsp;<a href="#kGB7">kGB7</a>, 22:&nbsp;<a href="#kGB8">kGB8</a>, 23:&nbsp;<a href="#kGSR">kGSR</a>, 24:&nbsp;<a href="#kGradeLevel">kGradeLevel</a>, 25:&nbsp;<a href="#kHDZRadBreak">kHDZRadBreak</a>, 26:&nbsp;<a href="#kHKGlyph">kHKGlyph</a>, 27:&nbsp;<a href="#kHKSCS">kHKSCS</a>, 28:&nbsp;<a href="#kHanYu">kHanYu</a>, 29:&nbsp;<a href="#kHanyuPinlu">kHanyuPinlu</a>, 30:&nbsp;<a href="#kIBMJapan">kIBMJapan</a>, 31:&nbsp;<a href="#kIICore">kIICore</a>, 32:&nbsp;<a href="#kIRGDaeJaweon">kIRGDaeJaweon</a>, 33:&nbsp;<a href="#kIRGDaiKanwaZiten">kIRGDaiKanwaZiten</a>, 34:&nbsp;<a href="#kIRGHanyuDaZidian">kIRGHanyuDaZidian</a>, 35:&nbsp;<a href="#kIRGKangXi">kIRGKangXi</a>, 36:&nbsp;<a href="#kIRG_GSource">kIRG_GSource</a>, 37:&nbsp;<a href="#kIRG_HSource">kIRG_HSource</a>, 38:&nbsp;<a href="#kIRG_JSource">kIRG_JSource</a>, 39:&nbsp;<a href="#kIRG_KPSource">kIRG_KPSource</a>, 40:&nbsp;<a href="#kIRG_KSource">kIRG_KSource</a>, 41:&nbsp;<a href="#kIRG_TSource">kIRG_TSource</a>, 42:&nbsp;<a href="#kIRG_USource">kIRG_USource</a>, 43:&nbsp;<a href="#kIRG_VSource">kIRG_VSource</a>, 44:&nbsp;<a href="#kJIS0213">kJIS0213</a>, 45:&nbsp;<a href="#kJapaneseKun">kJapaneseKun</a>, 46:&nbsp;<a href="#kJapaneseOn">kJapaneseOn</a>, 47:&nbsp;<a href="#kJis0">kJis0</a>, 48:&nbsp;<a href="#kJis1">kJis1</a>, 49:&nbsp;<a href="#kKPS0">kKPS0</a>, 50:&nbsp;<a href="#kKPS1">kKPS1</a>, 51:&nbsp;<a href="#kKSC0">kKSC0</a>, 52:&nbsp;<a href="#kKSC1">kKSC1</a>, 53:&nbsp;<a href="#kKangXi">kKangXi</a>, 54:&nbsp;<a href="#kKarlgren">kKarlgren</a>, 55:&nbsp;<a href="#kKorean">kKorean</a>, 56:&nbsp;<a href="#kLau">kLau</a>, 57:&nbsp;<a href="#kMainlandTelegraph">kMainlandTelegraph</a>, 58:&nbsp;<a href="#kMandarin">kMandarin</a>, 59:&nbsp;<a href="#kMatthews">kMatthews</a>, 60:&nbsp;<a href="#kMeyerWempe">kMeyerWempe</a>, 61:&nbsp;<a href="#kMorohashi">kMorohashi</a>, 62:&nbsp;<a href="#kNelson">kNelson</a>, 63:&nbsp;<a href="#kOtherNumeric">kOtherNumeric</a>, 64:&nbsp;<a href="#kPhonetic">kPhonetic</a>, 65:&nbsp;<a href="#kPrimaryNumeric">kPrimaryNumeric</a>, 66:&nbsp;<a href="#kPseudoGB1">kPseudoGB1</a>, 67:&nbsp;<a href="#kRSAdobe_Japan1_6">kRSAdobe_Japan1_6</a>, 68:&nbsp;<a href="#kRSJapanese">kRSJapanese</a>, 69:&nbsp;<a href="#kRSKanWa">kRSKanWa</a>, 70:&nbsp;<a href="#kRSKangXi">kRSKangXi</a>, 71:&nbsp;<a href="#kRSKorean">kRSKorean</a>, 72:&nbsp;<a href="#kRSUnicode">kRSUnicode</a>, 73:&nbsp;<a href="#kSBGY">kSBGY</a>, 74:&nbsp;<a href="#kSemanticVariant">kSemanticVariant</a>, 75:&nbsp;<a href="#kSimplifiedVariant">kSimplifiedVariant</a>, 76:&nbsp;<a href="#kSpecializedSemanticVariant">kSpecializedSemanticVariant</a>, 77:&nbsp;<a href="#kTaiwanTelegraph">kTaiwanTelegraph</a>, 78:&nbsp;<a href="#kTang">kTang</a>, 79:&nbsp;<a href="#kTotalStrokes">kTotalStrokes</a>, 80:&nbsp;<a href="#kTraditionalVariant">kTraditionalVariant</a>, 81:&nbsp;<a href="#kVietnamese">kVietnamese</a>, 82:&nbsp;<a href="#kXerox">kXerox</a>, 83:&nbsp;<a href="#kZVariant">kZVariant</a>.
-</blockquote>
-
-<hr>
-
-  <h3><a name="Unihan_Cats">Unihan Properties by Category</a></h3>
-<ul>
-<li>Dictionary&nbsp;Indices:&nbsp;<a href="#kCowles">kCowles</a>, <a href="#kDaeJaweon">kDaeJaweon</a>, <a href="#kFennIndex">kFennIndex</a>, <a href="#kGSR">kGSR</a>, <a href="#kHanYu">kHanYu</a>, <a href="#kHanyuPinlu">kHanyuPinlu</a>, <a href="#kIRGDaeJaweon">kIRGDaeJaweon</a>, <a href="#kIRGDaiKanwaZiten">kIRGDaiKanwaZiten</a>, <a href="#kIRGHanyuDaZidian">kIRGHanyuDaZidian</a>, <a href="#kIRGKangXi">kIRGKangXi</a>, <a href="#kKangXi">kKangXi</a>, <a href="#kKarlgren">kKarlgren</a>, <a href="#kLau">kLau</a>, <a href="#kMatthews">kMatthews</a>, <a href="#kMeyerWempe">kMeyerWempe</a>, <a href="#kMorohashi">kMorohashi</a>, <a href="#kNelson">kNelson</a>, <a href="#kSBGY">kSBGY</a>.</li> <li>Dictionary-like&nbsp;Data:&nbsp;<a href="#kCangjie">kCangjie</a>, <a href="#kCantonese">kCantonese</a>, <a href="#kCihaiT">kCihaiT</a>, <a href="#kDefinition">kDefinition</a>, <a href="#kFenn">kFenn</a>, <a href="#kFrequency">kFrequency</a>, <a href="#kGradeLevel">kGradeLevel</a>, <a href="#kHDZRadBreak">kHDZRadBreak</a>, <a href="#kHKGlyph">kHKGlyph</a>, <a href="#kIICore">kIICore</a>, <a href="#kJapaneseKun">kJapaneseKun</a>, <a href="#kJapaneseOn">kJapaneseOn</a>, <a href="#kKorean">kKorean</a>, <a href="#kMandarin">kMandarin</a>, <a href="#kPhonetic">kPhonetic</a>, <a href="#kTang">kTang</a>, <a href="#kTotalStrokes">kTotalStrokes</a>, <a href="#kVietnamese">kVietnamese</a>.</li> <li>IRG&nbsp;Mappings:&nbsp;<a href="#kIRG_GSource">kIRG_GSource</a>, <a href="#kIRG_HSource">kIRG_HSource</a>, <a href="#kIRG_JSource">kIRG_JSource</a>, <a href="#kIRG_KPSource">kIRG_KPSource</a>, <a href="#kIRG_KSource">kIRG_KSource</a>, <a href="#kIRG_TSource">kIRG_TSource</a>, <a href="#kIRG_USource">kIRG_USource</a>, <a href="#kIRG_VSource">kIRG_VSource</a>.</li> <li>Numeric&nbsp;Values:&nbsp;<a href="#kAccountingNumeric">kAccountingNumeric</a>, <a href="#kOtherNumeric">kOtherNumeric</a>, <a href="#kPrimaryNumeric">kPrimaryNumeric</a>.</li> <li>Other&nbsp;Mappings:&nbsp;<a href="#kBigFive">kBigFive</a>, <a href="#kCCCII">kCCCII</a>, <a href="#kCNS1986">kCNS1986</a>, <a href="#kCNS1992">kCNS1992</a>, <a href="#kEACC">kEACC</a>, <a href="#kGB0">kGB0</a>, <a href="#kGB1">kGB1</a>, <a href="#kGB3">kGB3</a>, <a href="#kGB5">kGB5</a>, <a href="#kGB7">kGB7</a>, <a href="#kGB8">kGB8</a>, <a href="#kHKSCS">kHKSCS</a>, <a href="#kIBMJapan">kIBMJapan</a>, <a href="#kJIS0213">kJIS0213</a>, <a href="#kJis0">kJis0</a>, <a href="#kJis1">kJis1</a>, <a href="#kKPS0">kKPS0</a>, <a href="#kKPS1">kKPS1</a>, <a href="#kKSC0">kKSC0</a>, <a href="#kKSC1">kKSC1</a>, <a href="#kMainlandTelegraph">kMainlandTelegraph</a>, <a href="#kPseudoGB1">kPseudoGB1</a>, <a href="#kTaiwanTelegraph">kTaiwanTelegraph</a>, <a href="#kXerox">kXerox</a>.</li> <li>Radical-Stroke&nbsp;Counts:&nbsp;<a href="#kRSAdobe_Japan1_6">kRSAdobe_Japan1_6</a>, <a href="#kRSJapanese">kRSJapanese</a>, <a href="#kRSKanWa">kRSKanWa</a>, <a href="#kRSKangXi">kRSKangXi</a>, <a href="#kRSKorean">kRSKorean</a>, <a href="#kRSUnicode">kRSUnicode</a>.</li> <li>Variants:&nbsp;<a href="#kCompatibilityVariant">kCompatibilityVariant</a>, <a href="#kSemanticVariant">kSemanticVariant</a>, <a href="#kSimplifiedVariant">kSimplifiedVariant</a>, <a href="#kSpecializedSemanticVariant">kSpecializedSemanticVariant</a>, <a href="#kTraditionalVariant">kTraditionalVariant</a>, <a href="#kZVariant">kZVariant</a>.</li> 
-</ul>
-
-<hr>
-
-  <h3><a name="Unihan_Status">Unihan Properties by Status</a></h3>
-<ul>
-<li>Normative:&nbsp;<a href="#kCompatibilityVariant">kCompatibilityVariant</a>, <a href="#kIRG_GSource">kIRG_GSource</a>, <a href="#kIRG_HSource">kIRG_HSource</a>, <a href="#kIRG_JSource">kIRG_JSource</a>, <a href="#kIRG_KPSource">kIRG_KPSource</a>, <a href="#kIRG_KSource">kIRG_KSource</a>, <a href="#kIRG_TSource">kIRG_TSource</a>, <a href="#kIRG_USource">kIRG_USource</a>, <a href="#kIRG_VSource">kIRG_VSource</a>.</li> <li>Informative:&nbsp;<a href="#kAccountingNumeric">kAccountingNumeric</a>, <a href="#kIICore">kIICore</a>, <a href="#kOtherNumeric">kOtherNumeric</a>, <a href="#kPrimaryNumeric">kPrimaryNumeric</a>, <a href="#kRSUnicode">kRSUnicode</a>.</li> <li>Provisional:&nbsp;<a href="#kBigFive">kBigFive</a>, <a href="#kCCCII">kCCCII</a>, <a href="#kCNS1986">kCNS1986</a>, <a href="#kCNS1992">kCNS1992</a>, <a href="#kCangjie">kCangjie</a>, <a href="#kCantonese">kCantonese</a>, <a href="#kCihaiT">kCihaiT</a>, <a href="#kCowles">kCowles</a>, <a href="#kDaeJaweon">kDaeJaweon</a>, <a href="#kDefinition">kDefinition</a>, <a href="#kEACC">kEACC</a>, <a href="#kFenn">kFenn</a>, <a href="#kFennIndex">kFennIndex</a>, <a href="#kFrequency">kFrequency</a>, <a href="#kGB0">kGB0</a>, <a href="#kGB1">kGB1</a>, <a href="#kGB3">kGB3</a>, <a href="#kGB5">kGB5</a>, <a href="#kGB7">kGB7</a>, <a href="#kGB8">kGB8</a>, <a href="#kGSR">kGSR</a>, <a href="#kGradeLevel">kGradeLevel</a>, <a href="#kHDZRadBreak">kHDZRadBreak</a>, <a href="#kHKGlyph">kHKGlyph</a>, <a href="#kHKSCS">kHKSCS</a>, <a href="#kHanYu">kHanYu</a>, <a href="#kHanyuPinlu">kHanyuPinlu</a>, <a href="#kIBMJapan">kIBMJapan</a>, <a href="#kIRGDaeJaweon">kIRGDaeJaweon</a>, <a href="#kIRGDaiKanwaZiten">kIRGDaiKanwaZiten</a>, <a href="#kIRGHanyuDaZidian">kIRGHanyuDaZidian</a>, <a href="#kIRGKangXi">kIRGKangXi</a>, <a href="#kJIS0213">kJIS0213</a>, <a href="#kJapaneseKun">kJapaneseKun</a>, <a href="#kJapaneseOn">kJapaneseOn</a>, <a href="#kJis0">kJis0</a>, <a href="#kJis1">kJis1</a>, <a href="#kKPS0">kKPS0</a>, <a href="#kKPS1">kKPS1</a>, <a href="#kKSC0">kKSC0</a>, <a href="#kKSC1">kKSC1</a>, <a href="#kKangXi">kKangXi</a>, <a href="#kKarlgren">kKarlgren</a>, <a href="#kKorean">kKorean</a>, <a href="#kLau">kLau</a>, <a href="#kMainlandTelegraph">kMainlandTelegraph</a>, <a href="#kMandarin">kMandarin</a>, <a href="#kMatthews">kMatthews</a>, <a href="#kMeyerWempe">kMeyerWempe</a>, <a href="#kMorohashi">kMorohashi</a>, <a href="#kNelson">kNelson</a>, <a href="#kPhonetic">kPhonetic</a>, <a href="#kPseudoGB1">kPseudoGB1</a>, <a href="#kRSAdobe_Japan1_6">kRSAdobe_Japan1_6</a>, <a href="#kRSJapanese">kRSJapanese</a>, <a href="#kRSKanWa">kRSKanWa</a>, <a href="#kRSKangXi">kRSKangXi</a>, <a href="#kRSKorean">kRSKorean</a>, <a href="#kSBGY">kSBGY</a>, <a href="#kSemanticVariant">kSemanticVariant</a>, <a href="#kSimplifiedVariant">kSimplifiedVariant</a>, <a href="#kSpecializedSemanticVariant">kSpecializedSemanticVariant</a>, <a href="#kTaiwanTelegraph">kTaiwanTelegraph</a>, <a href="#kTang">kTang</a>, <a href="#kTotalStrokes">kTotalStrokes</a>, <a href="#kTraditionalVariant">kTraditionalVariant</a>, <a href="#kVietnamese">kVietnamese</a>, <a href="#kXerox">kXerox</a>, <a href="#kZVariant">kZVariant</a>.</li> 
-</ul>
-
-<hr>
-
-  <h2><a name="Unihan_Detail">Unihan Properties in Detail</a></h2>
-  <table>
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kAccountingNumeric">kAccountingNumeric</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Numeric Values 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Informative 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>24 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The value of the character when used in the writing of accounting numerals. &#8226; Accounting numerals are used in East Asia to prevent fraud. Because a number like ten (十) is easily turned into one thousand (千) with a stroke of a brush, monetary documents will often use an accounting form of the numeral ten (such as 拾) in their place. &#8226; The three numeric-value fields should have no overlap; that is, characters with a kAccountingNumeric value should not have a kPrimaryNumeric or kOtherNumeric value as well. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kBigFive">kBigFive</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>13063 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The Big Five mapping for this character in hex; note that this does not cover any of the Big Five extensions in common use, including the ETEN extensions. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kCCCII">kCCCII</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>19698 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The CCCII mapping for this character in hex. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kCNS1986">kCNS1986</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>17258 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The CNS 11643-1986 mapping for this character in hex. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kCNS1992">kCNS1992</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>17258 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The CNS 11643-1992 mapping for this character in hex. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kCangjie">kCangjie</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary-like Data 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>17421 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The cangjie input code for the character. This incorporates data from the file cangjie-table.b5 by Christian Wittern. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kCantonese">kCantonese</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary-like Data 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>20007 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The Cantonese pronunciation(s) for this character using the jyutping romanization. &#8226; A full description of jyutping can be found at &#60;<a href="http://cpct92.cityu.edu.hk/lshk/Jyutping/Jyutping.htm">http://cpct92.cityu.edu.hk/lshk/Jyutping/Jyutping.htm</a>&#62;. The main differences between jyutping and the Yale romanization previously used are: &#8226; 1) Jyutping always uses tone numbers and does not distinguish the high falling and high level tones. &#8226; 2) Jyutping always writes a long a as &#34;aa&#34;. &#8226; 3) Jyutping uses &#34;oe&#34; and &#34;eo&#34; for the Yale &#34;eu&#34; vowel. &#8226; 4) Jyutping uses &#34;c&#34; instead of &#34;ch&#34;, &#34;z&#34; instead of &#34;j&#34;, and &#34;j&#34; instead of &#34;y&#34; as initials. &#8226; 5) A non-null initial is always explicitly written (thus &#34;jyut&#34; in jyutping instead of Yale&#39;s &#34;yut&#34;). &#8226; Cantonese pronunciations are sorted alphabetically, not in order of frequency. &#8226; N.B., the Hong Kong dialect of Cantonese is in the process of dropping initial NG- before non-null finals. Any word with an initial NG- may actually be pronounced without it, depending on the speaker and circumstances. Many words with a null initial may similarly be pronounced with an initial NG-. Similarly, many speakers use an initial L- for words previously pronounced with an initial N-. &#8226; Cantonese data are derived from the following sources: &#8226; Casey, G. Hugh, S.J. Ten Thousand Characters: An Analytic Dictionary. Hong Kong: Kelley and Walsh,1980 (kPhonetic). &#8226; Cheung Kwan-hin and Robert S. Bauer, The Representation of Cantonese with Chinese Characters, Journal of Chinese Linguistics Monograph Series Number 18, 2002. &#8226; Roy T. Cowles, A Pocket Dictionary of Cantonese, Hong Kong: University Press, 1999 (kCowles). &#8226; Sidney Lau, A Practical Cantonese-English Dictionary, Hong Kong: Government Printer, 1977 (kLau). &#8226; Bernard F. Meyer and Theodore F. Wempe, Student&#39;s Cantonese-English Dictionary, Maryknoll, New York: Catholic Foreign Mission Society of America, 1947 (kMeyerWempe). &#8226; 饒秉才, ed. 廣州音字典, Hong Kong: Joint Publishing (H.K.) Co., Ltd., 1989. &#8226; 中華新字典, Hong Kong:中華書局, 1987. &#8226; 黃港生, ed. 商務新詞典, Hong Kong: The Commercial Press, 1991. &#8226; 朗文初級中文詞典, Hong Kong: Longman, 2001. &#8226; The jyutping phrase box from the Linguistic Society of Hong Kong, &#60;<a href="http://cpct92.cityu.edu.hk/lshk/Jyutping/">http://cpct92.cityu.edu.hk/lshk/Jyutping/</a>&#62;. The copyright of the Jyutping phrase box belongs to the Linguistic Society of Hong Kong. We would like to thank the Jyutping Group of the Linguistic Society of Hong Kong for permission to use the electronic file in our research and/or product development. Note that the inclusion of the phrase box in the Unihan database requires that any products developed using the kCantonese field needs to include this acknowledgment. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kCihaiT">kCihaiT</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary-like Data 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>13883 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The position of this character in the Cihai (辭海) dictionary, single volume edition, published in Hong Kong by the Zhonghua Bookstore, 1983 (reprint of the 1947 edition), ISBN 962-231-005-2. &#8226; The position is indicated by a decimal number. The digits to the left of the decimal are the page number. The first digit after the decimal is the row on the page, and the remaining two digits after the decimal are the position on the row. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kCompatibilityVariant">kCompatibilityVariant</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Variants 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Normative 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>997 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The compatibility decomposition for this ideograph, derived from the UnicodeData.txt file. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kCowles">kCowles</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary Indices 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>4821 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The index or indices of this character in Roy T. Cowles, A Pocket Dictionary of Cantonese, Hong Kong: University Press, 1999. &#8226; The Cowles indices are numerical, usually integers but occasionally fractional where a character was added after the original indices were determined. Cowles is missing indices 1222 and 4949, and four characters in Cowles are part of Unicode&#39;s &#34;Hangzhou&#34; numeral set: 2964 (〥 U+3025), 3197 (〨 U+3028), 3574 (〣 U+3023), and 4720 (〧 U+3027). &#8226; Approximately 100 characters from Cowles which are not currently encoded are being submitted to the IRG by Unicode for inclusion in future versions of the standard. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kDaeJaweon">kDaeJaweon</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary Indices 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>16026 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The position of this character in the Dae Jaweon (Korean) dictionary used in the four-dictionary sorting algorithm. The position is in the form &#34;page.position&#34; with the final digit in the position being &#34;0&#34; for characters actually in the dictionary and &#34;1&#34; for characters not found in the dictionary and assigned a &#34;virtual&#34; position in the dictionary. &#8226; Thus, &#34;1187.060&#34; indicates the sixth character on page 1187. A character not in this dictionary but assigned a position between the 6th and 7th characters on page 1187 for sorting purposes would have the code &#34;1187.061&#34; &#8226; The edition used is the first edition, published in Seoul by Samseong Publishing Co., Ltd., 1988. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kDefinition">kDefinition</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary-like Data 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>20609 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> An English definition for this character. Definitions are for modern written Chinese and are usually (but not always) the same as the definition in other Chinese dialects or non-Chinese languages. In some cases, synonyms are indicated. Fuller variant information can be found using the various variant fields. &#8226; Definitions specific to non-Chinese languages or Chinese dialects other than modern Mandarin are marked, e.g., (Cant.) or (J). &#8226; Major definitions are separated by semicolons, and minor definitions by commas. Any valid Unicode character (except for tab, double-quote, and any line break character) may be used within the definition field. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kEACC">kEACC</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>13244 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The EACC mapping for this character in hex. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kFenn">kFenn</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary-like Data 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>5075 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> Data on the character from The Five Thousand Dictionary (aka Fenn&#39;s Chinese-English Pocket Dictionary) by Courtenay H. Fenn, Cambridge, Mass.: Harvard University Press, 1979. &#8226; The data here consists of a decimal number followed by a letter A through K, the letter P, or an asterisk. The decimal number gives the Soothill number for the character&#39;s phonetic, and the letter is a rough frequency indication, with A indicating the 500 most common ideographs, B the next five hundred, and so on. &#8226; P is used by Fenn to indicate a rare character included in the dictionary only because it is the phonetic element in other characters. &#8226; An asterisk is used instead of a letter in the final position to indicate a character which belongs to one of Soothill&#39;s phonetic groups but is not found in Fenn&#39;s dictionary. &#8226; Characters which have a frequency letter but no Soothill phonetic group are assigned group 0. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kFennIndex">kFennIndex</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary Indices 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>5937 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The position of this character in _Fenn&#39;s Chinese-English Pocket Dictionary_ by Courtenay H. Fenn, Cambridge, Mass.: Harvard University Press, 1942. The position is indicated by a three-digit page number followed by a period and a two-digit position on the page. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kFrequency">kFrequency</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary-like Data 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>5089 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> A rough frequency measurement for the character based on analysis of traditional Chinese USENET postings; characters with a kFrequency of 1 are the most common, those with a kFrequency of 2 are less common, and so on, through a kFrequency of 5. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kGB0">kGB0</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>6763 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The GB 2312-80 mapping for this character in ku/ten form. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kGB1">kGB1</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>6866 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The GB 12345-90 mapping for this character in ku/ten form. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kGB3">kGB3</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>4836 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The GB 7589-87 mapping for this character in ku/ten form. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kGB5">kGB5</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>2842 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The GB 7590-87 mapping for this character in ku/ten form. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kGB7">kGB7</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>42 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The GB 8565-89 mapping for this character in ku/ten form. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kGB8">kGB8</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>785 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The GB 8565-89 mapping for this character in ku/ten form .
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kGSR">kGSR</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary Indices 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>7403 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The position of this character in Bernhard Karlgren&#39;s Grammata Serica Recensa (1957). &#8226; This dataset contains a total of 7,403 records. References are given in the form DDDDa(&#39;), where &#34;DDDD&#34; is a set number in the range [0001..1260] zero-padded to 4-digits, &#34;a&#34; is a letter in the range [a..z] (excluding &#34;w&#34;), optionally followed by (&#39;) apostrophe. The data from which this mapping table is extracted contains a total of 10,023 references. References to inscriptional forms have been omitted. &#8226; Release notes &#8226; 22-Dec-2003: Initial release. The following 32 references are to unencoded forms: 0059k, 0069y, 0079d, 0275b, 0286a, 0289a, 0289f, 0293a, 0325a, 0389o, 0391h, 0392s, 0468h, 0480a, 0516a, 0526o, 0566g&#39;, 0642y, 0661a, 0739i,0775b, 0837h, 0893r, 0969a, 0969e, 1019e, 1062b, 1112d, 1124l, 1129c&#39;, 1144a, 1144b. In some cases a variant mapping has been substituted in the mapping table, in other cases the reference is omitted. &#8226; Bibliographic information &#8226; Karlgren, Klas Bernhard Johannes 高本漢 (1889–1978): 2000. Grammata Serica Recensa Electronica. Electronic version of GSR, including indices, syllable canon, &#38; images of the original Karlgren (1957) text. Prepared for the STEDT Project by Richard Cook; based in part on work by Tor Ulving &#38; Ferenc Tafferner (see below), used by permission. Berkeley: University of California., &#60;<a href="http://stedt.berkeley.edu/">http://stedt.berkeley.edu/</a>&#62; &#8226; Karlgren 1957. Grammata Serica Recensa. First published in the Bulletin of the Museum of Far Eastern Antiquities (BMFEA) No. 29, Stockholm, Sweden. Reprinted by Elanders Boktrycker Aktiebolag, Kungsbacka, [1972]. Reprinted also by SMC Publishing Inc., Taipei, Taiwan, ROC, [1996]. ISBN: 957-638-269-6. &#8226; Karlgren 1940. Grammata Serica: Script and Phonetics in Chinese and Sino-Japanese 《中日漢字形聲論》Zhong-Ri Hanzi Xingsheng Lun [A study of Sino-Japanese semantic-phonetic compound characters:] BMFEA No. 12. Reprinted, Taipei: Ch&#39;eng-Wen Publishing Company, [1966]. &#8226; Ulving, Tor: 1997. Dictionary of Old and Middle Chinese: Bernhard Karlgren&#39;s Grammata Serica Recensa Alphabetically Arranged. With Ferenc Tafferner. Göteborg, Sweden: Acta Universitatis Gothoburgensis. Orientalia Gothoburgensia, 11. ISBN: 91-7346-294-2. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kGradeLevel">kGradeLevel</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary-like Data 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>2631 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The primary grade in the Hong Kong school system by which a student is expected to know the character; this data is derived from 朗文初級中文詞典, Hong Kong: Longman, 2001. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kHDZRadBreak">kHDZRadBreak</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary-like Data 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>200 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Introduced
-        </td>
-        <td>4.1 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> Indicates that 《漢語大字典》 Hanyu Da Zidian has a radical break beginning at this character&#39;s position. The field consists of the radical (with its Unicode code point), a colon, and then the Hanyu Da Zidian position as in the kHanyu field. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kHKGlyph">kHKGlyph</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary-like Data 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>4825 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The index of the character in 常用字字形表 (二零零零年修訂本),香港: 香港教育學院, 2000, ISBN 962-949-040-4. This publication gives the &#34;proper&#34; shapes for 4759 characters as used in the Hong Kong school system. The index is an integer, zero-padded to four digits. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kHKSCS">kHKSCS</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>4375 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> Mappings to the Big Five extended code points used for the Hong Kong Supplementary Character Set. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kHanYu">kHanYu</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary Indices 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>55817 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The position of this character in the Hanyu Da Zidian (HDZ) Chinese character dictionary (bibliographic information below). &#8226; The character references are given in the form &#34;ABCDE.XYZ&#34;, in which: &#34;A&#34; is the volume number [1..8]; &#34;BCDE&#34; is the zero-padded page number [0001..4809]; &#34;XY&#34; is the zero-padded number of the character on the page [01..32]; &#34;Z&#34; is &#34;0&#34; for a character actually in the dictionary, and greater than 0 for a character assigned a &#34;virtual&#34; position in the dictionary. For example, 53024.060 indicates an actual HDZ character, the 6th character on Page 3,044 of Volume 5 (i.e. 籉). Note that the Volume 8 &#34;BCDE&#34; references are in the range [0008..0044] inclusive, referring to the pagination of the &#34;Appendix of Addendum&#34; at the end of that volume (beginning after p. 5746). &#8226; The first character assigned a given virtual position has an index ending in 1; the second assigned the same virtual position has an index ending in 2; and so on. &#8226; Release information &#8226; This data set contains a total of 56097 records, 54728 of which are actual HDZ character references (positions are given for all HDZ head entries, including source-internal unifications), and 1369 of which are virtual character positions (see note below). &#8226; All 55817 HDZ references in this data set are unique. Because of IRG source-internal unifications, a given UCS-4 Scalar Value (USV) may have more than one HDZ reference. Source-internal unifications are of two types: (1) unifications of graphical variants; (2) unifications of duplicate head entries. &#8226; The proofing of all references was done primarily on the basis of cross-checks of three versions of the reference data: (1) the original print source; (2) the &#34;kIRGHanyuDaZidian&#34; field of Unihan.txt (release 3.1.1d1); (3) &#34;HDZ.txt&#34;, originally produced and proofed for Academia Sinica&#39;s Institute of Information Technology (Document Processing Laboratory). In addition, the data was checked against the &#34;kHanYu&#34; and &#34;kAlternateHanYu&#34; fields of Unihan.txt (release 3.1.1d1), which the present data set supersedes. &#8226; String value, string length, compound key, field count, and page total validations were all performed. Altogether, 578 omissions/ errors in source (2) were identified/corrected. Any remaining errors will likely relate to virtual positions, or to the ordering of actual characters within a given page. It is unlikely that errors across page breaks remain. Possible future deunifications of source-internal unifications will necessitate update of USV for some references. Under no circumstances should the source-internal unification (duplicate USV) mappings be removed from this data set. &#8226; Note: Source (3) contributed only actual HDZ character references to the proofing process, while source (2) contributed all virtual positions. It seems that the compilers of source (2) usually assigned virtual positions based on stroke count, though occasionally the virtual position brings the virtual character together with the actual HDZ character of which it is a variant, without regard to actual stroke count. &#8226; Bibliographic information for the print source: &#8226; &#60;Hanyu Da Zidian&#62; [&#39;Great Chinese Character Dictionary&#39; (in 8 Volumes)]. XU Zhongshu (Editor in Chief). Wuhan, Hubei Province (PRC): Hubei and Sichuan Dictionary Publishing Collectives, 1986-1990. ISBN: 7-5403-0030-2/H.16. &#8226; 《漢語大字典》。許力以主任,徐中舒主編,(漢語大字典工作委員會)。武漢:四川辭書出版社,湖北辭書出版社,1986-1990. ISBN: 7-5403-0030 2/H.16. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kHanyuPinlu">kHanyuPinlu</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary Indices 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>3799 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The Pronunciations and Frequencies of this character, based in part on those appearing in 《現代漢語頻率詞典》 &#60;Xiandai Hanyu Pinlu Cidian&#62; (XDHYPLCD) [Modern Standard Beijing Chinese Frequency Dictionary] (complete bibliographic information below). &#8226; Data Format &#8226; This dataset contains a total of 3800 records. Each entry is comprised of two pieces of data. &#8226; The Hanyu Pinyin (HYPY) pronunciation(s) of the character, with numeric tone marks (1-5, where 5 indicates the &#34;neutral tone&#34;) immediately following each alphabetic string. &#8226; Immediately following the numeric tone mark, a numeric string appears in parentheses: e.g. in &#34;a1(392)&#34; the numeric string &#34;392&#34; indicates the sum total of the frequencies of the pronunciations of the character as given in HYPLCD. &#8226; Where more than one pronunciation exists, these are sorted by descending frequency, and the list elements are &#34;comma + space&#34; delimited. &#8226; Release Information &#8226; The XDHYPLCD data here for Modern Standard Chinese (Putonghua) cuts across 4 genres (&#34;News,&#34; &#34;Scientific,&#34; &#34;Colloquial,&#34; and &#34;Literature&#34;), and was derived from a 440799 character corpus. See that text for additional information. &#8226; The 8548 entries (8586 with variant writings) from p. 491-656 of XDHYPLCD were input by hand and proof-read from 1994/08/04 to 1995/03/22 by Richard Cook. &#8226; Current Release Date above reflects date of last proofing. &#8226; HYPY transcription for the data in this release was semiautomated and hand-corrected in 1995, based in part on data provided by Ross Paterson (Department of Computing, Imperial College, London). &#8226; Tom Bishop &#60;<a href="http://www.wenlin.com">http://www.wenlin.com</a>&#62; is also due thanks for early assistance in proof-reading this data. &#8226; The character set used for this digitization of HYPLCD (a &#34;simplified&#34; mainland PRC text) was (Mac OS 7-9) GB 2312-80 (plus 嗐). &#8226; These data were converted to Big5 (plus 腈), and both GB and Big5 versions were separately converted to Unicode 4.0, and then merged, resulting in the 3800 records in the current release. Frequency data for simplified polysyllabic words has been employed to generate both simplified and traditional character frequencies. &#8226; Bibliographic information for the primary print source &#8226; 《現代漢語頻率詞典》,北京語言學院語言教學研究所編著。 &#8226; &#60;Xiandai Hanyu Pinlu Cidian&#62; = XDHYPLCD First edition 1986/6, 2nd printing 1990/4. ISBN 7-5619-0094-5/H.67. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kIBMJapan">kIBMJapan</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>360 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The IBM Japanese mapping for this character in hexadecimal. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kIICore">kIICore</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary-like Data 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Informative 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>9809 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> Indicates that a character is in IICore, the IRG-produced minimal set of required ideographs for East Asian use. &#8226; Each individual value in this field is either P (for preliminary, meaning it has been approved by the IRG but not by WG2), or the ISO/IEC 10646 subset identifier for the subset(s) containing this character. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kIRGDaeJaweon">kIRGDaeJaweon</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary Indices 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>16024 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The position of this character in the Dae Jaweon (Korean) dictionary used in the four-dictionary sorting algorithm. The position is in the form &#34;page.position&#34; with the final digit in the position being &#34;0&#34; for characters actually in the dictionary and &#34;1&#34; for characters not found in the dictionary and assigned a &#34;virtual&#34; position in the dictionary. &#8226; Thus, &#34;1187.060&#34; indicates the sixth character on page 1187. A character not in this dictionary but assigned a position between the 6th and 7th characters on page 1187 for sorting purposes would have the code &#34;1187.061&#34; &#8226; This field represents the official position of the character within the Dae Jaweon dictionary as used by the IRG in the four-dictionary sorting algorithm. &#8226; The edition used is the first edition, published in Seoul by Samseong Publishing Co., Ltd., 1988. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kIRGDaiKanwaZiten">kIRGDaiKanwaZiten</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary Indices 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>17864 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The index of this character in the Dai Kanwa Ziten, aka Morohashi dictionary (Japanese) used in the four-dictionary sorting algorithm. &#8226; This field represents the official position of the character within the DaiKanwa dictionary as used by the IRG in the four-dictionary sorting algorithm. The edition used is the revised edition, published in Tokyo by Taishuukan Shoten, 1986. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kIRGHanyuDaZidian">kIRGHanyuDaZidian</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary Indices 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>55812 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The position of this character in the Hanyu Da Zidian (PRC) dictionary used in the four-dictionary sorting algorithm. The position is in the form &#34;volume page.position&#34; with the final digit in the position being &#34;0&#34; for characters actually in the dictionary and &#34;1&#34; for characters not found in the dictionary and assigned a &#34;virtual&#34; position in the dictionary. &#8226; Thus, &#34;32264.080&#34; indicates the eighth character on page 2264 in volume 3. A character not in this dictionary but assigned a position between the 8th and 9th characters on this page for sorting purposes would have the code &#34;32264.081&#34; &#8226; This field represents the official position of the character within the Hanyu Da Zidian dictionary as used by the IRG in the four-dictionary sorting algorithm. &#8226; The edition of the Hanyu Da Zidian used is the first edition, published in Chengdu by Sichuan Cishu Publishing, 1986. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kIRGKangXi">kIRGKangXi</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary Indices 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>70205 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The position of this character in the KangXi dictionary used in the four-dictionary sorting algorithm. The position is in the form &#34;page.position&#34; with the final digit in the position being &#34;0&#34; for characters actually in the dictionary and &#34;1&#34; for characters not found in the dictionary and assigned a &#34;virtual&#34; position in the dictionary. &#8226; Thus, &#34;1187.060&#34; indicates the sixth character on page 1187. A character not in this dictionary but assigned a position between the 6th and 7th characters on page 1187 for sorting purposes would have the code &#34;1187.061&#34; &#8226; This field represents the official position of the character within the KangXi dictionary as used by the IRG in the four-dictionary sorting algorithm. The edition of the KangXi dictionary used is the 7th edition published by Zhonghua Bookstore in Beijing, 1989. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kIRG_GSource">kIRG_GSource</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>IRG Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Normative 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>57627 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The IRG &#34;G&#34; source mapping for this character in hex. The IRG G source consists of data from the following national standards, publications, and lists from the People&#39;s Republic of China and Singapore. The versions of the standards used are those provided by the PRC to the IRG and may not always reflect published versions of the standards generally available. &#8226; 4K Siku Quanshu &#8226; BK Chinese Encyclopedia &#8226; CH The Ci Hai (PRC edition) &#8226; CY The Ci Yuan &#8226; FZ and FZ_BK Founder Press System &#8226; G0 GB2312-80 &#8226; G1 GB12345-90 with 58 Hong Kong and 92 Korean &#34;Idu&#34; characters &#8226; G3 GB7589-87 unsimplified forms &#8226; G5 GB7590-87 unsimplified forms &#8226; G7 General Purpose Hanzi List for Modern Chinese Language, and General List of Simplified Hanzi &#8226; GS Singapore characters &#8226; G8 GB8685-88 &#8226; GE GB16500-95 &#8226; HC The Hanyu Da Cidian &#8226; HZ The Hanyu Da Zidian &#8226; KX The KangXi dictionary .
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kIRG_HSource">kIRG_HSource</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>IRG Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Normative 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>4511 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The IRG &#34;H&#34; source mapping for this character in hex. The IRG &#34;H&#34; source consists of data from the Hong Kong Supplementary Characer Set. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kIRG_JSource">kIRG_JSource</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>IRG Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Normative 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>13684 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The IRG &#34;J&#34; source mapping for this character in hex. The IRG J source consists of data from the following national standards and lists from Japan. &#8226; J0 JIS X 0208-1990 &#8226; J1 JIS X 0212-1990 &#8226; J3 JIS X 0213-2000 &#8226; J4 JIS X 0213-2000 .
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kIRG_KPSource">kIRG_KPSource</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>IRG Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Normative 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>24122 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The IRG &#34;KP&#34; source mapping for this character in hex. The IRG &#34;KP&#34; source consists of data from the following national standards and lists from the Democratic People&#39;s Republic of Korea (North Korea). &#8226; KP0 KPS 9566-97 &#8226; KP1 KPS 10721-2000 .
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kIRG_KSource">kIRG_KSource</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>IRG Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Normative 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>17661 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The IRG &#34;K&#34; source mapping for this character in hex. The IRG &#34;K&#34; source consists of data from the following national standards and lists from the Republic of Korea (South Korea). &#8226; K0 KS C 5601-1987 &#8226; K1 KS C 5657-1991 &#8226; K2 PKS C 5700-1 1994 &#8226; K3 PKS C 5700-2 1994 &#8226; K4 PKS 5700-3:1998 &#8226; Note that the K4 source is expressed in hexadecimal, but unlike the other sources, it is not organized in row/column. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kIRG_TSource">kIRG_TSource</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>IRG Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Normative 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>54990 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The IRG &#34;T&#34; source mapping for this character in hex. The IRG &#34;T&#34; source consists of data from the following national standards and lists from the Republic of China (Taiwan). &#8226; T1 CNS 11643-1992, plane 1 &#8226; T2 CNS 11643-1992, plane 2 &#8226; T3 CNS 11643-1992, plane 3 (with some additional characters) &#8226; T4 CNS 11643-1992, plane 4 &#8226; T5 CNS 11643-1992, plane 5 &#8226; T6 CNS 11643-1992, plane 6 &#8226; T7 CNS 11643-1992, plane 7 &#8226; TF CNS 11643-1992, plane 15 .
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kIRG_USource">kIRG_USource</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>IRG Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Normative 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>34 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The IRG &#34;U&#34; source mapping for this character. Currently, the IRG U source is limited to a small number of characters in the CJK Compatibility Ideographs block, where the value is the Unicode code point. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kIRG_VSource">kIRG_VSource</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>IRG Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Normative 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>9300 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The IRG &#34;V&#34; source mapping for this character in hex. The IRG V source consists of data from the following national standards and lists from Vietnam. &#8226; V0 TCVN 5773:1993 &#8226; V1 VHN 01:1998 &#8226; V2 VHN 02:1998 &#8226; V3 TCVN 6056:1995 .
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kJIS0213">kJIS0213</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>3695 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The JIS X 0213-2000 mapping for this character in min,ku,ten form. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kJapaneseKun">kJapaneseKun</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary-like Data 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>11291 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The Japanese pronunciation(s) of this character. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kJapaneseOn">kJapaneseOn</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary-like Data 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>13173 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The Sino-Japanese pronunciation(s) of this character. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kJis0">kJis0</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>6356 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The JIS X 0208-1990 mapping for this character in ku/ten form. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kJis1">kJis1</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>5801 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The JIS X 0212-1990 mapping for this character in ku/ten form. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kKPS0">kKPS0</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>4653 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The KPS 9566-97 mapping for this character in hexadecimal form. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kKPS1">kKPS1</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>19301 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The KPS 10721-2000 mapping for this character in hexadecimal form. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kKSC0">kKSC0</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>4888 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The KS X 1001:1992 (KS C 5601-1989) mapping for this character in ku/ten form. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kKSC1">kKSC1</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>2856 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The KS X 1002:1991 (KS C 5657-1991) mapping for this character in ku/ten form. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kKangXi">kKangXi</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary Indices 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>20936 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The position of this character in the KangXi dictionary used in the four-dictionary sorting algorithm. The position is in the form &#34;page.position&#34; with the final digit in the position being &#34;0&#34; for characters actually in the dictionary and &#34;1&#34; for characters not found in the dictionary and assigned a &#34;virtual&#34; position in the dictionary. &#8226; Thus, &#34;1187.060&#34; indicates the sixth character on page 1187. A character not in this dictionary but assigned a position between the 6th and 7th characters on page 1187 for sorting purposes would have the code &#34;1187.061&#34; &#8226; The edition of the KangXi dictionary used is the 7th edition published by Zhonghua Bookstore in Beijing, 1989. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kKarlgren">kKarlgren</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary Indices 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>2560 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The index of this character in _Analytic Dictionary of Chinese and Sino-Japanese_ by Bernhard Karlgren, New York: Dover Publications, Inc., 1974. &#8226; If the index is followed by an asterisk (*), then the index is an interpolated one, indicating where the character would be found if it were to have been included in the dictionary. Note that while the index itself is usually an integer, there are some cases where it is an integer followed by an &#34;A&#34;. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kKorean">kKorean</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary-like Data 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>9050 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The Korean pronunciation(s) of this character, using the Yale romanization system. (See &#60;<a href="http://www.btranslations.com/resources/romanization/korean.asp">http://www.btranslations.com/resources/romanization/korean.asp</a>&#62; for a comparison of the various Korean romanization systems.) .
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kLau">kLau</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary Indices 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>3516 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The index of this character in A Practical Cantonese-English Dictionary by Sidney Lau, Hong Kong: The Government Printer, 1977. &#8226; The index consists of an integer. Missing indices indicate unencoded characters which are being submitted to the IRG for inclusion in future versions of the standard. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kMainlandTelegraph">kMainlandTelegraph</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>7085 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The PRC telegraph code for this character, derived from &#34;Kanzi denpou koudo henkan-hyou&#34; (&#34;Chinese character telegraph code conversion table&#34;), Lin Jinyi, KDD Engineering and Consulting, Tokyo, 1984. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kMandarin">kMandarin</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary-like Data 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>25476 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The Mandarin pronunciation(s) for this character in pinyin; Mandarin pronunciations are sorted in order of frequency, not alphabetically. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kMatthews">kMatthews</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary Indices 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>8988 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The index of this character in Mathews&#39; Chinese-English Dictionary by Robert H. Mathews, Cambrige: Harvard University Press, 1975. &#8226; Note that the field name is kMatthews instead of kMathews to maintain compatibility with earlier versions of this file, where it was inadvertently misspelled. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kMeyerWempe">kMeyerWempe</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary Indices 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>7352 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The index of this character in the Student&#39;s Cantonese-English Dictionary by Bernard F. Meyer and Theodore F. Wempe (3rd edition, 1947). The index is an integer, optionally followed by a lower-case Latin letter if the listing is in a subsidiary entry and not a main one. In some cases where the character is found in the radical-stroke index, but not in the main body of the dictionary, the integer is followed by an asterisk (e.g., 僥 U+50E5, which is listed as 736* as well as 1185a). 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kMorohashi">kMorohashi</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary Indices 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>21204 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The index of this character in the Dae Kanwa Ziten, aka Morohashi dictionary (Japanese) used in the four-dictionary sorting algorithm. &#8226; The edition used is the revised edition, published in Tokyo by Taishuukan Shoten, 1986. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kNelson">kNelson</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary Indices 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>5398 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The index of this character in The Modern Reader&#39;s Japanese-English Character Dictionary by Andrew Nathaniel Nelson, Rutland, Vermont: Charles E. Tuttle Company, 1974. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kOtherNumeric">kOtherNumeric</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Numeric Values 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Informative 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>30 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The numeric value for the character in certain unusual, specialized contexts. &#8226; The three numeric-value fields should have no overlap; that is, characters with a kOtherNumeric value should not have a kAccountingNumeric or kPrimaryNumeric value as well. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kPhonetic">kPhonetic</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary-like Data 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>11462 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The phonetic index for the character from Ten Thousand Characters: An Analytic Dictionary by G. Hugh Casey, S.J. Hong Kong: Kelley and Walsh,1980. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kPrimaryNumeric">kPrimaryNumeric</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Numeric Values 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Informative 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>17 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The value of the character when used in the writing of numbers in the standard fashion. &#8226; The three numeric-value fields should have no overlap; that is, characters with a kPrimaryNumeric value should not have a kAccountingNumeric or kOtherNumeric value as well. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kPseudoGB1">kPseudoGB1</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>153 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> A &#34;GB 12345-90&#34; code point assigned this character for the purposes of including it within Unihan. Pseudo-GB1 codes were used to provide official code points for characters not already in national standards, such as characters used to write Cantonese, and so on. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kRSAdobe_Japan1_6">kRSAdobe_Japan1_6</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Radical-Stroke Counts 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>13404 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Introduced
-        </td>
-        <td>4.1 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> Information on the glyphs in Adobe-Japan1-6 as contributed by Adobe. The value consists of a number of space-separated entries. Each entry consists of three pieces of information separated by a plus sign: &#8226; 1) C or V. &#34;C&#34; indicates that the Unicode code point maps directly to the Adobe-Japan1-6 CID that appears after it, and &#34;V&#34; indicates that it is considered a variant form, and thus not directly encoded. &#8226; 2) The Adobe-Japan1-6 CID. &#8226; 3) Radical-stroke data for the indicated Adobe-Japan1-6 CID. The radical-stroke data consists of three pieces separated by periods: the KangXi radical (1-214), the number of strokes in the form the radical takes in the glyph, and the number of strokes in the residue. The standard Unicode radical-stroke form can be obtained by omitting the second value, and the total strokes in the glyph from adding the second and third values. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kRSJapanese">kRSJapanese</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Radical-Stroke Counts 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>198 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> A Japanese radical/stroke count for this character in the form &#34;radical.additional strokes&#34;. A &#39; after the radical indicates the simplified version of the given radical. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kRSKanWa">kRSKanWa</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Radical-Stroke Counts 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>157 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> A Morohashi radical/stroke count for this character in the form &#34;radical.additional strokes&#34;. A &#39; after the radical indicates the simplified version of the given radical. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kRSKangXi">kRSKangXi</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Radical-Stroke Counts 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>63696 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The KangXi radical/stroke count for this character consistent with the value of the kKangXi field in the form &#34;radical.additional strokes&#34;. A &#39; after the radical indicates the simplified version of the given radical. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kRSKorean">kRSKorean</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Radical-Stroke Counts 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>20 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> A Korean radical/stroke count for this character in the form &#34;radical.additional strokes&#34;. A &#39; after the radical indicates the simplified version of the given radical .
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kRSUnicode">kRSUnicode</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Radical-Stroke Counts 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Informative 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>71226 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> A standard radical/stroke count for this character in the form &#34;radical.additional strokes&#34;. A &#39; after the radical indicates the simplified version of the given radical &#8226; This field is used for additional radical-stroke indices where either a character may be reasonably classified under more than one radical, or alternate stroke count algorithms may provide different stroke counts. &#8226; The first value is intended to reflect the same radical as the kRSKangXi field and the stroke count of the glyph used to print the character within the Unicode Standard. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kSBGY">kSBGY</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary Indices 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>19574 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The position of this character in the Song Ben Guang Yun (SBGY) Medieval Chinese character dictionary (bibliographic and general information below). &#8226; The 25334 character references are given in the form &#34;ABC.XY&#34;, in which: &#34;ABC&#34; is the zero-padded page number [004..546]; &#34;XY&#34; is the zero-padded number of the character on the page [01..73]. For example, 364.38 indicates the 38th character on Page 364 (i.e. 澍). Where a given Unicode Scalar Value (USV) has more than one reference, these are space-delimited. &#8226; - Release information (20031005): &#8226; This release corrects several mappings. &#8226; -- Release information (20020310) -- &#8226; This data set contains a total of 25334 references, for 19572 different hanzi (up from 25330 and 19511 in the previous release). &#8226; This release of the kSBGY data fixes a number of mappings, based on extensive work done since the initial release (compare the initial release counts given below). See the end of this header for additional information. &#8226; -- Initial release information (20020310) -- &#8226; The original data was input under the direction of Prof. LUO Fengzhu at Taiwan Taoyuanxian Yuan Zhi University (see below) using an early version of the Big5- based CDP encoding scheme developed at Academia Sinica. During 2000-2002 this raw data was processed and revised by Richard Cook as follows: the data was converted to Unicode encoding using his revised kHanYu mapping tables (first provided to the Unicode Consortium for the Unihan.txt release 3.1.1d1) and also using several other mapping tables developed specifically for this project; the kSBGY indices were generated based on hand-counts of all page totals; numerous indexing errors were corrected; and the data underwent final proofing. &#8226; -- About the print sources -- &#8226; The SBGY text, which dates to the beginning of the Song Dynasty (c. 1008, edited by 陳彭年 CHEN Pengnian et al.) is an enlargement of an earlier text known as Qie Yun (dated to c. 601, edited by 陸法言 LU Fayan). With 25,330 head entries, this large early lexicon is important in part for the information which it provides for historical Chinese phonology. The GY dictionary employs a Chinese transcription method (known as 反切) to give pronunciations for each of its head entries. In addition, each syllable is also given a brief gloss. &#8226; It must be emphasized that the mapping of a particular SBGY glyph to a single USV may in some cases be merely an approximation or may have required the choice of a &#34;best possible glyph&#34; (out of those available in the Unicode repertoire). This indexing data in conjunction with the print sources will be useful for evaluating the degree of distinctive variation in the character forms appearing in this text, and future proofing of this data may reveal additional Chinese glyphs for IRG encoding. &#8226; -- Bibliographic information on the print sources -- &#8226; 《宋本廣韻》 &#60;&#60;Song Ben Guang Yun&#62;&#62; [&#39;Song Dynasty edition of the Guang Yun Rhyming Dictionary&#39;], edited by 陳彭年 CHEN Pengnian et al. (c. 1008). &#8226; Two modern editions of this work were consulted in building the kSBGY indices: &#8226; 《新校正切宋本廣韻》。台灣黎明文化事業公司 出版,林尹校訂1976 年出版。[This was the edition used by Prof. LUO (台灣桃園縣元智大學中語系羅鳳珠), and in the subsequent revision, conversion, indexing and proofing.] &#8226; 新校互註‧宋本廣韻》。香港中文大學,余迺永 1993, 2000 年出版。ISBN: 962-201-413-5; 7-5326-0685-6. [Textual problems were resolved on the basis of this extensively annotated modern edition of the text.] &#8226; -- Additional Information -- &#8226; For further information on this index data and the databases from which it is excerpted, see: &#8226; Cook, Richard S. 2003. 《說文解字‧電子版》 Shuo Wen Jie Zi - Dianzi Ban: Digital Recension of the Eastern Han Chinese Grammaticon. PhD Dissertation. Department of Linguistics. Berkeley: University of California. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kSemanticVariant">kSemanticVariant</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Variants 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>3205 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The Unicode value for a semantic variant for this character. A semantic variant is an x- or y-variant with similar or identical meaning which can generally be used in place of the indicated character. &#8226; The basic syntax is a Unicode scalar value. It may optionally be followed by additional data. The additional data is separated from the Unicode scalar value by a less-than sign (&#60;), and may be subdivided itself into substrings by commas, each of which may be divided into two pieces by a colon. The additional data consists of a series of field tags for another field in the Unihan database indicating the source of the information. If subdivided, the final piece is a string consisting of the letters T (for tòng, 同 U+540C ) B (for bù, 不 U+4E0D ), or Z (for zhèng, 正 U+6B63 ). &#8226; T is used if the indicated source explicitly indicates the two are the same (e.g., by saying that the one character is &#34;the same as&#34; the other). &#8226; B is used if the source explicitly indicates that the two are used improperly one for the other. &#8226; Z is used if the source explicitly indicates that the given character is the preferred form. Thus, the Hanyu Da Zidian indicates that 刱 U+5231  and 創 U+5275  are semantic variants and that 創 U+5275  is the preferred form. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kSimplifiedVariant">kSimplifiedVariant</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Variants 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>2714 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The Unicode value for the simplified Chinese variant for this character (if any). &#8226; Note that a character can be *both* a traditional Chinese character in its own right *and* the simplified variant for other characters (e.g., 台 U+53F0). &#8226; In such case, the character is listed as its own simplified variant and one of its own traditional variants. This distinguishes this from the case where the character is not the simplified form for any character (e.g., 井 U+4E95). &#8226; Much of the of the data on simplified and traditional variants was supplied by Wenlin &#60;<a href="http://www.wenlin.com">http://www.wenlin.com</a>&#62; .
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kSpecializedSemanticVariant">kSpecializedSemanticVariant</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Variants 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>482 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The Unicode value for a specialized semantic variant for this character. The syntax is the same as for the kSemanticVariant field. &#8226; A specialized semantic variant is an x- or y-variant with similar or identical meaning only in certain contexts (such as accountants&#39; numerals). 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kTaiwanTelegraph">kTaiwanTelegraph</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>9041 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The Taiwanese telegraph code for this character, derived from &#34;Kanzi denpou koudo henkan-hyou&#34; (&#34;Chinese character telegraph code conversion table&#34;), Lin Jinyi, KDD Engineering and Consulting, Tokyo, 1984. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kTang">kTang</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary-like Data 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>3812 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The Tang dynasty pronunciation(s) of this character, derived from or consistent with _T&#39;ang Poetic Vocabulary_ by Hugh M. Stimson, Far Eastern Publications, Yale Univ. 1976. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kTotalStrokes">kTotalStrokes</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary-like Data 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>27788 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The total number of strokes in the character (including the radical). This value is for the character as drawn in the Unicode charts. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kTraditionalVariant">kTraditionalVariant</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Variants 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>2632 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The Unicode value(s) for the traditional Chinese variant(s) for this character. &#8226; Note that a character can be *both* a traditional Chinese character in its own right *and* the simplified variant for other characters (e.g., . &#8226; In such case, the character is listed as its own simplified variant and one of its own traditional variants. This distinguishes this from the case where the character is not the simplified form for any character (e.g., . &#8226; Much of the of the data on simplified and traditional variants was supplied by Wenlin Institute, Inc. &#60;<a href="http://www.wenlin.com">http://www.wenlin.com</a>&#62;. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kVietnamese">kVietnamese</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Dictionary-like Data 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>8300 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The character&#39;s pronunciation(s) in Quốc ngữ. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kXerox">kXerox</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Other Mappings 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>9747 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The Xerox code for this character. 
-        </td>
-    </tr>
-
-
-
-    <tr>
-        <td BGCOLOR="#FFFF99">Property
-        </td>
-        <th><a name="kZVariant">kZVariant</a>
-        </th>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Category
-        </td>
-        <td>Variants 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Status
-        </td>
-        <td>Provisional 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Records
-        </td>
-        <td>2570 
-        </td>
-    </tr>
-
-    <tr>
-        <td BGCOLOR="#FFFFCC">Description
-        </td>
-        <td> The Unicode value(s) for known z-variants of this character. 
-        </td>
-    </tr>
-
-
-
-  </table>
-
-
-
-
-
-
-  <p></p>
-  <hr>
-  <h2><a name="References">References</a></h2>
-
-  <table class="noborder" style="border-collapse: collapse" cellpadding="4" cellspacing="0">
-    <tr>
-      <td valign="top" width="1" class="noborder">[<a name="FAQ">FAQ</a>]</td>
-      <td valign="top" class="noborder">Unicode Frequently Asked Questions<br>
-      <a href="http://www.unicode.org/faq/">http://www.unicode.org/faq/<br>
-      </a><i>For answers to common questions on technical issues.</i></td>
-    </tr>
-    <tr>
-      <td valign="top" width="1" class="noborder">[<a name="Glossary">Glossary</a>]</td>
-      <td valign="top" class="noborder">Unicode Glossary<a href="http://www.unicode.org/glossary/"><br>
-      http://www.unicode.org/glossary/<br>
-      </a><i>For explanations of terminology used in this and other documents.</i></td>
-    </tr>
-    <tr>
-      <td valign="top" width="1" class="noborder">[<a name="Reports">Reports</a>]</td>
-      <td valign="top" class="noborder">Unicode Technical Reports<br>
-      <a href="http://www.unicode.org/reports/">http://www.unicode.org/reports/<br>
-      </a><i>For information on the status and development process for technical reports, and for a 
-      list of technical reports.</i></td>
-    </tr>
-    <tr>
-      <td valign="top" width="1" class="noborder">[<a name="U4.0">U4.0</a>]</td>
-      <td valign="top" class="noborder">The Unicode Standard Version 4.0<br>
-      <a href="http://www.unicode.org/versions/Unicode4.0.0/">
-      http://www.unicode.org/versions/Unicode4.0.0/</a></td>
-    </tr>
-    <tr>
-      <td valign="top" width="1" class="noborder">[<a name="U4.0.1">U4.0.1</a>]</td>
-      <td valign="top" class="noborder">The Unicode Standard Version 4.0.1<br>
-      <a href="http://www.unicode.org/versions/Unicode4.0.1/">
-      http://www.unicode.org/versions/Unicode4.0.1/</a></td>
-    </tr>
-    <tr>
-      <td valign="top" width="1" class="noborder">[<a name="U4.1.0">U4.1.0</a>]</td>
-      <td valign="top" class="noborder">The Unicode Standard Version 4.1.0<br>
-      <a href="http://www.unicode.org/versions/Unicode4.1.0/">
-      http://www.unicode.org/versions/Unicode4.1.0/</a></td>
-    </tr>
-    <tr>
-      <td valign="top" width="1" class="noborder">[<a name="Versions">Versions</a>]</td>
-      <td valign="top" class="noborder">Versions of the Unicode Standard<br>
-      <a href="http://www.unicode.org/versions/">http://www.unicode.org/versions/<br>
-      </a><i>For details on the precise contents of each version of the Unicode Standard, and how to 
-      cite them.</i></td>
-    </tr>
-  </table>
-
-  <hr>
-  <h2><a name="Modification_History">Modification History</a></h2>
-    <h3>Changes for Version 4.1.0 from Version 4.0.1</h3>
-      <h4>Changes in Unihan.html</h4>
-      <p>The current file Unihan.html was first created in version 4.1.0 by separating out the property table from <a href="UCD.html">UCD.html</a>,
-      and by generating the <a href="#Unihan_Tags">Unihan Properties</a> information from the Unihan.txt header.</p>
-
-      <h4>Data Changes in Unihan.txt</h4>
-
-  <p>Below is a list of the major changes in the Unicode Standard 4.1.0 release of Unihan.txt : </p>
-    <ul>
-        <li>The kPhonetic data was regenerated to include multiple entries for individual characters.</li>
-        <li>Duplicate entries were removed from the kMandarin and kCantonese fields.</li>
-        <li>All fields are now complete.</li>
-        <li>The kRSAdobe_Japan1_6 Radical/Stroke data field was added (with 13404 records), courtesy of Adobe (and Ken Lunde).</li>
-        <li>The kIICore field (with 9809 records) was added.</li>
-        <li>The kFennIndex field (with 5937 records) was added.</li>
-        <li>The kFenn field had substantial new data added (5002 additions).</li>
-        <li>The kHDZRadBreak field (with 200 records) was added.</li>
-        <li>The kRSUnicode field was updated (with 128 new records).</li>
-        <li>The kSBGY and kHanYu fields have been updated.</li>
-        <li>The kAlternateKangXi and kAlternateMorohashi fields were dropped.</li>
-        <li>The syntax of the kSemanticVariant and kSpecializedSemanticVariant fields was extended to include source information.</li>
-        <li>The kSemanticVariant field was augmented (3053 additions).</li>
-        <li>The kSpecializedSemanticVariant field was augmented (444 additions).</li>
-        <li>The Cantonese field has been changed to use jyutping instead of Yale romanization.</li>
-        <li>Preliminary data for new (4.1.0) characters has been added.</li>
-        <li>The various kIRG* fields have had their values resynchronized with data in ISO/IEC 10646.</li>
-        <li>The kIRG_KSource values were synchronized with editorial changes made to ISO/IEC 10646.</li>
-        <li>Numerous other individual corrections and additions were made.</li>
-        <li>The header was restructured and expanded, in preparation for moving the field descriptions into this document.</li>
-        <li>The header now contains information on "Valid UniHan Ranges for this release".</li>
-    </ul>
-
-  <hr>
-  <h2><i><a name="UCD_Terms">UCD Terms of Use</a></i></h2>
-  <p>For terms of use, see <i><a href="http://www.unicode.org/terms_of_use.html">
-  http://www.unicode.org/terms_of_use.html</a>.</i></p>
-  <hr width="50%">
-  <div align="center">
-    <center>
-    <table cellspacing="0" cellpadding="0" border="0">
-      <tr>
-        <td><a href="http://www.unicode.org/copyright.html">
-        <img src="http://www.unicode.org/img/hb_notice.gif" border="0" alt="Access to Copyright and terms of use" width="216" height="50"></a></td>
-      </tr>
-    </table>
-    <script language="Javascript" type="text/javascript" src="http://www.unicode.org/webscripts/lastModified.js">
-                </script>
-    </center>
-  </div>
-</div>
-
-</body>
-
-</html>
diff --git a/ucd/Unihan.zip b/ucd/Unihan.zip
deleted file mode 100644
index d1bf8c4..0000000
--- a/ucd/Unihan.zip
+++ /dev/null
Binary files differ
diff --git a/ucd/auxiliary/GraphemeBreakProperty.txt b/ucd/auxiliary/GraphemeBreakProperty.txt
deleted file mode 100644
index c1eea54..0000000
--- a/ucd/auxiliary/GraphemeBreakProperty.txt
+++ /dev/null
@@ -1,1039 +0,0 @@
-# GraphemeBreakProperty-5.0.0.txt
-# Date: 2006-03-09, 23:14:04 GMT [MD]
-#
-# Unicode Character Database
-# Copyright (c) 1991-2006 Unicode, Inc.
-# For terms of use, see http://www.unicode.org/terms_of_use.html
-# For documentation, see UCD.html
-
-# ================================================
-
-# Property:	Grapheme_Cluster_Break
-
-#  All code points not explicitly listed for Grapheme_Cluster_Break
-#  have the value Other (XX).
-
-# @missing: 0000..10FFFF; Other
-
-# ================================================
-
-000D          ; CR # Cc       <control-000D>
-
-# Total code points: 1
-
-# ================================================
-
-000A          ; LF # Cc       <control-000A>
-
-# Total code points: 1
-
-# ================================================
-
-0000..0009    ; Control # Cc  [10] <control-0000>..<control-0009>
-000B..000C    ; Control # Cc   [2] <control-000B>..<control-000C>
-000E..001F    ; Control # Cc  [18] <control-000E>..<control-001F>
-007F..009F    ; Control # Cc  [33] <control-007F>..<control-009F>
-00AD          ; Control # Cf       SOFT HYPHEN
-0600..0603    ; Control # Cf   [4] ARABIC NUMBER SIGN..ARABIC SIGN SAFHA
-06DD          ; Control # Cf       ARABIC END OF AYAH
-070F          ; Control # Cf       SYRIAC ABBREVIATION MARK
-17B4..17B5    ; Control # Cf   [2] KHMER VOWEL INHERENT AQ..KHMER VOWEL INHERENT AA
-200B          ; Control # Cf       ZERO WIDTH SPACE
-200E..200F    ; Control # Cf   [2] LEFT-TO-RIGHT MARK..RIGHT-TO-LEFT MARK
-2028          ; Control # Zl       LINE SEPARATOR
-2029          ; Control # Zp       PARAGRAPH SEPARATOR
-202A..202E    ; Control # Cf   [5] LEFT-TO-RIGHT EMBEDDING..RIGHT-TO-LEFT OVERRIDE
-2060..2063    ; Control # Cf   [4] WORD JOINER..INVISIBLE SEPARATOR
-206A..206F    ; Control # Cf   [6] INHIBIT SYMMETRIC SWAPPING..NOMINAL DIGIT SHAPES
-FEFF          ; Control # Cf       ZERO WIDTH NO-BREAK SPACE
-FFF9..FFFB    ; Control # Cf   [3] INTERLINEAR ANNOTATION ANCHOR..INTERLINEAR ANNOTATION TERMINATOR
-1D173..1D17A  ; Control # Cf   [8] MUSICAL SYMBOL BEGIN BEAM..MUSICAL SYMBOL END PHRASE
-E0001         ; Control # Cf       LANGUAGE TAG
-E0020..E007F  ; Control # Cf  [96] TAG SPACE..CANCEL TAG
-
-# Total code points: 201
-
-# ================================================
-
-0300..036F    ; Extend # Mn [112] COMBINING GRAVE ACCENT..COMBINING LATIN SMALL LETTER X
-0483..0486    ; Extend # Mn   [4] COMBINING CYRILLIC TITLO..COMBINING CYRILLIC PSILI PNEUMATA
-0488..0489    ; Extend # Me   [2] COMBINING CYRILLIC HUNDRED THOUSANDS SIGN..COMBINING CYRILLIC MILLIONS SIGN
-0591..05BD    ; Extend # Mn  [45] HEBREW ACCENT ETNAHTA..HEBREW POINT METEG
-05BF          ; Extend # Mn       HEBREW POINT RAFE
-05C1..05C2    ; Extend # Mn   [2] HEBREW POINT SHIN DOT..HEBREW POINT SIN DOT
-05C4..05C5    ; Extend # Mn   [2] HEBREW MARK UPPER DOT..HEBREW MARK LOWER DOT
-05C7          ; Extend # Mn       HEBREW POINT QAMATS QATAN
-0610..0615    ; Extend # Mn   [6] ARABIC SIGN SALLALLAHOU ALAYHE WASSALLAM..ARABIC SMALL HIGH TAH
-064B..065E    ; Extend # Mn  [20] ARABIC FATHATAN..ARABIC FATHA WITH TWO DOTS
-0670          ; Extend # Mn       ARABIC LETTER SUPERSCRIPT ALEF
-06D6..06DC    ; Extend # Mn   [7] ARABIC SMALL HIGH LIGATURE SAD WITH LAM WITH ALEF MAKSURA..ARABIC SMALL HIGH SEEN
-06DE          ; Extend # Me       ARABIC START OF RUB EL HIZB
-06DF..06E4    ; Extend # Mn   [6] ARABIC SMALL HIGH ROUNDED ZERO..ARABIC SMALL HIGH MADDA
-06E7..06E8    ; Extend # Mn   [2] ARABIC SMALL HIGH YEH..ARABIC SMALL HIGH NOON
-06EA..06ED    ; Extend # Mn   [4] ARABIC EMPTY CENTRE LOW STOP..ARABIC SMALL LOW MEEM
-0711          ; Extend # Mn       SYRIAC LETTER SUPERSCRIPT ALAPH
-0730..074A    ; Extend # Mn  [27] SYRIAC PTHAHA ABOVE..SYRIAC BARREKH
-07A6..07B0    ; Extend # Mn  [11] THAANA ABAFILI..THAANA SUKUN
-07EB..07F3    ; Extend # Mn   [9] NKO COMBINING SHORT HIGH TONE..NKO COMBINING DOUBLE DOT ABOVE
-0901..0902    ; Extend # Mn   [2] DEVANAGARI SIGN CANDRABINDU..DEVANAGARI SIGN ANUSVARA
-093C          ; Extend # Mn       DEVANAGARI SIGN NUKTA
-0941..0948    ; Extend # Mn   [8] DEVANAGARI VOWEL SIGN U..DEVANAGARI VOWEL SIGN AI
-094D          ; Extend # Mn       DEVANAGARI SIGN VIRAMA
-0951..0954    ; Extend # Mn   [4] DEVANAGARI STRESS SIGN UDATTA..DEVANAGARI ACUTE ACCENT
-0962..0963    ; Extend # Mn   [2] DEVANAGARI VOWEL SIGN VOCALIC L..DEVANAGARI VOWEL SIGN VOCALIC LL
-0981          ; Extend # Mn       BENGALI SIGN CANDRABINDU
-09BC          ; Extend # Mn       BENGALI SIGN NUKTA
-09BE          ; Extend # Mc       BENGALI VOWEL SIGN AA
-09C1..09C4    ; Extend # Mn   [4] BENGALI VOWEL SIGN U..BENGALI VOWEL SIGN VOCALIC RR
-09CD          ; Extend # Mn       BENGALI SIGN VIRAMA
-09D7          ; Extend # Mc       BENGALI AU LENGTH MARK
-09E2..09E3    ; Extend # Mn   [2] BENGALI VOWEL SIGN VOCALIC L..BENGALI VOWEL SIGN VOCALIC LL
-0A01..0A02    ; Extend # Mn   [2] GURMUKHI SIGN ADAK BINDI..GURMUKHI SIGN BINDI
-0A3C          ; Extend # Mn       GURMUKHI SIGN NUKTA
-0A41..0A42    ; Extend # Mn   [2] GURMUKHI VOWEL SIGN U..GURMUKHI VOWEL SIGN UU
-0A47..0A48    ; Extend # Mn   [2] GURMUKHI VOWEL SIGN EE..GURMUKHI VOWEL SIGN AI
-0A4B..0A4D    ; Extend # Mn   [3] GURMUKHI VOWEL SIGN OO..GURMUKHI SIGN VIRAMA
-0A70..0A71    ; Extend # Mn   [2] GURMUKHI TIPPI..GURMUKHI ADDAK
-0A81..0A82    ; Extend # Mn   [2] GUJARATI SIGN CANDRABINDU..GUJARATI SIGN ANUSVARA
-0ABC          ; Extend # Mn       GUJARATI SIGN NUKTA
-0AC1..0AC5    ; Extend # Mn   [5] GUJARATI VOWEL SIGN U..GUJARATI VOWEL SIGN CANDRA E
-0AC7..0AC8    ; Extend # Mn   [2] GUJARATI VOWEL SIGN E..GUJARATI VOWEL SIGN AI
-0ACD          ; Extend # Mn       GUJARATI SIGN VIRAMA
-0AE2..0AE3    ; Extend # Mn   [2] GUJARATI VOWEL SIGN VOCALIC L..GUJARATI VOWEL SIGN VOCALIC LL
-0B01          ; Extend # Mn       ORIYA SIGN CANDRABINDU
-0B3C          ; Extend # Mn       ORIYA SIGN NUKTA
-0B3E          ; Extend # Mc       ORIYA VOWEL SIGN AA
-0B3F          ; Extend # Mn       ORIYA VOWEL SIGN I
-0B41..0B43    ; Extend # Mn   [3] ORIYA VOWEL SIGN U..ORIYA VOWEL SIGN VOCALIC R
-0B4D          ; Extend # Mn       ORIYA SIGN VIRAMA
-0B56          ; Extend # Mn       ORIYA AI LENGTH MARK
-0B57          ; Extend # Mc       ORIYA AU LENGTH MARK
-0B82          ; Extend # Mn       TAMIL SIGN ANUSVARA
-0BBE          ; Extend # Mc       TAMIL VOWEL SIGN AA
-0BC0          ; Extend # Mn       TAMIL VOWEL SIGN II
-0BCD          ; Extend # Mn       TAMIL SIGN VIRAMA
-0BD7          ; Extend # Mc       TAMIL AU LENGTH MARK
-0C3E..0C40    ; Extend # Mn   [3] TELUGU VOWEL SIGN AA..TELUGU VOWEL SIGN II
-0C46..0C48    ; Extend # Mn   [3] TELUGU VOWEL SIGN E..TELUGU VOWEL SIGN AI
-0C4A..0C4D    ; Extend # Mn   [4] TELUGU VOWEL SIGN O..TELUGU SIGN VIRAMA
-0C55..0C56    ; Extend # Mn   [2] TELUGU LENGTH MARK..TELUGU AI LENGTH MARK
-0CBC          ; Extend # Mn       KANNADA SIGN NUKTA
-0CBF          ; Extend # Mn       KANNADA VOWEL SIGN I
-0CC2          ; Extend # Mc       KANNADA VOWEL SIGN UU
-0CC6          ; Extend # Mn       KANNADA VOWEL SIGN E
-0CCC..0CCD    ; Extend # Mn   [2] KANNADA VOWEL SIGN AU..KANNADA SIGN VIRAMA
-0CD5..0CD6    ; Extend # Mc   [2] KANNADA LENGTH MARK..KANNADA AI LENGTH MARK
-0CE2..0CE3    ; Extend # Mn   [2] KANNADA VOWEL SIGN VOCALIC L..KANNADA VOWEL SIGN VOCALIC LL
-0D3E          ; Extend # Mc       MALAYALAM VOWEL SIGN AA
-0D41..0D43    ; Extend # Mn   [3] MALAYALAM VOWEL SIGN U..MALAYALAM VOWEL SIGN VOCALIC R
-0D4D          ; Extend # Mn       MALAYALAM SIGN VIRAMA
-0D57          ; Extend # Mc       MALAYALAM AU LENGTH MARK
-0DCA          ; Extend # Mn       SINHALA SIGN AL-LAKUNA
-0DCF          ; Extend # Mc       SINHALA VOWEL SIGN AELA-PILLA
-0DD2..0DD4    ; Extend # Mn   [3] SINHALA VOWEL SIGN KETTI IS-PILLA..SINHALA VOWEL SIGN KETTI PAA-PILLA
-0DD6          ; Extend # Mn       SINHALA VOWEL SIGN DIGA PAA-PILLA
-0DDF          ; Extend # Mc       SINHALA VOWEL SIGN GAYANUKITTA
-0E31          ; Extend # Mn       THAI CHARACTER MAI HAN-AKAT
-0E34..0E3A    ; Extend # Mn   [7] THAI CHARACTER SARA I..THAI CHARACTER PHINTHU
-0E47..0E4E    ; Extend # Mn   [8] THAI CHARACTER MAITAIKHU..THAI CHARACTER YAMAKKAN
-0EB1          ; Extend # Mn       LAO VOWEL SIGN MAI KAN
-0EB4..0EB9    ; Extend # Mn   [6] LAO VOWEL SIGN I..LAO VOWEL SIGN UU
-0EBB..0EBC    ; Extend # Mn   [2] LAO VOWEL SIGN MAI KON..LAO SEMIVOWEL SIGN LO
-0EC8..0ECD    ; Extend # Mn   [6] LAO TONE MAI EK..LAO NIGGAHITA
-0F18..0F19    ; Extend # Mn   [2] TIBETAN ASTROLOGICAL SIGN -KHYUD PA..TIBETAN ASTROLOGICAL SIGN SDONG TSHUGS
-0F35          ; Extend # Mn       TIBETAN MARK NGAS BZUNG NYI ZLA
-0F37          ; Extend # Mn       TIBETAN MARK NGAS BZUNG SGOR RTAGS
-0F39          ; Extend # Mn       TIBETAN MARK TSA -PHRU
-0F71..0F7E    ; Extend # Mn  [14] TIBETAN VOWEL SIGN AA..TIBETAN SIGN RJES SU NGA RO
-0F80..0F84    ; Extend # Mn   [5] TIBETAN VOWEL SIGN REVERSED I..TIBETAN MARK HALANTA
-0F86..0F87    ; Extend # Mn   [2] TIBETAN SIGN LCI RTAGS..TIBETAN SIGN YANG RTAGS
-0F90..0F97    ; Extend # Mn   [8] TIBETAN SUBJOINED LETTER KA..TIBETAN SUBJOINED LETTER JA
-0F99..0FBC    ; Extend # Mn  [36] TIBETAN SUBJOINED LETTER NYA..TIBETAN SUBJOINED LETTER FIXED-FORM RA
-0FC6          ; Extend # Mn       TIBETAN SYMBOL PADMA GDAN
-102D..1030    ; Extend # Mn   [4] MYANMAR VOWEL SIGN I..MYANMAR VOWEL SIGN UU
-1032          ; Extend # Mn       MYANMAR VOWEL SIGN AI
-1036..1037    ; Extend # Mn   [2] MYANMAR SIGN ANUSVARA..MYANMAR SIGN DOT BELOW
-1039          ; Extend # Mn       MYANMAR SIGN VIRAMA
-1058..1059    ; Extend # Mn   [2] MYANMAR VOWEL SIGN VOCALIC L..MYANMAR VOWEL SIGN VOCALIC LL
-135F          ; Extend # Mn       ETHIOPIC COMBINING GEMINATION MARK
-1712..1714    ; Extend # Mn   [3] TAGALOG VOWEL SIGN I..TAGALOG SIGN VIRAMA
-1732..1734    ; Extend # Mn   [3] HANUNOO VOWEL SIGN I..HANUNOO SIGN PAMUDPOD
-1752..1753    ; Extend # Mn   [2] BUHID VOWEL SIGN I..BUHID VOWEL SIGN U
-1772..1773    ; Extend # Mn   [2] TAGBANWA VOWEL SIGN I..TAGBANWA VOWEL SIGN U
-17B7..17BD    ; Extend # Mn   [7] KHMER VOWEL SIGN I..KHMER VOWEL SIGN UA
-17C6          ; Extend # Mn       KHMER SIGN NIKAHIT
-17C9..17D3    ; Extend # Mn  [11] KHMER SIGN MUUSIKATOAN..KHMER SIGN BATHAMASAT
-17DD          ; Extend # Mn       KHMER SIGN ATTHACAN
-180B..180D    ; Extend # Mn   [3] MONGOLIAN FREE VARIATION SELECTOR ONE..MONGOLIAN FREE VARIATION SELECTOR THREE
-18A9          ; Extend # Mn       MONGOLIAN LETTER ALI GALI DAGALGA
-1920..1922    ; Extend # Mn   [3] LIMBU VOWEL SIGN A..LIMBU VOWEL SIGN U
-1927..1928    ; Extend # Mn   [2] LIMBU VOWEL SIGN E..LIMBU VOWEL SIGN O
-1932          ; Extend # Mn       LIMBU SMALL LETTER ANUSVARA
-1939..193B    ; Extend # Mn   [3] LIMBU SIGN MUKPHRENG..LIMBU SIGN SA-I
-1A17..1A18    ; Extend # Mn   [2] BUGINESE VOWEL SIGN I..BUGINESE VOWEL SIGN U
-1B00..1B03    ; Extend # Mn   [4] BALINESE SIGN ULU RICEM..BALINESE SIGN SURANG
-1B34          ; Extend # Mn       BALINESE SIGN REREKAN
-1B36..1B3A    ; Extend # Mn   [5] BALINESE VOWEL SIGN ULU..BALINESE VOWEL SIGN RA REPA
-1B3C          ; Extend # Mn       BALINESE VOWEL SIGN LA LENGA
-1B42          ; Extend # Mn       BALINESE VOWEL SIGN PEPET
-1B6B..1B73    ; Extend # Mn   [9] BALINESE MUSICAL SYMBOL COMBINING TEGEH..BALINESE MUSICAL SYMBOL COMBINING GONG
-1DC0..1DCA    ; Extend # Mn  [11] COMBINING DOTTED GRAVE ACCENT..COMBINING LATIN SMALL LETTER R BELOW
-1DFE..1DFF    ; Extend # Mn   [2] COMBINING LEFT ARROWHEAD ABOVE..COMBINING RIGHT ARROWHEAD AND DOWN ARROWHEAD BELOW
-200C..200D    ; Extend # Cf   [2] ZERO WIDTH NON-JOINER..ZERO WIDTH JOINER
-20D0..20DC    ; Extend # Mn  [13] COMBINING LEFT HARPOON ABOVE..COMBINING FOUR DOTS ABOVE
-20DD..20E0    ; Extend # Me   [4] COMBINING ENCLOSING CIRCLE..COMBINING ENCLOSING CIRCLE BACKSLASH
-20E1          ; Extend # Mn       COMBINING LEFT RIGHT ARROW ABOVE
-20E2..20E4    ; Extend # Me   [3] COMBINING ENCLOSING SCREEN..COMBINING ENCLOSING UPWARD POINTING TRIANGLE
-20E5..20EF    ; Extend # Mn  [11] COMBINING REVERSE SOLIDUS OVERLAY..COMBINING RIGHT ARROW BELOW
-302A..302F    ; Extend # Mn   [6] IDEOGRAPHIC LEVEL TONE MARK..HANGUL DOUBLE DOT TONE MARK
-3099..309A    ; Extend # Mn   [2] COMBINING KATAKANA-HIRAGANA VOICED SOUND MARK..COMBINING KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK
-A806          ; Extend # Mn       SYLOTI NAGRI SIGN HASANTA
-A80B          ; Extend # Mn       SYLOTI NAGRI SIGN ANUSVARA
-A825..A826    ; Extend # Mn   [2] SYLOTI NAGRI VOWEL SIGN U..SYLOTI NAGRI VOWEL SIGN E
-FB1E          ; Extend # Mn       HEBREW POINT JUDEO-SPANISH VARIKA
-FE00..FE0F    ; Extend # Mn  [16] VARIATION SELECTOR-1..VARIATION SELECTOR-16
-FE20..FE23    ; Extend # Mn   [4] COMBINING LIGATURE LEFT HALF..COMBINING DOUBLE TILDE RIGHT HALF
-10A01..10A03  ; Extend # Mn   [3] KHAROSHTHI VOWEL SIGN I..KHAROSHTHI VOWEL SIGN VOCALIC R
-10A05..10A06  ; Extend # Mn   [2] KHAROSHTHI VOWEL SIGN E..KHAROSHTHI VOWEL SIGN O
-10A0C..10A0F  ; Extend # Mn   [4] KHAROSHTHI VOWEL LENGTH MARK..KHAROSHTHI SIGN VISARGA
-10A38..10A3A  ; Extend # Mn   [3] KHAROSHTHI SIGN BAR ABOVE..KHAROSHTHI SIGN DOT BELOW
-10A3F         ; Extend # Mn       KHAROSHTHI VIRAMA
-1D165         ; Extend # Mc       MUSICAL SYMBOL COMBINING STEM
-1D167..1D169  ; Extend # Mn   [3] MUSICAL SYMBOL COMBINING TREMOLO-1..MUSICAL SYMBOL COMBINING TREMOLO-3
-1D16E..1D172  ; Extend # Mc   [5] MUSICAL SYMBOL COMBINING FLAG-1..MUSICAL SYMBOL COMBINING FLAG-5
-1D17B..1D182  ; Extend # Mn   [8] MUSICAL SYMBOL COMBINING ACCENT..MUSICAL SYMBOL COMBINING LOURE
-1D185..1D18B  ; Extend # Mn   [7] MUSICAL SYMBOL COMBINING DOIT..MUSICAL SYMBOL COMBINING TRIPLE TONGUE
-1D1AA..1D1AD  ; Extend # Mn   [4] MUSICAL SYMBOL COMBINING DOWN BOW..MUSICAL SYMBOL COMBINING SNAP PIZZICATO
-1D242..1D244  ; Extend # Mn   [3] COMBINING GREEK MUSICAL TRISEME..COMBINING GREEK MUSICAL PENTASEME
-E0100..E01EF  ; Extend # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
-
-# Total code points: 911
-
-# ================================================
-
-1100..1159    ; L # Lo  [90] HANGUL CHOSEONG KIYEOK..HANGUL CHOSEONG YEORINHIEUH
-115F          ; L # Lo       HANGUL CHOSEONG FILLER
-
-# Total code points: 91
-
-# ================================================
-
-1160..11A2    ; V # Lo  [67] HANGUL JUNGSEONG FILLER..HANGUL JUNGSEONG SSANGARAEA
-
-# Total code points: 67
-
-# ================================================
-
-11A8..11F9    ; T # Lo  [82] HANGUL JONGSEONG KIYEOK..HANGUL JONGSEONG YEORINHIEUH
-
-# Total code points: 82
-
-# ================================================
-
-AC00          ; LV # Lo       HANGUL SYLLABLE GA
-AC1C          ; LV # Lo       HANGUL SYLLABLE GAE
-AC38          ; LV # Lo       HANGUL SYLLABLE GYA
-AC54          ; LV # Lo       HANGUL SYLLABLE GYAE
-AC70          ; LV # Lo       HANGUL SYLLABLE GEO
-AC8C          ; LV # Lo       HANGUL SYLLABLE GE
-ACA8          ; LV # Lo       HANGUL SYLLABLE GYEO
-ACC4          ; LV # Lo       HANGUL SYLLABLE GYE
-ACE0          ; LV # Lo       HANGUL SYLLABLE GO
-ACFC          ; LV # Lo       HANGUL SYLLABLE GWA
-AD18          ; LV # Lo       HANGUL SYLLABLE GWAE
-AD34          ; LV # Lo       HANGUL SYLLABLE GOE
-AD50          ; LV # Lo       HANGUL SYLLABLE GYO
-AD6C          ; LV # Lo       HANGUL SYLLABLE GU
-AD88          ; LV # Lo       HANGUL SYLLABLE GWEO
-ADA4          ; LV # Lo       HANGUL SYLLABLE GWE
-ADC0          ; LV # Lo       HANGUL SYLLABLE GWI
-ADDC          ; LV # Lo       HANGUL SYLLABLE GYU
-ADF8          ; LV # Lo       HANGUL SYLLABLE GEU
-AE14          ; LV # Lo       HANGUL SYLLABLE GYI
-AE30          ; LV # Lo       HANGUL SYLLABLE GI
-AE4C          ; LV # Lo       HANGUL SYLLABLE GGA
-AE68          ; LV # Lo       HANGUL SYLLABLE GGAE
-AE84          ; LV # Lo       HANGUL SYLLABLE GGYA
-AEA0          ; LV # Lo       HANGUL SYLLABLE GGYAE
-AEBC          ; LV # Lo       HANGUL SYLLABLE GGEO
-AED8          ; LV # Lo       HANGUL SYLLABLE GGE
-AEF4          ; LV # Lo       HANGUL SYLLABLE GGYEO
-AF10          ; LV # Lo       HANGUL SYLLABLE GGYE
-AF2C          ; LV # Lo       HANGUL SYLLABLE GGO
-AF48          ; LV # Lo       HANGUL SYLLABLE GGWA
-AF64          ; LV # Lo       HANGUL SYLLABLE GGWAE
-AF80          ; LV # Lo       HANGUL SYLLABLE GGOE
-AF9C          ; LV # Lo       HANGUL SYLLABLE GGYO
-AFB8          ; LV # Lo       HANGUL SYLLABLE GGU
-AFD4          ; LV # Lo       HANGUL SYLLABLE GGWEO
-AFF0          ; LV # Lo       HANGUL SYLLABLE GGWE
-B00C          ; LV # Lo       HANGUL SYLLABLE GGWI
-B028          ; LV # Lo       HANGUL SYLLABLE GGYU
-B044          ; LV # Lo       HANGUL SYLLABLE GGEU
-B060          ; LV # Lo       HANGUL SYLLABLE GGYI
-B07C          ; LV # Lo       HANGUL SYLLABLE GGI
-B098          ; LV # Lo       HANGUL SYLLABLE NA
-B0B4          ; LV # Lo       HANGUL SYLLABLE NAE
-B0D0          ; LV # Lo       HANGUL SYLLABLE NYA
-B0EC          ; LV # Lo       HANGUL SYLLABLE NYAE
-B108          ; LV # Lo       HANGUL SYLLABLE NEO
-B124          ; LV # Lo       HANGUL SYLLABLE NE
-B140          ; LV # Lo       HANGUL SYLLABLE NYEO
-B15C          ; LV # Lo       HANGUL SYLLABLE NYE
-B178          ; LV # Lo       HANGUL SYLLABLE NO
-B194          ; LV # Lo       HANGUL SYLLABLE NWA
-B1B0          ; LV # Lo       HANGUL SYLLABLE NWAE
-B1CC          ; LV # Lo       HANGUL SYLLABLE NOE
-B1E8          ; LV # Lo       HANGUL SYLLABLE NYO
-B204          ; LV # Lo       HANGUL SYLLABLE NU
-B220          ; LV # Lo       HANGUL SYLLABLE NWEO
-B23C          ; LV # Lo       HANGUL SYLLABLE NWE
-B258          ; LV # Lo       HANGUL SYLLABLE NWI
-B274          ; LV # Lo       HANGUL SYLLABLE NYU
-B290          ; LV # Lo       HANGUL SYLLABLE NEU
-B2AC          ; LV # Lo       HANGUL SYLLABLE NYI
-B2C8          ; LV # Lo       HANGUL SYLLABLE NI
-B2E4          ; LV # Lo       HANGUL SYLLABLE DA
-B300          ; LV # Lo       HANGUL SYLLABLE DAE
-B31C          ; LV # Lo       HANGUL SYLLABLE DYA
-B338          ; LV # Lo       HANGUL SYLLABLE DYAE
-B354          ; LV # Lo       HANGUL SYLLABLE DEO
-B370          ; LV # Lo       HANGUL SYLLABLE DE
-B38C          ; LV # Lo       HANGUL SYLLABLE DYEO
-B3A8          ; LV # Lo       HANGUL SYLLABLE DYE
-B3C4          ; LV # Lo       HANGUL SYLLABLE DO
-B3E0          ; LV # Lo       HANGUL SYLLABLE DWA
-B3FC          ; LV # Lo       HANGUL SYLLABLE DWAE
-B418          ; LV # Lo       HANGUL SYLLABLE DOE
-B434          ; LV # Lo       HANGUL SYLLABLE DYO
-B450          ; LV # Lo       HANGUL SYLLABLE DU
-B46C          ; LV # Lo       HANGUL SYLLABLE DWEO
-B488          ; LV # Lo       HANGUL SYLLABLE DWE
-B4A4          ; LV # Lo       HANGUL SYLLABLE DWI
-B4C0          ; LV # Lo       HANGUL SYLLABLE DYU
-B4DC          ; LV # Lo       HANGUL SYLLABLE DEU
-B4F8          ; LV # Lo       HANGUL SYLLABLE DYI
-B514          ; LV # Lo       HANGUL SYLLABLE DI
-B530          ; LV # Lo       HANGUL SYLLABLE DDA
-B54C          ; LV # Lo       HANGUL SYLLABLE DDAE
-B568          ; LV # Lo       HANGUL SYLLABLE DDYA
-B584          ; LV # Lo       HANGUL SYLLABLE DDYAE
-B5A0          ; LV # Lo       HANGUL SYLLABLE DDEO
-B5BC          ; LV # Lo       HANGUL SYLLABLE DDE
-B5D8          ; LV # Lo       HANGUL SYLLABLE DDYEO
-B5F4          ; LV # Lo       HANGUL SYLLABLE DDYE
-B610          ; LV # Lo       HANGUL SYLLABLE DDO
-B62C          ; LV # Lo       HANGUL SYLLABLE DDWA
-B648          ; LV # Lo       HANGUL SYLLABLE DDWAE
-B664          ; LV # Lo       HANGUL SYLLABLE DDOE
-B680          ; LV # Lo       HANGUL SYLLABLE DDYO
-B69C          ; LV # Lo       HANGUL SYLLABLE DDU
-B6B8          ; LV # Lo       HANGUL SYLLABLE DDWEO
-B6D4          ; LV # Lo       HANGUL SYLLABLE DDWE
-B6F0          ; LV # Lo       HANGUL SYLLABLE DDWI
-B70C          ; LV # Lo       HANGUL SYLLABLE DDYU
-B728          ; LV # Lo       HANGUL SYLLABLE DDEU
-B744          ; LV # Lo       HANGUL SYLLABLE DDYI
-B760          ; LV # Lo       HANGUL SYLLABLE DDI
-B77C          ; LV # Lo       HANGUL SYLLABLE RA
-B798          ; LV # Lo       HANGUL SYLLABLE RAE
-B7B4          ; LV # Lo       HANGUL SYLLABLE RYA
-B7D0          ; LV # Lo       HANGUL SYLLABLE RYAE
-B7EC          ; LV # Lo       HANGUL SYLLABLE REO
-B808          ; LV # Lo       HANGUL SYLLABLE RE
-B824          ; LV # Lo       HANGUL SYLLABLE RYEO
-B840          ; LV # Lo       HANGUL SYLLABLE RYE
-B85C          ; LV # Lo       HANGUL SYLLABLE RO
-B878          ; LV # Lo       HANGUL SYLLABLE RWA
-B894          ; LV # Lo       HANGUL SYLLABLE RWAE
-B8B0          ; LV # Lo       HANGUL SYLLABLE ROE
-B8CC          ; LV # Lo       HANGUL SYLLABLE RYO
-B8E8          ; LV # Lo       HANGUL SYLLABLE RU
-B904          ; LV # Lo       HANGUL SYLLABLE RWEO
-B920          ; LV # Lo       HANGUL SYLLABLE RWE
-B93C          ; LV # Lo       HANGUL SYLLABLE RWI
-B958          ; LV # Lo       HANGUL SYLLABLE RYU
-B974          ; LV # Lo       HANGUL SYLLABLE REU
-B990          ; LV # Lo       HANGUL SYLLABLE RYI
-B9AC          ; LV # Lo       HANGUL SYLLABLE RI
-B9C8          ; LV # Lo       HANGUL SYLLABLE MA
-B9E4          ; LV # Lo       HANGUL SYLLABLE MAE
-BA00          ; LV # Lo       HANGUL SYLLABLE MYA
-BA1C          ; LV # Lo       HANGUL SYLLABLE MYAE
-BA38          ; LV # Lo       HANGUL SYLLABLE MEO
-BA54          ; LV # Lo       HANGUL SYLLABLE ME
-BA70          ; LV # Lo       HANGUL SYLLABLE MYEO
-BA8C          ; LV # Lo       HANGUL SYLLABLE MYE
-BAA8          ; LV # Lo       HANGUL SYLLABLE MO
-BAC4          ; LV # Lo       HANGUL SYLLABLE MWA
-BAE0          ; LV # Lo       HANGUL SYLLABLE MWAE
-BAFC          ; LV # Lo       HANGUL SYLLABLE MOE
-BB18          ; LV # Lo       HANGUL SYLLABLE MYO
-BB34          ; LV # Lo       HANGUL SYLLABLE MU
-BB50          ; LV # Lo       HANGUL SYLLABLE MWEO
-BB6C          ; LV # Lo       HANGUL SYLLABLE MWE
-BB88          ; LV # Lo       HANGUL SYLLABLE MWI
-BBA4          ; LV # Lo       HANGUL SYLLABLE MYU
-BBC0          ; LV # Lo       HANGUL SYLLABLE MEU
-BBDC          ; LV # Lo       HANGUL SYLLABLE MYI
-BBF8          ; LV # Lo       HANGUL SYLLABLE MI
-BC14          ; LV # Lo       HANGUL SYLLABLE BA
-BC30          ; LV # Lo       HANGUL SYLLABLE BAE
-BC4C          ; LV # Lo       HANGUL SYLLABLE BYA
-BC68          ; LV # Lo       HANGUL SYLLABLE BYAE
-BC84          ; LV # Lo       HANGUL SYLLABLE BEO
-BCA0          ; LV # Lo       HANGUL SYLLABLE BE
-BCBC          ; LV # Lo       HANGUL SYLLABLE BYEO
-BCD8          ; LV # Lo       HANGUL SYLLABLE BYE
-BCF4          ; LV # Lo       HANGUL SYLLABLE BO
-BD10          ; LV # Lo       HANGUL SYLLABLE BWA
-BD2C          ; LV # Lo       HANGUL SYLLABLE BWAE
-BD48          ; LV # Lo       HANGUL SYLLABLE BOE
-BD64          ; LV # Lo       HANGUL SYLLABLE BYO
-BD80          ; LV # Lo       HANGUL SYLLABLE BU
-BD9C          ; LV # Lo       HANGUL SYLLABLE BWEO
-BDB8          ; LV # Lo       HANGUL SYLLABLE BWE
-BDD4          ; LV # Lo       HANGUL SYLLABLE BWI
-BDF0          ; LV # Lo       HANGUL SYLLABLE BYU
-BE0C          ; LV # Lo       HANGUL SYLLABLE BEU
-BE28          ; LV # Lo       HANGUL SYLLABLE BYI
-BE44          ; LV # Lo       HANGUL SYLLABLE BI
-BE60          ; LV # Lo       HANGUL SYLLABLE BBA
-BE7C          ; LV # Lo       HANGUL SYLLABLE BBAE
-BE98          ; LV # Lo       HANGUL SYLLABLE BBYA
-BEB4          ; LV # Lo       HANGUL SYLLABLE BBYAE
-BED0          ; LV # Lo       HANGUL SYLLABLE BBEO
-BEEC          ; LV # Lo       HANGUL SYLLABLE BBE
-BF08          ; LV # Lo       HANGUL SYLLABLE BBYEO
-BF24          ; LV # Lo       HANGUL SYLLABLE BBYE
-BF40          ; LV # Lo       HANGUL SYLLABLE BBO
-BF5C          ; LV # Lo       HANGUL SYLLABLE BBWA
-BF78          ; LV # Lo       HANGUL SYLLABLE BBWAE
-BF94          ; LV # Lo       HANGUL SYLLABLE BBOE
-BFB0          ; LV # Lo       HANGUL SYLLABLE BBYO
-BFCC          ; LV # Lo       HANGUL SYLLABLE BBU
-BFE8          ; LV # Lo       HANGUL SYLLABLE BBWEO
-C004          ; LV # Lo       HANGUL SYLLABLE BBWE
-C020          ; LV # Lo       HANGUL SYLLABLE BBWI
-C03C          ; LV # Lo       HANGUL SYLLABLE BBYU
-C058          ; LV # Lo       HANGUL SYLLABLE BBEU
-C074          ; LV # Lo       HANGUL SYLLABLE BBYI
-C090          ; LV # Lo       HANGUL SYLLABLE BBI
-C0AC          ; LV # Lo       HANGUL SYLLABLE SA
-C0C8          ; LV # Lo       HANGUL SYLLABLE SAE
-C0E4          ; LV # Lo       HANGUL SYLLABLE SYA
-C100          ; LV # Lo       HANGUL SYLLABLE SYAE
-C11C          ; LV # Lo       HANGUL SYLLABLE SEO
-C138          ; LV # Lo       HANGUL SYLLABLE SE
-C154          ; LV # Lo       HANGUL SYLLABLE SYEO
-C170          ; LV # Lo       HANGUL SYLLABLE SYE
-C18C          ; LV # Lo       HANGUL SYLLABLE SO
-C1A8          ; LV # Lo       HANGUL SYLLABLE SWA
-C1C4          ; LV # Lo       HANGUL SYLLABLE SWAE
-C1E0          ; LV # Lo       HANGUL SYLLABLE SOE
-C1FC          ; LV # Lo       HANGUL SYLLABLE SYO
-C218          ; LV # Lo       HANGUL SYLLABLE SU
-C234          ; LV # Lo       HANGUL SYLLABLE SWEO
-C250          ; LV # Lo       HANGUL SYLLABLE SWE
-C26C          ; LV # Lo       HANGUL SYLLABLE SWI
-C288          ; LV # Lo       HANGUL SYLLABLE SYU
-C2A4          ; LV # Lo       HANGUL SYLLABLE SEU
-C2C0          ; LV # Lo       HANGUL SYLLABLE SYI
-C2DC          ; LV # Lo       HANGUL SYLLABLE SI
-C2F8          ; LV # Lo       HANGUL SYLLABLE SSA
-C314          ; LV # Lo       HANGUL SYLLABLE SSAE
-C330          ; LV # Lo       HANGUL SYLLABLE SSYA
-C34C          ; LV # Lo       HANGUL SYLLABLE SSYAE
-C368          ; LV # Lo       HANGUL SYLLABLE SSEO
-C384          ; LV # Lo       HANGUL SYLLABLE SSE
-C3A0          ; LV # Lo       HANGUL SYLLABLE SSYEO
-C3BC          ; LV # Lo       HANGUL SYLLABLE SSYE
-C3D8          ; LV # Lo       HANGUL SYLLABLE SSO
-C3F4          ; LV # Lo       HANGUL SYLLABLE SSWA
-C410          ; LV # Lo       HANGUL SYLLABLE SSWAE
-C42C          ; LV # Lo       HANGUL SYLLABLE SSOE
-C448          ; LV # Lo       HANGUL SYLLABLE SSYO
-C464          ; LV # Lo       HANGUL SYLLABLE SSU
-C480          ; LV # Lo       HANGUL SYLLABLE SSWEO
-C49C          ; LV # Lo       HANGUL SYLLABLE SSWE
-C4B8          ; LV # Lo       HANGUL SYLLABLE SSWI
-C4D4          ; LV # Lo       HANGUL SYLLABLE SSYU
-C4F0          ; LV # Lo       HANGUL SYLLABLE SSEU
-C50C          ; LV # Lo       HANGUL SYLLABLE SSYI
-C528          ; LV # Lo       HANGUL SYLLABLE SSI
-C544          ; LV # Lo       HANGUL SYLLABLE A
-C560          ; LV # Lo       HANGUL SYLLABLE AE
-C57C          ; LV # Lo       HANGUL SYLLABLE YA
-C598          ; LV # Lo       HANGUL SYLLABLE YAE
-C5B4          ; LV # Lo       HANGUL SYLLABLE EO
-C5D0          ; LV # Lo       HANGUL SYLLABLE E
-C5EC          ; LV # Lo       HANGUL SYLLABLE YEO
-C608          ; LV # Lo       HANGUL SYLLABLE YE
-C624          ; LV # Lo       HANGUL SYLLABLE O
-C640          ; LV # Lo       HANGUL SYLLABLE WA
-C65C          ; LV # Lo       HANGUL SYLLABLE WAE
-C678          ; LV # Lo       HANGUL SYLLABLE OE
-C694          ; LV # Lo       HANGUL SYLLABLE YO
-C6B0          ; LV # Lo       HANGUL SYLLABLE U
-C6CC          ; LV # Lo       HANGUL SYLLABLE WEO
-C6E8          ; LV # Lo       HANGUL SYLLABLE WE
-C704          ; LV # Lo       HANGUL SYLLABLE WI
-C720          ; LV # Lo       HANGUL SYLLABLE YU
-C73C          ; LV # Lo       HANGUL SYLLABLE EU
-C758          ; LV # Lo       HANGUL SYLLABLE YI
-C774          ; LV # Lo       HANGUL SYLLABLE I
-C790          ; LV # Lo       HANGUL SYLLABLE JA
-C7AC          ; LV # Lo       HANGUL SYLLABLE JAE
-C7C8          ; LV # Lo       HANGUL SYLLABLE JYA
-C7E4          ; LV # Lo       HANGUL SYLLABLE JYAE
-C800          ; LV # Lo       HANGUL SYLLABLE JEO
-C81C          ; LV # Lo       HANGUL SYLLABLE JE
-C838          ; LV # Lo       HANGUL SYLLABLE JYEO
-C854          ; LV # Lo       HANGUL SYLLABLE JYE
-C870          ; LV # Lo       HANGUL SYLLABLE JO
-C88C          ; LV # Lo       HANGUL SYLLABLE JWA
-C8A8          ; LV # Lo       HANGUL SYLLABLE JWAE
-C8C4          ; LV # Lo       HANGUL SYLLABLE JOE
-C8E0          ; LV # Lo       HANGUL SYLLABLE JYO
-C8FC          ; LV # Lo       HANGUL SYLLABLE JU
-C918          ; LV # Lo       HANGUL SYLLABLE JWEO
-C934          ; LV # Lo       HANGUL SYLLABLE JWE
-C950          ; LV # Lo       HANGUL SYLLABLE JWI
-C96C          ; LV # Lo       HANGUL SYLLABLE JYU
-C988          ; LV # Lo       HANGUL SYLLABLE JEU
-C9A4          ; LV # Lo       HANGUL SYLLABLE JYI
-C9C0          ; LV # Lo       HANGUL SYLLABLE JI
-C9DC          ; LV # Lo       HANGUL SYLLABLE JJA
-C9F8          ; LV # Lo       HANGUL SYLLABLE JJAE
-CA14          ; LV # Lo       HANGUL SYLLABLE JJYA
-CA30          ; LV # Lo       HANGUL SYLLABLE JJYAE
-CA4C          ; LV # Lo       HANGUL SYLLABLE JJEO
-CA68          ; LV # Lo       HANGUL SYLLABLE JJE
-CA84          ; LV # Lo       HANGUL SYLLABLE JJYEO
-CAA0          ; LV # Lo       HANGUL SYLLABLE JJYE
-CABC          ; LV # Lo       HANGUL SYLLABLE JJO
-CAD8          ; LV # Lo       HANGUL SYLLABLE JJWA
-CAF4          ; LV # Lo       HANGUL SYLLABLE JJWAE
-CB10          ; LV # Lo       HANGUL SYLLABLE JJOE
-CB2C          ; LV # Lo       HANGUL SYLLABLE JJYO
-CB48          ; LV # Lo       HANGUL SYLLABLE JJU
-CB64          ; LV # Lo       HANGUL SYLLABLE JJWEO
-CB80          ; LV # Lo       HANGUL SYLLABLE JJWE
-CB9C          ; LV # Lo       HANGUL SYLLABLE JJWI
-CBB8          ; LV # Lo       HANGUL SYLLABLE JJYU
-CBD4          ; LV # Lo       HANGUL SYLLABLE JJEU
-CBF0          ; LV # Lo       HANGUL SYLLABLE JJYI
-CC0C          ; LV # Lo       HANGUL SYLLABLE JJI
-CC28          ; LV # Lo       HANGUL SYLLABLE CA
-CC44          ; LV # Lo       HANGUL SYLLABLE CAE
-CC60          ; LV # Lo       HANGUL SYLLABLE CYA
-CC7C          ; LV # Lo       HANGUL SYLLABLE CYAE
-CC98          ; LV # Lo       HANGUL SYLLABLE CEO
-CCB4          ; LV # Lo       HANGUL SYLLABLE CE
-CCD0          ; LV # Lo       HANGUL SYLLABLE CYEO
-CCEC          ; LV # Lo       HANGUL SYLLABLE CYE
-CD08          ; LV # Lo       HANGUL SYLLABLE CO
-CD24          ; LV # Lo       HANGUL SYLLABLE CWA
-CD40          ; LV # Lo       HANGUL SYLLABLE CWAE
-CD5C          ; LV # Lo       HANGUL SYLLABLE COE
-CD78          ; LV # Lo       HANGUL SYLLABLE CYO
-CD94          ; LV # Lo       HANGUL SYLLABLE CU
-CDB0          ; LV # Lo       HANGUL SYLLABLE CWEO
-CDCC          ; LV # Lo       HANGUL SYLLABLE CWE
-CDE8          ; LV # Lo       HANGUL SYLLABLE CWI
-CE04          ; LV # Lo       HANGUL SYLLABLE CYU
-CE20          ; LV # Lo       HANGUL SYLLABLE CEU
-CE3C          ; LV # Lo       HANGUL SYLLABLE CYI
-CE58          ; LV # Lo       HANGUL SYLLABLE CI
-CE74          ; LV # Lo       HANGUL SYLLABLE KA
-CE90          ; LV # Lo       HANGUL SYLLABLE KAE
-CEAC          ; LV # Lo       HANGUL SYLLABLE KYA
-CEC8          ; LV # Lo       HANGUL SYLLABLE KYAE
-CEE4          ; LV # Lo       HANGUL SYLLABLE KEO
-CF00          ; LV # Lo       HANGUL SYLLABLE KE
-CF1C          ; LV # Lo       HANGUL SYLLABLE KYEO
-CF38          ; LV # Lo       HANGUL SYLLABLE KYE
-CF54          ; LV # Lo       HANGUL SYLLABLE KO
-CF70          ; LV # Lo       HANGUL SYLLABLE KWA
-CF8C          ; LV # Lo       HANGUL SYLLABLE KWAE
-CFA8          ; LV # Lo       HANGUL SYLLABLE KOE
-CFC4          ; LV # Lo       HANGUL SYLLABLE KYO
-CFE0          ; LV # Lo       HANGUL SYLLABLE KU
-CFFC          ; LV # Lo       HANGUL SYLLABLE KWEO
-D018          ; LV # Lo       HANGUL SYLLABLE KWE
-D034          ; LV # Lo       HANGUL SYLLABLE KWI
-D050          ; LV # Lo       HANGUL SYLLABLE KYU
-D06C          ; LV # Lo       HANGUL SYLLABLE KEU
-D088          ; LV # Lo       HANGUL SYLLABLE KYI
-D0A4          ; LV # Lo       HANGUL SYLLABLE KI
-D0C0          ; LV # Lo       HANGUL SYLLABLE TA
-D0DC          ; LV # Lo       HANGUL SYLLABLE TAE
-D0F8          ; LV # Lo       HANGUL SYLLABLE TYA
-D114          ; LV # Lo       HANGUL SYLLABLE TYAE
-D130          ; LV # Lo       HANGUL SYLLABLE TEO
-D14C          ; LV # Lo       HANGUL SYLLABLE TE
-D168          ; LV # Lo       HANGUL SYLLABLE TYEO
-D184          ; LV # Lo       HANGUL SYLLABLE TYE
-D1A0          ; LV # Lo       HANGUL SYLLABLE TO
-D1BC          ; LV # Lo       HANGUL SYLLABLE TWA
-D1D8          ; LV # Lo       HANGUL SYLLABLE TWAE
-D1F4          ; LV # Lo       HANGUL SYLLABLE TOE
-D210          ; LV # Lo       HANGUL SYLLABLE TYO
-D22C          ; LV # Lo       HANGUL SYLLABLE TU
-D248          ; LV # Lo       HANGUL SYLLABLE TWEO
-D264          ; LV # Lo       HANGUL SYLLABLE TWE
-D280          ; LV # Lo       HANGUL SYLLABLE TWI
-D29C          ; LV # Lo       HANGUL SYLLABLE TYU
-D2B8          ; LV # Lo       HANGUL SYLLABLE TEU
-D2D4          ; LV # Lo       HANGUL SYLLABLE TYI
-D2F0          ; LV # Lo       HANGUL SYLLABLE TI
-D30C          ; LV # Lo       HANGUL SYLLABLE PA
-D328          ; LV # Lo       HANGUL SYLLABLE PAE
-D344          ; LV # Lo       HANGUL SYLLABLE PYA
-D360          ; LV # Lo       HANGUL SYLLABLE PYAE
-D37C          ; LV # Lo       HANGUL SYLLABLE PEO
-D398          ; LV # Lo       HANGUL SYLLABLE PE
-D3B4          ; LV # Lo       HANGUL SYLLABLE PYEO
-D3D0          ; LV # Lo       HANGUL SYLLABLE PYE
-D3EC          ; LV # Lo       HANGUL SYLLABLE PO
-D408          ; LV # Lo       HANGUL SYLLABLE PWA
-D424          ; LV # Lo       HANGUL SYLLABLE PWAE
-D440          ; LV # Lo       HANGUL SYLLABLE POE
-D45C          ; LV # Lo       HANGUL SYLLABLE PYO
-D478          ; LV # Lo       HANGUL SYLLABLE PU
-D494          ; LV # Lo       HANGUL SYLLABLE PWEO
-D4B0          ; LV # Lo       HANGUL SYLLABLE PWE
-D4CC          ; LV # Lo       HANGUL SYLLABLE PWI
-D4E8          ; LV # Lo       HANGUL SYLLABLE PYU
-D504          ; LV # Lo       HANGUL SYLLABLE PEU
-D520          ; LV # Lo       HANGUL SYLLABLE PYI
-D53C          ; LV # Lo       HANGUL SYLLABLE PI
-D558          ; LV # Lo       HANGUL SYLLABLE HA
-D574          ; LV # Lo       HANGUL SYLLABLE HAE
-D590          ; LV # Lo       HANGUL SYLLABLE HYA
-D5AC          ; LV # Lo       HANGUL SYLLABLE HYAE
-D5C8          ; LV # Lo       HANGUL SYLLABLE HEO
-D5E4          ; LV # Lo       HANGUL SYLLABLE HE
-D600          ; LV # Lo       HANGUL SYLLABLE HYEO
-D61C          ; LV # Lo       HANGUL SYLLABLE HYE
-D638          ; LV # Lo       HANGUL SYLLABLE HO
-D654          ; LV # Lo       HANGUL SYLLABLE HWA
-D670          ; LV # Lo       HANGUL SYLLABLE HWAE
-D68C          ; LV # Lo       HANGUL SYLLABLE HOE
-D6A8          ; LV # Lo       HANGUL SYLLABLE HYO
-D6C4          ; LV # Lo       HANGUL SYLLABLE HU
-D6E0          ; LV # Lo       HANGUL SYLLABLE HWEO
-D6FC          ; LV # Lo       HANGUL SYLLABLE HWE
-D718          ; LV # Lo       HANGUL SYLLABLE HWI
-D734          ; LV # Lo       HANGUL SYLLABLE HYU
-D750          ; LV # Lo       HANGUL SYLLABLE HEU
-D76C          ; LV # Lo       HANGUL SYLLABLE HYI
-D788          ; LV # Lo       HANGUL SYLLABLE HI
-
-# Total code points: 399
-
-# ================================================
-
-AC01..AC1B    ; LVT # Lo  [27] HANGUL SYLLABLE GAG..HANGUL SYLLABLE GAH
-AC1D..AC37    ; LVT # Lo  [27] HANGUL SYLLABLE GAEG..HANGUL SYLLABLE GAEH
-AC39..AC53    ; LVT # Lo  [27] HANGUL SYLLABLE GYAG..HANGUL SYLLABLE GYAH
-AC55..AC6F    ; LVT # Lo  [27] HANGUL SYLLABLE GYAEG..HANGUL SYLLABLE GYAEH
-AC71..AC8B    ; LVT # Lo  [27] HANGUL SYLLABLE GEOG..HANGUL SYLLABLE GEOH
-AC8D..ACA7    ; LVT # Lo  [27] HANGUL SYLLABLE GEG..HANGUL SYLLABLE GEH
-ACA9..ACC3    ; LVT # Lo  [27] HANGUL SYLLABLE GYEOG..HANGUL SYLLABLE GYEOH
-ACC5..ACDF    ; LVT # Lo  [27] HANGUL SYLLABLE GYEG..HANGUL SYLLABLE GYEH
-ACE1..ACFB    ; LVT # Lo  [27] HANGUL SYLLABLE GOG..HANGUL SYLLABLE GOH
-ACFD..AD17    ; LVT # Lo  [27] HANGUL SYLLABLE GWAG..HANGUL SYLLABLE GWAH
-AD19..AD33    ; LVT # Lo  [27] HANGUL SYLLABLE GWAEG..HANGUL SYLLABLE GWAEH
-AD35..AD4F    ; LVT # Lo  [27] HANGUL SYLLABLE GOEG..HANGUL SYLLABLE GOEH
-AD51..AD6B    ; LVT # Lo  [27] HANGUL SYLLABLE GYOG..HANGUL SYLLABLE GYOH
-AD6D..AD87    ; LVT # Lo  [27] HANGUL SYLLABLE GUG..HANGUL SYLLABLE GUH
-AD89..ADA3    ; LVT # Lo  [27] HANGUL SYLLABLE GWEOG..HANGUL SYLLABLE GWEOH
-ADA5..ADBF    ; LVT # Lo  [27] HANGUL SYLLABLE GWEG..HANGUL SYLLABLE GWEH
-ADC1..ADDB    ; LVT # Lo  [27] HANGUL SYLLABLE GWIG..HANGUL SYLLABLE GWIH
-ADDD..ADF7    ; LVT # Lo  [27] HANGUL SYLLABLE GYUG..HANGUL SYLLABLE GYUH
-ADF9..AE13    ; LVT # Lo  [27] HANGUL SYLLABLE GEUG..HANGUL SYLLABLE GEUH
-AE15..AE2F    ; LVT # Lo  [27] HANGUL SYLLABLE GYIG..HANGUL SYLLABLE GYIH
-AE31..AE4B    ; LVT # Lo  [27] HANGUL SYLLABLE GIG..HANGUL SYLLABLE GIH
-AE4D..AE67    ; LVT # Lo  [27] HANGUL SYLLABLE GGAG..HANGUL SYLLABLE GGAH
-AE69..AE83    ; LVT # Lo  [27] HANGUL SYLLABLE GGAEG..HANGUL SYLLABLE GGAEH
-AE85..AE9F    ; LVT # Lo  [27] HANGUL SYLLABLE GGYAG..HANGUL SYLLABLE GGYAH
-AEA1..AEBB    ; LVT # Lo  [27] HANGUL SYLLABLE GGYAEG..HANGUL SYLLABLE GGYAEH
-AEBD..AED7    ; LVT # Lo  [27] HANGUL SYLLABLE GGEOG..HANGUL SYLLABLE GGEOH
-AED9..AEF3    ; LVT # Lo  [27] HANGUL SYLLABLE GGEG..HANGUL SYLLABLE GGEH
-AEF5..AF0F    ; LVT # Lo  [27] HANGUL SYLLABLE GGYEOG..HANGUL SYLLABLE GGYEOH
-AF11..AF2B    ; LVT # Lo  [27] HANGUL SYLLABLE GGYEG..HANGUL SYLLABLE GGYEH
-AF2D..AF47    ; LVT # Lo  [27] HANGUL SYLLABLE GGOG..HANGUL SYLLABLE GGOH
-AF49..AF63    ; LVT # Lo  [27] HANGUL SYLLABLE GGWAG..HANGUL SYLLABLE GGWAH
-AF65..AF7F    ; LVT # Lo  [27] HANGUL SYLLABLE GGWAEG..HANGUL SYLLABLE GGWAEH
-AF81..AF9B    ; LVT # Lo  [27] HANGUL SYLLABLE GGOEG..HANGUL SYLLABLE GGOEH
-AF9D..AFB7    ; LVT # Lo  [27] HANGUL SYLLABLE GGYOG..HANGUL SYLLABLE GGYOH
-AFB9..AFD3    ; LVT # Lo  [27] HANGUL SYLLABLE GGUG..HANGUL SYLLABLE GGUH
-AFD5..AFEF    ; LVT # Lo  [27] HANGUL SYLLABLE GGWEOG..HANGUL SYLLABLE GGWEOH
-AFF1..B00B    ; LVT # Lo  [27] HANGUL SYLLABLE GGWEG..HANGUL SYLLABLE GGWEH
-B00D..B027    ; LVT # Lo  [27] HANGUL SYLLABLE GGWIG..HANGUL SYLLABLE GGWIH
-B029..B043    ; LVT # Lo  [27] HANGUL SYLLABLE GGYUG..HANGUL SYLLABLE GGYUH
-B045..B05F    ; LVT # Lo  [27] HANGUL SYLLABLE GGEUG..HANGUL SYLLABLE GGEUH
-B061..B07B    ; LVT # Lo  [27] HANGUL SYLLABLE GGYIG..HANGUL SYLLABLE GGYIH
-B07D..B097    ; LVT # Lo  [27] HANGUL SYLLABLE GGIG..HANGUL SYLLABLE GGIH
-B099..B0B3    ; LVT # Lo  [27] HANGUL SYLLABLE NAG..HANGUL SYLLABLE NAH
-B0B5..B0CF    ; LVT # Lo  [27] HANGUL SYLLABLE NAEG..HANGUL SYLLABLE NAEH
-B0D1..B0EB    ; LVT # Lo  [27] HANGUL SYLLABLE NYAG..HANGUL SYLLABLE NYAH
-B0ED..B107    ; LVT # Lo  [27] HANGUL SYLLABLE NYAEG..HANGUL SYLLABLE NYAEH
-B109..B123    ; LVT # Lo  [27] HANGUL SYLLABLE NEOG..HANGUL SYLLABLE NEOH
-B125..B13F    ; LVT # Lo  [27] HANGUL SYLLABLE NEG..HANGUL SYLLABLE NEH
-B141..B15B    ; LVT # Lo  [27] HANGUL SYLLABLE NYEOG..HANGUL SYLLABLE NYEOH
-B15D..B177    ; LVT # Lo  [27] HANGUL SYLLABLE NYEG..HANGUL SYLLABLE NYEH
-B179..B193    ; LVT # Lo  [27] HANGUL SYLLABLE NOG..HANGUL SYLLABLE NOH
-B195..B1AF    ; LVT # Lo  [27] HANGUL SYLLABLE NWAG..HANGUL SYLLABLE NWAH
-B1B1..B1CB    ; LVT # Lo  [27] HANGUL SYLLABLE NWAEG..HANGUL SYLLABLE NWAEH
-B1CD..B1E7    ; LVT # Lo  [27] HANGUL SYLLABLE NOEG..HANGUL SYLLABLE NOEH
-B1E9..B203    ; LVT # Lo  [27] HANGUL SYLLABLE NYOG..HANGUL SYLLABLE NYOH
-B205..B21F    ; LVT # Lo  [27] HANGUL SYLLABLE NUG..HANGUL SYLLABLE NUH
-B221..B23B    ; LVT # Lo  [27] HANGUL SYLLABLE NWEOG..HANGUL SYLLABLE NWEOH
-B23D..B257    ; LVT # Lo  [27] HANGUL SYLLABLE NWEG..HANGUL SYLLABLE NWEH
-B259..B273    ; LVT # Lo  [27] HANGUL SYLLABLE NWIG..HANGUL SYLLABLE NWIH
-B275..B28F    ; LVT # Lo  [27] HANGUL SYLLABLE NYUG..HANGUL SYLLABLE NYUH
-B291..B2AB    ; LVT # Lo  [27] HANGUL SYLLABLE NEUG..HANGUL SYLLABLE NEUH
-B2AD..B2C7    ; LVT # Lo  [27] HANGUL SYLLABLE NYIG..HANGUL SYLLABLE NYIH
-B2C9..B2E3    ; LVT # Lo  [27] HANGUL SYLLABLE NIG..HANGUL SYLLABLE NIH
-B2E5..B2FF    ; LVT # Lo  [27] HANGUL SYLLABLE DAG..HANGUL SYLLABLE DAH
-B301..B31B    ; LVT # Lo  [27] HANGUL SYLLABLE DAEG..HANGUL SYLLABLE DAEH
-B31D..B337    ; LVT # Lo  [27] HANGUL SYLLABLE DYAG..HANGUL SYLLABLE DYAH
-B339..B353    ; LVT # Lo  [27] HANGUL SYLLABLE DYAEG..HANGUL SYLLABLE DYAEH
-B355..B36F    ; LVT # Lo  [27] HANGUL SYLLABLE DEOG..HANGUL SYLLABLE DEOH
-B371..B38B    ; LVT # Lo  [27] HANGUL SYLLABLE DEG..HANGUL SYLLABLE DEH
-B38D..B3A7    ; LVT # Lo  [27] HANGUL SYLLABLE DYEOG..HANGUL SYLLABLE DYEOH
-B3A9..B3C3    ; LVT # Lo  [27] HANGUL SYLLABLE DYEG..HANGUL SYLLABLE DYEH
-B3C5..B3DF    ; LVT # Lo  [27] HANGUL SYLLABLE DOG..HANGUL SYLLABLE DOH
-B3E1..B3FB    ; LVT # Lo  [27] HANGUL SYLLABLE DWAG..HANGUL SYLLABLE DWAH
-B3FD..B417    ; LVT # Lo  [27] HANGUL SYLLABLE DWAEG..HANGUL SYLLABLE DWAEH
-B419..B433    ; LVT # Lo  [27] HANGUL SYLLABLE DOEG..HANGUL SYLLABLE DOEH
-B435..B44F    ; LVT # Lo  [27] HANGUL SYLLABLE DYOG..HANGUL SYLLABLE DYOH
-B451..B46B    ; LVT # Lo  [27] HANGUL SYLLABLE DUG..HANGUL SYLLABLE DUH
-B46D..B487    ; LVT # Lo  [27] HANGUL SYLLABLE DWEOG..HANGUL SYLLABLE DWEOH
-B489..B4A3    ; LVT # Lo  [27] HANGUL SYLLABLE DWEG..HANGUL SYLLABLE DWEH
-B4A5..B4BF    ; LVT # Lo  [27] HANGUL SYLLABLE DWIG..HANGUL SYLLABLE DWIH
-B4C1..B4DB    ; LVT # Lo  [27] HANGUL SYLLABLE DYUG..HANGUL SYLLABLE DYUH
-B4DD..B4F7    ; LVT # Lo  [27] HANGUL SYLLABLE DEUG..HANGUL SYLLABLE DEUH
-B4F9..B513    ; LVT # Lo  [27] HANGUL SYLLABLE DYIG..HANGUL SYLLABLE DYIH
-B515..B52F    ; LVT # Lo  [27] HANGUL SYLLABLE DIG..HANGUL SYLLABLE DIH
-B531..B54B    ; LVT # Lo  [27] HANGUL SYLLABLE DDAG..HANGUL SYLLABLE DDAH
-B54D..B567    ; LVT # Lo  [27] HANGUL SYLLABLE DDAEG..HANGUL SYLLABLE DDAEH
-B569..B583    ; LVT # Lo  [27] HANGUL SYLLABLE DDYAG..HANGUL SYLLABLE DDYAH
-B585..B59F    ; LVT # Lo  [27] HANGUL SYLLABLE DDYAEG..HANGUL SYLLABLE DDYAEH
-B5A1..B5BB    ; LVT # Lo  [27] HANGUL SYLLABLE DDEOG..HANGUL SYLLABLE DDEOH
-B5BD..B5D7    ; LVT # Lo  [27] HANGUL SYLLABLE DDEG..HANGUL SYLLABLE DDEH
-B5D9..B5F3    ; LVT # Lo  [27] HANGUL SYLLABLE DDYEOG..HANGUL SYLLABLE DDYEOH
-B5F5..B60F    ; LVT # Lo  [27] HANGUL SYLLABLE DDYEG..HANGUL SYLLABLE DDYEH
-B611..B62B    ; LVT # Lo  [27] HANGUL SYLLABLE DDOG..HANGUL SYLLABLE DDOH
-B62D..B647    ; LVT # Lo  [27] HANGUL SYLLABLE DDWAG..HANGUL SYLLABLE DDWAH
-B649..B663    ; LVT # Lo  [27] HANGUL SYLLABLE DDWAEG..HANGUL SYLLABLE DDWAEH
-B665..B67F    ; LVT # Lo  [27] HANGUL SYLLABLE DDOEG..HANGUL SYLLABLE DDOEH
-B681..B69B    ; LVT # Lo  [27] HANGUL SYLLABLE DDYOG..HANGUL SYLLABLE DDYOH
-B69D..B6B7    ; LVT # Lo  [27] HANGUL SYLLABLE DDUG..HANGUL SYLLABLE DDUH
-B6B9..B6D3    ; LVT # Lo  [27] HANGUL SYLLABLE DDWEOG..HANGUL SYLLABLE DDWEOH
-B6D5..B6EF    ; LVT # Lo  [27] HANGUL SYLLABLE DDWEG..HANGUL SYLLABLE DDWEH
-B6F1..B70B    ; LVT # Lo  [27] HANGUL SYLLABLE DDWIG..HANGUL SYLLABLE DDWIH
-B70D..B727    ; LVT # Lo  [27] HANGUL SYLLABLE DDYUG..HANGUL SYLLABLE DDYUH
-B729..B743    ; LVT # Lo  [27] HANGUL SYLLABLE DDEUG..HANGUL SYLLABLE DDEUH
-B745..B75F    ; LVT # Lo  [27] HANGUL SYLLABLE DDYIG..HANGUL SYLLABLE DDYIH
-B761..B77B    ; LVT # Lo  [27] HANGUL SYLLABLE DDIG..HANGUL SYLLABLE DDIH
-B77D..B797    ; LVT # Lo  [27] HANGUL SYLLABLE RAG..HANGUL SYLLABLE RAH
-B799..B7B3    ; LVT # Lo  [27] HANGUL SYLLABLE RAEG..HANGUL SYLLABLE RAEH
-B7B5..B7CF    ; LVT # Lo  [27] HANGUL SYLLABLE RYAG..HANGUL SYLLABLE RYAH
-B7D1..B7EB    ; LVT # Lo  [27] HANGUL SYLLABLE RYAEG..HANGUL SYLLABLE RYAEH
-B7ED..B807    ; LVT # Lo  [27] HANGUL SYLLABLE REOG..HANGUL SYLLABLE REOH
-B809..B823    ; LVT # Lo  [27] HANGUL SYLLABLE REG..HANGUL SYLLABLE REH
-B825..B83F    ; LVT # Lo  [27] HANGUL SYLLABLE RYEOG..HANGUL SYLLABLE RYEOH
-B841..B85B    ; LVT # Lo  [27] HANGUL SYLLABLE RYEG..HANGUL SYLLABLE RYEH
-B85D..B877    ; LVT # Lo  [27] HANGUL SYLLABLE ROG..HANGUL SYLLABLE ROH
-B879..B893    ; LVT # Lo  [27] HANGUL SYLLABLE RWAG..HANGUL SYLLABLE RWAH
-B895..B8AF    ; LVT # Lo  [27] HANGUL SYLLABLE RWAEG..HANGUL SYLLABLE RWAEH
-B8B1..B8CB    ; LVT # Lo  [27] HANGUL SYLLABLE ROEG..HANGUL SYLLABLE ROEH
-B8CD..B8E7    ; LVT # Lo  [27] HANGUL SYLLABLE RYOG..HANGUL SYLLABLE RYOH
-B8E9..B903    ; LVT # Lo  [27] HANGUL SYLLABLE RUG..HANGUL SYLLABLE RUH
-B905..B91F    ; LVT # Lo  [27] HANGUL SYLLABLE RWEOG..HANGUL SYLLABLE RWEOH
-B921..B93B    ; LVT # Lo  [27] HANGUL SYLLABLE RWEG..HANGUL SYLLABLE RWEH
-B93D..B957    ; LVT # Lo  [27] HANGUL SYLLABLE RWIG..HANGUL SYLLABLE RWIH
-B959..B973    ; LVT # Lo  [27] HANGUL SYLLABLE RYUG..HANGUL SYLLABLE RYUH
-B975..B98F    ; LVT # Lo  [27] HANGUL SYLLABLE REUG..HANGUL SYLLABLE REUH
-B991..B9AB    ; LVT # Lo  [27] HANGUL SYLLABLE RYIG..HANGUL SYLLABLE RYIH
-B9AD..B9C7    ; LVT # Lo  [27] HANGUL SYLLABLE RIG..HANGUL SYLLABLE RIH
-B9C9..B9E3    ; LVT # Lo  [27] HANGUL SYLLABLE MAG..HANGUL SYLLABLE MAH
-B9E5..B9FF    ; LVT # Lo  [27] HANGUL SYLLABLE MAEG..HANGUL SYLLABLE MAEH
-BA01..BA1B    ; LVT # Lo  [27] HANGUL SYLLABLE MYAG..HANGUL SYLLABLE MYAH
-BA1D..BA37    ; LVT # Lo  [27] HANGUL SYLLABLE MYAEG..HANGUL SYLLABLE MYAEH
-BA39..BA53    ; LVT # Lo  [27] HANGUL SYLLABLE MEOG..HANGUL SYLLABLE MEOH
-BA55..BA6F    ; LVT # Lo  [27] HANGUL SYLLABLE MEG..HANGUL SYLLABLE MEH
-BA71..BA8B    ; LVT # Lo  [27] HANGUL SYLLABLE MYEOG..HANGUL SYLLABLE MYEOH
-BA8D..BAA7    ; LVT # Lo  [27] HANGUL SYLLABLE MYEG..HANGUL SYLLABLE MYEH
-BAA9..BAC3    ; LVT # Lo  [27] HANGUL SYLLABLE MOG..HANGUL SYLLABLE MOH
-BAC5..BADF    ; LVT # Lo  [27] HANGUL SYLLABLE MWAG..HANGUL SYLLABLE MWAH
-BAE1..BAFB    ; LVT # Lo  [27] HANGUL SYLLABLE MWAEG..HANGUL SYLLABLE MWAEH
-BAFD..BB17    ; LVT # Lo  [27] HANGUL SYLLABLE MOEG..HANGUL SYLLABLE MOEH
-BB19..BB33    ; LVT # Lo  [27] HANGUL SYLLABLE MYOG..HANGUL SYLLABLE MYOH
-BB35..BB4F    ; LVT # Lo  [27] HANGUL SYLLABLE MUG..HANGUL SYLLABLE MUH
-BB51..BB6B    ; LVT # Lo  [27] HANGUL SYLLABLE MWEOG..HANGUL SYLLABLE MWEOH
-BB6D..BB87    ; LVT # Lo  [27] HANGUL SYLLABLE MWEG..HANGUL SYLLABLE MWEH
-BB89..BBA3    ; LVT # Lo  [27] HANGUL SYLLABLE MWIG..HANGUL SYLLABLE MWIH
-BBA5..BBBF    ; LVT # Lo  [27] HANGUL SYLLABLE MYUG..HANGUL SYLLABLE MYUH
-BBC1..BBDB    ; LVT # Lo  [27] HANGUL SYLLABLE MEUG..HANGUL SYLLABLE MEUH
-BBDD..BBF7    ; LVT # Lo  [27] HANGUL SYLLABLE MYIG..HANGUL SYLLABLE MYIH
-BBF9..BC13    ; LVT # Lo  [27] HANGUL SYLLABLE MIG..HANGUL SYLLABLE MIH
-BC15..BC2F    ; LVT # Lo  [27] HANGUL SYLLABLE BAG..HANGUL SYLLABLE BAH
-BC31..BC4B    ; LVT # Lo  [27] HANGUL SYLLABLE BAEG..HANGUL SYLLABLE BAEH
-BC4D..BC67    ; LVT # Lo  [27] HANGUL SYLLABLE BYAG..HANGUL SYLLABLE BYAH
-BC69..BC83    ; LVT # Lo  [27] HANGUL SYLLABLE BYAEG..HANGUL SYLLABLE BYAEH
-BC85..BC9F    ; LVT # Lo  [27] HANGUL SYLLABLE BEOG..HANGUL SYLLABLE BEOH
-BCA1..BCBB    ; LVT # Lo  [27] HANGUL SYLLABLE BEG..HANGUL SYLLABLE BEH
-BCBD..BCD7    ; LVT # Lo  [27] HANGUL SYLLABLE BYEOG..HANGUL SYLLABLE BYEOH
-BCD9..BCF3    ; LVT # Lo  [27] HANGUL SYLLABLE BYEG..HANGUL SYLLABLE BYEH
-BCF5..BD0F    ; LVT # Lo  [27] HANGUL SYLLABLE BOG..HANGUL SYLLABLE BOH
-BD11..BD2B    ; LVT # Lo  [27] HANGUL SYLLABLE BWAG..HANGUL SYLLABLE BWAH
-BD2D..BD47    ; LVT # Lo  [27] HANGUL SYLLABLE BWAEG..HANGUL SYLLABLE BWAEH
-BD49..BD63    ; LVT # Lo  [27] HANGUL SYLLABLE BOEG..HANGUL SYLLABLE BOEH
-BD65..BD7F    ; LVT # Lo  [27] HANGUL SYLLABLE BYOG..HANGUL SYLLABLE BYOH
-BD81..BD9B    ; LVT # Lo  [27] HANGUL SYLLABLE BUG..HANGUL SYLLABLE BUH
-BD9D..BDB7    ; LVT # Lo  [27] HANGUL SYLLABLE BWEOG..HANGUL SYLLABLE BWEOH
-BDB9..BDD3    ; LVT # Lo  [27] HANGUL SYLLABLE BWEG..HANGUL SYLLABLE BWEH
-BDD5..BDEF    ; LVT # Lo  [27] HANGUL SYLLABLE BWIG..HANGUL SYLLABLE BWIH
-BDF1..BE0B    ; LVT # Lo  [27] HANGUL SYLLABLE BYUG..HANGUL SYLLABLE BYUH
-BE0D..BE27    ; LVT # Lo  [27] HANGUL SYLLABLE BEUG..HANGUL SYLLABLE BEUH
-BE29..BE43    ; LVT # Lo  [27] HANGUL SYLLABLE BYIG..HANGUL SYLLABLE BYIH
-BE45..BE5F    ; LVT # Lo  [27] HANGUL SYLLABLE BIG..HANGUL SYLLABLE BIH
-BE61..BE7B    ; LVT # Lo  [27] HANGUL SYLLABLE BBAG..HANGUL SYLLABLE BBAH
-BE7D..BE97    ; LVT # Lo  [27] HANGUL SYLLABLE BBAEG..HANGUL SYLLABLE BBAEH
-BE99..BEB3    ; LVT # Lo  [27] HANGUL SYLLABLE BBYAG..HANGUL SYLLABLE BBYAH
-BEB5..BECF    ; LVT # Lo  [27] HANGUL SYLLABLE BBYAEG..HANGUL SYLLABLE BBYAEH
-BED1..BEEB    ; LVT # Lo  [27] HANGUL SYLLABLE BBEOG..HANGUL SYLLABLE BBEOH
-BEED..BF07    ; LVT # Lo  [27] HANGUL SYLLABLE BBEG..HANGUL SYLLABLE BBEH
-BF09..BF23    ; LVT # Lo  [27] HANGUL SYLLABLE BBYEOG..HANGUL SYLLABLE BBYEOH
-BF25..BF3F    ; LVT # Lo  [27] HANGUL SYLLABLE BBYEG..HANGUL SYLLABLE BBYEH
-BF41..BF5B    ; LVT # Lo  [27] HANGUL SYLLABLE BBOG..HANGUL SYLLABLE BBOH
-BF5D..BF77    ; LVT # Lo  [27] HANGUL SYLLABLE BBWAG..HANGUL SYLLABLE BBWAH
-BF79..BF93    ; LVT # Lo  [27] HANGUL SYLLABLE BBWAEG..HANGUL SYLLABLE BBWAEH
-BF95..BFAF    ; LVT # Lo  [27] HANGUL SYLLABLE BBOEG..HANGUL SYLLABLE BBOEH
-BFB1..BFCB    ; LVT # Lo  [27] HANGUL SYLLABLE BBYOG..HANGUL SYLLABLE BBYOH
-BFCD..BFE7    ; LVT # Lo  [27] HANGUL SYLLABLE BBUG..HANGUL SYLLABLE BBUH
-BFE9..C003    ; LVT # Lo  [27] HANGUL SYLLABLE BBWEOG..HANGUL SYLLABLE BBWEOH
-C005..C01F    ; LVT # Lo  [27] HANGUL SYLLABLE BBWEG..HANGUL SYLLABLE BBWEH
-C021..C03B    ; LVT # Lo  [27] HANGUL SYLLABLE BBWIG..HANGUL SYLLABLE BBWIH
-C03D..C057    ; LVT # Lo  [27] HANGUL SYLLABLE BBYUG..HANGUL SYLLABLE BBYUH
-C059..C073    ; LVT # Lo  [27] HANGUL SYLLABLE BBEUG..HANGUL SYLLABLE BBEUH
-C075..C08F    ; LVT # Lo  [27] HANGUL SYLLABLE BBYIG..HANGUL SYLLABLE BBYIH
-C091..C0AB    ; LVT # Lo  [27] HANGUL SYLLABLE BBIG..HANGUL SYLLABLE BBIH
-C0AD..C0C7    ; LVT # Lo  [27] HANGUL SYLLABLE SAG..HANGUL SYLLABLE SAH
-C0C9..C0E3    ; LVT # Lo  [27] HANGUL SYLLABLE SAEG..HANGUL SYLLABLE SAEH
-C0E5..C0FF    ; LVT # Lo  [27] HANGUL SYLLABLE SYAG..HANGUL SYLLABLE SYAH
-C101..C11B    ; LVT # Lo  [27] HANGUL SYLLABLE SYAEG..HANGUL SYLLABLE SYAEH
-C11D..C137    ; LVT # Lo  [27] HANGUL SYLLABLE SEOG..HANGUL SYLLABLE SEOH
-C139..C153    ; LVT # Lo  [27] HANGUL SYLLABLE SEG..HANGUL SYLLABLE SEH
-C155..C16F    ; LVT # Lo  [27] HANGUL SYLLABLE SYEOG..HANGUL SYLLABLE SYEOH
-C171..C18B    ; LVT # Lo  [27] HANGUL SYLLABLE SYEG..HANGUL SYLLABLE SYEH
-C18D..C1A7    ; LVT # Lo  [27] HANGUL SYLLABLE SOG..HANGUL SYLLABLE SOH
-C1A9..C1C3    ; LVT # Lo  [27] HANGUL SYLLABLE SWAG..HANGUL SYLLABLE SWAH
-C1C5..C1DF    ; LVT # Lo  [27] HANGUL SYLLABLE SWAEG..HANGUL SYLLABLE SWAEH
-C1E1..C1FB    ; LVT # Lo  [27] HANGUL SYLLABLE SOEG..HANGUL SYLLABLE SOEH
-C1FD..C217    ; LVT # Lo  [27] HANGUL SYLLABLE SYOG..HANGUL SYLLABLE SYOH
-C219..C233    ; LVT # Lo  [27] HANGUL SYLLABLE SUG..HANGUL SYLLABLE SUH
-C235..C24F    ; LVT # Lo  [27] HANGUL SYLLABLE SWEOG..HANGUL SYLLABLE SWEOH
-C251..C26B    ; LVT # Lo  [27] HANGUL SYLLABLE SWEG..HANGUL SYLLABLE SWEH
-C26D..C287    ; LVT # Lo  [27] HANGUL SYLLABLE SWIG..HANGUL SYLLABLE SWIH
-C289..C2A3    ; LVT # Lo  [27] HANGUL SYLLABLE SYUG..HANGUL SYLLABLE SYUH
-C2A5..C2BF    ; LVT # Lo  [27] HANGUL SYLLABLE SEUG..HANGUL SYLLABLE SEUH
-C2C1..C2DB    ; LVT # Lo  [27] HANGUL SYLLABLE SYIG..HANGUL SYLLABLE SYIH
-C2DD..C2F7    ; LVT # Lo  [27] HANGUL SYLLABLE SIG..HANGUL SYLLABLE SIH
-C2F9..C313    ; LVT # Lo  [27] HANGUL SYLLABLE SSAG..HANGUL SYLLABLE SSAH
-C315..C32F    ; LVT # Lo  [27] HANGUL SYLLABLE SSAEG..HANGUL SYLLABLE SSAEH
-C331..C34B    ; LVT # Lo  [27] HANGUL SYLLABLE SSYAG..HANGUL SYLLABLE SSYAH
-C34D..C367    ; LVT # Lo  [27] HANGUL SYLLABLE SSYAEG..HANGUL SYLLABLE SSYAEH
-C369..C383    ; LVT # Lo  [27] HANGUL SYLLABLE SSEOG..HANGUL SYLLABLE SSEOH
-C385..C39F    ; LVT # Lo  [27] HANGUL SYLLABLE SSEG..HANGUL SYLLABLE SSEH
-C3A1..C3BB    ; LVT # Lo  [27] HANGUL SYLLABLE SSYEOG..HANGUL SYLLABLE SSYEOH
-C3BD..C3D7    ; LVT # Lo  [27] HANGUL SYLLABLE SSYEG..HANGUL SYLLABLE SSYEH
-C3D9..C3F3    ; LVT # Lo  [27] HANGUL SYLLABLE SSOG..HANGUL SYLLABLE SSOH
-C3F5..C40F    ; LVT # Lo  [27] HANGUL SYLLABLE SSWAG..HANGUL SYLLABLE SSWAH
-C411..C42B    ; LVT # Lo  [27] HANGUL SYLLABLE SSWAEG..HANGUL SYLLABLE SSWAEH
-C42D..C447    ; LVT # Lo  [27] HANGUL SYLLABLE SSOEG..HANGUL SYLLABLE SSOEH
-C449..C463    ; LVT # Lo  [27] HANGUL SYLLABLE SSYOG..HANGUL SYLLABLE SSYOH
-C465..C47F    ; LVT # Lo  [27] HANGUL SYLLABLE SSUG..HANGUL SYLLABLE SSUH
-C481..C49B    ; LVT # Lo  [27] HANGUL SYLLABLE SSWEOG..HANGUL SYLLABLE SSWEOH
-C49D..C4B7    ; LVT # Lo  [27] HANGUL SYLLABLE SSWEG..HANGUL SYLLABLE SSWEH
-C4B9..C4D3    ; LVT # Lo  [27] HANGUL SYLLABLE SSWIG..HANGUL SYLLABLE SSWIH
-C4D5..C4EF    ; LVT # Lo  [27] HANGUL SYLLABLE SSYUG..HANGUL SYLLABLE SSYUH
-C4F1..C50B    ; LVT # Lo  [27] HANGUL SYLLABLE SSEUG..HANGUL SYLLABLE SSEUH
-C50D..C527    ; LVT # Lo  [27] HANGUL SYLLABLE SSYIG..HANGUL SYLLABLE SSYIH
-C529..C543    ; LVT # Lo  [27] HANGUL SYLLABLE SSIG..HANGUL SYLLABLE SSIH
-C545..C55F    ; LVT # Lo  [27] HANGUL SYLLABLE AG..HANGUL SYLLABLE AH
-C561..C57B    ; LVT # Lo  [27] HANGUL SYLLABLE AEG..HANGUL SYLLABLE AEH
-C57D..C597    ; LVT # Lo  [27] HANGUL SYLLABLE YAG..HANGUL SYLLABLE YAH
-C599..C5B3    ; LVT # Lo  [27] HANGUL SYLLABLE YAEG..HANGUL SYLLABLE YAEH
-C5B5..C5CF    ; LVT # Lo  [27] HANGUL SYLLABLE EOG..HANGUL SYLLABLE EOH
-C5D1..C5EB    ; LVT # Lo  [27] HANGUL SYLLABLE EG..HANGUL SYLLABLE EH
-C5ED..C607    ; LVT # Lo  [27] HANGUL SYLLABLE YEOG..HANGUL SYLLABLE YEOH
-C609..C623    ; LVT # Lo  [27] HANGUL SYLLABLE YEG..HANGUL SYLLABLE YEH
-C625..C63F    ; LVT # Lo  [27] HANGUL SYLLABLE OG..HANGUL SYLLABLE OH
-C641..C65B    ; LVT # Lo  [27] HANGUL SYLLABLE WAG..HANGUL SYLLABLE WAH
-C65D..C677    ; LVT # Lo  [27] HANGUL SYLLABLE WAEG..HANGUL SYLLABLE WAEH
-C679..C693    ; LVT # Lo  [27] HANGUL SYLLABLE OEG..HANGUL SYLLABLE OEH
-C695..C6AF    ; LVT # Lo  [27] HANGUL SYLLABLE YOG..HANGUL SYLLABLE YOH
-C6B1..C6CB    ; LVT # Lo  [27] HANGUL SYLLABLE UG..HANGUL SYLLABLE UH
-C6CD..C6E7    ; LVT # Lo  [27] HANGUL SYLLABLE WEOG..HANGUL SYLLABLE WEOH
-C6E9..C703    ; LVT # Lo  [27] HANGUL SYLLABLE WEG..HANGUL SYLLABLE WEH
-C705..C71F    ; LVT # Lo  [27] HANGUL SYLLABLE WIG..HANGUL SYLLABLE WIH
-C721..C73B    ; LVT # Lo  [27] HANGUL SYLLABLE YUG..HANGUL SYLLABLE YUH
-C73D..C757    ; LVT # Lo  [27] HANGUL SYLLABLE EUG..HANGUL SYLLABLE EUH
-C759..C773    ; LVT # Lo  [27] HANGUL SYLLABLE YIG..HANGUL SYLLABLE YIH
-C775..C78F    ; LVT # Lo  [27] HANGUL SYLLABLE IG..HANGUL SYLLABLE IH
-C791..C7AB    ; LVT # Lo  [27] HANGUL SYLLABLE JAG..HANGUL SYLLABLE JAH
-C7AD..C7C7    ; LVT # Lo  [27] HANGUL SYLLABLE JAEG..HANGUL SYLLABLE JAEH
-C7C9..C7E3    ; LVT # Lo  [27] HANGUL SYLLABLE JYAG..HANGUL SYLLABLE JYAH
-C7E5..C7FF    ; LVT # Lo  [27] HANGUL SYLLABLE JYAEG..HANGUL SYLLABLE JYAEH
-C801..C81B    ; LVT # Lo  [27] HANGUL SYLLABLE JEOG..HANGUL SYLLABLE JEOH
-C81D..C837    ; LVT # Lo  [27] HANGUL SYLLABLE JEG..HANGUL SYLLABLE JEH
-C839..C853    ; LVT # Lo  [27] HANGUL SYLLABLE JYEOG..HANGUL SYLLABLE JYEOH
-C855..C86F    ; LVT # Lo  [27] HANGUL SYLLABLE JYEG..HANGUL SYLLABLE JYEH
-C871..C88B    ; LVT # Lo  [27] HANGUL SYLLABLE JOG..HANGUL SYLLABLE JOH
-C88D..C8A7    ; LVT # Lo  [27] HANGUL SYLLABLE JWAG..HANGUL SYLLABLE JWAH
-C8A9..C8C3    ; LVT # Lo  [27] HANGUL SYLLABLE JWAEG..HANGUL SYLLABLE JWAEH
-C8C5..C8DF    ; LVT # Lo  [27] HANGUL SYLLABLE JOEG..HANGUL SYLLABLE JOEH
-C8E1..C8FB    ; LVT # Lo  [27] HANGUL SYLLABLE JYOG..HANGUL SYLLABLE JYOH
-C8FD..C917    ; LVT # Lo  [27] HANGUL SYLLABLE JUG..HANGUL SYLLABLE JUH
-C919..C933    ; LVT # Lo  [27] HANGUL SYLLABLE JWEOG..HANGUL SYLLABLE JWEOH
-C935..C94F    ; LVT # Lo  [27] HANGUL SYLLABLE JWEG..HANGUL SYLLABLE JWEH
-C951..C96B    ; LVT # Lo  [27] HANGUL SYLLABLE JWIG..HANGUL SYLLABLE JWIH
-C96D..C987    ; LVT # Lo  [27] HANGUL SYLLABLE JYUG..HANGUL SYLLABLE JYUH
-C989..C9A3    ; LVT # Lo  [27] HANGUL SYLLABLE JEUG..HANGUL SYLLABLE JEUH
-C9A5..C9BF    ; LVT # Lo  [27] HANGUL SYLLABLE JYIG..HANGUL SYLLABLE JYIH
-C9C1..C9DB    ; LVT # Lo  [27] HANGUL SYLLABLE JIG..HANGUL SYLLABLE JIH
-C9DD..C9F7    ; LVT # Lo  [27] HANGUL SYLLABLE JJAG..HANGUL SYLLABLE JJAH
-C9F9..CA13    ; LVT # Lo  [27] HANGUL SYLLABLE JJAEG..HANGUL SYLLABLE JJAEH
-CA15..CA2F    ; LVT # Lo  [27] HANGUL SYLLABLE JJYAG..HANGUL SYLLABLE JJYAH
-CA31..CA4B    ; LVT # Lo  [27] HANGUL SYLLABLE JJYAEG..HANGUL SYLLABLE JJYAEH
-CA4D..CA67    ; LVT # Lo  [27] HANGUL SYLLABLE JJEOG..HANGUL SYLLABLE JJEOH
-CA69..CA83    ; LVT # Lo  [27] HANGUL SYLLABLE JJEG..HANGUL SYLLABLE JJEH
-CA85..CA9F    ; LVT # Lo  [27] HANGUL SYLLABLE JJYEOG..HANGUL SYLLABLE JJYEOH
-CAA1..CABB    ; LVT # Lo  [27] HANGUL SYLLABLE JJYEG..HANGUL SYLLABLE JJYEH
-CABD..CAD7    ; LVT # Lo  [27] HANGUL SYLLABLE JJOG..HANGUL SYLLABLE JJOH
-CAD9..CAF3    ; LVT # Lo  [27] HANGUL SYLLABLE JJWAG..HANGUL SYLLABLE JJWAH
-CAF5..CB0F    ; LVT # Lo  [27] HANGUL SYLLABLE JJWAEG..HANGUL SYLLABLE JJWAEH
-CB11..CB2B    ; LVT # Lo  [27] HANGUL SYLLABLE JJOEG..HANGUL SYLLABLE JJOEH
-CB2D..CB47    ; LVT # Lo  [27] HANGUL SYLLABLE JJYOG..HANGUL SYLLABLE JJYOH
-CB49..CB63    ; LVT # Lo  [27] HANGUL SYLLABLE JJUG..HANGUL SYLLABLE JJUH
-CB65..CB7F    ; LVT # Lo  [27] HANGUL SYLLABLE JJWEOG..HANGUL SYLLABLE JJWEOH
-CB81..CB9B    ; LVT # Lo  [27] HANGUL SYLLABLE JJWEG..HANGUL SYLLABLE JJWEH
-CB9D..CBB7    ; LVT # Lo  [27] HANGUL SYLLABLE JJWIG..HANGUL SYLLABLE JJWIH
-CBB9..CBD3    ; LVT # Lo  [27] HANGUL SYLLABLE JJYUG..HANGUL SYLLABLE JJYUH
-CBD5..CBEF    ; LVT # Lo  [27] HANGUL SYLLABLE JJEUG..HANGUL SYLLABLE JJEUH
-CBF1..CC0B    ; LVT # Lo  [27] HANGUL SYLLABLE JJYIG..HANGUL SYLLABLE JJYIH
-CC0D..CC27    ; LVT # Lo  [27] HANGUL SYLLABLE JJIG..HANGUL SYLLABLE JJIH
-CC29..CC43    ; LVT # Lo  [27] HANGUL SYLLABLE CAG..HANGUL SYLLABLE CAH
-CC45..CC5F    ; LVT # Lo  [27] HANGUL SYLLABLE CAEG..HANGUL SYLLABLE CAEH
-CC61..CC7B    ; LVT # Lo  [27] HANGUL SYLLABLE CYAG..HANGUL SYLLABLE CYAH
-CC7D..CC97    ; LVT # Lo  [27] HANGUL SYLLABLE CYAEG..HANGUL SYLLABLE CYAEH
-CC99..CCB3    ; LVT # Lo  [27] HANGUL SYLLABLE CEOG..HANGUL SYLLABLE CEOH
-CCB5..CCCF    ; LVT # Lo  [27] HANGUL SYLLABLE CEG..HANGUL SYLLABLE CEH
-CCD1..CCEB    ; LVT # Lo  [27] HANGUL SYLLABLE CYEOG..HANGUL SYLLABLE CYEOH
-CCED..CD07    ; LVT # Lo  [27] HANGUL SYLLABLE CYEG..HANGUL SYLLABLE CYEH
-CD09..CD23    ; LVT # Lo  [27] HANGUL SYLLABLE COG..HANGUL SYLLABLE COH
-CD25..CD3F    ; LVT # Lo  [27] HANGUL SYLLABLE CWAG..HANGUL SYLLABLE CWAH
-CD41..CD5B    ; LVT # Lo  [27] HANGUL SYLLABLE CWAEG..HANGUL SYLLABLE CWAEH
-CD5D..CD77    ; LVT # Lo  [27] HANGUL SYLLABLE COEG..HANGUL SYLLABLE COEH
-CD79..CD93    ; LVT # Lo  [27] HANGUL SYLLABLE CYOG..HANGUL SYLLABLE CYOH
-CD95..CDAF    ; LVT # Lo  [27] HANGUL SYLLABLE CUG..HANGUL SYLLABLE CUH
-CDB1..CDCB    ; LVT # Lo  [27] HANGUL SYLLABLE CWEOG..HANGUL SYLLABLE CWEOH
-CDCD..CDE7    ; LVT # Lo  [27] HANGUL SYLLABLE CWEG..HANGUL SYLLABLE CWEH
-CDE9..CE03    ; LVT # Lo  [27] HANGUL SYLLABLE CWIG..HANGUL SYLLABLE CWIH
-CE05..CE1F    ; LVT # Lo  [27] HANGUL SYLLABLE CYUG..HANGUL SYLLABLE CYUH
-CE21..CE3B    ; LVT # Lo  [27] HANGUL SYLLABLE CEUG..HANGUL SYLLABLE CEUH
-CE3D..CE57    ; LVT # Lo  [27] HANGUL SYLLABLE CYIG..HANGUL SYLLABLE CYIH
-CE59..CE73    ; LVT # Lo  [27] HANGUL SYLLABLE CIG..HANGUL SYLLABLE CIH
-CE75..CE8F    ; LVT # Lo  [27] HANGUL SYLLABLE KAG..HANGUL SYLLABLE KAH
-CE91..CEAB    ; LVT # Lo  [27] HANGUL SYLLABLE KAEG..HANGUL SYLLABLE KAEH
-CEAD..CEC7    ; LVT # Lo  [27] HANGUL SYLLABLE KYAG..HANGUL SYLLABLE KYAH
-CEC9..CEE3    ; LVT # Lo  [27] HANGUL SYLLABLE KYAEG..HANGUL SYLLABLE KYAEH
-CEE5..CEFF    ; LVT # Lo  [27] HANGUL SYLLABLE KEOG..HANGUL SYLLABLE KEOH
-CF01..CF1B    ; LVT # Lo  [27] HANGUL SYLLABLE KEG..HANGUL SYLLABLE KEH
-CF1D..CF37    ; LVT # Lo  [27] HANGUL SYLLABLE KYEOG..HANGUL SYLLABLE KYEOH
-CF39..CF53    ; LVT # Lo  [27] HANGUL SYLLABLE KYEG..HANGUL SYLLABLE KYEH
-CF55..CF6F    ; LVT # Lo  [27] HANGUL SYLLABLE KOG..HANGUL SYLLABLE KOH
-CF71..CF8B    ; LVT # Lo  [27] HANGUL SYLLABLE KWAG..HANGUL SYLLABLE KWAH
-CF8D..CFA7    ; LVT # Lo  [27] HANGUL SYLLABLE KWAEG..HANGUL SYLLABLE KWAEH
-CFA9..CFC3    ; LVT # Lo  [27] HANGUL SYLLABLE KOEG..HANGUL SYLLABLE KOEH
-CFC5..CFDF    ; LVT # Lo  [27] HANGUL SYLLABLE KYOG..HANGUL SYLLABLE KYOH
-CFE1..CFFB    ; LVT # Lo  [27] HANGUL SYLLABLE KUG..HANGUL SYLLABLE KUH
-CFFD..D017    ; LVT # Lo  [27] HANGUL SYLLABLE KWEOG..HANGUL SYLLABLE KWEOH
-D019..D033    ; LVT # Lo  [27] HANGUL SYLLABLE KWEG..HANGUL SYLLABLE KWEH
-D035..D04F    ; LVT # Lo  [27] HANGUL SYLLABLE KWIG..HANGUL SYLLABLE KWIH
-D051..D06B    ; LVT # Lo  [27] HANGUL SYLLABLE KYUG..HANGUL SYLLABLE KYUH
-D06D..D087    ; LVT # Lo  [27] HANGUL SYLLABLE KEUG..HANGUL SYLLABLE KEUH
-D089..D0A3    ; LVT # Lo  [27] HANGUL SYLLABLE KYIG..HANGUL SYLLABLE KYIH
-D0A5..D0BF    ; LVT # Lo  [27] HANGUL SYLLABLE KIG..HANGUL SYLLABLE KIH
-D0C1..D0DB    ; LVT # Lo  [27] HANGUL SYLLABLE TAG..HANGUL SYLLABLE TAH
-D0DD..D0F7    ; LVT # Lo  [27] HANGUL SYLLABLE TAEG..HANGUL SYLLABLE TAEH
-D0F9..D113    ; LVT # Lo  [27] HANGUL SYLLABLE TYAG..HANGUL SYLLABLE TYAH
-D115..D12F    ; LVT # Lo  [27] HANGUL SYLLABLE TYAEG..HANGUL SYLLABLE TYAEH
-D131..D14B    ; LVT # Lo  [27] HANGUL SYLLABLE TEOG..HANGUL SYLLABLE TEOH
-D14D..D167    ; LVT # Lo  [27] HANGUL SYLLABLE TEG..HANGUL SYLLABLE TEH
-D169..D183    ; LVT # Lo  [27] HANGUL SYLLABLE TYEOG..HANGUL SYLLABLE TYEOH
-D185..D19F    ; LVT # Lo  [27] HANGUL SYLLABLE TYEG..HANGUL SYLLABLE TYEH
-D1A1..D1BB    ; LVT # Lo  [27] HANGUL SYLLABLE TOG..HANGUL SYLLABLE TOH
-D1BD..D1D7    ; LVT # Lo  [27] HANGUL SYLLABLE TWAG..HANGUL SYLLABLE TWAH
-D1D9..D1F3    ; LVT # Lo  [27] HANGUL SYLLABLE TWAEG..HANGUL SYLLABLE TWAEH
-D1F5..D20F    ; LVT # Lo  [27] HANGUL SYLLABLE TOEG..HANGUL SYLLABLE TOEH
-D211..D22B    ; LVT # Lo  [27] HANGUL SYLLABLE TYOG..HANGUL SYLLABLE TYOH
-D22D..D247    ; LVT # Lo  [27] HANGUL SYLLABLE TUG..HANGUL SYLLABLE TUH
-D249..D263    ; LVT # Lo  [27] HANGUL SYLLABLE TWEOG..HANGUL SYLLABLE TWEOH
-D265..D27F    ; LVT # Lo  [27] HANGUL SYLLABLE TWEG..HANGUL SYLLABLE TWEH
-D281..D29B    ; LVT # Lo  [27] HANGUL SYLLABLE TWIG..HANGUL SYLLABLE TWIH
-D29D..D2B7    ; LVT # Lo  [27] HANGUL SYLLABLE TYUG..HANGUL SYLLABLE TYUH
-D2B9..D2D3    ; LVT # Lo  [27] HANGUL SYLLABLE TEUG..HANGUL SYLLABLE TEUH
-D2D5..D2EF    ; LVT # Lo  [27] HANGUL SYLLABLE TYIG..HANGUL SYLLABLE TYIH
-D2F1..D30B    ; LVT # Lo  [27] HANGUL SYLLABLE TIG..HANGUL SYLLABLE TIH
-D30D..D327    ; LVT # Lo  [27] HANGUL SYLLABLE PAG..HANGUL SYLLABLE PAH
-D329..D343    ; LVT # Lo  [27] HANGUL SYLLABLE PAEG..HANGUL SYLLABLE PAEH
-D345..D35F    ; LVT # Lo  [27] HANGUL SYLLABLE PYAG..HANGUL SYLLABLE PYAH
-D361..D37B    ; LVT # Lo  [27] HANGUL SYLLABLE PYAEG..HANGUL SYLLABLE PYAEH
-D37D..D397    ; LVT # Lo  [27] HANGUL SYLLABLE PEOG..HANGUL SYLLABLE PEOH
-D399..D3B3    ; LVT # Lo  [27] HANGUL SYLLABLE PEG..HANGUL SYLLABLE PEH
-D3B5..D3CF    ; LVT # Lo  [27] HANGUL SYLLABLE PYEOG..HANGUL SYLLABLE PYEOH
-D3D1..D3EB    ; LVT # Lo  [27] HANGUL SYLLABLE PYEG..HANGUL SYLLABLE PYEH
-D3ED..D407    ; LVT # Lo  [27] HANGUL SYLLABLE POG..HANGUL SYLLABLE POH
-D409..D423    ; LVT # Lo  [27] HANGUL SYLLABLE PWAG..HANGUL SYLLABLE PWAH
-D425..D43F    ; LVT # Lo  [27] HANGUL SYLLABLE PWAEG..HANGUL SYLLABLE PWAEH
-D441..D45B    ; LVT # Lo  [27] HANGUL SYLLABLE POEG..HANGUL SYLLABLE POEH
-D45D..D477    ; LVT # Lo  [27] HANGUL SYLLABLE PYOG..HANGUL SYLLABLE PYOH
-D479..D493    ; LVT # Lo  [27] HANGUL SYLLABLE PUG..HANGUL SYLLABLE PUH
-D495..D4AF    ; LVT # Lo  [27] HANGUL SYLLABLE PWEOG..HANGUL SYLLABLE PWEOH
-D4B1..D4CB    ; LVT # Lo  [27] HANGUL SYLLABLE PWEG..HANGUL SYLLABLE PWEH
-D4CD..D4E7    ; LVT # Lo  [27] HANGUL SYLLABLE PWIG..HANGUL SYLLABLE PWIH
-D4E9..D503    ; LVT # Lo  [27] HANGUL SYLLABLE PYUG..HANGUL SYLLABLE PYUH
-D505..D51F    ; LVT # Lo  [27] HANGUL SYLLABLE PEUG..HANGUL SYLLABLE PEUH
-D521..D53B    ; LVT # Lo  [27] HANGUL SYLLABLE PYIG..HANGUL SYLLABLE PYIH
-D53D..D557    ; LVT # Lo  [27] HANGUL SYLLABLE PIG..HANGUL SYLLABLE PIH
-D559..D573    ; LVT # Lo  [27] HANGUL SYLLABLE HAG..HANGUL SYLLABLE HAH
-D575..D58F    ; LVT # Lo  [27] HANGUL SYLLABLE HAEG..HANGUL SYLLABLE HAEH
-D591..D5AB    ; LVT # Lo  [27] HANGUL SYLLABLE HYAG..HANGUL SYLLABLE HYAH
-D5AD..D5C7    ; LVT # Lo  [27] HANGUL SYLLABLE HYAEG..HANGUL SYLLABLE HYAEH
-D5C9..D5E3    ; LVT # Lo  [27] HANGUL SYLLABLE HEOG..HANGUL SYLLABLE HEOH
-D5E5..D5FF    ; LVT # Lo  [27] HANGUL SYLLABLE HEG..HANGUL SYLLABLE HEH
-D601..D61B    ; LVT # Lo  [27] HANGUL SYLLABLE HYEOG..HANGUL SYLLABLE HYEOH
-D61D..D637    ; LVT # Lo  [27] HANGUL SYLLABLE HYEG..HANGUL SYLLABLE HYEH
-D639..D653    ; LVT # Lo  [27] HANGUL SYLLABLE HOG..HANGUL SYLLABLE HOH
-D655..D66F    ; LVT # Lo  [27] HANGUL SYLLABLE HWAG..HANGUL SYLLABLE HWAH
-D671..D68B    ; LVT # Lo  [27] HANGUL SYLLABLE HWAEG..HANGUL SYLLABLE HWAEH
-D68D..D6A7    ; LVT # Lo  [27] HANGUL SYLLABLE HOEG..HANGUL SYLLABLE HOEH
-D6A9..D6C3    ; LVT # Lo  [27] HANGUL SYLLABLE HYOG..HANGUL SYLLABLE HYOH
-D6C5..D6DF    ; LVT # Lo  [27] HANGUL SYLLABLE HUG..HANGUL SYLLABLE HUH
-D6E1..D6FB    ; LVT # Lo  [27] HANGUL SYLLABLE HWEOG..HANGUL SYLLABLE HWEOH
-D6FD..D717    ; LVT # Lo  [27] HANGUL SYLLABLE HWEG..HANGUL SYLLABLE HWEH
-D719..D733    ; LVT # Lo  [27] HANGUL SYLLABLE HWIG..HANGUL SYLLABLE HWIH
-D735..D74F    ; LVT # Lo  [27] HANGUL SYLLABLE HYUG..HANGUL SYLLABLE HYUH
-D751..D76B    ; LVT # Lo  [27] HANGUL SYLLABLE HEUG..HANGUL SYLLABLE HEUH
-D76D..D787    ; LVT # Lo  [27] HANGUL SYLLABLE HYIG..HANGUL SYLLABLE HYIH
-D789..D7A3    ; LVT # Lo  [27] HANGUL SYLLABLE HIG..HANGUL SYLLABLE HIH
-
-# Total code points: 10773
-
-# EOF
diff --git a/ucd/auxiliary/GraphemeBreakTest.txt b/ucd/auxiliary/GraphemeBreakTest.txt
deleted file mode 100644
index 69096e1..0000000
--- a/ucd/auxiliary/GraphemeBreakTest.txt
+++ /dev/null
@@ -1,123 +0,0 @@
-# GraphemeBreakTest-5.0.0.txt
-# Date: 2006-06-11, 20:09:11 GMT [MD]
-#
-# Unicode Character Database
-# Copyright (c) 1991-2006 Unicode, Inc.
-# For terms of use, see http://www.unicode.org/terms_of_use.html
-# For documentation, see UCD.html
-#
-# Default Grapheme Break Test
-#
-# Format:
-# <string> (# <comment>)? 
-#  <string> contains hex Unicode code points, with 
-#	÷ wherever there is a break opportunity, and 
-#	× wherever there is not.
-#  <comment> the format can change, but currently it shows:
-#	- the sample character name
-#	- (x) the line_break property* for the sample character
-#	- [x] the rule that determines whether there is a break or not
-#
-# These samples may be extended or changed in the future.
-#
-÷ 0020 ÷ 0020 ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] SPACE (Other) ÷ [0.3]
-÷ 0020 ÷ 000D ÷	#  ÷ [0.2] SPACE (Other) ÷ [5.0] <CARRIAGE RETURN (CR)> (CR) ÷ [0.3]
-÷ 0020 ÷ 000A ÷	#  ÷ [0.2] SPACE (Other) ÷ [5.0] <LINE FEED (LF)> (LF) ÷ [0.3]
-÷ 0020 ÷ 0001 ÷	#  ÷ [0.2] SPACE (Other) ÷ [5.0] <START OF HEADING> (Control) ÷ [0.3]
-÷ 0020 × 0300 ÷	#  ÷ [0.2] SPACE (Other) × [9.0] COMBINING GRAVE ACCENT (Extend) ÷ [0.3]
-÷ 0020 ÷ 1100 ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] HANGUL CHOSEONG KIYEOK (L) ÷ [0.3]
-÷ 0020 ÷ 1160 ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] HANGUL JUNGSEONG FILLER (V) ÷ [0.3]
-÷ 0020 ÷ 11A8 ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] HANGUL JONGSEONG KIYEOK (T) ÷ [0.3]
-÷ 0020 ÷ AC00 ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] HANGUL SYLLABLE GA (LV) ÷ [0.3]
-÷ 0020 ÷ AC01 ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] HANGUL SYLLABLE GAG (LVT) ÷ [0.3]
-÷ 000D ÷ 0020 ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (CR) ÷ [4.0] SPACE (Other) ÷ [0.3]
-÷ 000D ÷ 000D ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (CR) ÷ [4.0] <CARRIAGE RETURN (CR)> (CR) ÷ [0.3]
-÷ 000D × 000A ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (CR) × [3.0] <LINE FEED (LF)> (LF) ÷ [0.3]
-÷ 000D ÷ 0001 ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (CR) ÷ [4.0] <START OF HEADING> (Control) ÷ [0.3]
-÷ 000D ÷ 0300 ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (CR) ÷ [4.0] COMBINING GRAVE ACCENT (Extend) ÷ [0.3]
-÷ 000D ÷ 1100 ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (CR) ÷ [4.0] HANGUL CHOSEONG KIYEOK (L) ÷ [0.3]
-÷ 000D ÷ 1160 ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (CR) ÷ [4.0] HANGUL JUNGSEONG FILLER (V) ÷ [0.3]
-÷ 000D ÷ 11A8 ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (CR) ÷ [4.0] HANGUL JONGSEONG KIYEOK (T) ÷ [0.3]
-÷ 000D ÷ AC00 ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (CR) ÷ [4.0] HANGUL SYLLABLE GA (LV) ÷ [0.3]
-÷ 000D ÷ AC01 ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (CR) ÷ [4.0] HANGUL SYLLABLE GAG (LVT) ÷ [0.3]
-÷ 000A ÷ 0020 ÷	#  ÷ [0.2] <LINE FEED (LF)> (LF) ÷ [4.0] SPACE (Other) ÷ [0.3]
-÷ 000A ÷ 000D ÷	#  ÷ [0.2] <LINE FEED (LF)> (LF) ÷ [4.0] <CARRIAGE RETURN (CR)> (CR) ÷ [0.3]
-÷ 000A ÷ 000A ÷	#  ÷ [0.2] <LINE FEED (LF)> (LF) ÷ [4.0] <LINE FEED (LF)> (LF) ÷ [0.3]
-÷ 000A ÷ 0001 ÷	#  ÷ [0.2] <LINE FEED (LF)> (LF) ÷ [4.0] <START OF HEADING> (Control) ÷ [0.3]
-÷ 000A ÷ 0300 ÷	#  ÷ [0.2] <LINE FEED (LF)> (LF) ÷ [4.0] COMBINING GRAVE ACCENT (Extend) ÷ [0.3]
-÷ 000A ÷ 1100 ÷	#  ÷ [0.2] <LINE FEED (LF)> (LF) ÷ [4.0] HANGUL CHOSEONG KIYEOK (L) ÷ [0.3]
-÷ 000A ÷ 1160 ÷	#  ÷ [0.2] <LINE FEED (LF)> (LF) ÷ [4.0] HANGUL JUNGSEONG FILLER (V) ÷ [0.3]
-÷ 000A ÷ 11A8 ÷	#  ÷ [0.2] <LINE FEED (LF)> (LF) ÷ [4.0] HANGUL JONGSEONG KIYEOK (T) ÷ [0.3]
-÷ 000A ÷ AC00 ÷	#  ÷ [0.2] <LINE FEED (LF)> (LF) ÷ [4.0] HANGUL SYLLABLE GA (LV) ÷ [0.3]
-÷ 000A ÷ AC01 ÷	#  ÷ [0.2] <LINE FEED (LF)> (LF) ÷ [4.0] HANGUL SYLLABLE GAG (LVT) ÷ [0.3]
-÷ 0001 ÷ 0020 ÷	#  ÷ [0.2] <START OF HEADING> (Control) ÷ [4.0] SPACE (Other) ÷ [0.3]
-÷ 0001 ÷ 000D ÷	#  ÷ [0.2] <START OF HEADING> (Control) ÷ [4.0] <CARRIAGE RETURN (CR)> (CR) ÷ [0.3]
-÷ 0001 ÷ 000A ÷	#  ÷ [0.2] <START OF HEADING> (Control) ÷ [4.0] <LINE FEED (LF)> (LF) ÷ [0.3]
-÷ 0001 ÷ 0001 ÷	#  ÷ [0.2] <START OF HEADING> (Control) ÷ [4.0] <START OF HEADING> (Control) ÷ [0.3]
-÷ 0001 ÷ 0300 ÷	#  ÷ [0.2] <START OF HEADING> (Control) ÷ [4.0] COMBINING GRAVE ACCENT (Extend) ÷ [0.3]
-÷ 0001 ÷ 1100 ÷	#  ÷ [0.2] <START OF HEADING> (Control) ÷ [4.0] HANGUL CHOSEONG KIYEOK (L) ÷ [0.3]
-÷ 0001 ÷ 1160 ÷	#  ÷ [0.2] <START OF HEADING> (Control) ÷ [4.0] HANGUL JUNGSEONG FILLER (V) ÷ [0.3]
-÷ 0001 ÷ 11A8 ÷	#  ÷ [0.2] <START OF HEADING> (Control) ÷ [4.0] HANGUL JONGSEONG KIYEOK (T) ÷ [0.3]
-÷ 0001 ÷ AC00 ÷	#  ÷ [0.2] <START OF HEADING> (Control) ÷ [4.0] HANGUL SYLLABLE GA (LV) ÷ [0.3]
-÷ 0001 ÷ AC01 ÷	#  ÷ [0.2] <START OF HEADING> (Control) ÷ [4.0] HANGUL SYLLABLE GAG (LVT) ÷ [0.3]
-÷ 0300 ÷ 0020 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (Extend) ÷ [999.0] SPACE (Other) ÷ [0.3]
-÷ 0300 ÷ 000D ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (Extend) ÷ [5.0] <CARRIAGE RETURN (CR)> (CR) ÷ [0.3]
-÷ 0300 ÷ 000A ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (Extend) ÷ [5.0] <LINE FEED (LF)> (LF) ÷ [0.3]
-÷ 0300 ÷ 0001 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (Extend) ÷ [5.0] <START OF HEADING> (Control) ÷ [0.3]
-÷ 0300 × 0300 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (Extend) × [9.0] COMBINING GRAVE ACCENT (Extend) ÷ [0.3]
-÷ 0300 ÷ 1100 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (Extend) ÷ [999.0] HANGUL CHOSEONG KIYEOK (L) ÷ [0.3]
-÷ 0300 ÷ 1160 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (Extend) ÷ [999.0] HANGUL JUNGSEONG FILLER (V) ÷ [0.3]
-÷ 0300 ÷ 11A8 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (Extend) ÷ [999.0] HANGUL JONGSEONG KIYEOK (T) ÷ [0.3]
-÷ 0300 ÷ AC00 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (Extend) ÷ [999.0] HANGUL SYLLABLE GA (LV) ÷ [0.3]
-÷ 0300 ÷ AC01 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (Extend) ÷ [999.0] HANGUL SYLLABLE GAG (LVT) ÷ [0.3]
-÷ 1100 ÷ 0020 ÷	#  ÷ [0.2] HANGUL CHOSEONG KIYEOK (L) ÷ [999.0] SPACE (Other) ÷ [0.3]
-÷ 1100 ÷ 000D ÷	#  ÷ [0.2] HANGUL CHOSEONG KIYEOK (L) ÷ [5.0] <CARRIAGE RETURN (CR)> (CR) ÷ [0.3]
-÷ 1100 ÷ 000A ÷	#  ÷ [0.2] HANGUL CHOSEONG KIYEOK (L) ÷ [5.0] <LINE FEED (LF)> (LF) ÷ [0.3]
-÷ 1100 ÷ 0001 ÷	#  ÷ [0.2] HANGUL CHOSEONG KIYEOK (L) ÷ [5.0] <START OF HEADING> (Control) ÷ [0.3]
-÷ 1100 × 0300 ÷	#  ÷ [0.2] HANGUL CHOSEONG KIYEOK (L) × [9.0] COMBINING GRAVE ACCENT (Extend) ÷ [0.3]
-÷ 1100 × 1100 ÷	#  ÷ [0.2] HANGUL CHOSEONG KIYEOK (L) × [6.0] HANGUL CHOSEONG KIYEOK (L) ÷ [0.3]
-÷ 1100 × 1160 ÷	#  ÷ [0.2] HANGUL CHOSEONG KIYEOK (L) × [6.0] HANGUL JUNGSEONG FILLER (V) ÷ [0.3]
-÷ 1100 ÷ 11A8 ÷	#  ÷ [0.2] HANGUL CHOSEONG KIYEOK (L) ÷ [999.0] HANGUL JONGSEONG KIYEOK (T) ÷ [0.3]
-÷ 1100 × AC00 ÷	#  ÷ [0.2] HANGUL CHOSEONG KIYEOK (L) × [6.0] HANGUL SYLLABLE GA (LV) ÷ [0.3]
-÷ 1100 × AC01 ÷	#  ÷ [0.2] HANGUL CHOSEONG KIYEOK (L) × [6.0] HANGUL SYLLABLE GAG (LVT) ÷ [0.3]
-÷ 1160 ÷ 0020 ÷	#  ÷ [0.2] HANGUL JUNGSEONG FILLER (V) ÷ [999.0] SPACE (Other) ÷ [0.3]
-÷ 1160 ÷ 000D ÷	#  ÷ [0.2] HANGUL JUNGSEONG FILLER (V) ÷ [5.0] <CARRIAGE RETURN (CR)> (CR) ÷ [0.3]
-÷ 1160 ÷ 000A ÷	#  ÷ [0.2] HANGUL JUNGSEONG FILLER (V) ÷ [5.0] <LINE FEED (LF)> (LF) ÷ [0.3]
-÷ 1160 ÷ 0001 ÷	#  ÷ [0.2] HANGUL JUNGSEONG FILLER (V) ÷ [5.0] <START OF HEADING> (Control) ÷ [0.3]
-÷ 1160 × 0300 ÷	#  ÷ [0.2] HANGUL JUNGSEONG FILLER (V) × [9.0] COMBINING GRAVE ACCENT (Extend) ÷ [0.3]
-÷ 1160 ÷ 1100 ÷	#  ÷ [0.2] HANGUL JUNGSEONG FILLER (V) ÷ [999.0] HANGUL CHOSEONG KIYEOK (L) ÷ [0.3]
-÷ 1160 × 1160 ÷	#  ÷ [0.2] HANGUL JUNGSEONG FILLER (V) × [7.0] HANGUL JUNGSEONG FILLER (V) ÷ [0.3]
-÷ 1160 × 11A8 ÷	#  ÷ [0.2] HANGUL JUNGSEONG FILLER (V) × [7.0] HANGUL JONGSEONG KIYEOK (T) ÷ [0.3]
-÷ 1160 ÷ AC00 ÷	#  ÷ [0.2] HANGUL JUNGSEONG FILLER (V) ÷ [999.0] HANGUL SYLLABLE GA (LV) ÷ [0.3]
-÷ 1160 ÷ AC01 ÷	#  ÷ [0.2] HANGUL JUNGSEONG FILLER (V) ÷ [999.0] HANGUL SYLLABLE GAG (LVT) ÷ [0.3]
-÷ 11A8 ÷ 0020 ÷	#  ÷ [0.2] HANGUL JONGSEONG KIYEOK (T) ÷ [999.0] SPACE (Other) ÷ [0.3]
-÷ 11A8 ÷ 000D ÷	#  ÷ [0.2] HANGUL JONGSEONG KIYEOK (T) ÷ [5.0] <CARRIAGE RETURN (CR)> (CR) ÷ [0.3]
-÷ 11A8 ÷ 000A ÷	#  ÷ [0.2] HANGUL JONGSEONG KIYEOK (T) ÷ [5.0] <LINE FEED (LF)> (LF) ÷ [0.3]
-÷ 11A8 ÷ 0001 ÷	#  ÷ [0.2] HANGUL JONGSEONG KIYEOK (T) ÷ [5.0] <START OF HEADING> (Control) ÷ [0.3]
-÷ 11A8 × 0300 ÷	#  ÷ [0.2] HANGUL JONGSEONG KIYEOK (T) × [9.0] COMBINING GRAVE ACCENT (Extend) ÷ [0.3]
-÷ 11A8 ÷ 1100 ÷	#  ÷ [0.2] HANGUL JONGSEONG KIYEOK (T) ÷ [999.0] HANGUL CHOSEONG KIYEOK (L) ÷ [0.3]
-÷ 11A8 ÷ 1160 ÷	#  ÷ [0.2] HANGUL JONGSEONG KIYEOK (T) ÷ [999.0] HANGUL JUNGSEONG FILLER (V) ÷ [0.3]
-÷ 11A8 × 11A8 ÷	#  ÷ [0.2] HANGUL JONGSEONG KIYEOK (T) × [8.0] HANGUL JONGSEONG KIYEOK (T) ÷ [0.3]
-÷ 11A8 ÷ AC00 ÷	#  ÷ [0.2] HANGUL JONGSEONG KIYEOK (T) ÷ [999.0] HANGUL SYLLABLE GA (LV) ÷ [0.3]
-÷ 11A8 ÷ AC01 ÷	#  ÷ [0.2] HANGUL JONGSEONG KIYEOK (T) ÷ [999.0] HANGUL SYLLABLE GAG (LVT) ÷ [0.3]
-÷ AC00 ÷ 0020 ÷	#  ÷ [0.2] HANGUL SYLLABLE GA (LV) ÷ [999.0] SPACE (Other) ÷ [0.3]
-÷ AC00 ÷ 000D ÷	#  ÷ [0.2] HANGUL SYLLABLE GA (LV) ÷ [5.0] <CARRIAGE RETURN (CR)> (CR) ÷ [0.3]
-÷ AC00 ÷ 000A ÷	#  ÷ [0.2] HANGUL SYLLABLE GA (LV) ÷ [5.0] <LINE FEED (LF)> (LF) ÷ [0.3]
-÷ AC00 ÷ 0001 ÷	#  ÷ [0.2] HANGUL SYLLABLE GA (LV) ÷ [5.0] <START OF HEADING> (Control) ÷ [0.3]
-÷ AC00 × 0300 ÷	#  ÷ [0.2] HANGUL SYLLABLE GA (LV) × [9.0] COMBINING GRAVE ACCENT (Extend) ÷ [0.3]
-÷ AC00 ÷ 1100 ÷	#  ÷ [0.2] HANGUL SYLLABLE GA (LV) ÷ [999.0] HANGUL CHOSEONG KIYEOK (L) ÷ [0.3]
-÷ AC00 × 1160 ÷	#  ÷ [0.2] HANGUL SYLLABLE GA (LV) × [7.0] HANGUL JUNGSEONG FILLER (V) ÷ [0.3]
-÷ AC00 × 11A8 ÷	#  ÷ [0.2] HANGUL SYLLABLE GA (LV) × [7.0] HANGUL JONGSEONG KIYEOK (T) ÷ [0.3]
-÷ AC00 ÷ AC00 ÷	#  ÷ [0.2] HANGUL SYLLABLE GA (LV) ÷ [999.0] HANGUL SYLLABLE GA (LV) ÷ [0.3]
-÷ AC00 ÷ AC01 ÷	#  ÷ [0.2] HANGUL SYLLABLE GA (LV) ÷ [999.0] HANGUL SYLLABLE GAG (LVT) ÷ [0.3]
-÷ AC01 ÷ 0020 ÷	#  ÷ [0.2] HANGUL SYLLABLE GAG (LVT) ÷ [999.0] SPACE (Other) ÷ [0.3]
-÷ AC01 ÷ 000D ÷	#  ÷ [0.2] HANGUL SYLLABLE GAG (LVT) ÷ [5.0] <CARRIAGE RETURN (CR)> (CR) ÷ [0.3]
-÷ AC01 ÷ 000A ÷	#  ÷ [0.2] HANGUL SYLLABLE GAG (LVT) ÷ [5.0] <LINE FEED (LF)> (LF) ÷ [0.3]
-÷ AC01 ÷ 0001 ÷	#  ÷ [0.2] HANGUL SYLLABLE GAG (LVT) ÷ [5.0] <START OF HEADING> (Control) ÷ [0.3]
-÷ AC01 × 0300 ÷	#  ÷ [0.2] HANGUL SYLLABLE GAG (LVT) × [9.0] COMBINING GRAVE ACCENT (Extend) ÷ [0.3]
-÷ AC01 ÷ 1100 ÷	#  ÷ [0.2] HANGUL SYLLABLE GAG (LVT) ÷ [999.0] HANGUL CHOSEONG KIYEOK (L) ÷ [0.3]
-÷ AC01 ÷ 1160 ÷	#  ÷ [0.2] HANGUL SYLLABLE GAG (LVT) ÷ [999.0] HANGUL JUNGSEONG FILLER (V) ÷ [0.3]
-÷ AC01 × 11A8 ÷	#  ÷ [0.2] HANGUL SYLLABLE GAG (LVT) × [8.0] HANGUL JONGSEONG KIYEOK (T) ÷ [0.3]
-÷ AC01 ÷ AC00 ÷	#  ÷ [0.2] HANGUL SYLLABLE GAG (LVT) ÷ [999.0] HANGUL SYLLABLE GA (LV) ÷ [0.3]
-÷ AC01 ÷ AC01 ÷	#  ÷ [0.2] HANGUL SYLLABLE GAG (LVT) ÷ [999.0] HANGUL SYLLABLE GAG (LVT) ÷ [0.3]
-# Lines: 100
diff --git a/ucd/auxiliary/SentenceBreakProperty.txt b/ucd/auxiliary/SentenceBreakProperty.txt
deleted file mode 100644
index 3aefc41..0000000
--- a/ucd/auxiliary/SentenceBreakProperty.txt
+++ /dev/null
@@ -1,1664 +0,0 @@
-# SentenceBreakProperty-5.0.0.txt
-# Date: 2006-03-09, 23:14:25 GMT [MD]
-#
-# Unicode Character Database
-# Copyright (c) 1991-2006 Unicode, Inc.
-# For terms of use, see http://www.unicode.org/terms_of_use.html
-# For documentation, see UCD.html
-
-# ================================================
-
-# Property:	Sentence_Break
-
-#  All code points not explicitly listed for Sentence_Break
-#  have the value Other (XX).
-
-# @missing: 0000..10FFFF; Other
-
-# ================================================
-
-000A          ; Sep # Cc       <control-000A>
-000D          ; Sep # Cc       <control-000D>
-0085          ; Sep # Cc       <control-0085>
-2028          ; Sep # Zl       LINE SEPARATOR
-2029          ; Sep # Zp       PARAGRAPH SEPARATOR
-
-# Total code points: 5
-
-# ================================================
-
-00AD          ; Format # Cf       SOFT HYPHEN
-0600..0603    ; Format # Cf   [4] ARABIC NUMBER SIGN..ARABIC SIGN SAFHA
-06DD          ; Format # Cf       ARABIC END OF AYAH
-070F          ; Format # Cf       SYRIAC ABBREVIATION MARK
-17B4..17B5    ; Format # Cf   [2] KHMER VOWEL INHERENT AQ..KHMER VOWEL INHERENT AA
-200B          ; Format # Cf       ZERO WIDTH SPACE
-200E..200F    ; Format # Cf   [2] LEFT-TO-RIGHT MARK..RIGHT-TO-LEFT MARK
-202A..202E    ; Format # Cf   [5] LEFT-TO-RIGHT EMBEDDING..RIGHT-TO-LEFT OVERRIDE
-2060..2063    ; Format # Cf   [4] WORD JOINER..INVISIBLE SEPARATOR
-206A..206F    ; Format # Cf   [6] INHIBIT SYMMETRIC SWAPPING..NOMINAL DIGIT SHAPES
-FEFF          ; Format # Cf       ZERO WIDTH NO-BREAK SPACE
-FFF9..FFFB    ; Format # Cf   [3] INTERLINEAR ANNOTATION ANCHOR..INTERLINEAR ANNOTATION TERMINATOR
-1D173..1D17A  ; Format # Cf   [8] MUSICAL SYMBOL BEGIN BEAM..MUSICAL SYMBOL END PHRASE
-E0001         ; Format # Cf       LANGUAGE TAG
-E0020..E007F  ; Format # Cf  [96] TAG SPACE..CANCEL TAG
-
-# Total code points: 136
-
-# ================================================
-
-0009          ; Sp # Cc       <control-0009>
-000B..000C    ; Sp # Cc   [2] <control-000B>..<control-000C>
-0020          ; Sp # Zs       SPACE
-1680          ; Sp # Zs       OGHAM SPACE MARK
-180E          ; Sp # Zs       MONGOLIAN VOWEL SEPARATOR
-2000..200A    ; Sp # Zs  [11] EN QUAD..HAIR SPACE
-202F          ; Sp # Zs       NARROW NO-BREAK SPACE
-205F          ; Sp # Zs       MEDIUM MATHEMATICAL SPACE
-3000          ; Sp # Zs       IDEOGRAPHIC SPACE
-
-# Total code points: 20
-
-# ================================================
-
-0061..007A    ; Lower # L&  [26] LATIN SMALL LETTER A..LATIN SMALL LETTER Z
-00AA          ; Lower # L&       FEMININE ORDINAL INDICATOR
-00B5          ; Lower # L&       MICRO SIGN
-00BA          ; Lower # L&       MASCULINE ORDINAL INDICATOR
-00DF..00F6    ; Lower # L&  [24] LATIN SMALL LETTER SHARP S..LATIN SMALL LETTER O WITH DIAERESIS
-00F8..00FF    ; Lower # L&   [8] LATIN SMALL LETTER O WITH STROKE..LATIN SMALL LETTER Y WITH DIAERESIS
-0101          ; Lower # L&       LATIN SMALL LETTER A WITH MACRON
-0103          ; Lower # L&       LATIN SMALL LETTER A WITH BREVE
-0105          ; Lower # L&       LATIN SMALL LETTER A WITH OGONEK
-0107          ; Lower # L&       LATIN SMALL LETTER C WITH ACUTE
-0109          ; Lower # L&       LATIN SMALL LETTER C WITH CIRCUMFLEX
-010B          ; Lower # L&       LATIN SMALL LETTER C WITH DOT ABOVE
-010D          ; Lower # L&       LATIN SMALL LETTER C WITH CARON
-010F          ; Lower # L&       LATIN SMALL LETTER D WITH CARON
-0111          ; Lower # L&       LATIN SMALL LETTER D WITH STROKE
-0113          ; Lower # L&       LATIN SMALL LETTER E WITH MACRON
-0115          ; Lower # L&       LATIN SMALL LETTER E WITH BREVE
-0117          ; Lower # L&       LATIN SMALL LETTER E WITH DOT ABOVE
-0119          ; Lower # L&       LATIN SMALL LETTER E WITH OGONEK
-011B          ; Lower # L&       LATIN SMALL LETTER E WITH CARON
-011D          ; Lower # L&       LATIN SMALL LETTER G WITH CIRCUMFLEX
-011F          ; Lower # L&       LATIN SMALL LETTER G WITH BREVE
-0121          ; Lower # L&       LATIN SMALL LETTER G WITH DOT ABOVE
-0123          ; Lower # L&       LATIN SMALL LETTER G WITH CEDILLA
-0125          ; Lower # L&       LATIN SMALL LETTER H WITH CIRCUMFLEX
-0127          ; Lower # L&       LATIN SMALL LETTER H WITH STROKE
-0129          ; Lower # L&       LATIN SMALL LETTER I WITH TILDE
-012B          ; Lower # L&       LATIN SMALL LETTER I WITH MACRON
-012D          ; Lower # L&       LATIN SMALL LETTER I WITH BREVE
-012F          ; Lower # L&       LATIN SMALL LETTER I WITH OGONEK
-0131          ; Lower # L&       LATIN SMALL LETTER DOTLESS I
-0133          ; Lower # L&       LATIN SMALL LIGATURE IJ
-0135          ; Lower # L&       LATIN SMALL LETTER J WITH CIRCUMFLEX
-0137..0138    ; Lower # L&   [2] LATIN SMALL LETTER K WITH CEDILLA..LATIN SMALL LETTER KRA
-013A          ; Lower # L&       LATIN SMALL LETTER L WITH ACUTE
-013C          ; Lower # L&       LATIN SMALL LETTER L WITH CEDILLA
-013E          ; Lower # L&       LATIN SMALL LETTER L WITH CARON
-0140          ; Lower # L&       LATIN SMALL LETTER L WITH MIDDLE DOT
-0142          ; Lower # L&       LATIN SMALL LETTER L WITH STROKE
-0144          ; Lower # L&       LATIN SMALL LETTER N WITH ACUTE
-0146          ; Lower # L&       LATIN SMALL LETTER N WITH CEDILLA
-0148..0149    ; Lower # L&   [2] LATIN SMALL LETTER N WITH CARON..LATIN SMALL LETTER N PRECEDED BY APOSTROPHE
-014B          ; Lower # L&       LATIN SMALL LETTER ENG
-014D          ; Lower # L&       LATIN SMALL LETTER O WITH MACRON
-014F          ; Lower # L&       LATIN SMALL LETTER O WITH BREVE
-0151          ; Lower # L&       LATIN SMALL LETTER O WITH DOUBLE ACUTE
-0153          ; Lower # L&       LATIN SMALL LIGATURE OE
-0155          ; Lower # L&       LATIN SMALL LETTER R WITH ACUTE
-0157          ; Lower # L&       LATIN SMALL LETTER R WITH CEDILLA
-0159          ; Lower # L&       LATIN SMALL LETTER R WITH CARON
-015B          ; Lower # L&       LATIN SMALL LETTER S WITH ACUTE
-015D          ; Lower # L&       LATIN SMALL LETTER S WITH CIRCUMFLEX
-015F          ; Lower # L&       LATIN SMALL LETTER S WITH CEDILLA
-0161          ; Lower # L&       LATIN SMALL LETTER S WITH CARON
-0163          ; Lower # L&       LATIN SMALL LETTER T WITH CEDILLA
-0165          ; Lower # L&       LATIN SMALL LETTER T WITH CARON
-0167          ; Lower # L&       LATIN SMALL LETTER T WITH STROKE
-0169          ; Lower # L&       LATIN SMALL LETTER U WITH TILDE
-016B          ; Lower # L&       LATIN SMALL LETTER U WITH MACRON
-016D          ; Lower # L&       LATIN SMALL LETTER U WITH BREVE
-016F          ; Lower # L&       LATIN SMALL LETTER U WITH RING ABOVE
-0171          ; Lower # L&       LATIN SMALL LETTER U WITH DOUBLE ACUTE
-0173          ; Lower # L&       LATIN SMALL LETTER U WITH OGONEK
-0175          ; Lower # L&       LATIN SMALL LETTER W WITH CIRCUMFLEX
-0177          ; Lower # L&       LATIN SMALL LETTER Y WITH CIRCUMFLEX
-017A          ; Lower # L&       LATIN SMALL LETTER Z WITH ACUTE
-017C          ; Lower # L&       LATIN SMALL LETTER Z WITH DOT ABOVE
-017E..0180    ; Lower # L&   [3] LATIN SMALL LETTER Z WITH CARON..LATIN SMALL LETTER B WITH STROKE
-0183          ; Lower # L&       LATIN SMALL LETTER B WITH TOPBAR
-0185          ; Lower # L&       LATIN SMALL LETTER TONE SIX
-0188          ; Lower # L&       LATIN SMALL LETTER C WITH HOOK
-018C..018D    ; Lower # L&   [2] LATIN SMALL LETTER D WITH TOPBAR..LATIN SMALL LETTER TURNED DELTA
-0192          ; Lower # L&       LATIN SMALL LETTER F WITH HOOK
-0195          ; Lower # L&       LATIN SMALL LETTER HV
-0199..019B    ; Lower # L&   [3] LATIN SMALL LETTER K WITH HOOK..LATIN SMALL LETTER LAMBDA WITH STROKE
-019E          ; Lower # L&       LATIN SMALL LETTER N WITH LONG RIGHT LEG
-01A1          ; Lower # L&       LATIN SMALL LETTER O WITH HORN
-01A3          ; Lower # L&       LATIN SMALL LETTER OI
-01A5          ; Lower # L&       LATIN SMALL LETTER P WITH HOOK
-01A8          ; Lower # L&       LATIN SMALL LETTER TONE TWO
-01AA..01AB    ; Lower # L&   [2] LATIN LETTER REVERSED ESH LOOP..LATIN SMALL LETTER T WITH PALATAL HOOK
-01AD          ; Lower # L&       LATIN SMALL LETTER T WITH HOOK
-01B0          ; Lower # L&       LATIN SMALL LETTER U WITH HORN
-01B4          ; Lower # L&       LATIN SMALL LETTER Y WITH HOOK
-01B6          ; Lower # L&       LATIN SMALL LETTER Z WITH STROKE
-01B9..01BA    ; Lower # L&   [2] LATIN SMALL LETTER EZH REVERSED..LATIN SMALL LETTER EZH WITH TAIL
-01BD..01BF    ; Lower # L&   [3] LATIN SMALL LETTER TONE FIVE..LATIN LETTER WYNN
-01C6          ; Lower # L&       LATIN SMALL LETTER DZ WITH CARON
-01C9          ; Lower # L&       LATIN SMALL LETTER LJ
-01CC          ; Lower # L&       LATIN SMALL LETTER NJ
-01CE          ; Lower # L&       LATIN SMALL LETTER A WITH CARON
-01D0          ; Lower # L&       LATIN SMALL LETTER I WITH CARON
-01D2          ; Lower # L&       LATIN SMALL LETTER O WITH CARON
-01D4          ; Lower # L&       LATIN SMALL LETTER U WITH CARON
-01D6          ; Lower # L&       LATIN SMALL LETTER U WITH DIAERESIS AND MACRON
-01D8          ; Lower # L&       LATIN SMALL LETTER U WITH DIAERESIS AND ACUTE
-01DA          ; Lower # L&       LATIN SMALL LETTER U WITH DIAERESIS AND CARON
-01DC..01DD    ; Lower # L&   [2] LATIN SMALL LETTER U WITH DIAERESIS AND GRAVE..LATIN SMALL LETTER TURNED E
-01DF          ; Lower # L&       LATIN SMALL LETTER A WITH DIAERESIS AND MACRON
-01E1          ; Lower # L&       LATIN SMALL LETTER A WITH DOT ABOVE AND MACRON
-01E3          ; Lower # L&       LATIN SMALL LETTER AE WITH MACRON
-01E5          ; Lower # L&       LATIN SMALL LETTER G WITH STROKE
-01E7          ; Lower # L&       LATIN SMALL LETTER G WITH CARON
-01E9          ; Lower # L&       LATIN SMALL LETTER K WITH CARON
-01EB          ; Lower # L&       LATIN SMALL LETTER O WITH OGONEK
-01ED          ; Lower # L&       LATIN SMALL LETTER O WITH OGONEK AND MACRON
-01EF..01F0    ; Lower # L&   [2] LATIN SMALL LETTER EZH WITH CARON..LATIN SMALL LETTER J WITH CARON
-01F3          ; Lower # L&       LATIN SMALL LETTER DZ
-01F5          ; Lower # L&       LATIN SMALL LETTER G WITH ACUTE
-01F9          ; Lower # L&       LATIN SMALL LETTER N WITH GRAVE
-01FB          ; Lower # L&       LATIN SMALL LETTER A WITH RING ABOVE AND ACUTE
-01FD          ; Lower # L&       LATIN SMALL LETTER AE WITH ACUTE
-01FF          ; Lower # L&       LATIN SMALL LETTER O WITH STROKE AND ACUTE
-0201          ; Lower # L&       LATIN SMALL LETTER A WITH DOUBLE GRAVE
-0203          ; Lower # L&       LATIN SMALL LETTER A WITH INVERTED BREVE
-0205          ; Lower # L&       LATIN SMALL LETTER E WITH DOUBLE GRAVE
-0207          ; Lower # L&       LATIN SMALL LETTER E WITH INVERTED BREVE
-0209          ; Lower # L&       LATIN SMALL LETTER I WITH DOUBLE GRAVE
-020B          ; Lower # L&       LATIN SMALL LETTER I WITH INVERTED BREVE
-020D          ; Lower # L&       LATIN SMALL LETTER O WITH DOUBLE GRAVE
-020F          ; Lower # L&       LATIN SMALL LETTER O WITH INVERTED BREVE
-0211          ; Lower # L&       LATIN SMALL LETTER R WITH DOUBLE GRAVE
-0213          ; Lower # L&       LATIN SMALL LETTER R WITH INVERTED BREVE
-0215          ; Lower # L&       LATIN SMALL LETTER U WITH DOUBLE GRAVE
-0217          ; Lower # L&       LATIN SMALL LETTER U WITH INVERTED BREVE
-0219          ; Lower # L&       LATIN SMALL LETTER S WITH COMMA BELOW
-021B          ; Lower # L&       LATIN SMALL LETTER T WITH COMMA BELOW
-021D          ; Lower # L&       LATIN SMALL LETTER YOGH
-021F          ; Lower # L&       LATIN SMALL LETTER H WITH CARON
-0221          ; Lower # L&       LATIN SMALL LETTER D WITH CURL
-0223          ; Lower # L&       LATIN SMALL LETTER OU
-0225          ; Lower # L&       LATIN SMALL LETTER Z WITH HOOK
-0227          ; Lower # L&       LATIN SMALL LETTER A WITH DOT ABOVE
-0229          ; Lower # L&       LATIN SMALL LETTER E WITH CEDILLA
-022B          ; Lower # L&       LATIN SMALL LETTER O WITH DIAERESIS AND MACRON
-022D          ; Lower # L&       LATIN SMALL LETTER O WITH TILDE AND MACRON
-022F          ; Lower # L&       LATIN SMALL LETTER O WITH DOT ABOVE
-0231          ; Lower # L&       LATIN SMALL LETTER O WITH DOT ABOVE AND MACRON
-0233..0239    ; Lower # L&   [7] LATIN SMALL LETTER Y WITH MACRON..LATIN SMALL LETTER QP DIGRAPH
-023C          ; Lower # L&       LATIN SMALL LETTER C WITH STROKE
-023F..0240    ; Lower # L&   [2] LATIN SMALL LETTER S WITH SWASH TAIL..LATIN SMALL LETTER Z WITH SWASH TAIL
-0242          ; Lower # L&       LATIN SMALL LETTER GLOTTAL STOP
-0247          ; Lower # L&       LATIN SMALL LETTER E WITH STROKE
-0249          ; Lower # L&       LATIN SMALL LETTER J WITH STROKE
-024B          ; Lower # L&       LATIN SMALL LETTER Q WITH HOOK TAIL
-024D          ; Lower # L&       LATIN SMALL LETTER R WITH STROKE
-024F..0293    ; Lower # L&  [69] LATIN SMALL LETTER Y WITH STROKE..LATIN SMALL LETTER EZH WITH CURL
-0295..02AF    ; Lower # L&  [27] LATIN LETTER PHARYNGEAL VOICED FRICATIVE..LATIN SMALL LETTER TURNED H WITH FISHHOOK AND TAIL
-02B0..02B8    ; Lower # Lm   [9] MODIFIER LETTER SMALL H..MODIFIER LETTER SMALL Y
-02C0..02C1    ; Lower # Lm   [2] MODIFIER LETTER GLOTTAL STOP..MODIFIER LETTER REVERSED GLOTTAL STOP
-02E0..02E4    ; Lower # Lm   [5] MODIFIER LETTER SMALL GAMMA..MODIFIER LETTER SMALL REVERSED GLOTTAL STOP
-037A          ; Lower # Lm       GREEK YPOGEGRAMMENI
-037B..037D    ; Lower # L&   [3] GREEK SMALL REVERSED LUNATE SIGMA SYMBOL..GREEK SMALL REVERSED DOTTED LUNATE SIGMA SYMBOL
-0390          ; Lower # L&       GREEK SMALL LETTER IOTA WITH DIALYTIKA AND TONOS
-03AC..03CE    ; Lower # L&  [35] GREEK SMALL LETTER ALPHA WITH TONOS..GREEK SMALL LETTER OMEGA WITH TONOS
-03D0..03D1    ; Lower # L&   [2] GREEK BETA SYMBOL..GREEK THETA SYMBOL
-03D5..03D7    ; Lower # L&   [3] GREEK PHI SYMBOL..GREEK KAI SYMBOL
-03D9          ; Lower # L&       GREEK SMALL LETTER ARCHAIC KOPPA
-03DB          ; Lower # L&       GREEK SMALL LETTER STIGMA
-03DD          ; Lower # L&       GREEK SMALL LETTER DIGAMMA
-03DF          ; Lower # L&       GREEK SMALL LETTER KOPPA
-03E1          ; Lower # L&       GREEK SMALL LETTER SAMPI
-03E3          ; Lower # L&       COPTIC SMALL LETTER SHEI
-03E5          ; Lower # L&       COPTIC SMALL LETTER FEI
-03E7          ; Lower # L&       COPTIC SMALL LETTER KHEI
-03E9          ; Lower # L&       COPTIC SMALL LETTER HORI
-03EB          ; Lower # L&       COPTIC SMALL LETTER GANGIA
-03ED          ; Lower # L&       COPTIC SMALL LETTER SHIMA
-03EF..03F3    ; Lower # L&   [5] COPTIC SMALL LETTER DEI..GREEK LETTER YOT
-03F5          ; Lower # L&       GREEK LUNATE EPSILON SYMBOL
-03F8          ; Lower # L&       GREEK SMALL LETTER SHO
-03FB..03FC    ; Lower # L&   [2] GREEK SMALL LETTER SAN..GREEK RHO WITH STROKE SYMBOL
-0430..045F    ; Lower # L&  [48] CYRILLIC SMALL LETTER A..CYRILLIC SMALL LETTER DZHE
-0461          ; Lower # L&       CYRILLIC SMALL LETTER OMEGA
-0463          ; Lower # L&       CYRILLIC SMALL LETTER YAT
-0465          ; Lower # L&       CYRILLIC SMALL LETTER IOTIFIED E
-0467          ; Lower # L&       CYRILLIC SMALL LETTER LITTLE YUS
-0469          ; Lower # L&       CYRILLIC SMALL LETTER IOTIFIED LITTLE YUS
-046B          ; Lower # L&       CYRILLIC SMALL LETTER BIG YUS
-046D          ; Lower # L&       CYRILLIC SMALL LETTER IOTIFIED BIG YUS
-046F          ; Lower # L&       CYRILLIC SMALL LETTER KSI
-0471          ; Lower # L&       CYRILLIC SMALL LETTER PSI
-0473          ; Lower # L&       CYRILLIC SMALL LETTER FITA
-0475          ; Lower # L&       CYRILLIC SMALL LETTER IZHITSA
-0477          ; Lower # L&       CYRILLIC SMALL LETTER IZHITSA WITH DOUBLE GRAVE ACCENT
-0479          ; Lower # L&       CYRILLIC SMALL LETTER UK
-047B          ; Lower # L&       CYRILLIC SMALL LETTER ROUND OMEGA
-047D          ; Lower # L&       CYRILLIC SMALL LETTER OMEGA WITH TITLO
-047F          ; Lower # L&       CYRILLIC SMALL LETTER OT
-0481          ; Lower # L&       CYRILLIC SMALL LETTER KOPPA
-048B          ; Lower # L&       CYRILLIC SMALL LETTER SHORT I WITH TAIL
-048D          ; Lower # L&       CYRILLIC SMALL LETTER SEMISOFT SIGN
-048F          ; Lower # L&       CYRILLIC SMALL LETTER ER WITH TICK
-0491          ; Lower # L&       CYRILLIC SMALL LETTER GHE WITH UPTURN
-0493          ; Lower # L&       CYRILLIC SMALL LETTER GHE WITH STROKE
-0495          ; Lower # L&       CYRILLIC SMALL LETTER GHE WITH MIDDLE HOOK
-0497          ; Lower # L&       CYRILLIC SMALL LETTER ZHE WITH DESCENDER
-0499          ; Lower # L&       CYRILLIC SMALL LETTER ZE WITH DESCENDER
-049B          ; Lower # L&       CYRILLIC SMALL LETTER KA WITH DESCENDER
-049D          ; Lower # L&       CYRILLIC SMALL LETTER KA WITH VERTICAL STROKE
-049F          ; Lower # L&       CYRILLIC SMALL LETTER KA WITH STROKE
-04A1          ; Lower # L&       CYRILLIC SMALL LETTER BASHKIR KA
-04A3          ; Lower # L&       CYRILLIC SMALL LETTER EN WITH DESCENDER
-04A5          ; Lower # L&       CYRILLIC SMALL LIGATURE EN GHE
-04A7          ; Lower # L&       CYRILLIC SMALL LETTER PE WITH MIDDLE HOOK
-04A9          ; Lower # L&       CYRILLIC SMALL LETTER ABKHASIAN HA
-04AB          ; Lower # L&       CYRILLIC SMALL LETTER ES WITH DESCENDER
-04AD          ; Lower # L&       CYRILLIC SMALL LETTER TE WITH DESCENDER
-04AF          ; Lower # L&       CYRILLIC SMALL LETTER STRAIGHT U
-04B1          ; Lower # L&       CYRILLIC SMALL LETTER STRAIGHT U WITH STROKE
-04B3          ; Lower # L&       CYRILLIC SMALL LETTER HA WITH DESCENDER
-04B5          ; Lower # L&       CYRILLIC SMALL LIGATURE TE TSE
-04B7          ; Lower # L&       CYRILLIC SMALL LETTER CHE WITH DESCENDER
-04B9          ; Lower # L&       CYRILLIC SMALL LETTER CHE WITH VERTICAL STROKE
-04BB          ; Lower # L&       CYRILLIC SMALL LETTER SHHA
-04BD          ; Lower # L&       CYRILLIC SMALL LETTER ABKHASIAN CHE
-04BF          ; Lower # L&       CYRILLIC SMALL LETTER ABKHASIAN CHE WITH DESCENDER
-04C2          ; Lower # L&       CYRILLIC SMALL LETTER ZHE WITH BREVE
-04C4          ; Lower # L&       CYRILLIC SMALL LETTER KA WITH HOOK
-04C6          ; Lower # L&       CYRILLIC SMALL LETTER EL WITH TAIL
-04C8          ; Lower # L&       CYRILLIC SMALL LETTER EN WITH HOOK
-04CA          ; Lower # L&       CYRILLIC SMALL LETTER EN WITH TAIL
-04CC          ; Lower # L&       CYRILLIC SMALL LETTER KHAKASSIAN CHE
-04CE..04CF    ; Lower # L&   [2] CYRILLIC SMALL LETTER EM WITH TAIL..CYRILLIC SMALL LETTER PALOCHKA
-04D1          ; Lower # L&       CYRILLIC SMALL LETTER A WITH BREVE
-04D3          ; Lower # L&       CYRILLIC SMALL LETTER A WITH DIAERESIS
-04D5          ; Lower # L&       CYRILLIC SMALL LIGATURE A IE
-04D7          ; Lower # L&       CYRILLIC SMALL LETTER IE WITH BREVE
-04D9          ; Lower # L&       CYRILLIC SMALL LETTER SCHWA
-04DB          ; Lower # L&       CYRILLIC SMALL LETTER SCHWA WITH DIAERESIS
-04DD          ; Lower # L&       CYRILLIC SMALL LETTER ZHE WITH DIAERESIS
-04DF          ; Lower # L&       CYRILLIC SMALL LETTER ZE WITH DIAERESIS
-04E1          ; Lower # L&       CYRILLIC SMALL LETTER ABKHASIAN DZE
-04E3          ; Lower # L&       CYRILLIC SMALL LETTER I WITH MACRON
-04E5          ; Lower # L&       CYRILLIC SMALL LETTER I WITH DIAERESIS
-04E7          ; Lower # L&       CYRILLIC SMALL LETTER O WITH DIAERESIS
-04E9          ; Lower # L&       CYRILLIC SMALL LETTER BARRED O
-04EB          ; Lower # L&       CYRILLIC SMALL LETTER BARRED O WITH DIAERESIS
-04ED          ; Lower # L&       CYRILLIC SMALL LETTER E WITH DIAERESIS
-04EF          ; Lower # L&       CYRILLIC SMALL LETTER U WITH MACRON
-04F1          ; Lower # L&       CYRILLIC SMALL LETTER U WITH DIAERESIS
-04F3          ; Lower # L&       CYRILLIC SMALL LETTER U WITH DOUBLE ACUTE
-04F5          ; Lower # L&       CYRILLIC SMALL LETTER CHE WITH DIAERESIS
-04F7          ; Lower # L&       CYRILLIC SMALL LETTER GHE WITH DESCENDER
-04F9          ; Lower # L&       CYRILLIC SMALL LETTER YERU WITH DIAERESIS
-04FB          ; Lower # L&       CYRILLIC SMALL LETTER GHE WITH STROKE AND HOOK
-04FD          ; Lower # L&       CYRILLIC SMALL LETTER HA WITH HOOK
-04FF          ; Lower # L&       CYRILLIC SMALL LETTER HA WITH STROKE
-0501          ; Lower # L&       CYRILLIC SMALL LETTER KOMI DE
-0503          ; Lower # L&       CYRILLIC SMALL LETTER KOMI DJE
-0505          ; Lower # L&       CYRILLIC SMALL LETTER KOMI ZJE
-0507          ; Lower # L&       CYRILLIC SMALL LETTER KOMI DZJE
-0509          ; Lower # L&       CYRILLIC SMALL LETTER KOMI LJE
-050B          ; Lower # L&       CYRILLIC SMALL LETTER KOMI NJE
-050D          ; Lower # L&       CYRILLIC SMALL LETTER KOMI SJE
-050F          ; Lower # L&       CYRILLIC SMALL LETTER KOMI TJE
-0511          ; Lower # L&       CYRILLIC SMALL LETTER REVERSED ZE
-0513          ; Lower # L&       CYRILLIC SMALL LETTER EL WITH HOOK
-0561..0587    ; Lower # L&  [39] ARMENIAN SMALL LETTER AYB..ARMENIAN SMALL LIGATURE ECH YIWN
-1D00..1D2B    ; Lower # L&  [44] LATIN LETTER SMALL CAPITAL A..CYRILLIC LETTER SMALL CAPITAL EL
-1D2C..1D61    ; Lower # Lm  [54] MODIFIER LETTER CAPITAL A..MODIFIER LETTER SMALL CHI
-1D62..1D77    ; Lower # L&  [22] LATIN SUBSCRIPT SMALL LETTER I..LATIN SMALL LETTER TURNED G
-1D78          ; Lower # Lm       MODIFIER LETTER CYRILLIC EN
-1D79..1D9A    ; Lower # L&  [34] LATIN SMALL LETTER INSULAR G..LATIN SMALL LETTER EZH WITH RETROFLEX HOOK
-1D9B..1DBF    ; Lower # Lm  [37] MODIFIER LETTER SMALL TURNED ALPHA..MODIFIER LETTER SMALL THETA
-1E01          ; Lower # L&       LATIN SMALL LETTER A WITH RING BELOW
-1E03          ; Lower # L&       LATIN SMALL LETTER B WITH DOT ABOVE
-1E05          ; Lower # L&       LATIN SMALL LETTER B WITH DOT BELOW
-1E07          ; Lower # L&       LATIN SMALL LETTER B WITH LINE BELOW
-1E09          ; Lower # L&       LATIN SMALL LETTER C WITH CEDILLA AND ACUTE
-1E0B          ; Lower # L&       LATIN SMALL LETTER D WITH DOT ABOVE
-1E0D          ; Lower # L&       LATIN SMALL LETTER D WITH DOT BELOW
-1E0F          ; Lower # L&       LATIN SMALL LETTER D WITH LINE BELOW
-1E11          ; Lower # L&       LATIN SMALL LETTER D WITH CEDILLA
-1E13          ; Lower # L&       LATIN SMALL LETTER D WITH CIRCUMFLEX BELOW
-1E15          ; Lower # L&       LATIN SMALL LETTER E WITH MACRON AND GRAVE
-1E17          ; Lower # L&       LATIN SMALL LETTER E WITH MACRON AND ACUTE
-1E19          ; Lower # L&       LATIN SMALL LETTER E WITH CIRCUMFLEX BELOW
-1E1B          ; Lower # L&       LATIN SMALL LETTER E WITH TILDE BELOW
-1E1D          ; Lower # L&       LATIN SMALL LETTER E WITH CEDILLA AND BREVE
-1E1F          ; Lower # L&       LATIN SMALL LETTER F WITH DOT ABOVE
-1E21          ; Lower # L&       LATIN SMALL LETTER G WITH MACRON
-1E23          ; Lower # L&       LATIN SMALL LETTER H WITH DOT ABOVE
-1E25          ; Lower # L&       LATIN SMALL LETTER H WITH DOT BELOW
-1E27          ; Lower # L&       LATIN SMALL LETTER H WITH DIAERESIS
-1E29          ; Lower # L&       LATIN SMALL LETTER H WITH CEDILLA
-1E2B          ; Lower # L&       LATIN SMALL LETTER H WITH BREVE BELOW
-1E2D          ; Lower # L&       LATIN SMALL LETTER I WITH TILDE BELOW
-1E2F          ; Lower # L&       LATIN SMALL LETTER I WITH DIAERESIS AND ACUTE
-1E31          ; Lower # L&       LATIN SMALL LETTER K WITH ACUTE
-1E33          ; Lower # L&       LATIN SMALL LETTER K WITH DOT BELOW
-1E35          ; Lower # L&       LATIN SMALL LETTER K WITH LINE BELOW
-1E37          ; Lower # L&       LATIN SMALL LETTER L WITH DOT BELOW
-1E39          ; Lower # L&       LATIN SMALL LETTER L WITH DOT BELOW AND MACRON
-1E3B          ; Lower # L&       LATIN SMALL LETTER L WITH LINE BELOW
-1E3D          ; Lower # L&       LATIN SMALL LETTER L WITH CIRCUMFLEX BELOW
-1E3F          ; Lower # L&       LATIN SMALL LETTER M WITH ACUTE
-1E41          ; Lower # L&       LATIN SMALL LETTER M WITH DOT ABOVE
-1E43          ; Lower # L&       LATIN SMALL LETTER M WITH DOT BELOW
-1E45          ; Lower # L&       LATIN SMALL LETTER N WITH DOT ABOVE
-1E47          ; Lower # L&       LATIN SMALL LETTER N WITH DOT BELOW
-1E49          ; Lower # L&       LATIN SMALL LETTER N WITH LINE BELOW
-1E4B          ; Lower # L&       LATIN SMALL LETTER N WITH CIRCUMFLEX BELOW
-1E4D          ; Lower # L&       LATIN SMALL LETTER O WITH TILDE AND ACUTE
-1E4F          ; Lower # L&       LATIN SMALL LETTER O WITH TILDE AND DIAERESIS
-1E51          ; Lower # L&       LATIN SMALL LETTER O WITH MACRON AND GRAVE
-1E53          ; Lower # L&       LATIN SMALL LETTER O WITH MACRON AND ACUTE
-1E55          ; Lower # L&       LATIN SMALL LETTER P WITH ACUTE
-1E57          ; Lower # L&       LATIN SMALL LETTER P WITH DOT ABOVE
-1E59          ; Lower # L&       LATIN SMALL LETTER R WITH DOT ABOVE
-1E5B          ; Lower # L&       LATIN SMALL LETTER R WITH DOT BELOW
-1E5D          ; Lower # L&       LATIN SMALL LETTER R WITH DOT BELOW AND MACRON
-1E5F          ; Lower # L&       LATIN SMALL LETTER R WITH LINE BELOW
-1E61          ; Lower # L&       LATIN SMALL LETTER S WITH DOT ABOVE
-1E63          ; Lower # L&       LATIN SMALL LETTER S WITH DOT BELOW
-1E65          ; Lower # L&       LATIN SMALL LETTER S WITH ACUTE AND DOT ABOVE
-1E67          ; Lower # L&       LATIN SMALL LETTER S WITH CARON AND DOT ABOVE
-1E69          ; Lower # L&       LATIN SMALL LETTER S WITH DOT BELOW AND DOT ABOVE
-1E6B          ; Lower # L&       LATIN SMALL LETTER T WITH DOT ABOVE
-1E6D          ; Lower # L&       LATIN SMALL LETTER T WITH DOT BELOW
-1E6F          ; Lower # L&       LATIN SMALL LETTER T WITH LINE BELOW
-1E71          ; Lower # L&       LATIN SMALL LETTER T WITH CIRCUMFLEX BELOW
-1E73          ; Lower # L&       LATIN SMALL LETTER U WITH DIAERESIS BELOW
-1E75          ; Lower # L&       LATIN SMALL LETTER U WITH TILDE BELOW
-1E77          ; Lower # L&       LATIN SMALL LETTER U WITH CIRCUMFLEX BELOW
-1E79          ; Lower # L&       LATIN SMALL LETTER U WITH TILDE AND ACUTE
-1E7B          ; Lower # L&       LATIN SMALL LETTER U WITH MACRON AND DIAERESIS
-1E7D          ; Lower # L&       LATIN SMALL LETTER V WITH TILDE
-1E7F          ; Lower # L&       LATIN SMALL LETTER V WITH DOT BELOW
-1E81          ; Lower # L&       LATIN SMALL LETTER W WITH GRAVE
-1E83          ; Lower # L&       LATIN SMALL LETTER W WITH ACUTE
-1E85          ; Lower # L&       LATIN SMALL LETTER W WITH DIAERESIS
-1E87          ; Lower # L&       LATIN SMALL LETTER W WITH DOT ABOVE
-1E89          ; Lower # L&       LATIN SMALL LETTER W WITH DOT BELOW
-1E8B          ; Lower # L&       LATIN SMALL LETTER X WITH DOT ABOVE
-1E8D          ; Lower # L&       LATIN SMALL LETTER X WITH DIAERESIS
-1E8F          ; Lower # L&       LATIN SMALL LETTER Y WITH DOT ABOVE
-1E91          ; Lower # L&       LATIN SMALL LETTER Z WITH CIRCUMFLEX
-1E93          ; Lower # L&       LATIN SMALL LETTER Z WITH DOT BELOW
-1E95..1E9B    ; Lower # L&   [7] LATIN SMALL LETTER Z WITH LINE BELOW..LATIN SMALL LETTER LONG S WITH DOT ABOVE
-1EA1          ; Lower # L&       LATIN SMALL LETTER A WITH DOT BELOW
-1EA3          ; Lower # L&       LATIN SMALL LETTER A WITH HOOK ABOVE
-1EA5          ; Lower # L&       LATIN SMALL LETTER A WITH CIRCUMFLEX AND ACUTE
-1EA7          ; Lower # L&       LATIN SMALL LETTER A WITH CIRCUMFLEX AND GRAVE
-1EA9          ; Lower # L&       LATIN SMALL LETTER A WITH CIRCUMFLEX AND HOOK ABOVE
-1EAB          ; Lower # L&       LATIN SMALL LETTER A WITH CIRCUMFLEX AND TILDE
-1EAD          ; Lower # L&       LATIN SMALL LETTER A WITH CIRCUMFLEX AND DOT BELOW
-1EAF          ; Lower # L&       LATIN SMALL LETTER A WITH BREVE AND ACUTE
-1EB1          ; Lower # L&       LATIN SMALL LETTER A WITH BREVE AND GRAVE
-1EB3          ; Lower # L&       LATIN SMALL LETTER A WITH BREVE AND HOOK ABOVE
-1EB5          ; Lower # L&       LATIN SMALL LETTER A WITH BREVE AND TILDE
-1EB7          ; Lower # L&       LATIN SMALL LETTER A WITH BREVE AND DOT BELOW
-1EB9          ; Lower # L&       LATIN SMALL LETTER E WITH DOT BELOW
-1EBB          ; Lower # L&       LATIN SMALL LETTER E WITH HOOK ABOVE
-1EBD          ; Lower # L&       LATIN SMALL LETTER E WITH TILDE
-1EBF          ; Lower # L&       LATIN SMALL LETTER E WITH CIRCUMFLEX AND ACUTE
-1EC1          ; Lower # L&       LATIN SMALL LETTER E WITH CIRCUMFLEX AND GRAVE
-1EC3          ; Lower # L&       LATIN SMALL LETTER E WITH CIRCUMFLEX AND HOOK ABOVE
-1EC5          ; Lower # L&       LATIN SMALL LETTER E WITH CIRCUMFLEX AND TILDE
-1EC7          ; Lower # L&       LATIN SMALL LETTER E WITH CIRCUMFLEX AND DOT BELOW
-1EC9          ; Lower # L&       LATIN SMALL LETTER I WITH HOOK ABOVE
-1ECB          ; Lower # L&       LATIN SMALL LETTER I WITH DOT BELOW
-1ECD          ; Lower # L&       LATIN SMALL LETTER O WITH DOT BELOW
-1ECF          ; Lower # L&       LATIN SMALL LETTER O WITH HOOK ABOVE
-1ED1          ; Lower # L&       LATIN SMALL LETTER O WITH CIRCUMFLEX AND ACUTE
-1ED3          ; Lower # L&       LATIN SMALL LETTER O WITH CIRCUMFLEX AND GRAVE
-1ED5          ; Lower # L&       LATIN SMALL LETTER O WITH CIRCUMFLEX AND HOOK ABOVE
-1ED7          ; Lower # L&       LATIN SMALL LETTER O WITH CIRCUMFLEX AND TILDE
-1ED9          ; Lower # L&       LATIN SMALL LETTER O WITH CIRCUMFLEX AND DOT BELOW
-1EDB          ; Lower # L&       LATIN SMALL LETTER O WITH HORN AND ACUTE
-1EDD          ; Lower # L&       LATIN SMALL LETTER O WITH HORN AND GRAVE
-1EDF          ; Lower # L&       LATIN SMALL LETTER O WITH HORN AND HOOK ABOVE
-1EE1          ; Lower # L&       LATIN SMALL LETTER O WITH HORN AND TILDE
-1EE3          ; Lower # L&       LATIN SMALL LETTER O WITH HORN AND DOT BELOW
-1EE5          ; Lower # L&       LATIN SMALL LETTER U WITH DOT BELOW
-1EE7          ; Lower # L&       LATIN SMALL LETTER U WITH HOOK ABOVE
-1EE9          ; Lower # L&       LATIN SMALL LETTER U WITH HORN AND ACUTE
-1EEB          ; Lower # L&       LATIN SMALL LETTER U WITH HORN AND GRAVE
-1EED          ; Lower # L&       LATIN SMALL LETTER U WITH HORN AND HOOK ABOVE
-1EEF          ; Lower # L&       LATIN SMALL LETTER U WITH HORN AND TILDE
-1EF1          ; Lower # L&       LATIN SMALL LETTER U WITH HORN AND DOT BELOW
-1EF3          ; Lower # L&       LATIN SMALL LETTER Y WITH GRAVE
-1EF5          ; Lower # L&       LATIN SMALL LETTER Y WITH DOT BELOW
-1EF7          ; Lower # L&       LATIN SMALL LETTER Y WITH HOOK ABOVE
-1EF9          ; Lower # L&       LATIN SMALL LETTER Y WITH TILDE
-1F00..1F07    ; Lower # L&   [8] GREEK SMALL LETTER ALPHA WITH PSILI..GREEK SMALL LETTER ALPHA WITH DASIA AND PERISPOMENI
-1F10..1F15    ; Lower # L&   [6] GREEK SMALL LETTER EPSILON WITH PSILI..GREEK SMALL LETTER EPSILON WITH DASIA AND OXIA
-1F20..1F27    ; Lower # L&   [8] GREEK SMALL LETTER ETA WITH PSILI..GREEK SMALL LETTER ETA WITH DASIA AND PERISPOMENI
-1F30..1F37    ; Lower # L&   [8] GREEK SMALL LETTER IOTA WITH PSILI..GREEK SMALL LETTER IOTA WITH DASIA AND PERISPOMENI
-1F40..1F45    ; Lower # L&   [6] GREEK SMALL LETTER OMICRON WITH PSILI..GREEK SMALL LETTER OMICRON WITH DASIA AND OXIA
-1F50..1F57    ; Lower # L&   [8] GREEK SMALL LETTER UPSILON WITH PSILI..GREEK SMALL LETTER UPSILON WITH DASIA AND PERISPOMENI
-1F60..1F67    ; Lower # L&   [8] GREEK SMALL LETTER OMEGA WITH PSILI..GREEK SMALL LETTER OMEGA WITH DASIA AND PERISPOMENI
-1F70..1F7D    ; Lower # L&  [14] GREEK SMALL LETTER ALPHA WITH VARIA..GREEK SMALL LETTER OMEGA WITH OXIA
-1F80..1F87    ; Lower # L&   [8] GREEK SMALL LETTER ALPHA WITH PSILI AND YPOGEGRAMMENI..GREEK SMALL LETTER ALPHA WITH DASIA AND PERISPOMENI AND YPOGEGRAMMENI
-1F90..1F97    ; Lower # L&   [8] GREEK SMALL LETTER ETA WITH PSILI AND YPOGEGRAMMENI..GREEK SMALL LETTER ETA WITH DASIA AND PERISPOMENI AND YPOGEGRAMMENI
-1FA0..1FA7    ; Lower # L&   [8] GREEK SMALL LETTER OMEGA WITH PSILI AND YPOGEGRAMMENI..GREEK SMALL LETTER OMEGA WITH DASIA AND PERISPOMENI AND YPOGEGRAMMENI
-1FB0..1FB4    ; Lower # L&   [5] GREEK SMALL LETTER ALPHA WITH VRACHY..GREEK SMALL LETTER ALPHA WITH OXIA AND YPOGEGRAMMENI
-1FB6..1FB7    ; Lower # L&   [2] GREEK SMALL LETTER ALPHA WITH PERISPOMENI..GREEK SMALL LETTER ALPHA WITH PERISPOMENI AND YPOGEGRAMMENI
-1FBE          ; Lower # L&       GREEK PROSGEGRAMMENI
-1FC2..1FC4    ; Lower # L&   [3] GREEK SMALL LETTER ETA WITH VARIA AND YPOGEGRAMMENI..GREEK SMALL LETTER ETA WITH OXIA AND YPOGEGRAMMENI
-1FC6..1FC7    ; Lower # L&   [2] GREEK SMALL LETTER ETA WITH PERISPOMENI..GREEK SMALL LETTER ETA WITH PERISPOMENI AND YPOGEGRAMMENI
-1FD0..1FD3    ; Lower # L&   [4] GREEK SMALL LETTER IOTA WITH VRACHY..GREEK SMALL LETTER IOTA WITH DIALYTIKA AND OXIA
-1FD6..1FD7    ; Lower # L&   [2] GREEK SMALL LETTER IOTA WITH PERISPOMENI..GREEK SMALL LETTER IOTA WITH DIALYTIKA AND PERISPOMENI
-1FE0..1FE7    ; Lower # L&   [8] GREEK SMALL LETTER UPSILON WITH VRACHY..GREEK SMALL LETTER UPSILON WITH DIALYTIKA AND PERISPOMENI
-1FF2..1FF4    ; Lower # L&   [3] GREEK SMALL LETTER OMEGA WITH VARIA AND YPOGEGRAMMENI..GREEK SMALL LETTER OMEGA WITH OXIA AND YPOGEGRAMMENI
-1FF6..1FF7    ; Lower # L&   [2] GREEK SMALL LETTER OMEGA WITH PERISPOMENI..GREEK SMALL LETTER OMEGA WITH PERISPOMENI AND YPOGEGRAMMENI
-2071          ; Lower # L&       SUPERSCRIPT LATIN SMALL LETTER I
-207F          ; Lower # L&       SUPERSCRIPT LATIN SMALL LETTER N
-2090..2094    ; Lower # Lm   [5] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER SCHWA
-210A          ; Lower # L&       SCRIPT SMALL G
-210E..210F    ; Lower # L&   [2] PLANCK CONSTANT..PLANCK CONSTANT OVER TWO PI
-2113          ; Lower # L&       SCRIPT SMALL L
-212F          ; Lower # L&       SCRIPT SMALL E
-2134          ; Lower # L&       SCRIPT SMALL O
-2139          ; Lower # L&       INFORMATION SOURCE
-213C..213D    ; Lower # L&   [2] DOUBLE-STRUCK SMALL PI..DOUBLE-STRUCK SMALL GAMMA
-2146..2149    ; Lower # L&   [4] DOUBLE-STRUCK ITALIC SMALL D..DOUBLE-STRUCK ITALIC SMALL J
-214E          ; Lower # L&       TURNED SMALL F
-2170..217F    ; Lower # Nl  [16] SMALL ROMAN NUMERAL ONE..SMALL ROMAN NUMERAL ONE THOUSAND
-2184          ; Lower # L&       LATIN SMALL LETTER REVERSED C
-24D0..24E9    ; Lower # So  [26] CIRCLED LATIN SMALL LETTER A..CIRCLED LATIN SMALL LETTER Z
-2C30..2C5E    ; Lower # L&  [47] GLAGOLITIC SMALL LETTER AZU..GLAGOLITIC SMALL LETTER LATINATE MYSLITE
-2C61          ; Lower # L&       LATIN SMALL LETTER L WITH DOUBLE BAR
-2C65..2C66    ; Lower # L&   [2] LATIN SMALL LETTER A WITH STROKE..LATIN SMALL LETTER T WITH DIAGONAL STROKE
-2C68          ; Lower # L&       LATIN SMALL LETTER H WITH DESCENDER
-2C6A          ; Lower # L&       LATIN SMALL LETTER K WITH DESCENDER
-2C6C          ; Lower # L&       LATIN SMALL LETTER Z WITH DESCENDER
-2C74          ; Lower # L&       LATIN SMALL LETTER V WITH CURL
-2C76..2C77    ; Lower # L&   [2] LATIN SMALL LETTER HALF H..LATIN SMALL LETTER TAILLESS PHI
-2C81          ; Lower # L&       COPTIC SMALL LETTER ALFA
-2C83          ; Lower # L&       COPTIC SMALL LETTER VIDA
-2C85          ; Lower # L&       COPTIC SMALL LETTER GAMMA
-2C87          ; Lower # L&       COPTIC SMALL LETTER DALDA
-2C89          ; Lower # L&       COPTIC SMALL LETTER EIE
-2C8B          ; Lower # L&       COPTIC SMALL LETTER SOU
-2C8D          ; Lower # L&       COPTIC SMALL LETTER ZATA
-2C8F          ; Lower # L&       COPTIC SMALL LETTER HATE
-2C91          ; Lower # L&       COPTIC SMALL LETTER THETHE
-2C93          ; Lower # L&       COPTIC SMALL LETTER IAUDA
-2C95          ; Lower # L&       COPTIC SMALL LETTER KAPA
-2C97          ; Lower # L&       COPTIC SMALL LETTER LAULA
-2C99          ; Lower # L&       COPTIC SMALL LETTER MI
-2C9B          ; Lower # L&       COPTIC SMALL LETTER NI
-2C9D          ; Lower # L&       COPTIC SMALL LETTER KSI
-2C9F          ; Lower # L&       COPTIC SMALL LETTER O
-2CA1          ; Lower # L&       COPTIC SMALL LETTER PI
-2CA3          ; Lower # L&       COPTIC SMALL LETTER RO
-2CA5          ; Lower # L&       COPTIC SMALL LETTER SIMA
-2CA7          ; Lower # L&       COPTIC SMALL LETTER TAU
-2CA9          ; Lower # L&       COPTIC SMALL LETTER UA
-2CAB          ; Lower # L&       COPTIC SMALL LETTER FI
-2CAD          ; Lower # L&       COPTIC SMALL LETTER KHI
-2CAF          ; Lower # L&       COPTIC SMALL LETTER PSI
-2CB1          ; Lower # L&       COPTIC SMALL LETTER OOU
-2CB3          ; Lower # L&       COPTIC SMALL LETTER DIALECT-P ALEF
-2CB5          ; Lower # L&       COPTIC SMALL LETTER OLD COPTIC AIN
-2CB7          ; Lower # L&       COPTIC SMALL LETTER CRYPTOGRAMMIC EIE
-2CB9          ; Lower # L&       COPTIC SMALL LETTER DIALECT-P KAPA
-2CBB          ; Lower # L&       COPTIC SMALL LETTER DIALECT-P NI
-2CBD          ; Lower # L&       COPTIC SMALL LETTER CRYPTOGRAMMIC NI
-2CBF          ; Lower # L&       COPTIC SMALL LETTER OLD COPTIC OOU
-2CC1          ; Lower # L&       COPTIC SMALL LETTER SAMPI
-2CC3          ; Lower # L&       COPTIC SMALL LETTER CROSSED SHEI
-2CC5          ; Lower # L&       COPTIC SMALL LETTER OLD COPTIC SHEI
-2CC7          ; Lower # L&       COPTIC SMALL LETTER OLD COPTIC ESH
-2CC9          ; Lower # L&       COPTIC SMALL LETTER AKHMIMIC KHEI
-2CCB          ; Lower # L&       COPTIC SMALL LETTER DIALECT-P HORI
-2CCD          ; Lower # L&       COPTIC SMALL LETTER OLD COPTIC HORI
-2CCF          ; Lower # L&       COPTIC SMALL LETTER OLD COPTIC HA
-2CD1          ; Lower # L&       COPTIC SMALL LETTER L-SHAPED HA
-2CD3          ; Lower # L&       COPTIC SMALL LETTER OLD COPTIC HEI
-2CD5          ; Lower # L&       COPTIC SMALL LETTER OLD COPTIC HAT
-2CD7          ; Lower # L&       COPTIC SMALL LETTER OLD COPTIC GANGIA
-2CD9          ; Lower # L&       COPTIC SMALL LETTER OLD COPTIC DJA
-2CDB          ; Lower # L&       COPTIC SMALL LETTER OLD COPTIC SHIMA
-2CDD          ; Lower # L&       COPTIC SMALL LETTER OLD NUBIAN SHIMA
-2CDF          ; Lower # L&       COPTIC SMALL LETTER OLD NUBIAN NGI
-2CE1          ; Lower # L&       COPTIC SMALL LETTER OLD NUBIAN NYI
-2CE3..2CE4    ; Lower # L&   [2] COPTIC SMALL LETTER OLD NUBIAN WAU..COPTIC SYMBOL KAI
-2D00..2D25    ; Lower # L&  [38] GEORGIAN SMALL LETTER AN..GEORGIAN SMALL LETTER HOE
-FB00..FB06    ; Lower # L&   [7] LATIN SMALL LIGATURE FF..LATIN SMALL LIGATURE ST
-FB13..FB17    ; Lower # L&   [5] ARMENIAN SMALL LIGATURE MEN NOW..ARMENIAN SMALL LIGATURE MEN XEH
-FF41..FF5A    ; Lower # L&  [26] FULLWIDTH LATIN SMALL LETTER A..FULLWIDTH LATIN SMALL LETTER Z
-10428..1044F  ; Lower # L&  [40] DESERET SMALL LETTER LONG I..DESERET SMALL LETTER EW
-1D41A..1D433  ; Lower # L&  [26] MATHEMATICAL BOLD SMALL A..MATHEMATICAL BOLD SMALL Z
-1D44E..1D454  ; Lower # L&   [7] MATHEMATICAL ITALIC SMALL A..MATHEMATICAL ITALIC SMALL G
-1D456..1D467  ; Lower # L&  [18] MATHEMATICAL ITALIC SMALL I..MATHEMATICAL ITALIC SMALL Z
-1D482..1D49B  ; Lower # L&  [26] MATHEMATICAL BOLD ITALIC SMALL A..MATHEMATICAL BOLD ITALIC SMALL Z
-1D4B6..1D4B9  ; Lower # L&   [4] MATHEMATICAL SCRIPT SMALL A..MATHEMATICAL SCRIPT SMALL D
-1D4BB         ; Lower # L&       MATHEMATICAL SCRIPT SMALL F
-1D4BD..1D4C3  ; Lower # L&   [7] MATHEMATICAL SCRIPT SMALL H..MATHEMATICAL SCRIPT SMALL N
-1D4C5..1D4CF  ; Lower # L&  [11] MATHEMATICAL SCRIPT SMALL P..MATHEMATICAL SCRIPT SMALL Z
-1D4EA..1D503  ; Lower # L&  [26] MATHEMATICAL BOLD SCRIPT SMALL A..MATHEMATICAL BOLD SCRIPT SMALL Z
-1D51E..1D537  ; Lower # L&  [26] MATHEMATICAL FRAKTUR SMALL A..MATHEMATICAL FRAKTUR SMALL Z
-1D552..1D56B  ; Lower # L&  [26] MATHEMATICAL DOUBLE-STRUCK SMALL A..MATHEMATICAL DOUBLE-STRUCK SMALL Z
-1D586..1D59F  ; Lower # L&  [26] MATHEMATICAL BOLD FRAKTUR SMALL A..MATHEMATICAL BOLD FRAKTUR SMALL Z
-1D5BA..1D5D3  ; Lower # L&  [26] MATHEMATICAL SANS-SERIF SMALL A..MATHEMATICAL SANS-SERIF SMALL Z
-1D5EE..1D607  ; Lower # L&  [26] MATHEMATICAL SANS-SERIF BOLD SMALL A..MATHEMATICAL SANS-SERIF BOLD SMALL Z
-1D622..1D63B  ; Lower # L&  [26] MATHEMATICAL SANS-SERIF ITALIC SMALL A..MATHEMATICAL SANS-SERIF ITALIC SMALL Z
-1D656..1D66F  ; Lower # L&  [26] MATHEMATICAL SANS-SERIF BOLD ITALIC SMALL A..MATHEMATICAL SANS-SERIF BOLD ITALIC SMALL Z
-1D68A..1D6A5  ; Lower # L&  [28] MATHEMATICAL MONOSPACE SMALL A..MATHEMATICAL ITALIC SMALL DOTLESS J
-1D6C2..1D6DA  ; Lower # L&  [25] MATHEMATICAL BOLD SMALL ALPHA..MATHEMATICAL BOLD SMALL OMEGA
-1D6DC..1D6E1  ; Lower # L&   [6] MATHEMATICAL BOLD EPSILON SYMBOL..MATHEMATICAL BOLD PI SYMBOL
-1D6FC..1D714  ; Lower # L&  [25] MATHEMATICAL ITALIC SMALL ALPHA..MATHEMATICAL ITALIC SMALL OMEGA
-1D716..1D71B  ; Lower # L&   [6] MATHEMATICAL ITALIC EPSILON SYMBOL..MATHEMATICAL ITALIC PI SYMBOL
-1D736..1D74E  ; Lower # L&  [25] MATHEMATICAL BOLD ITALIC SMALL ALPHA..MATHEMATICAL BOLD ITALIC SMALL OMEGA
-1D750..1D755  ; Lower # L&   [6] MATHEMATICAL BOLD ITALIC EPSILON SYMBOL..MATHEMATICAL BOLD ITALIC PI SYMBOL
-1D770..1D788  ; Lower # L&  [25] MATHEMATICAL SANS-SERIF BOLD SMALL ALPHA..MATHEMATICAL SANS-SERIF BOLD SMALL OMEGA
-1D78A..1D78F  ; Lower # L&   [6] MATHEMATICAL SANS-SERIF BOLD EPSILON SYMBOL..MATHEMATICAL SANS-SERIF BOLD PI SYMBOL
-1D7AA..1D7C2  ; Lower # L&  [25] MATHEMATICAL SANS-SERIF BOLD ITALIC SMALL ALPHA..MATHEMATICAL SANS-SERIF BOLD ITALIC SMALL OMEGA
-1D7C4..1D7C9  ; Lower # L&   [6] MATHEMATICAL SANS-SERIF BOLD ITALIC EPSILON SYMBOL..MATHEMATICAL SANS-SERIF BOLD ITALIC PI SYMBOL
-1D7CB         ; Lower # L&       MATHEMATICAL BOLD SMALL DIGAMMA
-
-# Total code points: 1790
-
-# ================================================
-
-0041..005A    ; Upper # L&  [26] LATIN CAPITAL LETTER A..LATIN CAPITAL LETTER Z
-00C0..00D6    ; Upper # L&  [23] LATIN CAPITAL LETTER A WITH GRAVE..LATIN CAPITAL LETTER O WITH DIAERESIS
-00D8..00DE    ; Upper # L&   [7] LATIN CAPITAL LETTER O WITH STROKE..LATIN CAPITAL LETTER THORN
-0100          ; Upper # L&       LATIN CAPITAL LETTER A WITH MACRON
-0102          ; Upper # L&       LATIN CAPITAL LETTER A WITH BREVE
-0104          ; Upper # L&       LATIN CAPITAL LETTER A WITH OGONEK
-0106          ; Upper # L&       LATIN CAPITAL LETTER C WITH ACUTE
-0108          ; Upper # L&       LATIN CAPITAL LETTER C WITH CIRCUMFLEX
-010A          ; Upper # L&       LATIN CAPITAL LETTER C WITH DOT ABOVE
-010C          ; Upper # L&       LATIN CAPITAL LETTER C WITH CARON
-010E          ; Upper # L&       LATIN CAPITAL LETTER D WITH CARON
-0110          ; Upper # L&       LATIN CAPITAL LETTER D WITH STROKE
-0112          ; Upper # L&       LATIN CAPITAL LETTER E WITH MACRON
-0114          ; Upper # L&       LATIN CAPITAL LETTER E WITH BREVE
-0116          ; Upper # L&       LATIN CAPITAL LETTER E WITH DOT ABOVE
-0118          ; Upper # L&       LATIN CAPITAL LETTER E WITH OGONEK
-011A          ; Upper # L&       LATIN CAPITAL LETTER E WITH CARON
-011C          ; Upper # L&       LATIN CAPITAL LETTER G WITH CIRCUMFLEX
-011E          ; Upper # L&       LATIN CAPITAL LETTER G WITH BREVE
-0120          ; Upper # L&       LATIN CAPITAL LETTER G WITH DOT ABOVE
-0122          ; Upper # L&       LATIN CAPITAL LETTER G WITH CEDILLA
-0124          ; Upper # L&       LATIN CAPITAL LETTER H WITH CIRCUMFLEX
-0126          ; Upper # L&       LATIN CAPITAL LETTER H WITH STROKE
-0128          ; Upper # L&       LATIN CAPITAL LETTER I WITH TILDE
-012A          ; Upper # L&       LATIN CAPITAL LETTER I WITH MACRON
-012C          ; Upper # L&       LATIN CAPITAL LETTER I WITH BREVE
-012E          ; Upper # L&       LATIN CAPITAL LETTER I WITH OGONEK
-0130          ; Upper # L&       LATIN CAPITAL LETTER I WITH DOT ABOVE
-0132          ; Upper # L&       LATIN CAPITAL LIGATURE IJ
-0134          ; Upper # L&       LATIN CAPITAL LETTER J WITH CIRCUMFLEX
-0136          ; Upper # L&       LATIN CAPITAL LETTER K WITH CEDILLA
-0139          ; Upper # L&       LATIN CAPITAL LETTER L WITH ACUTE
-013B          ; Upper # L&       LATIN CAPITAL LETTER L WITH CEDILLA
-013D          ; Upper # L&       LATIN CAPITAL LETTER L WITH CARON
-013F          ; Upper # L&       LATIN CAPITAL LETTER L WITH MIDDLE DOT
-0141          ; Upper # L&       LATIN CAPITAL LETTER L WITH STROKE
-0143          ; Upper # L&       LATIN CAPITAL LETTER N WITH ACUTE
-0145          ; Upper # L&       LATIN CAPITAL LETTER N WITH CEDILLA
-0147          ; Upper # L&       LATIN CAPITAL LETTER N WITH CARON
-014A          ; Upper # L&       LATIN CAPITAL LETTER ENG
-014C          ; Upper # L&       LATIN CAPITAL LETTER O WITH MACRON
-014E          ; Upper # L&       LATIN CAPITAL LETTER O WITH BREVE
-0150          ; Upper # L&       LATIN CAPITAL LETTER O WITH DOUBLE ACUTE
-0152          ; Upper # L&       LATIN CAPITAL LIGATURE OE
-0154          ; Upper # L&       LATIN CAPITAL LETTER R WITH ACUTE
-0156          ; Upper # L&       LATIN CAPITAL LETTER R WITH CEDILLA
-0158          ; Upper # L&       LATIN CAPITAL LETTER R WITH CARON
-015A          ; Upper # L&       LATIN CAPITAL LETTER S WITH ACUTE
-015C          ; Upper # L&       LATIN CAPITAL LETTER S WITH CIRCUMFLEX
-015E          ; Upper # L&       LATIN CAPITAL LETTER S WITH CEDILLA
-0160          ; Upper # L&       LATIN CAPITAL LETTER S WITH CARON
-0162          ; Upper # L&       LATIN CAPITAL LETTER T WITH CEDILLA
-0164          ; Upper # L&       LATIN CAPITAL LETTER T WITH CARON
-0166          ; Upper # L&       LATIN CAPITAL LETTER T WITH STROKE
-0168          ; Upper # L&       LATIN CAPITAL LETTER U WITH TILDE
-016A          ; Upper # L&       LATIN CAPITAL LETTER U WITH MACRON
-016C          ; Upper # L&       LATIN CAPITAL LETTER U WITH BREVE
-016E          ; Upper # L&       LATIN CAPITAL LETTER U WITH RING ABOVE
-0170          ; Upper # L&       LATIN CAPITAL LETTER U WITH DOUBLE ACUTE
-0172          ; Upper # L&       LATIN CAPITAL LETTER U WITH OGONEK
-0174          ; Upper # L&       LATIN CAPITAL LETTER W WITH CIRCUMFLEX
-0176          ; Upper # L&       LATIN CAPITAL LETTER Y WITH CIRCUMFLEX
-0178..0179    ; Upper # L&   [2] LATIN CAPITAL LETTER Y WITH DIAERESIS..LATIN CAPITAL LETTER Z WITH ACUTE
-017B          ; Upper # L&       LATIN CAPITAL LETTER Z WITH DOT ABOVE
-017D          ; Upper # L&       LATIN CAPITAL LETTER Z WITH CARON
-0181..0182    ; Upper # L&   [2] LATIN CAPITAL LETTER B WITH HOOK..LATIN CAPITAL LETTER B WITH TOPBAR
-0184          ; Upper # L&       LATIN CAPITAL LETTER TONE SIX
-0186..0187    ; Upper # L&   [2] LATIN CAPITAL LETTER OPEN O..LATIN CAPITAL LETTER C WITH HOOK
-0189..018B    ; Upper # L&   [3] LATIN CAPITAL LETTER AFRICAN D..LATIN CAPITAL LETTER D WITH TOPBAR
-018E..0191    ; Upper # L&   [4] LATIN CAPITAL LETTER REVERSED E..LATIN CAPITAL LETTER F WITH HOOK
-0193..0194    ; Upper # L&   [2] LATIN CAPITAL LETTER G WITH HOOK..LATIN CAPITAL LETTER GAMMA
-0196..0198    ; Upper # L&   [3] LATIN CAPITAL LETTER IOTA..LATIN CAPITAL LETTER K WITH HOOK
-019C..019D    ; Upper # L&   [2] LATIN CAPITAL LETTER TURNED M..LATIN CAPITAL LETTER N WITH LEFT HOOK
-019F..01A0    ; Upper # L&   [2] LATIN CAPITAL LETTER O WITH MIDDLE TILDE..LATIN CAPITAL LETTER O WITH HORN
-01A2          ; Upper # L&       LATIN CAPITAL LETTER OI
-01A4          ; Upper # L&       LATIN CAPITAL LETTER P WITH HOOK
-01A6..01A7    ; Upper # L&   [2] LATIN LETTER YR..LATIN CAPITAL LETTER TONE TWO
-01A9          ; Upper # L&       LATIN CAPITAL LETTER ESH
-01AC          ; Upper # L&       LATIN CAPITAL LETTER T WITH HOOK
-01AE..01AF    ; Upper # L&   [2] LATIN CAPITAL LETTER T WITH RETROFLEX HOOK..LATIN CAPITAL LETTER U WITH HORN
-01B1..01B3    ; Upper # L&   [3] LATIN CAPITAL LETTER UPSILON..LATIN CAPITAL LETTER Y WITH HOOK
-01B5          ; Upper # L&       LATIN CAPITAL LETTER Z WITH STROKE
-01B7..01B8    ; Upper # L&   [2] LATIN CAPITAL LETTER EZH..LATIN CAPITAL LETTER EZH REVERSED
-01BC          ; Upper # L&       LATIN CAPITAL LETTER TONE FIVE
-01C4..01C5    ; Upper # L&   [2] LATIN CAPITAL LETTER DZ WITH CARON..LATIN CAPITAL LETTER D WITH SMALL LETTER Z WITH CARON
-01C7..01C8    ; Upper # L&   [2] LATIN CAPITAL LETTER LJ..LATIN CAPITAL LETTER L WITH SMALL LETTER J
-01CA..01CB    ; Upper # L&   [2] LATIN CAPITAL LETTER NJ..LATIN CAPITAL LETTER N WITH SMALL LETTER J
-01CD          ; Upper # L&       LATIN CAPITAL LETTER A WITH CARON
-01CF          ; Upper # L&       LATIN CAPITAL LETTER I WITH CARON
-01D1          ; Upper # L&       LATIN CAPITAL LETTER O WITH CARON
-01D3          ; Upper # L&       LATIN CAPITAL LETTER U WITH CARON
-01D5          ; Upper # L&       LATIN CAPITAL LETTER U WITH DIAERESIS AND MACRON
-01D7          ; Upper # L&       LATIN CAPITAL LETTER U WITH DIAERESIS AND ACUTE
-01D9          ; Upper # L&       LATIN CAPITAL LETTER U WITH DIAERESIS AND CARON
-01DB          ; Upper # L&       LATIN CAPITAL LETTER U WITH DIAERESIS AND GRAVE
-01DE          ; Upper # L&       LATIN CAPITAL LETTER A WITH DIAERESIS AND MACRON
-01E0          ; Upper # L&       LATIN CAPITAL LETTER A WITH DOT ABOVE AND MACRON
-01E2          ; Upper # L&       LATIN CAPITAL LETTER AE WITH MACRON
-01E4          ; Upper # L&       LATIN CAPITAL LETTER G WITH STROKE
-01E6          ; Upper # L&       LATIN CAPITAL LETTER G WITH CARON
-01E8          ; Upper # L&       LATIN CAPITAL LETTER K WITH CARON
-01EA          ; Upper # L&       LATIN CAPITAL LETTER O WITH OGONEK
-01EC          ; Upper # L&       LATIN CAPITAL LETTER O WITH OGONEK AND MACRON
-01EE          ; Upper # L&       LATIN CAPITAL LETTER EZH WITH CARON
-01F1..01F2    ; Upper # L&   [2] LATIN CAPITAL LETTER DZ..LATIN CAPITAL LETTER D WITH SMALL LETTER Z
-01F4          ; Upper # L&       LATIN CAPITAL LETTER G WITH ACUTE
-01F6..01F8    ; Upper # L&   [3] LATIN CAPITAL LETTER HWAIR..LATIN CAPITAL LETTER N WITH GRAVE
-01FA          ; Upper # L&       LATIN CAPITAL LETTER A WITH RING ABOVE AND ACUTE
-01FC          ; Upper # L&       LATIN CAPITAL LETTER AE WITH ACUTE
-01FE          ; Upper # L&       LATIN CAPITAL LETTER O WITH STROKE AND ACUTE
-0200          ; Upper # L&       LATIN CAPITAL LETTER A WITH DOUBLE GRAVE
-0202          ; Upper # L&       LATIN CAPITAL LETTER A WITH INVERTED BREVE
-0204          ; Upper # L&       LATIN CAPITAL LETTER E WITH DOUBLE GRAVE
-0206          ; Upper # L&       LATIN CAPITAL LETTER E WITH INVERTED BREVE
-0208          ; Upper # L&       LATIN CAPITAL LETTER I WITH DOUBLE GRAVE
-020A          ; Upper # L&       LATIN CAPITAL LETTER I WITH INVERTED BREVE
-020C          ; Upper # L&       LATIN CAPITAL LETTER O WITH DOUBLE GRAVE
-020E          ; Upper # L&       LATIN CAPITAL LETTER O WITH INVERTED BREVE
-0210          ; Upper # L&       LATIN CAPITAL LETTER R WITH DOUBLE GRAVE
-0212          ; Upper # L&       LATIN CAPITAL LETTER R WITH INVERTED BREVE
-0214          ; Upper # L&       LATIN CAPITAL LETTER U WITH DOUBLE GRAVE
-0216          ; Upper # L&       LATIN CAPITAL LETTER U WITH INVERTED BREVE
-0218          ; Upper # L&       LATIN CAPITAL LETTER S WITH COMMA BELOW
-021A          ; Upper # L&       LATIN CAPITAL LETTER T WITH COMMA BELOW
-021C          ; Upper # L&       LATIN CAPITAL LETTER YOGH
-021E          ; Upper # L&       LATIN CAPITAL LETTER H WITH CARON
-0220          ; Upper # L&       LATIN CAPITAL LETTER N WITH LONG RIGHT LEG
-0222          ; Upper # L&       LATIN CAPITAL LETTER OU
-0224          ; Upper # L&       LATIN CAPITAL LETTER Z WITH HOOK
-0226          ; Upper # L&       LATIN CAPITAL LETTER A WITH DOT ABOVE
-0228          ; Upper # L&       LATIN CAPITAL LETTER E WITH CEDILLA
-022A          ; Upper # L&       LATIN CAPITAL LETTER O WITH DIAERESIS AND MACRON
-022C          ; Upper # L&       LATIN CAPITAL LETTER O WITH TILDE AND MACRON
-022E          ; Upper # L&       LATIN CAPITAL LETTER O WITH DOT ABOVE
-0230          ; Upper # L&       LATIN CAPITAL LETTER O WITH DOT ABOVE AND MACRON
-0232          ; Upper # L&       LATIN CAPITAL LETTER Y WITH MACRON
-023A..023B    ; Upper # L&   [2] LATIN CAPITAL LETTER A WITH STROKE..LATIN CAPITAL LETTER C WITH STROKE
-023D..023E    ; Upper # L&   [2] LATIN CAPITAL LETTER L WITH BAR..LATIN CAPITAL LETTER T WITH DIAGONAL STROKE
-0241          ; Upper # L&       LATIN CAPITAL LETTER GLOTTAL STOP
-0243..0246    ; Upper # L&   [4] LATIN CAPITAL LETTER B WITH STROKE..LATIN CAPITAL LETTER E WITH STROKE
-0248          ; Upper # L&       LATIN CAPITAL LETTER J WITH STROKE
-024A          ; Upper # L&       LATIN CAPITAL LETTER SMALL Q WITH HOOK TAIL
-024C          ; Upper # L&       LATIN CAPITAL LETTER R WITH STROKE
-024E          ; Upper # L&       LATIN CAPITAL LETTER Y WITH STROKE
-0386          ; Upper # L&       GREEK CAPITAL LETTER ALPHA WITH TONOS
-0388..038A    ; Upper # L&   [3] GREEK CAPITAL LETTER EPSILON WITH TONOS..GREEK CAPITAL LETTER IOTA WITH TONOS
-038C          ; Upper # L&       GREEK CAPITAL LETTER OMICRON WITH TONOS
-038E..038F    ; Upper # L&   [2] GREEK CAPITAL LETTER UPSILON WITH TONOS..GREEK CAPITAL LETTER OMEGA WITH TONOS
-0391..03A1    ; Upper # L&  [17] GREEK CAPITAL LETTER ALPHA..GREEK CAPITAL LETTER RHO
-03A3..03AB    ; Upper # L&   [9] GREEK CAPITAL LETTER SIGMA..GREEK CAPITAL LETTER UPSILON WITH DIALYTIKA
-03D2..03D4    ; Upper # L&   [3] GREEK UPSILON WITH HOOK SYMBOL..GREEK UPSILON WITH DIAERESIS AND HOOK SYMBOL
-03D8          ; Upper # L&       GREEK LETTER ARCHAIC KOPPA
-03DA          ; Upper # L&       GREEK LETTER STIGMA
-03DC          ; Upper # L&       GREEK LETTER DIGAMMA
-03DE          ; Upper # L&       GREEK LETTER KOPPA
-03E0          ; Upper # L&       GREEK LETTER SAMPI
-03E2          ; Upper # L&       COPTIC CAPITAL LETTER SHEI
-03E4          ; Upper # L&       COPTIC CAPITAL LETTER FEI
-03E6          ; Upper # L&       COPTIC CAPITAL LETTER KHEI
-03E8          ; Upper # L&       COPTIC CAPITAL LETTER HORI
-03EA          ; Upper # L&       COPTIC CAPITAL LETTER GANGIA
-03EC          ; Upper # L&       COPTIC CAPITAL LETTER SHIMA
-03EE          ; Upper # L&       COPTIC CAPITAL LETTER DEI
-03F4          ; Upper # L&       GREEK CAPITAL THETA SYMBOL
-03F7          ; Upper # L&       GREEK CAPITAL LETTER SHO
-03F9..03FA    ; Upper # L&   [2] GREEK CAPITAL LUNATE SIGMA SYMBOL..GREEK CAPITAL LETTER SAN
-03FD..042F    ; Upper # L&  [51] GREEK CAPITAL REVERSED LUNATE SIGMA SYMBOL..CYRILLIC CAPITAL LETTER YA
-0460          ; Upper # L&       CYRILLIC CAPITAL LETTER OMEGA
-0462          ; Upper # L&       CYRILLIC CAPITAL LETTER YAT
-0464          ; Upper # L&       CYRILLIC CAPITAL LETTER IOTIFIED E
-0466          ; Upper # L&       CYRILLIC CAPITAL LETTER LITTLE YUS
-0468          ; Upper # L&       CYRILLIC CAPITAL LETTER IOTIFIED LITTLE YUS
-046A          ; Upper # L&       CYRILLIC CAPITAL LETTER BIG YUS
-046C          ; Upper # L&       CYRILLIC CAPITAL LETTER IOTIFIED BIG YUS
-046E          ; Upper # L&       CYRILLIC CAPITAL LETTER KSI
-0470          ; Upper # L&       CYRILLIC CAPITAL LETTER PSI
-0472          ; Upper # L&       CYRILLIC CAPITAL LETTER FITA
-0474          ; Upper # L&       CYRILLIC CAPITAL LETTER IZHITSA
-0476          ; Upper # L&       CYRILLIC CAPITAL LETTER IZHITSA WITH DOUBLE GRAVE ACCENT
-0478          ; Upper # L&       CYRILLIC CAPITAL LETTER UK
-047A          ; Upper # L&       CYRILLIC CAPITAL LETTER ROUND OMEGA
-047C          ; Upper # L&       CYRILLIC CAPITAL LETTER OMEGA WITH TITLO
-047E          ; Upper # L&       CYRILLIC CAPITAL LETTER OT
-0480          ; Upper # L&       CYRILLIC CAPITAL LETTER KOPPA
-048A          ; Upper # L&       CYRILLIC CAPITAL LETTER SHORT I WITH TAIL
-048C          ; Upper # L&       CYRILLIC CAPITAL LETTER SEMISOFT SIGN
-048E          ; Upper # L&       CYRILLIC CAPITAL LETTER ER WITH TICK
-0490          ; Upper # L&       CYRILLIC CAPITAL LETTER GHE WITH UPTURN
-0492          ; Upper # L&       CYRILLIC CAPITAL LETTER GHE WITH STROKE
-0494          ; Upper # L&       CYRILLIC CAPITAL LETTER GHE WITH MIDDLE HOOK
-0496          ; Upper # L&       CYRILLIC CAPITAL LETTER ZHE WITH DESCENDER
-0498          ; Upper # L&       CYRILLIC CAPITAL LETTER ZE WITH DESCENDER
-049A          ; Upper # L&       CYRILLIC CAPITAL LETTER KA WITH DESCENDER
-049C          ; Upper # L&       CYRILLIC CAPITAL LETTER KA WITH VERTICAL STROKE
-049E          ; Upper # L&       CYRILLIC CAPITAL LETTER KA WITH STROKE
-04A0          ; Upper # L&       CYRILLIC CAPITAL LETTER BASHKIR KA
-04A2          ; Upper # L&       CYRILLIC CAPITAL LETTER EN WITH DESCENDER
-04A4          ; Upper # L&       CYRILLIC CAPITAL LIGATURE EN GHE
-04A6          ; Upper # L&       CYRILLIC CAPITAL LETTER PE WITH MIDDLE HOOK
-04A8          ; Upper # L&       CYRILLIC CAPITAL LETTER ABKHASIAN HA
-04AA          ; Upper # L&       CYRILLIC CAPITAL LETTER ES WITH DESCENDER
-04AC          ; Upper # L&       CYRILLIC CAPITAL LETTER TE WITH DESCENDER
-04AE          ; Upper # L&       CYRILLIC CAPITAL LETTER STRAIGHT U
-04B0          ; Upper # L&       CYRILLIC CAPITAL LETTER STRAIGHT U WITH STROKE
-04B2          ; Upper # L&       CYRILLIC CAPITAL LETTER HA WITH DESCENDER
-04B4          ; Upper # L&       CYRILLIC CAPITAL LIGATURE TE TSE
-04B6          ; Upper # L&       CYRILLIC CAPITAL LETTER CHE WITH DESCENDER
-04B8          ; Upper # L&       CYRILLIC CAPITAL LETTER CHE WITH VERTICAL STROKE
-04BA          ; Upper # L&       CYRILLIC CAPITAL LETTER SHHA
-04BC          ; Upper # L&       CYRILLIC CAPITAL LETTER ABKHASIAN CHE
-04BE          ; Upper # L&       CYRILLIC CAPITAL LETTER ABKHASIAN CHE WITH DESCENDER
-04C0..04C1    ; Upper # L&   [2] CYRILLIC LETTER PALOCHKA..CYRILLIC CAPITAL LETTER ZHE WITH BREVE
-04C3          ; Upper # L&       CYRILLIC CAPITAL LETTER KA WITH HOOK
-04C5          ; Upper # L&       CYRILLIC CAPITAL LETTER EL WITH TAIL
-04C7          ; Upper # L&       CYRILLIC CAPITAL LETTER EN WITH HOOK
-04C9          ; Upper # L&       CYRILLIC CAPITAL LETTER EN WITH TAIL
-04CB          ; Upper # L&       CYRILLIC CAPITAL LETTER KHAKASSIAN CHE
-04CD          ; Upper # L&       CYRILLIC CAPITAL LETTER EM WITH TAIL
-04D0          ; Upper # L&       CYRILLIC CAPITAL LETTER A WITH BREVE
-04D2          ; Upper # L&       CYRILLIC CAPITAL LETTER A WITH DIAERESIS
-04D4          ; Upper # L&       CYRILLIC CAPITAL LIGATURE A IE
-04D6          ; Upper # L&       CYRILLIC CAPITAL LETTER IE WITH BREVE
-04D8          ; Upper # L&       CYRILLIC CAPITAL LETTER SCHWA
-04DA          ; Upper # L&       CYRILLIC CAPITAL LETTER SCHWA WITH DIAERESIS
-04DC          ; Upper # L&       CYRILLIC CAPITAL LETTER ZHE WITH DIAERESIS
-04DE          ; Upper # L&       CYRILLIC CAPITAL LETTER ZE WITH DIAERESIS
-04E0          ; Upper # L&       CYRILLIC CAPITAL LETTER ABKHASIAN DZE
-04E2          ; Upper # L&       CYRILLIC CAPITAL LETTER I WITH MACRON
-04E4          ; Upper # L&       CYRILLIC CAPITAL LETTER I WITH DIAERESIS
-04E6          ; Upper # L&       CYRILLIC CAPITAL LETTER O WITH DIAERESIS
-04E8          ; Upper # L&       CYRILLIC CAPITAL LETTER BARRED O
-04EA          ; Upper # L&       CYRILLIC CAPITAL LETTER BARRED O WITH DIAERESIS
-04EC          ; Upper # L&       CYRILLIC CAPITAL LETTER E WITH DIAERESIS
-04EE          ; Upper # L&       CYRILLIC CAPITAL LETTER U WITH MACRON
-04F0          ; Upper # L&       CYRILLIC CAPITAL LETTER U WITH DIAERESIS
-04F2          ; Upper # L&       CYRILLIC CAPITAL LETTER U WITH DOUBLE ACUTE
-04F4          ; Upper # L&       CYRILLIC CAPITAL LETTER CHE WITH DIAERESIS
-04F6          ; Upper # L&       CYRILLIC CAPITAL LETTER GHE WITH DESCENDER
-04F8          ; Upper # L&       CYRILLIC CAPITAL LETTER YERU WITH DIAERESIS
-04FA          ; Upper # L&       CYRILLIC CAPITAL LETTER GHE WITH STROKE AND HOOK
-04FC          ; Upper # L&       CYRILLIC CAPITAL LETTER HA WITH HOOK
-04FE          ; Upper # L&       CYRILLIC CAPITAL LETTER HA WITH STROKE
-0500          ; Upper # L&       CYRILLIC CAPITAL LETTER KOMI DE
-0502          ; Upper # L&       CYRILLIC CAPITAL LETTER KOMI DJE
-0504          ; Upper # L&       CYRILLIC CAPITAL LETTER KOMI ZJE
-0506          ; Upper # L&       CYRILLIC CAPITAL LETTER KOMI DZJE
-0508          ; Upper # L&       CYRILLIC CAPITAL LETTER KOMI LJE
-050A          ; Upper # L&       CYRILLIC CAPITAL LETTER KOMI NJE
-050C          ; Upper # L&       CYRILLIC CAPITAL LETTER KOMI SJE
-050E          ; Upper # L&       CYRILLIC CAPITAL LETTER KOMI TJE
-0510          ; Upper # L&       CYRILLIC CAPITAL LETTER REVERSED ZE
-0512          ; Upper # L&       CYRILLIC CAPITAL LETTER EL WITH HOOK
-0531..0556    ; Upper # L&  [38] ARMENIAN CAPITAL LETTER AYB..ARMENIAN CAPITAL LETTER FEH
-10A0..10C5    ; Upper # L&  [38] GEORGIAN CAPITAL LETTER AN..GEORGIAN CAPITAL LETTER HOE
-1E00          ; Upper # L&       LATIN CAPITAL LETTER A WITH RING BELOW
-1E02          ; Upper # L&       LATIN CAPITAL LETTER B WITH DOT ABOVE
-1E04          ; Upper # L&       LATIN CAPITAL LETTER B WITH DOT BELOW
-1E06          ; Upper # L&       LATIN CAPITAL LETTER B WITH LINE BELOW
-1E08          ; Upper # L&       LATIN CAPITAL LETTER C WITH CEDILLA AND ACUTE
-1E0A          ; Upper # L&       LATIN CAPITAL LETTER D WITH DOT ABOVE
-1E0C          ; Upper # L&       LATIN CAPITAL LETTER D WITH DOT BELOW
-1E0E          ; Upper # L&       LATIN CAPITAL LETTER D WITH LINE BELOW
-1E10          ; Upper # L&       LATIN CAPITAL LETTER D WITH CEDILLA
-1E12          ; Upper # L&       LATIN CAPITAL LETTER D WITH CIRCUMFLEX BELOW
-1E14          ; Upper # L&       LATIN CAPITAL LETTER E WITH MACRON AND GRAVE
-1E16          ; Upper # L&       LATIN CAPITAL LETTER E WITH MACRON AND ACUTE
-1E18          ; Upper # L&       LATIN CAPITAL LETTER E WITH CIRCUMFLEX BELOW
-1E1A          ; Upper # L&       LATIN CAPITAL LETTER E WITH TILDE BELOW
-1E1C          ; Upper # L&       LATIN CAPITAL LETTER E WITH CEDILLA AND BREVE
-1E1E          ; Upper # L&       LATIN CAPITAL LETTER F WITH DOT ABOVE
-1E20          ; Upper # L&       LATIN CAPITAL LETTER G WITH MACRON
-1E22          ; Upper # L&       LATIN CAPITAL LETTER H WITH DOT ABOVE
-1E24          ; Upper # L&       LATIN CAPITAL LETTER H WITH DOT BELOW
-1E26          ; Upper # L&       LATIN CAPITAL LETTER H WITH DIAERESIS
-1E28          ; Upper # L&       LATIN CAPITAL LETTER H WITH CEDILLA
-1E2A          ; Upper # L&       LATIN CAPITAL LETTER H WITH BREVE BELOW
-1E2C          ; Upper # L&       LATIN CAPITAL LETTER I WITH TILDE BELOW
-1E2E          ; Upper # L&       LATIN CAPITAL LETTER I WITH DIAERESIS AND ACUTE
-1E30          ; Upper # L&       LATIN CAPITAL LETTER K WITH ACUTE
-1E32          ; Upper # L&       LATIN CAPITAL LETTER K WITH DOT BELOW
-1E34          ; Upper # L&       LATIN CAPITAL LETTER K WITH LINE BELOW
-1E36          ; Upper # L&       LATIN CAPITAL LETTER L WITH DOT BELOW
-1E38          ; Upper # L&       LATIN CAPITAL LETTER L WITH DOT BELOW AND MACRON
-1E3A          ; Upper # L&       LATIN CAPITAL LETTER L WITH LINE BELOW
-1E3C          ; Upper # L&       LATIN CAPITAL LETTER L WITH CIRCUMFLEX BELOW
-1E3E          ; Upper # L&       LATIN CAPITAL LETTER M WITH ACUTE
-1E40          ; Upper # L&       LATIN CAPITAL LETTER M WITH DOT ABOVE
-1E42          ; Upper # L&       LATIN CAPITAL LETTER M WITH DOT BELOW
-1E44          ; Upper # L&       LATIN CAPITAL LETTER N WITH DOT ABOVE
-1E46          ; Upper # L&       LATIN CAPITAL LETTER N WITH DOT BELOW
-1E48          ; Upper # L&       LATIN CAPITAL LETTER N WITH LINE BELOW
-1E4A          ; Upper # L&       LATIN CAPITAL LETTER N WITH CIRCUMFLEX BELOW
-1E4C          ; Upper # L&       LATIN CAPITAL LETTER O WITH TILDE AND ACUTE
-1E4E          ; Upper # L&       LATIN CAPITAL LETTER O WITH TILDE AND DIAERESIS
-1E50          ; Upper # L&       LATIN CAPITAL LETTER O WITH MACRON AND GRAVE
-1E52          ; Upper # L&       LATIN CAPITAL LETTER O WITH MACRON AND ACUTE
-1E54          ; Upper # L&       LATIN CAPITAL LETTER P WITH ACUTE
-1E56          ; Upper # L&       LATIN CAPITAL LETTER P WITH DOT ABOVE
-1E58          ; Upper # L&       LATIN CAPITAL LETTER R WITH DOT ABOVE
-1E5A          ; Upper # L&       LATIN CAPITAL LETTER R WITH DOT BELOW
-1E5C          ; Upper # L&       LATIN CAPITAL LETTER R WITH DOT BELOW AND MACRON
-1E5E          ; Upper # L&       LATIN CAPITAL LETTER R WITH LINE BELOW
-1E60          ; Upper # L&       LATIN CAPITAL LETTER S WITH DOT ABOVE
-1E62          ; Upper # L&       LATIN CAPITAL LETTER S WITH DOT BELOW
-1E64          ; Upper # L&       LATIN CAPITAL LETTER S WITH ACUTE AND DOT ABOVE
-1E66          ; Upper # L&       LATIN CAPITAL LETTER S WITH CARON AND DOT ABOVE
-1E68          ; Upper # L&       LATIN CAPITAL LETTER S WITH DOT BELOW AND DOT ABOVE
-1E6A          ; Upper # L&       LATIN CAPITAL LETTER T WITH DOT ABOVE
-1E6C          ; Upper # L&       LATIN CAPITAL LETTER T WITH DOT BELOW
-1E6E          ; Upper # L&       LATIN CAPITAL LETTER T WITH LINE BELOW
-1E70          ; Upper # L&       LATIN CAPITAL LETTER T WITH CIRCUMFLEX BELOW
-1E72          ; Upper # L&       LATIN CAPITAL LETTER U WITH DIAERESIS BELOW
-1E74          ; Upper # L&       LATIN CAPITAL LETTER U WITH TILDE BELOW
-1E76          ; Upper # L&       LATIN CAPITAL LETTER U WITH CIRCUMFLEX BELOW
-1E78          ; Upper # L&       LATIN CAPITAL LETTER U WITH TILDE AND ACUTE
-1E7A          ; Upper # L&       LATIN CAPITAL LETTER U WITH MACRON AND DIAERESIS
-1E7C          ; Upper # L&       LATIN CAPITAL LETTER V WITH TILDE
-1E7E          ; Upper # L&       LATIN CAPITAL LETTER V WITH DOT BELOW
-1E80          ; Upper # L&       LATIN CAPITAL LETTER W WITH GRAVE
-1E82          ; Upper # L&       LATIN CAPITAL LETTER W WITH ACUTE
-1E84          ; Upper # L&       LATIN CAPITAL LETTER W WITH DIAERESIS
-1E86          ; Upper # L&       LATIN CAPITAL LETTER W WITH DOT ABOVE
-1E88          ; Upper # L&       LATIN CAPITAL LETTER W WITH DOT BELOW
-1E8A          ; Upper # L&       LATIN CAPITAL LETTER X WITH DOT ABOVE
-1E8C          ; Upper # L&       LATIN CAPITAL LETTER X WITH DIAERESIS
-1E8E          ; Upper # L&       LATIN CAPITAL LETTER Y WITH DOT ABOVE
-1E90          ; Upper # L&       LATIN CAPITAL LETTER Z WITH CIRCUMFLEX
-1E92          ; Upper # L&       LATIN CAPITAL LETTER Z WITH DOT BELOW
-1E94          ; Upper # L&       LATIN CAPITAL LETTER Z WITH LINE BELOW
-1EA0          ; Upper # L&       LATIN CAPITAL LETTER A WITH DOT BELOW
-1EA2          ; Upper # L&       LATIN CAPITAL LETTER A WITH HOOK ABOVE
-1EA4          ; Upper # L&       LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND ACUTE
-1EA6          ; Upper # L&       LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND GRAVE
-1EA8          ; Upper # L&       LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND HOOK ABOVE
-1EAA          ; Upper # L&       LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND TILDE
-1EAC          ; Upper # L&       LATIN CAPITAL LETTER A WITH CIRCUMFLEX AND DOT BELOW
-1EAE          ; Upper # L&       LATIN CAPITAL LETTER A WITH BREVE AND ACUTE
-1EB0          ; Upper # L&       LATIN CAPITAL LETTER A WITH BREVE AND GRAVE
-1EB2          ; Upper # L&       LATIN CAPITAL LETTER A WITH BREVE AND HOOK ABOVE
-1EB4          ; Upper # L&       LATIN CAPITAL LETTER A WITH BREVE AND TILDE
-1EB6          ; Upper # L&       LATIN CAPITAL LETTER A WITH BREVE AND DOT BELOW
-1EB8          ; Upper # L&       LATIN CAPITAL LETTER E WITH DOT BELOW
-1EBA          ; Upper # L&       LATIN CAPITAL LETTER E WITH HOOK ABOVE
-1EBC          ; Upper # L&       LATIN CAPITAL LETTER E WITH TILDE
-1EBE          ; Upper # L&       LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND ACUTE
-1EC0          ; Upper # L&       LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND GRAVE
-1EC2          ; Upper # L&       LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND HOOK ABOVE
-1EC4          ; Upper # L&       LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND TILDE
-1EC6          ; Upper # L&       LATIN CAPITAL LETTER E WITH CIRCUMFLEX AND DOT BELOW
-1EC8          ; Upper # L&       LATIN CAPITAL LETTER I WITH HOOK ABOVE
-1ECA          ; Upper # L&       LATIN CAPITAL LETTER I WITH DOT BELOW
-1ECC          ; Upper # L&       LATIN CAPITAL LETTER O WITH DOT BELOW
-1ECE          ; Upper # L&       LATIN CAPITAL LETTER O WITH HOOK ABOVE
-1ED0          ; Upper # L&       LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND ACUTE
-1ED2          ; Upper # L&       LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND GRAVE
-1ED4          ; Upper # L&       LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND HOOK ABOVE
-1ED6          ; Upper # L&       LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND TILDE
-1ED8          ; Upper # L&       LATIN CAPITAL LETTER O WITH CIRCUMFLEX AND DOT BELOW
-1EDA          ; Upper # L&       LATIN CAPITAL LETTER O WITH HORN AND ACUTE
-1EDC          ; Upper # L&       LATIN CAPITAL LETTER O WITH HORN AND GRAVE
-1EDE          ; Upper # L&       LATIN CAPITAL LETTER O WITH HORN AND HOOK ABOVE
-1EE0          ; Upper # L&       LATIN CAPITAL LETTER O WITH HORN AND TILDE
-1EE2          ; Upper # L&       LATIN CAPITAL LETTER O WITH HORN AND DOT BELOW
-1EE4          ; Upper # L&       LATIN CAPITAL LETTER U WITH DOT BELOW
-1EE6          ; Upper # L&       LATIN CAPITAL LETTER U WITH HOOK ABOVE
-1EE8          ; Upper # L&       LATIN CAPITAL LETTER U WITH HORN AND ACUTE
-1EEA          ; Upper # L&       LATIN CAPITAL LETTER U WITH HORN AND GRAVE
-1EEC          ; Upper # L&       LATIN CAPITAL LETTER U WITH HORN AND HOOK ABOVE
-1EEE          ; Upper # L&       LATIN CAPITAL LETTER U WITH HORN AND TILDE
-1EF0          ; Upper # L&       LATIN CAPITAL LETTER U WITH HORN AND DOT BELOW
-1EF2          ; Upper # L&       LATIN CAPITAL LETTER Y WITH GRAVE
-1EF4          ; Upper # L&       LATIN CAPITAL LETTER Y WITH DOT BELOW
-1EF6          ; Upper # L&       LATIN CAPITAL LETTER Y WITH HOOK ABOVE
-1EF8          ; Upper # L&       LATIN CAPITAL LETTER Y WITH TILDE
-1F08..1F0F    ; Upper # L&   [8] GREEK CAPITAL LETTER ALPHA WITH PSILI..GREEK CAPITAL LETTER ALPHA WITH DASIA AND PERISPOMENI
-1F18..1F1D    ; Upper # L&   [6] GREEK CAPITAL LETTER EPSILON WITH PSILI..GREEK CAPITAL LETTER EPSILON WITH DASIA AND OXIA
-1F28..1F2F    ; Upper # L&   [8] GREEK CAPITAL LETTER ETA WITH PSILI..GREEK CAPITAL LETTER ETA WITH DASIA AND PERISPOMENI
-1F38..1F3F    ; Upper # L&   [8] GREEK CAPITAL LETTER IOTA WITH PSILI..GREEK CAPITAL LETTER IOTA WITH DASIA AND PERISPOMENI
-1F48..1F4D    ; Upper # L&   [6] GREEK CAPITAL LETTER OMICRON WITH PSILI..GREEK CAPITAL LETTER OMICRON WITH DASIA AND OXIA
-1F59          ; Upper # L&       GREEK CAPITAL LETTER UPSILON WITH DASIA
-1F5B          ; Upper # L&       GREEK CAPITAL LETTER UPSILON WITH DASIA AND VARIA
-1F5D          ; Upper # L&       GREEK CAPITAL LETTER UPSILON WITH DASIA AND OXIA
-1F5F          ; Upper # L&       GREEK CAPITAL LETTER UPSILON WITH DASIA AND PERISPOMENI
-1F68..1F6F    ; Upper # L&   [8] GREEK CAPITAL LETTER OMEGA WITH PSILI..GREEK CAPITAL LETTER OMEGA WITH DASIA AND PERISPOMENI
-1F88..1F8F    ; Upper # L&   [8] GREEK CAPITAL LETTER ALPHA WITH PSILI AND PROSGEGRAMMENI..GREEK CAPITAL LETTER ALPHA WITH DASIA AND PERISPOMENI AND PROSGEGRAMMENI
-1F98..1F9F    ; Upper # L&   [8] GREEK CAPITAL LETTER ETA WITH PSILI AND PROSGEGRAMMENI..GREEK CAPITAL LETTER ETA WITH DASIA AND PERISPOMENI AND PROSGEGRAMMENI
-1FA8..1FAF    ; Upper # L&   [8] GREEK CAPITAL LETTER OMEGA WITH PSILI AND PROSGEGRAMMENI..GREEK CAPITAL LETTER OMEGA WITH DASIA AND PERISPOMENI AND PROSGEGRAMMENI
-1FB8..1FBC    ; Upper # L&   [5] GREEK CAPITAL LETTER ALPHA WITH VRACHY..GREEK CAPITAL LETTER ALPHA WITH PROSGEGRAMMENI
-1FC8..1FCC    ; Upper # L&   [5] GREEK CAPITAL LETTER EPSILON WITH VARIA..GREEK CAPITAL LETTER ETA WITH PROSGEGRAMMENI
-1FD8..1FDB    ; Upper # L&   [4] GREEK CAPITAL LETTER IOTA WITH VRACHY..GREEK CAPITAL LETTER IOTA WITH OXIA
-1FE8..1FEC    ; Upper # L&   [5] GREEK CAPITAL LETTER UPSILON WITH VRACHY..GREEK CAPITAL LETTER RHO WITH DASIA
-1FF8..1FFC    ; Upper # L&   [5] GREEK CAPITAL LETTER OMICRON WITH VARIA..GREEK CAPITAL LETTER OMEGA WITH PROSGEGRAMMENI
-2102          ; Upper # L&       DOUBLE-STRUCK CAPITAL C
-2107          ; Upper # L&       EULER CONSTANT
-210B..210D    ; Upper # L&   [3] SCRIPT CAPITAL H..DOUBLE-STRUCK CAPITAL H
-2110..2112    ; Upper # L&   [3] SCRIPT CAPITAL I..SCRIPT CAPITAL L
-2115          ; Upper # L&       DOUBLE-STRUCK CAPITAL N
-2119..211D    ; Upper # L&   [5] DOUBLE-STRUCK CAPITAL P..DOUBLE-STRUCK CAPITAL R
-2124          ; Upper # L&       DOUBLE-STRUCK CAPITAL Z
-2126          ; Upper # L&       OHM SIGN
-2128          ; Upper # L&       BLACK-LETTER CAPITAL Z
-212A..212D    ; Upper # L&   [4] KELVIN SIGN..BLACK-LETTER CAPITAL C
-2130..2133    ; Upper # L&   [4] SCRIPT CAPITAL E..SCRIPT CAPITAL M
-213E..213F    ; Upper # L&   [2] DOUBLE-STRUCK CAPITAL GAMMA..DOUBLE-STRUCK CAPITAL PI
-2145          ; Upper # L&       DOUBLE-STRUCK ITALIC CAPITAL D
-2160..216F    ; Upper # Nl  [16] ROMAN NUMERAL ONE..ROMAN NUMERAL ONE THOUSAND
-2183          ; Upper # L&       ROMAN NUMERAL REVERSED ONE HUNDRED
-24B6..24CF    ; Upper # So  [26] CIRCLED LATIN CAPITAL LETTER A..CIRCLED LATIN CAPITAL LETTER Z
-2C00..2C2E    ; Upper # L&  [47] GLAGOLITIC CAPITAL LETTER AZU..GLAGOLITIC CAPITAL LETTER LATINATE MYSLITE
-2C60          ; Upper # L&       LATIN CAPITAL LETTER L WITH DOUBLE BAR
-2C62..2C64    ; Upper # L&   [3] LATIN CAPITAL LETTER L WITH MIDDLE TILDE..LATIN CAPITAL LETTER R WITH TAIL
-2C67          ; Upper # L&       LATIN CAPITAL LETTER H WITH DESCENDER
-2C69          ; Upper # L&       LATIN CAPITAL LETTER K WITH DESCENDER
-2C6B          ; Upper # L&       LATIN CAPITAL LETTER Z WITH DESCENDER
-2C75          ; Upper # L&       LATIN CAPITAL LETTER HALF H
-2C80          ; Upper # L&       COPTIC CAPITAL LETTER ALFA
-2C82          ; Upper # L&       COPTIC CAPITAL LETTER VIDA
-2C84          ; Upper # L&       COPTIC CAPITAL LETTER GAMMA
-2C86          ; Upper # L&       COPTIC CAPITAL LETTER DALDA
-2C88          ; Upper # L&       COPTIC CAPITAL LETTER EIE
-2C8A          ; Upper # L&       COPTIC CAPITAL LETTER SOU
-2C8C          ; Upper # L&       COPTIC CAPITAL LETTER ZATA
-2C8E          ; Upper # L&       COPTIC CAPITAL LETTER HATE
-2C90          ; Upper # L&       COPTIC CAPITAL LETTER THETHE
-2C92          ; Upper # L&       COPTIC CAPITAL LETTER IAUDA
-2C94          ; Upper # L&       COPTIC CAPITAL LETTER KAPA
-2C96          ; Upper # L&       COPTIC CAPITAL LETTER LAULA
-2C98          ; Upper # L&       COPTIC CAPITAL LETTER MI
-2C9A          ; Upper # L&       COPTIC CAPITAL LETTER NI
-2C9C          ; Upper # L&       COPTIC CAPITAL LETTER KSI
-2C9E          ; Upper # L&       COPTIC CAPITAL LETTER O
-2CA0          ; Upper # L&       COPTIC CAPITAL LETTER PI
-2CA2          ; Upper # L&       COPTIC CAPITAL LETTER RO
-2CA4          ; Upper # L&       COPTIC CAPITAL LETTER SIMA
-2CA6          ; Upper # L&       COPTIC CAPITAL LETTER TAU
-2CA8          ; Upper # L&       COPTIC CAPITAL LETTER UA
-2CAA          ; Upper # L&       COPTIC CAPITAL LETTER FI
-2CAC          ; Upper # L&       COPTIC CAPITAL LETTER KHI
-2CAE          ; Upper # L&       COPTIC CAPITAL LETTER PSI
-2CB0          ; Upper # L&       COPTIC CAPITAL LETTER OOU
-2CB2          ; Upper # L&       COPTIC CAPITAL LETTER DIALECT-P ALEF
-2CB4          ; Upper # L&       COPTIC CAPITAL LETTER OLD COPTIC AIN
-2CB6          ; Upper # L&       COPTIC CAPITAL LETTER CRYPTOGRAMMIC EIE
-2CB8          ; Upper # L&       COPTIC CAPITAL LETTER DIALECT-P KAPA
-2CBA          ; Upper # L&       COPTIC CAPITAL LETTER DIALECT-P NI
-2CBC          ; Upper # L&       COPTIC CAPITAL LETTER CRYPTOGRAMMIC NI
-2CBE          ; Upper # L&       COPTIC CAPITAL LETTER OLD COPTIC OOU
-2CC0          ; Upper # L&       COPTIC CAPITAL LETTER SAMPI
-2CC2          ; Upper # L&       COPTIC CAPITAL LETTER CROSSED SHEI
-2CC4          ; Upper # L&       COPTIC CAPITAL LETTER OLD COPTIC SHEI
-2CC6          ; Upper # L&       COPTIC CAPITAL LETTER OLD COPTIC ESH
-2CC8          ; Upper # L&       COPTIC CAPITAL LETTER AKHMIMIC KHEI
-2CCA          ; Upper # L&       COPTIC CAPITAL LETTER DIALECT-P HORI
-2CCC          ; Upper # L&       COPTIC CAPITAL LETTER OLD COPTIC HORI
-2CCE          ; Upper # L&       COPTIC CAPITAL LETTER OLD COPTIC HA
-2CD0          ; Upper # L&       COPTIC CAPITAL LETTER L-SHAPED HA
-2CD2          ; Upper # L&       COPTIC CAPITAL LETTER OLD COPTIC HEI
-2CD4          ; Upper # L&       COPTIC CAPITAL LETTER OLD COPTIC HAT
-2CD6          ; Upper # L&       COPTIC CAPITAL LETTER OLD COPTIC GANGIA
-2CD8          ; Upper # L&       COPTIC CAPITAL LETTER OLD COPTIC DJA
-2CDA          ; Upper # L&       COPTIC CAPITAL LETTER OLD COPTIC SHIMA
-2CDC          ; Upper # L&       COPTIC CAPITAL LETTER OLD NUBIAN SHIMA
-2CDE          ; Upper # L&       COPTIC CAPITAL LETTER OLD NUBIAN NGI
-2CE0          ; Upper # L&       COPTIC CAPITAL LETTER OLD NUBIAN NYI
-2CE2          ; Upper # L&       COPTIC CAPITAL LETTER OLD NUBIAN WAU
-FF21..FF3A    ; Upper # L&  [26] FULLWIDTH LATIN CAPITAL LETTER A..FULLWIDTH LATIN CAPITAL LETTER Z
-10400..10427  ; Upper # L&  [40] DESERET CAPITAL LETTER LONG I..DESERET CAPITAL LETTER EW
-1D400..1D419  ; Upper # L&  [26] MATHEMATICAL BOLD CAPITAL A..MATHEMATICAL BOLD CAPITAL Z
-1D434..1D44D  ; Upper # L&  [26] MATHEMATICAL ITALIC CAPITAL A..MATHEMATICAL ITALIC CAPITAL Z
-1D468..1D481  ; Upper # L&  [26] MATHEMATICAL BOLD ITALIC CAPITAL A..MATHEMATICAL BOLD ITALIC CAPITAL Z
-1D49C         ; Upper # L&       MATHEMATICAL SCRIPT CAPITAL A
-1D49E..1D49F  ; Upper # L&   [2] MATHEMATICAL SCRIPT CAPITAL C..MATHEMATICAL SCRIPT CAPITAL D
-1D4A2         ; Upper # L&       MATHEMATICAL SCRIPT CAPITAL G
-1D4A5..1D4A6  ; Upper # L&   [2] MATHEMATICAL SCRIPT CAPITAL J..MATHEMATICAL SCRIPT CAPITAL K
-1D4A9..1D4AC  ; Upper # L&   [4] MATHEMATICAL SCRIPT CAPITAL N..MATHEMATICAL SCRIPT CAPITAL Q
-1D4AE..1D4B5  ; Upper # L&   [8] MATHEMATICAL SCRIPT CAPITAL S..MATHEMATICAL SCRIPT CAPITAL Z
-1D4D0..1D4E9  ; Upper # L&  [26] MATHEMATICAL BOLD SCRIPT CAPITAL A..MATHEMATICAL BOLD SCRIPT CAPITAL Z
-1D504..1D505  ; Upper # L&   [2] MATHEMATICAL FRAKTUR CAPITAL A..MATHEMATICAL FRAKTUR CAPITAL B
-1D507..1D50A  ; Upper # L&   [4] MATHEMATICAL FRAKTUR CAPITAL D..MATHEMATICAL FRAKTUR CAPITAL G
-1D50D..1D514  ; Upper # L&   [8] MATHEMATICAL FRAKTUR CAPITAL J..MATHEMATICAL FRAKTUR CAPITAL Q
-1D516..1D51C  ; Upper # L&   [7] MATHEMATICAL FRAKTUR CAPITAL S..MATHEMATICAL FRAKTUR CAPITAL Y
-1D538..1D539  ; Upper # L&   [2] MATHEMATICAL DOUBLE-STRUCK CAPITAL A..MATHEMATICAL DOUBLE-STRUCK CAPITAL B
-1D53B..1D53E  ; Upper # L&   [4] MATHEMATICAL DOUBLE-STRUCK CAPITAL D..MATHEMATICAL DOUBLE-STRUCK CAPITAL G
-1D540..1D544  ; Upper # L&   [5] MATHEMATICAL DOUBLE-STRUCK CAPITAL I..MATHEMATICAL DOUBLE-STRUCK CAPITAL M
-1D546         ; Upper # L&       MATHEMATICAL DOUBLE-STRUCK CAPITAL O
-1D54A..1D550  ; Upper # L&   [7] MATHEMATICAL DOUBLE-STRUCK CAPITAL S..MATHEMATICAL DOUBLE-STRUCK CAPITAL Y
-1D56C..1D585  ; Upper # L&  [26] MATHEMATICAL BOLD FRAKTUR CAPITAL A..MATHEMATICAL BOLD FRAKTUR CAPITAL Z
-1D5A0..1D5B9  ; Upper # L&  [26] MATHEMATICAL SANS-SERIF CAPITAL A..MATHEMATICAL SANS-SERIF CAPITAL Z
-1D5D4..1D5ED  ; Upper # L&  [26] MATHEMATICAL SANS-SERIF BOLD CAPITAL A..MATHEMATICAL SANS-SERIF BOLD CAPITAL Z
-1D608..1D621  ; Upper # L&  [26] MATHEMATICAL SANS-SERIF ITALIC CAPITAL A..MATHEMATICAL SANS-SERIF ITALIC CAPITAL Z
-1D63C..1D655  ; Upper # L&  [26] MATHEMATICAL SANS-SERIF BOLD ITALIC CAPITAL A..MATHEMATICAL SANS-SERIF BOLD ITALIC CAPITAL Z
-1D670..1D689  ; Upper # L&  [26] MATHEMATICAL MONOSPACE CAPITAL A..MATHEMATICAL MONOSPACE CAPITAL Z
-1D6A8..1D6C0  ; Upper # L&  [25] MATHEMATICAL BOLD CAPITAL ALPHA..MATHEMATICAL BOLD CAPITAL OMEGA
-1D6E2..1D6FA  ; Upper # L&  [25] MATHEMATICAL ITALIC CAPITAL ALPHA..MATHEMATICAL ITALIC CAPITAL OMEGA
-1D71C..1D734  ; Upper # L&  [25] MATHEMATICAL BOLD ITALIC CAPITAL ALPHA..MATHEMATICAL BOLD ITALIC CAPITAL OMEGA
-1D756..1D76E  ; Upper # L&  [25] MATHEMATICAL SANS-SERIF BOLD CAPITAL ALPHA..MATHEMATICAL SANS-SERIF BOLD CAPITAL OMEGA
-1D790..1D7A8  ; Upper # L&  [25] MATHEMATICAL SANS-SERIF BOLD ITALIC CAPITAL ALPHA..MATHEMATICAL SANS-SERIF BOLD ITALIC CAPITAL OMEGA
-1D7CA         ; Upper # L&       MATHEMATICAL BOLD CAPITAL DIGAMMA
-
-# Total code points: 1393
-
-# ================================================
-
-00A0          ; OLetter # Zs       NO-BREAK SPACE
-01BB          ; OLetter # Lo       LATIN LETTER TWO WITH STROKE
-01C0..01C3    ; OLetter # Lo   [4] LATIN LETTER DENTAL CLICK..LATIN LETTER RETROFLEX CLICK
-0294          ; OLetter # Lo       LATIN LETTER GLOTTAL STOP
-02B9..02BF    ; OLetter # Lm   [7] MODIFIER LETTER PRIME..MODIFIER LETTER LEFT HALF RING
-02C6..02D1    ; OLetter # Lm  [12] MODIFIER LETTER CIRCUMFLEX ACCENT..MODIFIER LETTER HALF TRIANGULAR COLON
-02EE          ; OLetter # Lm       MODIFIER LETTER DOUBLE APOSTROPHE
-0559          ; OLetter # Lm       ARMENIAN MODIFIER LETTER LEFT HALF RING
-05D0..05EA    ; OLetter # Lo  [27] HEBREW LETTER ALEF..HEBREW LETTER TAV
-05F0..05F2    ; OLetter # Lo   [3] HEBREW LIGATURE YIDDISH DOUBLE VAV..HEBREW LIGATURE YIDDISH DOUBLE YOD
-05F3          ; OLetter # Po       HEBREW PUNCTUATION GERESH
-0621..063A    ; OLetter # Lo  [26] ARABIC LETTER HAMZA..ARABIC LETTER GHAIN
-0640          ; OLetter # Lm       ARABIC TATWEEL
-0641..064A    ; OLetter # Lo  [10] ARABIC LETTER FEH..ARABIC LETTER YEH
-066E..066F    ; OLetter # Lo   [2] ARABIC LETTER DOTLESS BEH..ARABIC LETTER DOTLESS QAF
-0671..06D3    ; OLetter # Lo  [99] ARABIC LETTER ALEF WASLA..ARABIC LETTER YEH BARREE WITH HAMZA ABOVE
-06D5          ; OLetter # Lo       ARABIC LETTER AE
-06E5..06E6    ; OLetter # Lm   [2] ARABIC SMALL WAW..ARABIC SMALL YEH
-06EE..06EF    ; OLetter # Lo   [2] ARABIC LETTER DAL WITH INVERTED V..ARABIC LETTER REH WITH INVERTED V
-06FA..06FC    ; OLetter # Lo   [3] ARABIC LETTER SHEEN WITH DOT BELOW..ARABIC LETTER GHAIN WITH DOT BELOW
-06FF          ; OLetter # Lo       ARABIC LETTER HEH WITH INVERTED V
-0710          ; OLetter # Lo       SYRIAC LETTER ALAPH
-0712..072F    ; OLetter # Lo  [30] SYRIAC LETTER BETH..SYRIAC LETTER PERSIAN DHALATH
-074D..076D    ; OLetter # Lo  [33] SYRIAC LETTER SOGDIAN ZHAIN..ARABIC LETTER SEEN WITH TWO DOTS VERTICALLY ABOVE
-0780..07A5    ; OLetter # Lo  [38] THAANA LETTER HAA..THAANA LETTER WAAVU
-07B1          ; OLetter # Lo       THAANA LETTER NAA
-07CA..07EA    ; OLetter # Lo  [33] NKO LETTER A..NKO LETTER JONA RA
-07F4..07F5    ; OLetter # Lm   [2] NKO HIGH TONE APOSTROPHE..NKO LOW TONE APOSTROPHE
-07FA          ; OLetter # Lm       NKO LAJANYALAN
-0903          ; OLetter # Mc       DEVANAGARI SIGN VISARGA
-0904..0939    ; OLetter # Lo  [54] DEVANAGARI LETTER SHORT A..DEVANAGARI LETTER HA
-093D          ; OLetter # Lo       DEVANAGARI SIGN AVAGRAHA
-093E..0940    ; OLetter # Mc   [3] DEVANAGARI VOWEL SIGN AA..DEVANAGARI VOWEL SIGN II
-0949..094C    ; OLetter # Mc   [4] DEVANAGARI VOWEL SIGN CANDRA O..DEVANAGARI VOWEL SIGN AU
-0950          ; OLetter # Lo       DEVANAGARI OM
-0958..0961    ; OLetter # Lo  [10] DEVANAGARI LETTER QA..DEVANAGARI LETTER VOCALIC LL
-097B..097F    ; OLetter # Lo   [5] DEVANAGARI LETTER GGA..DEVANAGARI LETTER BBA
-0982..0983    ; OLetter # Mc   [2] BENGALI SIGN ANUSVARA..BENGALI SIGN VISARGA
-0985..098C    ; OLetter # Lo   [8] BENGALI LETTER A..BENGALI LETTER VOCALIC L
-098F..0990    ; OLetter # Lo   [2] BENGALI LETTER E..BENGALI LETTER AI
-0993..09A8    ; OLetter # Lo  [22] BENGALI LETTER O..BENGALI LETTER NA
-09AA..09B0    ; OLetter # Lo   [7] BENGALI LETTER PA..BENGALI LETTER RA
-09B2          ; OLetter # Lo       BENGALI LETTER LA
-09B6..09B9    ; OLetter # Lo   [4] BENGALI LETTER SHA..BENGALI LETTER HA
-09BD          ; OLetter # Lo       BENGALI SIGN AVAGRAHA
-09BF..09C0    ; OLetter # Mc   [2] BENGALI VOWEL SIGN I..BENGALI VOWEL SIGN II
-09C7..09C8    ; OLetter # Mc   [2] BENGALI VOWEL SIGN E..BENGALI VOWEL SIGN AI
-09CB..09CC    ; OLetter # Mc   [2] BENGALI VOWEL SIGN O..BENGALI VOWEL SIGN AU
-09CE          ; OLetter # Lo       BENGALI LETTER KHANDA TA
-09DC..09DD    ; OLetter # Lo   [2] BENGALI LETTER RRA..BENGALI LETTER RHA
-09DF..09E1    ; OLetter # Lo   [3] BENGALI LETTER YYA..BENGALI LETTER VOCALIC LL
-09F0..09F1    ; OLetter # Lo   [2] BENGALI LETTER RA WITH MIDDLE DIAGONAL..BENGALI LETTER RA WITH LOWER DIAGONAL
-0A03          ; OLetter # Mc       GURMUKHI SIGN VISARGA
-0A05..0A0A    ; OLetter # Lo   [6] GURMUKHI LETTER A..GURMUKHI LETTER UU
-0A0F..0A10    ; OLetter # Lo   [2] GURMUKHI LETTER EE..GURMUKHI LETTER AI
-0A13..0A28    ; OLetter # Lo  [22] GURMUKHI LETTER OO..GURMUKHI LETTER NA
-0A2A..0A30    ; OLetter # Lo   [7] GURMUKHI LETTER PA..GURMUKHI LETTER RA
-0A32..0A33    ; OLetter # Lo   [2] GURMUKHI LETTER LA..GURMUKHI LETTER LLA
-0A35..0A36    ; OLetter # Lo   [2] GURMUKHI LETTER VA..GURMUKHI LETTER SHA
-0A38..0A39    ; OLetter # Lo   [2] GURMUKHI LETTER SA..GURMUKHI LETTER HA
-0A3E..0A40    ; OLetter # Mc   [3] GURMUKHI VOWEL SIGN AA..GURMUKHI VOWEL SIGN II
-0A59..0A5C    ; OLetter # Lo   [4] GURMUKHI LETTER KHHA..GURMUKHI LETTER RRA
-0A5E          ; OLetter # Lo       GURMUKHI LETTER FA
-0A72..0A74    ; OLetter # Lo   [3] GURMUKHI IRI..GURMUKHI EK ONKAR
-0A83          ; OLetter # Mc       GUJARATI SIGN VISARGA
-0A85..0A8D    ; OLetter # Lo   [9] GUJARATI LETTER A..GUJARATI VOWEL CANDRA E
-0A8F..0A91    ; OLetter # Lo   [3] GUJARATI LETTER E..GUJARATI VOWEL CANDRA O
-0A93..0AA8    ; OLetter # Lo  [22] GUJARATI LETTER O..GUJARATI LETTER NA
-0AAA..0AB0    ; OLetter # Lo   [7] GUJARATI LETTER PA..GUJARATI LETTER RA
-0AB2..0AB3    ; OLetter # Lo   [2] GUJARATI LETTER LA..GUJARATI LETTER LLA
-0AB5..0AB9    ; OLetter # Lo   [5] GUJARATI LETTER VA..GUJARATI LETTER HA
-0ABD          ; OLetter # Lo       GUJARATI SIGN AVAGRAHA
-0ABE..0AC0    ; OLetter # Mc   [3] GUJARATI VOWEL SIGN AA..GUJARATI VOWEL SIGN II
-0AC9          ; OLetter # Mc       GUJARATI VOWEL SIGN CANDRA O
-0ACB..0ACC    ; OLetter # Mc   [2] GUJARATI VOWEL SIGN O..GUJARATI VOWEL SIGN AU
-0AD0          ; OLetter # Lo       GUJARATI OM
-0AE0..0AE1    ; OLetter # Lo   [2] GUJARATI LETTER VOCALIC RR..GUJARATI LETTER VOCALIC LL
-0B02..0B03    ; OLetter # Mc   [2] ORIYA SIGN ANUSVARA..ORIYA SIGN VISARGA
-0B05..0B0C    ; OLetter # Lo   [8] ORIYA LETTER A..ORIYA LETTER VOCALIC L
-0B0F..0B10    ; OLetter # Lo   [2] ORIYA LETTER E..ORIYA LETTER AI
-0B13..0B28    ; OLetter # Lo  [22] ORIYA LETTER O..ORIYA LETTER NA
-0B2A..0B30    ; OLetter # Lo   [7] ORIYA LETTER PA..ORIYA LETTER RA
-0B32..0B33    ; OLetter # Lo   [2] ORIYA LETTER LA..ORIYA LETTER LLA
-0B35..0B39    ; OLetter # Lo   [5] ORIYA LETTER VA..ORIYA LETTER HA
-0B3D          ; OLetter # Lo       ORIYA SIGN AVAGRAHA
-0B40          ; OLetter # Mc       ORIYA VOWEL SIGN II
-0B47..0B48    ; OLetter # Mc   [2] ORIYA VOWEL SIGN E..ORIYA VOWEL SIGN AI
-0B4B..0B4C    ; OLetter # Mc   [2] ORIYA VOWEL SIGN O..ORIYA VOWEL SIGN AU
-0B5C..0B5D    ; OLetter # Lo   [2] ORIYA LETTER RRA..ORIYA LETTER RHA
-0B5F..0B61    ; OLetter # Lo   [3] ORIYA LETTER YYA..ORIYA LETTER VOCALIC LL
-0B71          ; OLetter # Lo       ORIYA LETTER WA
-0B83          ; OLetter # Lo       TAMIL SIGN VISARGA
-0B85..0B8A    ; OLetter # Lo   [6] TAMIL LETTER A..TAMIL LETTER UU
-0B8E..0B90    ; OLetter # Lo   [3] TAMIL LETTER E..TAMIL LETTER AI
-0B92..0B95    ; OLetter # Lo   [4] TAMIL LETTER O..TAMIL LETTER KA
-0B99..0B9A    ; OLetter # Lo   [2] TAMIL LETTER NGA..TAMIL LETTER CA
-0B9C          ; OLetter # Lo       TAMIL LETTER JA
-0B9E..0B9F    ; OLetter # Lo   [2] TAMIL LETTER NYA..TAMIL LETTER TTA
-0BA3..0BA4    ; OLetter # Lo   [2] TAMIL LETTER NNA..TAMIL LETTER TA
-0BA8..0BAA    ; OLetter # Lo   [3] TAMIL LETTER NA..TAMIL LETTER PA
-0BAE..0BB9    ; OLetter # Lo  [12] TAMIL LETTER MA..TAMIL LETTER HA
-0BBF          ; OLetter # Mc       TAMIL VOWEL SIGN I
-0BC1..0BC2    ; OLetter # Mc   [2] TAMIL VOWEL SIGN U..TAMIL VOWEL SIGN UU
-0BC6..0BC8    ; OLetter # Mc   [3] TAMIL VOWEL SIGN E..TAMIL VOWEL SIGN AI
-0BCA..0BCC    ; OLetter # Mc   [3] TAMIL VOWEL SIGN O..TAMIL VOWEL SIGN AU
-0C01..0C03    ; OLetter # Mc   [3] TELUGU SIGN CANDRABINDU..TELUGU SIGN VISARGA
-0C05..0C0C    ; OLetter # Lo   [8] TELUGU LETTER A..TELUGU LETTER VOCALIC L
-0C0E..0C10    ; OLetter # Lo   [3] TELUGU LETTER E..TELUGU LETTER AI
-0C12..0C28    ; OLetter # Lo  [23] TELUGU LETTER O..TELUGU LETTER NA
-0C2A..0C33    ; OLetter # Lo  [10] TELUGU LETTER PA..TELUGU LETTER LLA
-0C35..0C39    ; OLetter # Lo   [5] TELUGU LETTER VA..TELUGU LETTER HA
-0C41..0C44    ; OLetter # Mc   [4] TELUGU VOWEL SIGN U..TELUGU VOWEL SIGN VOCALIC RR
-0C60..0C61    ; OLetter # Lo   [2] TELUGU LETTER VOCALIC RR..TELUGU LETTER VOCALIC LL
-0C82..0C83    ; OLetter # Mc   [2] KANNADA SIGN ANUSVARA..KANNADA SIGN VISARGA
-0C85..0C8C    ; OLetter # Lo   [8] KANNADA LETTER A..KANNADA LETTER VOCALIC L
-0C8E..0C90    ; OLetter # Lo   [3] KANNADA LETTER E..KANNADA LETTER AI
-0C92..0CA8    ; OLetter # Lo  [23] KANNADA LETTER O..KANNADA LETTER NA
-0CAA..0CB3    ; OLetter # Lo  [10] KANNADA LETTER PA..KANNADA LETTER LLA
-0CB5..0CB9    ; OLetter # Lo   [5] KANNADA LETTER VA..KANNADA LETTER HA
-0CBD          ; OLetter # Lo       KANNADA SIGN AVAGRAHA
-0CBE          ; OLetter # Mc       KANNADA VOWEL SIGN AA
-0CC0..0CC1    ; OLetter # Mc   [2] KANNADA VOWEL SIGN II..KANNADA VOWEL SIGN U
-0CC3..0CC4    ; OLetter # Mc   [2] KANNADA VOWEL SIGN VOCALIC R..KANNADA VOWEL SIGN VOCALIC RR
-0CC7..0CC8    ; OLetter # Mc   [2] KANNADA VOWEL SIGN EE..KANNADA VOWEL SIGN AI
-0CCA..0CCB    ; OLetter # Mc   [2] KANNADA VOWEL SIGN O..KANNADA VOWEL SIGN OO
-0CDE          ; OLetter # Lo       KANNADA LETTER FA
-0CE0..0CE1    ; OLetter # Lo   [2] KANNADA LETTER VOCALIC RR..KANNADA LETTER VOCALIC LL
-0D02..0D03    ; OLetter # Mc   [2] MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISARGA
-0D05..0D0C    ; OLetter # Lo   [8] MALAYALAM LETTER A..MALAYALAM LETTER VOCALIC L
-0D0E..0D10    ; OLetter # Lo   [3] MALAYALAM LETTER E..MALAYALAM LETTER AI
-0D12..0D28    ; OLetter # Lo  [23] MALAYALAM LETTER O..MALAYALAM LETTER NA
-0D2A..0D39    ; OLetter # Lo  [16] MALAYALAM LETTER PA..MALAYALAM LETTER HA
-0D3F..0D40    ; OLetter # Mc   [2] MALAYALAM VOWEL SIGN I..MALAYALAM VOWEL SIGN II
-0D46..0D48    ; OLetter # Mc   [3] MALAYALAM VOWEL SIGN E..MALAYALAM VOWEL SIGN AI
-0D4A..0D4C    ; OLetter # Mc   [3] MALAYALAM VOWEL SIGN O..MALAYALAM VOWEL SIGN AU
-0D60..0D61    ; OLetter # Lo   [2] MALAYALAM LETTER VOCALIC RR..MALAYALAM LETTER VOCALIC LL
-0D82..0D83    ; OLetter # Mc   [2] SINHALA SIGN ANUSVARAYA..SINHALA SIGN VISARGAYA
-0D85..0D96    ; OLetter # Lo  [18] SINHALA LETTER AYANNA..SINHALA LETTER AUYANNA
-0D9A..0DB1    ; OLetter # Lo  [24] SINHALA LETTER ALPAPRAANA KAYANNA..SINHALA LETTER DANTAJA NAYANNA
-0DB3..0DBB    ; OLetter # Lo   [9] SINHALA LETTER SANYAKA DAYANNA..SINHALA LETTER RAYANNA
-0DBD          ; OLetter # Lo       SINHALA LETTER DANTAJA LAYANNA
-0DC0..0DC6    ; OLetter # Lo   [7] SINHALA LETTER VAYANNA..SINHALA LETTER FAYANNA
-0DD0..0DD1    ; OLetter # Mc   [2] SINHALA VOWEL SIGN KETTI AEDA-PILLA..SINHALA VOWEL SIGN DIGA AEDA-PILLA
-0DD8..0DDE    ; OLetter # Mc   [7] SINHALA VOWEL SIGN GAETTA-PILLA..SINHALA VOWEL SIGN KOMBUVA HAA GAYANUKITTA
-0DF2..0DF3    ; OLetter # Mc   [2] SINHALA VOWEL SIGN DIGA GAETTA-PILLA..SINHALA VOWEL SIGN DIGA GAYANUKITTA
-0E01..0E30    ; OLetter # Lo  [48] THAI CHARACTER KO KAI..THAI CHARACTER SARA A
-0E32..0E33    ; OLetter # Lo   [2] THAI CHARACTER SARA AA..THAI CHARACTER SARA AM
-0E40..0E45    ; OLetter # Lo   [6] THAI CHARACTER SARA E..THAI CHARACTER LAKKHANGYAO
-0E46          ; OLetter # Lm       THAI CHARACTER MAIYAMOK
-0E81..0E82    ; OLetter # Lo   [2] LAO LETTER KO..LAO LETTER KHO SUNG
-0E84          ; OLetter # Lo       LAO LETTER KHO TAM
-0E87..0E88    ; OLetter # Lo   [2] LAO LETTER NGO..LAO LETTER CO
-0E8A          ; OLetter # Lo       LAO LETTER SO TAM
-0E8D          ; OLetter # Lo       LAO LETTER NYO
-0E94..0E97    ; OLetter # Lo   [4] LAO LETTER DO..LAO LETTER THO TAM
-0E99..0E9F    ; OLetter # Lo   [7] LAO LETTER NO..LAO LETTER FO SUNG
-0EA1..0EA3    ; OLetter # Lo   [3] LAO LETTER MO..LAO LETTER LO LING
-0EA5          ; OLetter # Lo       LAO LETTER LO LOOT
-0EA7          ; OLetter # Lo       LAO LETTER WO
-0EAA..0EAB    ; OLetter # Lo   [2] LAO LETTER SO SUNG..LAO LETTER HO SUNG
-0EAD..0EB0    ; OLetter # Lo   [4] LAO LETTER O..LAO VOWEL SIGN A
-0EB2..0EB3    ; OLetter # Lo   [2] LAO VOWEL SIGN AA..LAO VOWEL SIGN AM
-0EBD          ; OLetter # Lo       LAO SEMIVOWEL SIGN NYO
-0EC0..0EC4    ; OLetter # Lo   [5] LAO VOWEL SIGN E..LAO VOWEL SIGN AI
-0EC6          ; OLetter # Lm       LAO KO LA
-0EDC..0EDD    ; OLetter # Lo   [2] LAO HO NO..LAO HO MO
-0F00          ; OLetter # Lo       TIBETAN SYLLABLE OM
-0F40..0F47    ; OLetter # Lo   [8] TIBETAN LETTER KA..TIBETAN LETTER JA
-0F49..0F6A    ; OLetter # Lo  [34] TIBETAN LETTER NYA..TIBETAN LETTER FIXED-FORM RA
-0F7F          ; OLetter # Mc       TIBETAN SIGN RNAM BCAD
-0F88..0F8B    ; OLetter # Lo   [4] TIBETAN SIGN LCE TSA CAN..TIBETAN SIGN GRU MED RGYINGS
-1000..1021    ; OLetter # Lo  [34] MYANMAR LETTER KA..MYANMAR LETTER A
-1023..1027    ; OLetter # Lo   [5] MYANMAR LETTER I..MYANMAR LETTER E
-1029..102A    ; OLetter # Lo   [2] MYANMAR LETTER O..MYANMAR LETTER AU
-102C          ; OLetter # Mc       MYANMAR VOWEL SIGN AA
-1031          ; OLetter # Mc       MYANMAR VOWEL SIGN E
-1038          ; OLetter # Mc       MYANMAR SIGN VISARGA
-1050..1055    ; OLetter # Lo   [6] MYANMAR LETTER SHA..MYANMAR LETTER VOCALIC LL
-1056..1057    ; OLetter # Mc   [2] MYANMAR VOWEL SIGN VOCALIC R..MYANMAR VOWEL SIGN VOCALIC RR
-10D0..10FA    ; OLetter # Lo  [43] GEORGIAN LETTER AN..GEORGIAN LETTER AIN
-10FC          ; OLetter # Lm       MODIFIER LETTER GEORGIAN NAR
-1100..1159    ; OLetter # Lo  [90] HANGUL CHOSEONG KIYEOK..HANGUL CHOSEONG YEORINHIEUH
-115F..11A2    ; OLetter # Lo  [68] HANGUL CHOSEONG FILLER..HANGUL JUNGSEONG SSANGARAEA
-11A8..11F9    ; OLetter # Lo  [82] HANGUL JONGSEONG KIYEOK..HANGUL JONGSEONG YEORINHIEUH
-1200..1248    ; OLetter # Lo  [73] ETHIOPIC SYLLABLE HA..ETHIOPIC SYLLABLE QWA
-124A..124D    ; OLetter # Lo   [4] ETHIOPIC SYLLABLE QWI..ETHIOPIC SYLLABLE QWE
-1250..1256    ; OLetter # Lo   [7] ETHIOPIC SYLLABLE QHA..ETHIOPIC SYLLABLE QHO
-1258          ; OLetter # Lo       ETHIOPIC SYLLABLE QHWA
-125A..125D    ; OLetter # Lo   [4] ETHIOPIC SYLLABLE QHWI..ETHIOPIC SYLLABLE QHWE
-1260..1288    ; OLetter # Lo  [41] ETHIOPIC SYLLABLE BA..ETHIOPIC SYLLABLE XWA
-128A..128D    ; OLetter # Lo   [4] ETHIOPIC SYLLABLE XWI..ETHIOPIC SYLLABLE XWE
-1290..12B0    ; OLetter # Lo  [33] ETHIOPIC SYLLABLE NA..ETHIOPIC SYLLABLE KWA
-12B2..12B5    ; OLetter # Lo   [4] ETHIOPIC SYLLABLE KWI..ETHIOPIC SYLLABLE KWE
-12B8..12BE    ; OLetter # Lo   [7] ETHIOPIC SYLLABLE KXA..ETHIOPIC SYLLABLE KXO
-12C0          ; OLetter # Lo       ETHIOPIC SYLLABLE KXWA
-12C2..12C5    ; OLetter # Lo   [4] ETHIOPIC SYLLABLE KXWI..ETHIOPIC SYLLABLE KXWE
-12C8..12D6    ; OLetter # Lo  [15] ETHIOPIC SYLLABLE WA..ETHIOPIC SYLLABLE PHARYNGEAL O
-12D8..1310    ; OLetter # Lo  [57] ETHIOPIC SYLLABLE ZA..ETHIOPIC SYLLABLE GWA
-1312..1315    ; OLetter # Lo   [4] ETHIOPIC SYLLABLE GWI..ETHIOPIC SYLLABLE GWE
-1318..135A    ; OLetter # Lo  [67] ETHIOPIC SYLLABLE GGA..ETHIOPIC SYLLABLE FYA
-1380..138F    ; OLetter # Lo  [16] ETHIOPIC SYLLABLE SEBATBEIT MWA..ETHIOPIC SYLLABLE PWE
-13A0..13F4    ; OLetter # Lo  [85] CHEROKEE LETTER A..CHEROKEE LETTER YV
-1401..166C    ; OLetter # Lo [620] CANADIAN SYLLABICS E..CANADIAN SYLLABICS CARRIER TTSA
-166F..1676    ; OLetter # Lo   [8] CANADIAN SYLLABICS QAI..CANADIAN SYLLABICS NNGAA
-1681..169A    ; OLetter # Lo  [26] OGHAM LETTER BEITH..OGHAM LETTER PEITH
-16A0..16EA    ; OLetter # Lo  [75] RUNIC LETTER FEHU FEOH FE F..RUNIC LETTER X
-16EE..16F0    ; OLetter # Nl   [3] RUNIC ARLAUG SYMBOL..RUNIC BELGTHOR SYMBOL
-1700..170C    ; OLetter # Lo  [13] TAGALOG LETTER A..TAGALOG LETTER YA
-170E..1711    ; OLetter # Lo   [4] TAGALOG LETTER LA..TAGALOG LETTER HA
-1720..1731    ; OLetter # Lo  [18] HANUNOO LETTER A..HANUNOO LETTER HA
-1740..1751    ; OLetter # Lo  [18] BUHID LETTER A..BUHID LETTER HA
-1760..176C    ; OLetter # Lo  [13] TAGBANWA LETTER A..TAGBANWA LETTER YA
-176E..1770    ; OLetter # Lo   [3] TAGBANWA LETTER LA..TAGBANWA LETTER SA
-1780..17B3    ; OLetter # Lo  [52] KHMER LETTER KA..KHMER INDEPENDENT VOWEL QAU
-17B6          ; OLetter # Mc       KHMER VOWEL SIGN AA
-17BE..17C5    ; OLetter # Mc   [8] KHMER VOWEL SIGN OE..KHMER VOWEL SIGN AU
-17C7..17C8    ; OLetter # Mc   [2] KHMER SIGN REAHMUK..KHMER SIGN YUUKALEAPINTU
-17D7          ; OLetter # Lm       KHMER SIGN LEK TOO
-17DC          ; OLetter # Lo       KHMER SIGN AVAKRAHASANYA
-1820..1842    ; OLetter # Lo  [35] MONGOLIAN LETTER A..MONGOLIAN LETTER CHI
-1843          ; OLetter # Lm       MONGOLIAN LETTER TODO LONG VOWEL SIGN
-1844..1877    ; OLetter # Lo  [52] MONGOLIAN LETTER TODO E..MONGOLIAN LETTER MANCHU ZHA
-1880..18A8    ; OLetter # Lo  [41] MONGOLIAN LETTER ALI GALI ANUSVARA ONE..MONGOLIAN LETTER MANCHU ALI GALI BHA
-1900..191C    ; OLetter # Lo  [29] LIMBU VOWEL-CARRIER LETTER..LIMBU LETTER HA
-1923..1926    ; OLetter # Mc   [4] LIMBU VOWEL SIGN EE..LIMBU VOWEL SIGN AU
-1929..192B    ; OLetter # Mc   [3] LIMBU SUBJOINED LETTER YA..LIMBU SUBJOINED LETTER WA
-1930..1931    ; OLetter # Mc   [2] LIMBU SMALL LETTER KA..LIMBU SMALL LETTER NGA
-1933..1938    ; OLetter # Mc   [6] LIMBU SMALL LETTER TA..LIMBU SMALL LETTER LA
-1950..196D    ; OLetter # Lo  [30] TAI LE LETTER KA..TAI LE LETTER AI
-1970..1974    ; OLetter # Lo   [5] TAI LE LETTER TONE-2..TAI LE LETTER TONE-6
-1980..19A9    ; OLetter # Lo  [42] NEW TAI LUE LETTER HIGH QA..NEW TAI LUE LETTER LOW XVA
-19B0..19C0    ; OLetter # Mc  [17] NEW TAI LUE VOWEL SIGN VOWEL SHORTENER..NEW TAI LUE VOWEL SIGN IY
-19C1..19C7    ; OLetter # Lo   [7] NEW TAI LUE LETTER FINAL V..NEW TAI LUE LETTER FINAL B
-19C8..19C9    ; OLetter # Mc   [2] NEW TAI LUE TONE MARK-1..NEW TAI LUE TONE MARK-2
-1A00..1A16    ; OLetter # Lo  [23] BUGINESE LETTER KA..BUGINESE LETTER HA
-1A19..1A1B    ; OLetter # Mc   [3] BUGINESE VOWEL SIGN E..BUGINESE VOWEL SIGN AE
-1B04          ; OLetter # Mc       BALINESE SIGN BISAH
-1B05..1B33    ; OLetter # Lo  [47] BALINESE LETTER AKARA..BALINESE LETTER HA
-1B35          ; OLetter # Mc       BALINESE VOWEL SIGN TEDUNG
-1B3B          ; OLetter # Mc       BALINESE VOWEL SIGN RA REPA TEDUNG
-1B3D..1B41    ; OLetter # Mc   [5] BALINESE VOWEL SIGN LA LENGA TEDUNG..BALINESE VOWEL SIGN TALING REPA TEDUNG
-1B43          ; OLetter # Mc       BALINESE VOWEL SIGN PEPET TEDUNG
-1B45..1B4B    ; OLetter # Lo   [7] BALINESE LETTER KAF SASAK..BALINESE LETTER ASYURA SASAK
-2135..2138    ; OLetter # Lo   [4] ALEF SYMBOL..DALET SYMBOL
-2180..2182    ; OLetter # Nl   [3] ROMAN NUMERAL ONE THOUSAND C D..ROMAN NUMERAL TEN THOUSAND
-2D30..2D65    ; OLetter # Lo  [54] TIFINAGH LETTER YA..TIFINAGH LETTER YAZZ
-2D6F          ; OLetter # Lm       TIFINAGH MODIFIER LETTER LABIALIZATION MARK
-2D80..2D96    ; OLetter # Lo  [23] ETHIOPIC SYLLABLE LOA..ETHIOPIC SYLLABLE GGWE
-2DA0..2DA6    ; OLetter # Lo   [7] ETHIOPIC SYLLABLE SSA..ETHIOPIC SYLLABLE SSO
-2DA8..2DAE    ; OLetter # Lo   [7] ETHIOPIC SYLLABLE CCA..ETHIOPIC SYLLABLE CCO
-2DB0..2DB6    ; OLetter # Lo   [7] ETHIOPIC SYLLABLE ZZA..ETHIOPIC SYLLABLE ZZO
-2DB8..2DBE    ; OLetter # Lo   [7] ETHIOPIC SYLLABLE CCHA..ETHIOPIC SYLLABLE CCHO
-2DC0..2DC6    ; OLetter # Lo   [7] ETHIOPIC SYLLABLE QYA..ETHIOPIC SYLLABLE QYO
-2DC8..2DCE    ; OLetter # Lo   [7] ETHIOPIC SYLLABLE KYA..ETHIOPIC SYLLABLE KYO
-2DD0..2DD6    ; OLetter # Lo   [7] ETHIOPIC SYLLABLE XYA..ETHIOPIC SYLLABLE XYO
-2DD8..2DDE    ; OLetter # Lo   [7] ETHIOPIC SYLLABLE GYA..ETHIOPIC SYLLABLE GYO
-3005          ; OLetter # Lm       IDEOGRAPHIC ITERATION MARK
-3006          ; OLetter # Lo       IDEOGRAPHIC CLOSING MARK
-3007          ; OLetter # Nl       IDEOGRAPHIC NUMBER ZERO
-3021..3029    ; OLetter # Nl   [9] HANGZHOU NUMERAL ONE..HANGZHOU NUMERAL NINE
-3031..3035    ; OLetter # Lm   [5] VERTICAL KANA REPEAT MARK..VERTICAL KANA REPEAT MARK LOWER HALF
-3038..303A    ; OLetter # Nl   [3] HANGZHOU NUMERAL TEN..HANGZHOU NUMERAL THIRTY
-303B          ; OLetter # Lm       VERTICAL IDEOGRAPHIC ITERATION MARK
-303C          ; OLetter # Lo       MASU MARK
-3041..3096    ; OLetter # Lo  [86] HIRAGANA LETTER SMALL A..HIRAGANA LETTER SMALL KE
-309D..309E    ; OLetter # Lm   [2] HIRAGANA ITERATION MARK..HIRAGANA VOICED ITERATION MARK
-309F          ; OLetter # Lo       HIRAGANA DIGRAPH YORI
-30A1..30FA    ; OLetter # Lo  [90] KATAKANA LETTER SMALL A..KATAKANA LETTER VO
-30FC..30FE    ; OLetter # Lm   [3] KATAKANA-HIRAGANA PROLONGED SOUND MARK..KATAKANA VOICED ITERATION MARK
-30FF          ; OLetter # Lo       KATAKANA DIGRAPH KOTO
-3105..312C    ; OLetter # Lo  [40] BOPOMOFO LETTER B..BOPOMOFO LETTER GN
-3131..318E    ; OLetter # Lo  [94] HANGUL LETTER KIYEOK..HANGUL LETTER ARAEAE
-31A0..31B7    ; OLetter # Lo  [24] BOPOMOFO LETTER BU..BOPOMOFO FINAL LETTER H
-31F0..31FF    ; OLetter # Lo  [16] KATAKANA LETTER SMALL KU..KATAKANA LETTER SMALL RO
-3400..4DB5    ; OLetter # Lo [6582] CJK UNIFIED IDEOGRAPH-3400..CJK UNIFIED IDEOGRAPH-4DB5
-4E00..9FBB    ; OLetter # Lo [20924] CJK UNIFIED IDEOGRAPH-4E00..CJK UNIFIED IDEOGRAPH-9FBB
-A000..A014    ; OLetter # Lo  [21] YI SYLLABLE IT..YI SYLLABLE E
-A015          ; OLetter # Lm       YI SYLLABLE WU
-A016..A48C    ; OLetter # Lo [1143] YI SYLLABLE BIT..YI SYLLABLE YYR
-A717..A71A    ; OLetter # Lm   [4] MODIFIER LETTER DOT VERTICAL BAR..MODIFIER LETTER LOWER RIGHT CORNER ANGLE
-A800..A801    ; OLetter # Lo   [2] SYLOTI NAGRI LETTER A..SYLOTI NAGRI LETTER I
-A803..A805    ; OLetter # Lo   [3] SYLOTI NAGRI LETTER U..SYLOTI NAGRI LETTER O
-A807..A80A    ; OLetter # Lo   [4] SYLOTI NAGRI LETTER KO..SYLOTI NAGRI LETTER GHO
-A80C..A822    ; OLetter # Lo  [23] SYLOTI NAGRI LETTER CO..SYLOTI NAGRI LETTER HO
-A823..A824    ; OLetter # Mc   [2] SYLOTI NAGRI VOWEL SIGN A..SYLOTI NAGRI VOWEL SIGN I
-A827          ; OLetter # Mc       SYLOTI NAGRI VOWEL SIGN OO
-A840..A873    ; OLetter # Lo  [52] PHAGS-PA LETTER KA..PHAGS-PA LETTER CANDRABINDU
-AC00..D7A3    ; OLetter # Lo [11172] HANGUL SYLLABLE GA..HANGUL SYLLABLE HIH
-F900..FA2D    ; OLetter # Lo [302] CJK COMPATIBILITY IDEOGRAPH-F900..CJK COMPATIBILITY IDEOGRAPH-FA2D
-FA30..FA6A    ; OLetter # Lo  [59] CJK COMPATIBILITY IDEOGRAPH-FA30..CJK COMPATIBILITY IDEOGRAPH-FA6A
-FA70..FAD9    ; OLetter # Lo [106] CJK COMPATIBILITY IDEOGRAPH-FA70..CJK COMPATIBILITY IDEOGRAPH-FAD9
-FB1D          ; OLetter # Lo       HEBREW LETTER YOD WITH HIRIQ
-FB1F..FB28    ; OLetter # Lo  [10] HEBREW LIGATURE YIDDISH YOD YOD PATAH..HEBREW LETTER WIDE TAV
-FB2A..FB36    ; OLetter # Lo  [13] HEBREW LETTER SHIN WITH SHIN DOT..HEBREW LETTER ZAYIN WITH DAGESH
-FB38..FB3C    ; OLetter # Lo   [5] HEBREW LETTER TET WITH DAGESH..HEBREW LETTER LAMED WITH DAGESH
-FB3E          ; OLetter # Lo       HEBREW LETTER MEM WITH DAGESH
-FB40..FB41    ; OLetter # Lo   [2] HEBREW LETTER NUN WITH DAGESH..HEBREW LETTER SAMEKH WITH DAGESH
-FB43..FB44    ; OLetter # Lo   [2] HEBREW LETTER FINAL PE WITH DAGESH..HEBREW LETTER PE WITH DAGESH
-FB46..FBB1    ; OLetter # Lo [108] HEBREW LETTER TSADI WITH DAGESH..ARABIC LETTER YEH BARREE WITH HAMZA ABOVE FINAL FORM
-FBD3..FD3D    ; OLetter # Lo [363] ARABIC LETTER NG ISOLATED FORM..ARABIC LIGATURE ALEF WITH FATHATAN ISOLATED FORM
-FD50..FD8F    ; OLetter # Lo  [64] ARABIC LIGATURE TEH WITH JEEM WITH MEEM INITIAL FORM..ARABIC LIGATURE MEEM WITH KHAH WITH MEEM INITIAL FORM
-FD92..FDC7    ; OLetter # Lo  [54] ARABIC LIGATURE MEEM WITH JEEM WITH KHAH INITIAL FORM..ARABIC LIGATURE NOON WITH JEEM WITH YEH FINAL FORM
-FDF0..FDFB    ; OLetter # Lo  [12] ARABIC LIGATURE SALLA USED AS KORANIC STOP SIGN ISOLATED FORM..ARABIC LIGATURE JALLAJALALOUHOU
-FE70..FE74    ; OLetter # Lo   [5] ARABIC FATHATAN ISOLATED FORM..ARABIC KASRATAN ISOLATED FORM
-FE76..FEFC    ; OLetter # Lo [135] ARABIC FATHA ISOLATED FORM..ARABIC LIGATURE LAM WITH ALEF FINAL FORM
-FF66..FF6F    ; OLetter # Lo  [10] HALFWIDTH KATAKANA LETTER WO..HALFWIDTH KATAKANA LETTER SMALL TU
-FF70          ; OLetter # Lm       HALFWIDTH KATAKANA-HIRAGANA PROLONGED SOUND MARK
-FF71..FF9D    ; OLetter # Lo  [45] HALFWIDTH KATAKANA LETTER A..HALFWIDTH KATAKANA LETTER N
-FF9E..FF9F    ; OLetter # Lm   [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDTH KATAKANA SEMI-VOICED SOUND MARK
-FFA0..FFBE    ; OLetter # Lo  [31] HALFWIDTH HANGUL FILLER..HALFWIDTH HANGUL LETTER HIEUH
-FFC2..FFC7    ; OLetter # Lo   [6] HALFWIDTH HANGUL LETTER A..HALFWIDTH HANGUL LETTER E
-FFCA..FFCF    ; OLetter # Lo   [6] HALFWIDTH HANGUL LETTER YEO..HALFWIDTH HANGUL LETTER OE
-FFD2..FFD7    ; OLetter # Lo   [6] HALFWIDTH HANGUL LETTER YO..HALFWIDTH HANGUL LETTER YU
-FFDA..FFDC    ; OLetter # Lo   [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL LETTER I
-10000..1000B  ; OLetter # Lo  [12] LINEAR B SYLLABLE B008 A..LINEAR B SYLLABLE B046 JE
-1000D..10026  ; OLetter # Lo  [26] LINEAR B SYLLABLE B036 JO..LINEAR B SYLLABLE B032 QO
-10028..1003A  ; OLetter # Lo  [19] LINEAR B SYLLABLE B060 RA..LINEAR B SYLLABLE B042 WO
-1003C..1003D  ; OLetter # Lo   [2] LINEAR B SYLLABLE B017 ZA..LINEAR B SYLLABLE B074 ZE
-1003F..1004D  ; OLetter # Lo  [15] LINEAR B SYLLABLE B020 ZO..LINEAR B SYLLABLE B091 TWO
-10050..1005D  ; OLetter # Lo  [14] LINEAR B SYMBOL B018..LINEAR B SYMBOL B089
-10080..100FA  ; OLetter # Lo [123] LINEAR B IDEOGRAM B100 MAN..LINEAR B IDEOGRAM VESSEL B305
-10140..10174  ; OLetter # Nl  [53] GREEK ACROPHONIC ATTIC ONE QUARTER..GREEK ACROPHONIC STRATIAN FIFTY MNAS
-10300..1031E  ; OLetter # Lo  [31] OLD ITALIC LETTER A..OLD ITALIC LETTER UU
-10330..10340  ; OLetter # Lo  [17] GOTHIC LETTER AHSA..GOTHIC LETTER PAIRTHRA
-10341         ; OLetter # Nl       GOTHIC LETTER NINETY
-10342..10349  ; OLetter # Lo   [8] GOTHIC LETTER RAIDA..GOTHIC LETTER OTHAL
-1034A         ; OLetter # Nl       GOTHIC LETTER NINE HUNDRED
-10380..1039D  ; OLetter # Lo  [30] UGARITIC LETTER ALPA..UGARITIC LETTER SSU
-103A0..103C3  ; OLetter # Lo  [36] OLD PERSIAN SIGN A..OLD PERSIAN SIGN HA
-103C8..103CF  ; OLetter # Lo   [8] OLD PERSIAN SIGN AURAMAZDAA..OLD PERSIAN SIGN BUUMISH
-103D1..103D5  ; OLetter # Nl   [5] OLD PERSIAN NUMBER ONE..OLD PERSIAN NUMBER HUNDRED
-10450..1049D  ; OLetter # Lo  [78] SHAVIAN LETTER PEEP..OSMANYA LETTER OO
-10800..10805  ; OLetter # Lo   [6] CYPRIOT SYLLABLE A..CYPRIOT SYLLABLE JA
-10808         ; OLetter # Lo       CYPRIOT SYLLABLE JO
-1080A..10835  ; OLetter # Lo  [44] CYPRIOT SYLLABLE KA..CYPRIOT SYLLABLE WO
-10837..10838  ; OLetter # Lo   [2] CYPRIOT SYLLABLE XA..CYPRIOT SYLLABLE XE
-1083C         ; OLetter # Lo       CYPRIOT SYLLABLE ZA
-1083F         ; OLetter # Lo       CYPRIOT SYLLABLE ZO
-10900..10915  ; OLetter # Lo  [22] PHOENICIAN LETTER ALF..PHOENICIAN LETTER TAU
-10A00         ; OLetter # Lo       KHAROSHTHI LETTER A
-10A10..10A13  ; OLetter # Lo   [4] KHAROSHTHI LETTER KA..KHAROSHTHI LETTER GHA
-10A15..10A17  ; OLetter # Lo   [3] KHAROSHTHI LETTER CA..KHAROSHTHI LETTER JA
-10A19..10A33  ; OLetter # Lo  [27] KHAROSHTHI LETTER NYA..KHAROSHTHI LETTER TTTHA
-12000..1236E  ; OLetter # Lo [879] CUNEIFORM SIGN A..CUNEIFORM SIGN ZUM
-12400..12462  ; OLetter # Nl  [99] CUNEIFORM NUMERIC SIGN TWO ASH..CUNEIFORM NUMERIC SIGN OLD ASSYRIAN ONE QUARTER
-20000..2A6D6  ; OLetter # Lo [42711] CJK UNIFIED IDEOGRAPH-20000..CJK UNIFIED IDEOGRAPH-2A6D6
-2F800..2FA1D  ; OLetter # Lo [542] CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPATIBILITY IDEOGRAPH-2FA1D
-
-# Total code points: 89727
-
-# ================================================
-
-0030..0039    ; Numeric # Nd  [10] DIGIT ZERO..DIGIT NINE
-0660..0669    ; Numeric # Nd  [10] ARABIC-INDIC DIGIT ZERO..ARABIC-INDIC DIGIT NINE
-066B..066C    ; Numeric # Po   [2] ARABIC DECIMAL SEPARATOR..ARABIC THOUSANDS SEPARATOR
-06F0..06F9    ; Numeric # Nd  [10] EXTENDED ARABIC-INDIC DIGIT ZERO..EXTENDED ARABIC-INDIC DIGIT NINE
-07C0..07C9    ; Numeric # Nd  [10] NKO DIGIT ZERO..NKO DIGIT NINE
-0966..096F    ; Numeric # Nd  [10] DEVANAGARI DIGIT ZERO..DEVANAGARI DIGIT NINE
-09E6..09EF    ; Numeric # Nd  [10] BENGALI DIGIT ZERO..BENGALI DIGIT NINE
-0A66..0A6F    ; Numeric # Nd  [10] GURMUKHI DIGIT ZERO..GURMUKHI DIGIT NINE
-0AE6..0AEF    ; Numeric # Nd  [10] GUJARATI DIGIT ZERO..GUJARATI DIGIT NINE
-0B66..0B6F    ; Numeric # Nd  [10] ORIYA DIGIT ZERO..ORIYA DIGIT NINE
-0BE6..0BEF    ; Numeric # Nd  [10] TAMIL DIGIT ZERO..TAMIL DIGIT NINE
-0C66..0C6F    ; Numeric # Nd  [10] TELUGU DIGIT ZERO..TELUGU DIGIT NINE
-0CE6..0CEF    ; Numeric # Nd  [10] KANNADA DIGIT ZERO..KANNADA DIGIT NINE
-0D66..0D6F    ; Numeric # Nd  [10] MALAYALAM DIGIT ZERO..MALAYALAM DIGIT NINE
-0E50..0E59    ; Numeric # Nd  [10] THAI DIGIT ZERO..THAI DIGIT NINE
-0ED0..0ED9    ; Numeric # Nd  [10] LAO DIGIT ZERO..LAO DIGIT NINE
-0F20..0F29    ; Numeric # Nd  [10] TIBETAN DIGIT ZERO..TIBETAN DIGIT NINE
-1040..1049    ; Numeric # Nd  [10] MYANMAR DIGIT ZERO..MYANMAR DIGIT NINE
-17E0..17E9    ; Numeric # Nd  [10] KHMER DIGIT ZERO..KHMER DIGIT NINE
-1810..1819    ; Numeric # Nd  [10] MONGOLIAN DIGIT ZERO..MONGOLIAN DIGIT NINE
-1946..194F    ; Numeric # Nd  [10] LIMBU DIGIT ZERO..LIMBU DIGIT NINE
-19D0..19D9    ; Numeric # Nd  [10] NEW TAI LUE DIGIT ZERO..NEW TAI LUE DIGIT NINE
-1B50..1B59    ; Numeric # Nd  [10] BALINESE DIGIT ZERO..BALINESE DIGIT NINE
-104A0..104A9  ; Numeric # Nd  [10] OSMANYA DIGIT ZERO..OSMANYA DIGIT NINE
-1D7CE..1D7FF  ; Numeric # Nd  [50] MATHEMATICAL BOLD DIGIT ZERO..MATHEMATICAL MONOSPACE DIGIT NINE
-
-# Total code points: 282
-
-# ================================================
-
-002E          ; ATerm # Po       FULL STOP
-
-# Total code points: 1
-
-# ================================================
-
-0021          ; STerm # Po       EXCLAMATION MARK
-003F          ; STerm # Po       QUESTION MARK
-055C          ; STerm # Po       ARMENIAN EXCLAMATION MARK
-055E          ; STerm # Po       ARMENIAN QUESTION MARK
-0589          ; STerm # Po       ARMENIAN FULL STOP
-061F          ; STerm # Po       ARABIC QUESTION MARK
-06D4          ; STerm # Po       ARABIC FULL STOP
-0700..0702    ; STerm # Po   [3] SYRIAC END OF PARAGRAPH..SYRIAC SUBLINEAR FULL STOP
-07F9          ; STerm # Po       NKO EXCLAMATION MARK
-0964..0965    ; STerm # Po   [2] DEVANAGARI DANDA..DEVANAGARI DOUBLE DANDA
-104A..104B    ; STerm # Po   [2] MYANMAR SIGN LITTLE SECTION..MYANMAR SIGN SECTION
-1362          ; STerm # Po       ETHIOPIC FULL STOP
-1367..1368    ; STerm # Po   [2] ETHIOPIC QUESTION MARK..ETHIOPIC PARAGRAPH SEPARATOR
-166E          ; STerm # Po       CANADIAN SYLLABICS FULL STOP
-1803          ; STerm # Po       MONGOLIAN FULL STOP
-1809          ; STerm # Po       MONGOLIAN MANCHU FULL STOP
-1944..1945    ; STerm # Po   [2] LIMBU EXCLAMATION MARK..LIMBU QUESTION MARK
-1B5A..1B5B    ; STerm # Po   [2] BALINESE PANTI..BALINESE PAMADA
-1B5E..1B5F    ; STerm # Po   [2] BALINESE CARIK SIKI..BALINESE CARIK PAREREN
-203C..203D    ; STerm # Po   [2] DOUBLE EXCLAMATION MARK..INTERROBANG
-2047..2049    ; STerm # Po   [3] DOUBLE QUESTION MARK..EXCLAMATION QUESTION MARK
-3002          ; STerm # Po       IDEOGRAPHIC FULL STOP
-A876..A877    ; STerm # Po   [2] PHAGS-PA MARK SHAD..PHAGS-PA MARK DOUBLE SHAD
-FE52          ; STerm # Po       SMALL FULL STOP
-FE56..FE57    ; STerm # Po   [2] SMALL QUESTION MARK..SMALL EXCLAMATION MARK
-FF01          ; STerm # Po       FULLWIDTH EXCLAMATION MARK
-FF0E          ; STerm # Po       FULLWIDTH FULL STOP
-FF1F          ; STerm # Po       FULLWIDTH QUESTION MARK
-FF61          ; STerm # Po       HALFWIDTH IDEOGRAPHIC FULL STOP
-
-# Total code points: 42
-
-# ================================================
-
-0022          ; Close # Po       QUOTATION MARK
-0027          ; Close # Po       APOSTROPHE
-0028          ; Close # Ps       LEFT PARENTHESIS
-0029          ; Close # Pe       RIGHT PARENTHESIS
-005B          ; Close # Ps       LEFT SQUARE BRACKET
-005D          ; Close # Pe       RIGHT SQUARE BRACKET
-007B          ; Close # Ps       LEFT CURLY BRACKET
-007D          ; Close # Pe       RIGHT CURLY BRACKET
-00AB          ; Close # Pi       LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
-00BB          ; Close # Pf       RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
-0F3A          ; Close # Ps       TIBETAN MARK GUG RTAGS GYON
-0F3B          ; Close # Pe       TIBETAN MARK GUG RTAGS GYAS
-0F3C          ; Close # Ps       TIBETAN MARK ANG KHANG GYON
-0F3D          ; Close # Pe       TIBETAN MARK ANG KHANG GYAS
-169B          ; Close # Ps       OGHAM FEATHER MARK
-169C          ; Close # Pe       OGHAM REVERSED FEATHER MARK
-2018          ; Close # Pi       LEFT SINGLE QUOTATION MARK
-2019          ; Close # Pf       RIGHT SINGLE QUOTATION MARK
-201A          ; Close # Ps       SINGLE LOW-9 QUOTATION MARK
-201B..201C    ; Close # Pi   [2] SINGLE HIGH-REVERSED-9 QUOTATION MARK..LEFT DOUBLE QUOTATION MARK
-201D          ; Close # Pf       RIGHT DOUBLE QUOTATION MARK
-201E          ; Close # Ps       DOUBLE LOW-9 QUOTATION MARK
-201F          ; Close # Pi       DOUBLE HIGH-REVERSED-9 QUOTATION MARK
-2039          ; Close # Pi       SINGLE LEFT-POINTING ANGLE QUOTATION MARK
-203A          ; Close # Pf       SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
-2045          ; Close # Ps       LEFT SQUARE BRACKET WITH QUILL
-2046          ; Close # Pe       RIGHT SQUARE BRACKET WITH QUILL
-207D          ; Close # Ps       SUPERSCRIPT LEFT PARENTHESIS
-207E          ; Close # Pe       SUPERSCRIPT RIGHT PARENTHESIS
-208D          ; Close # Ps       SUBSCRIPT LEFT PARENTHESIS
-208E          ; Close # Pe       SUBSCRIPT RIGHT PARENTHESIS
-2329          ; Close # Ps       LEFT-POINTING ANGLE BRACKET
-232A          ; Close # Pe       RIGHT-POINTING ANGLE BRACKET
-275B..275E    ; Close # So   [4] HEAVY SINGLE TURNED COMMA QUOTATION MARK ORNAMENT..HEAVY DOUBLE COMMA QUOTATION MARK ORNAMENT
-2768          ; Close # Ps       MEDIUM LEFT PARENTHESIS ORNAMENT
-2769          ; Close # Pe       MEDIUM RIGHT PARENTHESIS ORNAMENT
-276A          ; Close # Ps       MEDIUM FLATTENED LEFT PARENTHESIS ORNAMENT
-276B          ; Close # Pe       MEDIUM FLATTENED RIGHT PARENTHESIS ORNAMENT
-276C          ; Close # Ps       MEDIUM LEFT-POINTING ANGLE BRACKET ORNAMENT
-276D          ; Close # Pe       MEDIUM RIGHT-POINTING ANGLE BRACKET ORNAMENT
-276E          ; Close # Ps       HEAVY LEFT-POINTING ANGLE QUOTATION MARK ORNAMENT
-276F          ; Close # Pe       HEAVY RIGHT-POINTING ANGLE QUOTATION MARK ORNAMENT
-2770          ; Close # Ps       HEAVY LEFT-POINTING ANGLE BRACKET ORNAMENT
-2771          ; Close # Pe       HEAVY RIGHT-POINTING ANGLE BRACKET ORNAMENT
-2772          ; Close # Ps       LIGHT LEFT TORTOISE SHELL BRACKET ORNAMENT
-2773          ; Close # Pe       LIGHT RIGHT TORTOISE SHELL BRACKET ORNAMENT
-2774          ; Close # Ps       MEDIUM LEFT CURLY BRACKET ORNAMENT
-2775          ; Close # Pe       MEDIUM RIGHT CURLY BRACKET ORNAMENT
-27C5          ; Close # Ps       LEFT S-SHAPED BAG DELIMITER
-27C6          ; Close # Pe       RIGHT S-SHAPED BAG DELIMITER
-27E6          ; Close # Ps       MATHEMATICAL LEFT WHITE SQUARE BRACKET
-27E7          ; Close # Pe       MATHEMATICAL RIGHT WHITE SQUARE BRACKET
-27E8          ; Close # Ps       MATHEMATICAL LEFT ANGLE BRACKET
-27E9          ; Close # Pe       MATHEMATICAL RIGHT ANGLE BRACKET
-27EA          ; Close # Ps       MATHEMATICAL LEFT DOUBLE ANGLE BRACKET
-27EB          ; Close # Pe       MATHEMATICAL RIGHT DOUBLE ANGLE BRACKET
-2983          ; Close # Ps       LEFT WHITE CURLY BRACKET
-2984          ; Close # Pe       RIGHT WHITE CURLY BRACKET
-2985          ; Close # Ps       LEFT WHITE PARENTHESIS
-2986          ; Close # Pe       RIGHT WHITE PARENTHESIS
-2987          ; Close # Ps       Z NOTATION LEFT IMAGE BRACKET
-2988          ; Close # Pe       Z NOTATION RIGHT IMAGE BRACKET
-2989          ; Close # Ps       Z NOTATION LEFT BINDING BRACKET
-298A          ; Close # Pe       Z NOTATION RIGHT BINDING BRACKET
-298B          ; Close # Ps       LEFT SQUARE BRACKET WITH UNDERBAR
-298C          ; Close # Pe       RIGHT SQUARE BRACKET WITH UNDERBAR
-298D          ; Close # Ps       LEFT SQUARE BRACKET WITH TICK IN TOP CORNER
-298E          ; Close # Pe       RIGHT SQUARE BRACKET WITH TICK IN BOTTOM CORNER
-298F          ; Close # Ps       LEFT SQUARE BRACKET WITH TICK IN BOTTOM CORNER
-2990          ; Close # Pe       RIGHT SQUARE BRACKET WITH TICK IN TOP CORNER
-2991          ; Close # Ps       LEFT ANGLE BRACKET WITH DOT
-2992          ; Close # Pe       RIGHT ANGLE BRACKET WITH DOT
-2993          ; Close # Ps       LEFT ARC LESS-THAN BRACKET
-2994          ; Close # Pe       RIGHT ARC GREATER-THAN BRACKET
-2995          ; Close # Ps       DOUBLE LEFT ARC GREATER-THAN BRACKET
-2996          ; Close # Pe       DOUBLE RIGHT ARC LESS-THAN BRACKET
-2997          ; Close # Ps       LEFT BLACK TORTOISE SHELL BRACKET
-2998          ; Close # Pe       RIGHT BLACK TORTOISE SHELL BRACKET
-29D8          ; Close # Ps       LEFT WIGGLY FENCE
-29D9          ; Close # Pe       RIGHT WIGGLY FENCE
-29DA          ; Close # Ps       LEFT DOUBLE WIGGLY FENCE
-29DB          ; Close # Pe       RIGHT DOUBLE WIGGLY FENCE
-29FC          ; Close # Ps       LEFT-POINTING CURVED ANGLE BRACKET
-29FD          ; Close # Pe       RIGHT-POINTING CURVED ANGLE BRACKET
-2E00..2E01    ; Close # Po   [2] RIGHT ANGLE SUBSTITUTION MARKER..RIGHT ANGLE DOTTED SUBSTITUTION MARKER
-2E02          ; Close # Pi       LEFT SUBSTITUTION BRACKET
-2E03          ; Close # Pf       RIGHT SUBSTITUTION BRACKET
-2E04          ; Close # Pi       LEFT DOTTED SUBSTITUTION BRACKET
-2E05          ; Close # Pf       RIGHT DOTTED SUBSTITUTION BRACKET
-2E06..2E08    ; Close # Po   [3] RAISED INTERPOLATION MARKER..DOTTED TRANSPOSITION MARKER
-2E09          ; Close # Pi       LEFT TRANSPOSITION BRACKET
-2E0A          ; Close # Pf       RIGHT TRANSPOSITION BRACKET
-2E0B          ; Close # Po       RAISED SQUARE
-2E0C          ; Close # Pi       LEFT RAISED OMISSION BRACKET
-2E0D          ; Close # Pf       RIGHT RAISED OMISSION BRACKET
-2E1C          ; Close # Pi       LEFT LOW PARAPHRASE BRACKET
-2E1D          ; Close # Pf       RIGHT LOW PARAPHRASE BRACKET
-3008          ; Close # Ps       LEFT ANGLE BRACKET
-3009          ; Close # Pe       RIGHT ANGLE BRACKET
-300A          ; Close # Ps       LEFT DOUBLE ANGLE BRACKET
-300B          ; Close # Pe       RIGHT DOUBLE ANGLE BRACKET
-300C          ; Close # Ps       LEFT CORNER BRACKET
-300D          ; Close # Pe       RIGHT CORNER BRACKET
-300E          ; Close # Ps       LEFT WHITE CORNER BRACKET
-300F          ; Close # Pe       RIGHT WHITE CORNER BRACKET
-3010          ; Close # Ps       LEFT BLACK LENTICULAR BRACKET
-3011          ; Close # Pe       RIGHT BLACK LENTICULAR BRACKET
-3014          ; Close # Ps       LEFT TORTOISE SHELL BRACKET
-3015          ; Close # Pe       RIGHT TORTOISE SHELL BRACKET
-3016          ; Close # Ps       LEFT WHITE LENTICULAR BRACKET
-3017          ; Close # Pe       RIGHT WHITE LENTICULAR BRACKET
-3018          ; Close # Ps       LEFT WHITE TORTOISE SHELL BRACKET
-3019          ; Close # Pe       RIGHT WHITE TORTOISE SHELL BRACKET
-301A          ; Close # Ps       LEFT WHITE SQUARE BRACKET
-301B          ; Close # Pe       RIGHT WHITE SQUARE BRACKET
-301D          ; Close # Ps       REVERSED DOUBLE PRIME QUOTATION MARK
-301E..301F    ; Close # Pe   [2] DOUBLE PRIME QUOTATION MARK..LOW DOUBLE PRIME QUOTATION MARK
-FD3E          ; Close # Ps       ORNATE LEFT PARENTHESIS
-FD3F          ; Close # Pe       ORNATE RIGHT PARENTHESIS
-FE17          ; Close # Ps       PRESENTATION FORM FOR VERTICAL LEFT WHITE LENTICULAR BRACKET
-FE18          ; Close # Pe       PRESENTATION FORM FOR VERTICAL RIGHT WHITE LENTICULAR BRAKCET
-FE35          ; Close # Ps       PRESENTATION FORM FOR VERTICAL LEFT PARENTHESIS
-FE36          ; Close # Pe       PRESENTATION FORM FOR VERTICAL RIGHT PARENTHESIS
-FE37          ; Close # Ps       PRESENTATION FORM FOR VERTICAL LEFT CURLY BRACKET
-FE38          ; Close # Pe       PRESENTATION FORM FOR VERTICAL RIGHT CURLY BRACKET
-FE39          ; Close # Ps       PRESENTATION FORM FOR VERTICAL LEFT TORTOISE SHELL BRACKET
-FE3A          ; Close # Pe       PRESENTATION FORM FOR VERTICAL RIGHT TORTOISE SHELL BRACKET
-FE3B          ; Close # Ps       PRESENTATION FORM FOR VERTICAL LEFT BLACK LENTICULAR BRACKET
-FE3C          ; Close # Pe       PRESENTATION FORM FOR VERTICAL RIGHT BLACK LENTICULAR BRACKET
-FE3D          ; Close # Ps       PRESENTATION FORM FOR VERTICAL LEFT DOUBLE ANGLE BRACKET
-FE3E          ; Close # Pe       PRESENTATION FORM FOR VERTICAL RIGHT DOUBLE ANGLE BRACKET
-FE3F          ; Close # Ps       PRESENTATION FORM FOR VERTICAL LEFT ANGLE BRACKET
-FE40          ; Close # Pe       PRESENTATION FORM FOR VERTICAL RIGHT ANGLE BRACKET
-FE41          ; Close # Ps       PRESENTATION FORM FOR VERTICAL LEFT CORNER BRACKET
-FE42          ; Close # Pe       PRESENTATION FORM FOR VERTICAL RIGHT CORNER BRACKET
-FE43          ; Close # Ps       PRESENTATION FORM FOR VERTICAL LEFT WHITE CORNER BRACKET
-FE44          ; Close # Pe       PRESENTATION FORM FOR VERTICAL RIGHT WHITE CORNER BRACKET
-FE47          ; Close # Ps       PRESENTATION FORM FOR VERTICAL LEFT SQUARE BRACKET
-FE48          ; Close # Pe       PRESENTATION FORM FOR VERTICAL RIGHT SQUARE BRACKET
-FE59          ; Close # Ps       SMALL LEFT PARENTHESIS
-FE5A          ; Close # Pe       SMALL RIGHT PARENTHESIS
-FE5B          ; Close # Ps       SMALL LEFT CURLY BRACKET
-FE5C          ; Close # Pe       SMALL RIGHT CURLY BRACKET
-FE5D          ; Close # Ps       SMALL LEFT TORTOISE SHELL BRACKET
-FE5E          ; Close # Pe       SMALL RIGHT TORTOISE SHELL BRACKET
-FF08          ; Close # Ps       FULLWIDTH LEFT PARENTHESIS
-FF09          ; Close # Pe       FULLWIDTH RIGHT PARENTHESIS
-FF3B          ; Close # Ps       FULLWIDTH LEFT SQUARE BRACKET
-FF3D          ; Close # Pe       FULLWIDTH RIGHT SQUARE BRACKET
-FF5B          ; Close # Ps       FULLWIDTH LEFT CURLY BRACKET
-FF5D          ; Close # Pe       FULLWIDTH RIGHT CURLY BRACKET
-FF5F          ; Close # Ps       FULLWIDTH LEFT WHITE PARENTHESIS
-FF60          ; Close # Pe       FULLWIDTH RIGHT WHITE PARENTHESIS
-FF62          ; Close # Ps       HALFWIDTH LEFT CORNER BRACKET
-FF63          ; Close # Pe       HALFWIDTH RIGHT CORNER BRACKET
-
-# Total code points: 163
-
-# EOF
diff --git a/ucd/auxiliary/SentenceBreakTest.txt b/ucd/auxiliary/SentenceBreakTest.txt
deleted file mode 100644
index 431d0e6..0000000
--- a/ucd/auxiliary/SentenceBreakTest.txt
+++ /dev/null
@@ -1,307 +0,0 @@
-# SentenceBreakTest-5.0.0.txt
-# Date: 2006-06-11, 20:09:14 GMT [MD]
-#
-# Unicode Character Database
-# Copyright (c) 1991-2006 Unicode, Inc.
-# For terms of use, see http://www.unicode.org/terms_of_use.html
-# For documentation, see UCD.html
-#
-# Default Sentence Break Test
-#
-# Format:
-# <string> (# <comment>)? 
-#  <string> contains hex Unicode code points, with 
-#	÷ wherever there is a break opportunity, and 
-#	× wherever there is not.
-#  <comment> the format can change, but currently it shows:
-#	- the sample character name
-#	- (x) the line_break property* for the sample character
-#	- [x] the rule that determines whether there is a break or not
-#
-# These samples may be extended or changed in the future.
-#
-÷ 0023 × 0023 ÷	#  ÷ [0.2] NUMBER SIGN (Other) × [12.0] NUMBER SIGN (Other) ÷ [0.3]
-÷ 0023 × 0001 ÷	#  ÷ [0.2] NUMBER SIGN (Other) × [12.0] <START OF HEADING> (GCControl) ÷ [0.3]
-÷ 0023 × 0300 ÷	#  ÷ [0.2] NUMBER SIGN (Other) × [5.0] COMBINING GRAVE ACCENT (GCExtend) ÷ [0.3]
-÷ 0023 × 00AD ÷	#  ÷ [0.2] NUMBER SIGN (Other) × [5.0] SOFT HYPHEN (GCControl_Format) ÷ [0.3]
-÷ 0023 × 000A ÷	#  ÷ [0.2] NUMBER SIGN (Other) × [12.0] <LINE FEED (LF)> (GCLF_Sep) ÷ [0.3]
-÷ 0023 × 000D ÷	#  ÷ [0.2] NUMBER SIGN (Other) × [12.0] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [0.3]
-÷ 0023 × 0085 ÷	#  ÷ [0.2] NUMBER SIGN (Other) × [12.0] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [0.3]
-÷ 0023 × 0009 ÷	#  ÷ [0.2] NUMBER SIGN (Other) × [12.0] <CHARACTER TABULATION> (GCControl_Sp) ÷ [0.3]
-÷ 0023 × 0020 ÷	#  ÷ [0.2] NUMBER SIGN (Other) × [12.0] SPACE (Sp) ÷ [0.3]
-÷ 0023 × 0061 ÷	#  ÷ [0.2] NUMBER SIGN (Other) × [12.0] LATIN SMALL LETTER A (Lower) ÷ [0.3]
-÷ 0023 × 0041 ÷	#  ÷ [0.2] NUMBER SIGN (Other) × [12.0] LATIN CAPITAL LETTER A (Upper) ÷ [0.3]
-÷ 0023 × 00A0 ÷	#  ÷ [0.2] NUMBER SIGN (Other) × [12.0] NO-BREAK SPACE (OLetter) ÷ [0.3]
-÷ 0023 × 0030 ÷	#  ÷ [0.2] NUMBER SIGN (Other) × [12.0] DIGIT ZERO (Numeric) ÷ [0.3]
-÷ 0023 × 002E ÷	#  ÷ [0.2] NUMBER SIGN (Other) × [12.0] FULL STOP (ATerm) ÷ [0.3]
-÷ 0023 × 0021 ÷	#  ÷ [0.2] NUMBER SIGN (Other) × [12.0] EXCLAMATION MARK (STerm) ÷ [0.3]
-÷ 0023 × 0022 ÷	#  ÷ [0.2] NUMBER SIGN (Other) × [12.0] QUOTATION MARK (Close) ÷ [0.3]
-÷ 0001 × 0023 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) × [12.0] NUMBER SIGN (Other) ÷ [0.3]
-÷ 0001 × 0001 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) × [12.0] <START OF HEADING> (GCControl) ÷ [0.3]
-÷ 0001 × 0300 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) × [5.0] COMBINING GRAVE ACCENT (GCExtend) ÷ [0.3]
-÷ 0001 × 00AD ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) × [5.0] SOFT HYPHEN (GCControl_Format) ÷ [0.3]
-÷ 0001 × 000A ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) × [12.0] <LINE FEED (LF)> (GCLF_Sep) ÷ [0.3]
-÷ 0001 × 000D ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) × [12.0] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [0.3]
-÷ 0001 × 0085 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) × [12.0] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [0.3]
-÷ 0001 × 0009 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) × [12.0] <CHARACTER TABULATION> (GCControl_Sp) ÷ [0.3]
-÷ 0001 × 0020 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) × [12.0] SPACE (Sp) ÷ [0.3]
-÷ 0001 × 0061 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) × [12.0] LATIN SMALL LETTER A (Lower) ÷ [0.3]
-÷ 0001 × 0041 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) × [12.0] LATIN CAPITAL LETTER A (Upper) ÷ [0.3]
-÷ 0001 × 00A0 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) × [12.0] NO-BREAK SPACE (OLetter) ÷ [0.3]
-÷ 0001 × 0030 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) × [12.0] DIGIT ZERO (Numeric) ÷ [0.3]
-÷ 0001 × 002E ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) × [12.0] FULL STOP (ATerm) ÷ [0.3]
-÷ 0001 × 0021 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) × [12.0] EXCLAMATION MARK (STerm) ÷ [0.3]
-÷ 0001 × 0022 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) × [12.0] QUOTATION MARK (Close) ÷ [0.3]
-÷ 0300 × 0023 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (GCExtend) × [12.0] NUMBER SIGN (Other) ÷ [0.3]
-÷ 0300 × 0001 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (GCExtend) × [12.0] <START OF HEADING> (GCControl) ÷ [0.3]
-÷ 0300 × 0300 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (GCExtend) × [5.0] COMBINING GRAVE ACCENT (GCExtend) ÷ [0.3]
-÷ 0300 × 00AD ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (GCExtend) × [5.0] SOFT HYPHEN (GCControl_Format) ÷ [0.3]
-÷ 0300 × 000A ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (GCExtend) × [12.0] <LINE FEED (LF)> (GCLF_Sep) ÷ [0.3]
-÷ 0300 × 000D ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (GCExtend) × [12.0] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [0.3]
-÷ 0300 × 0085 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (GCExtend) × [12.0] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [0.3]
-÷ 0300 × 0009 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (GCExtend) × [12.0] <CHARACTER TABULATION> (GCControl_Sp) ÷ [0.3]
-÷ 0300 × 0020 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (GCExtend) × [12.0] SPACE (Sp) ÷ [0.3]
-÷ 0300 × 0061 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (GCExtend) × [12.0] LATIN SMALL LETTER A (Lower) ÷ [0.3]
-÷ 0300 × 0041 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (GCExtend) × [12.0] LATIN CAPITAL LETTER A (Upper) ÷ [0.3]
-÷ 0300 × 00A0 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (GCExtend) × [12.0] NO-BREAK SPACE (OLetter) ÷ [0.3]
-÷ 0300 × 0030 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (GCExtend) × [12.0] DIGIT ZERO (Numeric) ÷ [0.3]
-÷ 0300 × 002E ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (GCExtend) × [12.0] FULL STOP (ATerm) ÷ [0.3]
-÷ 0300 × 0021 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (GCExtend) × [12.0] EXCLAMATION MARK (STerm) ÷ [0.3]
-÷ 0300 × 0022 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (GCExtend) × [12.0] QUOTATION MARK (Close) ÷ [0.3]
-÷ 00AD × 0023 ÷	#  ÷ [0.2] SOFT HYPHEN (GCControl_Format) × [12.0] NUMBER SIGN (Other) ÷ [0.3]
-÷ 00AD × 0001 ÷	#  ÷ [0.2] SOFT HYPHEN (GCControl_Format) × [12.0] <START OF HEADING> (GCControl) ÷ [0.3]
-÷ 00AD × 0300 ÷	#  ÷ [0.2] SOFT HYPHEN (GCControl_Format) × [5.0] COMBINING GRAVE ACCENT (GCExtend) ÷ [0.3]
-÷ 00AD × 00AD ÷	#  ÷ [0.2] SOFT HYPHEN (GCControl_Format) × [5.0] SOFT HYPHEN (GCControl_Format) ÷ [0.3]
-÷ 00AD × 000A ÷	#  ÷ [0.2] SOFT HYPHEN (GCControl_Format) × [12.0] <LINE FEED (LF)> (GCLF_Sep) ÷ [0.3]
-÷ 00AD × 000D ÷	#  ÷ [0.2] SOFT HYPHEN (GCControl_Format) × [12.0] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [0.3]
-÷ 00AD × 0085 ÷	#  ÷ [0.2] SOFT HYPHEN (GCControl_Format) × [12.0] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [0.3]
-÷ 00AD × 0009 ÷	#  ÷ [0.2] SOFT HYPHEN (GCControl_Format) × [12.0] <CHARACTER TABULATION> (GCControl_Sp) ÷ [0.3]
-÷ 00AD × 0020 ÷	#  ÷ [0.2] SOFT HYPHEN (GCControl_Format) × [12.0] SPACE (Sp) ÷ [0.3]
-÷ 00AD × 0061 ÷	#  ÷ [0.2] SOFT HYPHEN (GCControl_Format) × [12.0] LATIN SMALL LETTER A (Lower) ÷ [0.3]
-÷ 00AD × 0041 ÷	#  ÷ [0.2] SOFT HYPHEN (GCControl_Format) × [12.0] LATIN CAPITAL LETTER A (Upper) ÷ [0.3]
-÷ 00AD × 00A0 ÷	#  ÷ [0.2] SOFT HYPHEN (GCControl_Format) × [12.0] NO-BREAK SPACE (OLetter) ÷ [0.3]
-÷ 00AD × 0030 ÷	#  ÷ [0.2] SOFT HYPHEN (GCControl_Format) × [12.0] DIGIT ZERO (Numeric) ÷ [0.3]
-÷ 00AD × 002E ÷	#  ÷ [0.2] SOFT HYPHEN (GCControl_Format) × [12.0] FULL STOP (ATerm) ÷ [0.3]
-÷ 00AD × 0021 ÷	#  ÷ [0.2] SOFT HYPHEN (GCControl_Format) × [12.0] EXCLAMATION MARK (STerm) ÷ [0.3]
-÷ 00AD × 0022 ÷	#  ÷ [0.2] SOFT HYPHEN (GCControl_Format) × [12.0] QUOTATION MARK (Close) ÷ [0.3]
-÷ 000A ÷ 0023 ÷	#  ÷ [0.2] <LINE FEED (LF)> (GCLF_Sep) ÷ [4.0] NUMBER SIGN (Other) ÷ [0.3]
-÷ 000A ÷ 0001 ÷	#  ÷ [0.2] <LINE FEED (LF)> (GCLF_Sep) ÷ [4.0] <START OF HEADING> (GCControl) ÷ [0.3]
-÷ 000A ÷ 0300 ÷	#  ÷ [0.2] <LINE FEED (LF)> (GCLF_Sep) ÷ [4.0] COMBINING GRAVE ACCENT (GCExtend) ÷ [0.3]
-÷ 000A ÷ 00AD ÷	#  ÷ [0.2] <LINE FEED (LF)> (GCLF_Sep) ÷ [4.0] SOFT HYPHEN (GCControl_Format) ÷ [0.3]
-÷ 000A ÷ 000A ÷	#  ÷ [0.2] <LINE FEED (LF)> (GCLF_Sep) ÷ [4.0] <LINE FEED (LF)> (GCLF_Sep) ÷ [0.3]
-÷ 000A ÷ 000D ÷	#  ÷ [0.2] <LINE FEED (LF)> (GCLF_Sep) ÷ [4.0] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [0.3]
-÷ 000A ÷ 0085 ÷	#  ÷ [0.2] <LINE FEED (LF)> (GCLF_Sep) ÷ [4.0] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [0.3]
-÷ 000A ÷ 0009 ÷	#  ÷ [0.2] <LINE FEED (LF)> (GCLF_Sep) ÷ [4.0] <CHARACTER TABULATION> (GCControl_Sp) ÷ [0.3]
-÷ 000A ÷ 0020 ÷	#  ÷ [0.2] <LINE FEED (LF)> (GCLF_Sep) ÷ [4.0] SPACE (Sp) ÷ [0.3]
-÷ 000A ÷ 0061 ÷	#  ÷ [0.2] <LINE FEED (LF)> (GCLF_Sep) ÷ [4.0] LATIN SMALL LETTER A (Lower) ÷ [0.3]
-÷ 000A ÷ 0041 ÷	#  ÷ [0.2] <LINE FEED (LF)> (GCLF_Sep) ÷ [4.0] LATIN CAPITAL LETTER A (Upper) ÷ [0.3]
-÷ 000A ÷ 00A0 ÷	#  ÷ [0.2] <LINE FEED (LF)> (GCLF_Sep) ÷ [4.0] NO-BREAK SPACE (OLetter) ÷ [0.3]
-÷ 000A ÷ 0030 ÷	#  ÷ [0.2] <LINE FEED (LF)> (GCLF_Sep) ÷ [4.0] DIGIT ZERO (Numeric) ÷ [0.3]
-÷ 000A ÷ 002E ÷	#  ÷ [0.2] <LINE FEED (LF)> (GCLF_Sep) ÷ [4.0] FULL STOP (ATerm) ÷ [0.3]
-÷ 000A ÷ 0021 ÷	#  ÷ [0.2] <LINE FEED (LF)> (GCLF_Sep) ÷ [4.0] EXCLAMATION MARK (STerm) ÷ [0.3]
-÷ 000A ÷ 0022 ÷	#  ÷ [0.2] <LINE FEED (LF)> (GCLF_Sep) ÷ [4.0] QUOTATION MARK (Close) ÷ [0.3]
-÷ 000D ÷ 0023 ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [4.0] NUMBER SIGN (Other) ÷ [0.3]
-÷ 000D ÷ 0001 ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [4.0] <START OF HEADING> (GCControl) ÷ [0.3]
-÷ 000D ÷ 0300 ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [4.0] COMBINING GRAVE ACCENT (GCExtend) ÷ [0.3]
-÷ 000D ÷ 00AD ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [4.0] SOFT HYPHEN (GCControl_Format) ÷ [0.3]
-÷ 000D × 000A ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (GCCR_Sep) × [3.0] <LINE FEED (LF)> (GCLF_Sep) ÷ [0.3]
-÷ 000D ÷ 000D ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [4.0] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [0.3]
-÷ 000D ÷ 0085 ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [4.0] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [0.3]
-÷ 000D ÷ 0009 ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [4.0] <CHARACTER TABULATION> (GCControl_Sp) ÷ [0.3]
-÷ 000D ÷ 0020 ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [4.0] SPACE (Sp) ÷ [0.3]
-÷ 000D ÷ 0061 ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [4.0] LATIN SMALL LETTER A (Lower) ÷ [0.3]
-÷ 000D ÷ 0041 ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [4.0] LATIN CAPITAL LETTER A (Upper) ÷ [0.3]
-÷ 000D ÷ 00A0 ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [4.0] NO-BREAK SPACE (OLetter) ÷ [0.3]
-÷ 000D ÷ 0030 ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [4.0] DIGIT ZERO (Numeric) ÷ [0.3]
-÷ 000D ÷ 002E ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [4.0] FULL STOP (ATerm) ÷ [0.3]
-÷ 000D ÷ 0021 ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [4.0] EXCLAMATION MARK (STerm) ÷ [0.3]
-÷ 000D ÷ 0022 ÷	#  ÷ [0.2] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [4.0] QUOTATION MARK (Close) ÷ [0.3]
-÷ 0085 ÷ 0023 ÷	#  ÷ [0.2] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [4.0] NUMBER SIGN (Other) ÷ [0.3]
-÷ 0085 ÷ 0001 ÷	#  ÷ [0.2] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [4.0] <START OF HEADING> (GCControl) ÷ [0.3]
-÷ 0085 ÷ 0300 ÷	#  ÷ [0.2] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [4.0] COMBINING GRAVE ACCENT (GCExtend) ÷ [0.3]
-÷ 0085 ÷ 00AD ÷	#  ÷ [0.2] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [4.0] SOFT HYPHEN (GCControl_Format) ÷ [0.3]
-÷ 0085 ÷ 000A ÷	#  ÷ [0.2] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [4.0] <LINE FEED (LF)> (GCLF_Sep) ÷ [0.3]
-÷ 0085 ÷ 000D ÷	#  ÷ [0.2] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [4.0] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [0.3]
-÷ 0085 ÷ 0085 ÷	#  ÷ [0.2] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [4.0] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [0.3]
-÷ 0085 ÷ 0009 ÷	#  ÷ [0.2] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [4.0] <CHARACTER TABULATION> (GCControl_Sp) ÷ [0.3]
-÷ 0085 ÷ 0020 ÷	#  ÷ [0.2] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [4.0] SPACE (Sp) ÷ [0.3]
-÷ 0085 ÷ 0061 ÷	#  ÷ [0.2] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [4.0] LATIN SMALL LETTER A (Lower) ÷ [0.3]
-÷ 0085 ÷ 0041 ÷	#  ÷ [0.2] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [4.0] LATIN CAPITAL LETTER A (Upper) ÷ [0.3]
-÷ 0085 ÷ 00A0 ÷	#  ÷ [0.2] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [4.0] NO-BREAK SPACE (OLetter) ÷ [0.3]
-÷ 0085 ÷ 0030 ÷	#  ÷ [0.2] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [4.0] DIGIT ZERO (Numeric) ÷ [0.3]
-÷ 0085 ÷ 002E ÷	#  ÷ [0.2] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [4.0] FULL STOP (ATerm) ÷ [0.3]
-÷ 0085 ÷ 0021 ÷	#  ÷ [0.2] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [4.0] EXCLAMATION MARK (STerm) ÷ [0.3]
-÷ 0085 ÷ 0022 ÷	#  ÷ [0.2] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [4.0] QUOTATION MARK (Close) ÷ [0.3]
-÷ 0009 × 0023 ÷	#  ÷ [0.2] <CHARACTER TABULATION> (GCControl_Sp) × [12.0] NUMBER SIGN (Other) ÷ [0.3]
-÷ 0009 × 0001 ÷	#  ÷ [0.2] <CHARACTER TABULATION> (GCControl_Sp) × [12.0] <START OF HEADING> (GCControl) ÷ [0.3]
-÷ 0009 × 0300 ÷	#  ÷ [0.2] <CHARACTER TABULATION> (GCControl_Sp) × [5.0] COMBINING GRAVE ACCENT (GCExtend) ÷ [0.3]
-÷ 0009 × 00AD ÷	#  ÷ [0.2] <CHARACTER TABULATION> (GCControl_Sp) × [5.0] SOFT HYPHEN (GCControl_Format) ÷ [0.3]
-÷ 0009 × 000A ÷	#  ÷ [0.2] <CHARACTER TABULATION> (GCControl_Sp) × [12.0] <LINE FEED (LF)> (GCLF_Sep) ÷ [0.3]
-÷ 0009 × 000D ÷	#  ÷ [0.2] <CHARACTER TABULATION> (GCControl_Sp) × [12.0] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [0.3]
-÷ 0009 × 0085 ÷	#  ÷ [0.2] <CHARACTER TABULATION> (GCControl_Sp) × [12.0] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [0.3]
-÷ 0009 × 0009 ÷	#  ÷ [0.2] <CHARACTER TABULATION> (GCControl_Sp) × [12.0] <CHARACTER TABULATION> (GCControl_Sp) ÷ [0.3]
-÷ 0009 × 0020 ÷	#  ÷ [0.2] <CHARACTER TABULATION> (GCControl_Sp) × [12.0] SPACE (Sp) ÷ [0.3]
-÷ 0009 × 0061 ÷	#  ÷ [0.2] <CHARACTER TABULATION> (GCControl_Sp) × [12.0] LATIN SMALL LETTER A (Lower) ÷ [0.3]
-÷ 0009 × 0041 ÷	#  ÷ [0.2] <CHARACTER TABULATION> (GCControl_Sp) × [12.0] LATIN CAPITAL LETTER A (Upper) ÷ [0.3]
-÷ 0009 × 00A0 ÷	#  ÷ [0.2] <CHARACTER TABULATION> (GCControl_Sp) × [12.0] NO-BREAK SPACE (OLetter) ÷ [0.3]
-÷ 0009 × 0030 ÷	#  ÷ [0.2] <CHARACTER TABULATION> (GCControl_Sp) × [12.0] DIGIT ZERO (Numeric) ÷ [0.3]
-÷ 0009 × 002E ÷	#  ÷ [0.2] <CHARACTER TABULATION> (GCControl_Sp) × [12.0] FULL STOP (ATerm) ÷ [0.3]
-÷ 0009 × 0021 ÷	#  ÷ [0.2] <CHARACTER TABULATION> (GCControl_Sp) × [12.0] EXCLAMATION MARK (STerm) ÷ [0.3]
-÷ 0009 × 0022 ÷	#  ÷ [0.2] <CHARACTER TABULATION> (GCControl_Sp) × [12.0] QUOTATION MARK (Close) ÷ [0.3]
-÷ 0020 × 0023 ÷	#  ÷ [0.2] SPACE (Sp) × [12.0] NUMBER SIGN (Other) ÷ [0.3]
-÷ 0020 × 0001 ÷	#  ÷ [0.2] SPACE (Sp) × [12.0] <START OF HEADING> (GCControl) ÷ [0.3]
-÷ 0020 × 0300 ÷	#  ÷ [0.2] SPACE (Sp) × [5.0] COMBINING GRAVE ACCENT (GCExtend) ÷ [0.3]
-÷ 0020 × 00AD ÷	#  ÷ [0.2] SPACE (Sp) × [5.0] SOFT HYPHEN (GCControl_Format) ÷ [0.3]
-÷ 0020 × 000A ÷	#  ÷ [0.2] SPACE (Sp) × [12.0] <LINE FEED (LF)> (GCLF_Sep) ÷ [0.3]
-÷ 0020 × 000D ÷	#  ÷ [0.2] SPACE (Sp) × [12.0] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [0.3]
-÷ 0020 × 0085 ÷	#  ÷ [0.2] SPACE (Sp) × [12.0] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [0.3]
-÷ 0020 × 0009 ÷	#  ÷ [0.2] SPACE (Sp) × [12.0] <CHARACTER TABULATION> (GCControl_Sp) ÷ [0.3]
-÷ 0020 × 0020 ÷	#  ÷ [0.2] SPACE (Sp) × [12.0] SPACE (Sp) ÷ [0.3]
-÷ 0020 × 0061 ÷	#  ÷ [0.2] SPACE (Sp) × [12.0] LATIN SMALL LETTER A (Lower) ÷ [0.3]
-÷ 0020 × 0041 ÷	#  ÷ [0.2] SPACE (Sp) × [12.0] LATIN CAPITAL LETTER A (Upper) ÷ [0.3]
-÷ 0020 × 00A0 ÷	#  ÷ [0.2] SPACE (Sp) × [12.0] NO-BREAK SPACE (OLetter) ÷ [0.3]
-÷ 0020 × 0030 ÷	#  ÷ [0.2] SPACE (Sp) × [12.0] DIGIT ZERO (Numeric) ÷ [0.3]
-÷ 0020 × 002E ÷	#  ÷ [0.2] SPACE (Sp) × [12.0] FULL STOP (ATerm) ÷ [0.3]
-÷ 0020 × 0021 ÷	#  ÷ [0.2] SPACE (Sp) × [12.0] EXCLAMATION MARK (STerm) ÷ [0.3]
-÷ 0020 × 0022 ÷	#  ÷ [0.2] SPACE (Sp) × [12.0] QUOTATION MARK (Close) ÷ [0.3]
-÷ 0061 × 0023 ÷	#  ÷ [0.2] LATIN SMALL LETTER A (Lower) × [12.0] NUMBER SIGN (Other) ÷ [0.3]
-÷ 0061 × 0001 ÷	#  ÷ [0.2] LATIN SMALL LETTER A (Lower) × [12.0] <START OF HEADING> (GCControl) ÷ [0.3]
-÷ 0061 × 0300 ÷	#  ÷ [0.2] LATIN SMALL LETTER A (Lower) × [5.0] COMBINING GRAVE ACCENT (GCExtend) ÷ [0.3]
-÷ 0061 × 00AD ÷	#  ÷ [0.2] LATIN SMALL LETTER A (Lower) × [5.0] SOFT HYPHEN (GCControl_Format) ÷ [0.3]
-÷ 0061 × 000A ÷	#  ÷ [0.2] LATIN SMALL LETTER A (Lower) × [12.0] <LINE FEED (LF)> (GCLF_Sep) ÷ [0.3]
-÷ 0061 × 000D ÷	#  ÷ [0.2] LATIN SMALL LETTER A (Lower) × [12.0] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [0.3]
-÷ 0061 × 0085 ÷	#  ÷ [0.2] LATIN SMALL LETTER A (Lower) × [12.0] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [0.3]
-÷ 0061 × 0009 ÷	#  ÷ [0.2] LATIN SMALL LETTER A (Lower) × [12.0] <CHARACTER TABULATION> (GCControl_Sp) ÷ [0.3]
-÷ 0061 × 0020 ÷	#  ÷ [0.2] LATIN SMALL LETTER A (Lower) × [12.0] SPACE (Sp) ÷ [0.3]
-÷ 0061 × 0061 ÷	#  ÷ [0.2] LATIN SMALL LETTER A (Lower) × [12.0] LATIN SMALL LETTER A (Lower) ÷ [0.3]
-÷ 0061 × 0041 ÷	#  ÷ [0.2] LATIN SMALL LETTER A (Lower) × [12.0] LATIN CAPITAL LETTER A (Upper) ÷ [0.3]
-÷ 0061 × 00A0 ÷	#  ÷ [0.2] LATIN SMALL LETTER A (Lower) × [12.0] NO-BREAK SPACE (OLetter) ÷ [0.3]
-÷ 0061 × 0030 ÷	#  ÷ [0.2] LATIN SMALL LETTER A (Lower) × [12.0] DIGIT ZERO (Numeric) ÷ [0.3]
-÷ 0061 × 002E ÷	#  ÷ [0.2] LATIN SMALL LETTER A (Lower) × [12.0] FULL STOP (ATerm) ÷ [0.3]
-÷ 0061 × 0021 ÷	#  ÷ [0.2] LATIN SMALL LETTER A (Lower) × [12.0] EXCLAMATION MARK (STerm) ÷ [0.3]
-÷ 0061 × 0022 ÷	#  ÷ [0.2] LATIN SMALL LETTER A (Lower) × [12.0] QUOTATION MARK (Close) ÷ [0.3]
-÷ 0041 × 0023 ÷	#  ÷ [0.2] LATIN CAPITAL LETTER A (Upper) × [12.0] NUMBER SIGN (Other) ÷ [0.3]
-÷ 0041 × 0001 ÷	#  ÷ [0.2] LATIN CAPITAL LETTER A (Upper) × [12.0] <START OF HEADING> (GCControl) ÷ [0.3]
-÷ 0041 × 0300 ÷	#  ÷ [0.2] LATIN CAPITAL LETTER A (Upper) × [5.0] COMBINING GRAVE ACCENT (GCExtend) ÷ [0.3]
-÷ 0041 × 00AD ÷	#  ÷ [0.2] LATIN CAPITAL LETTER A (Upper) × [5.0] SOFT HYPHEN (GCControl_Format) ÷ [0.3]
-÷ 0041 × 000A ÷	#  ÷ [0.2] LATIN CAPITAL LETTER A (Upper) × [12.0] <LINE FEED (LF)> (GCLF_Sep) ÷ [0.3]
-÷ 0041 × 000D ÷	#  ÷ [0.2] LATIN CAPITAL LETTER A (Upper) × [12.0] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [0.3]
-÷ 0041 × 0085 ÷	#  ÷ [0.2] LATIN CAPITAL LETTER A (Upper) × [12.0] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [0.3]
-÷ 0041 × 0009 ÷	#  ÷ [0.2] LATIN CAPITAL LETTER A (Upper) × [12.0] <CHARACTER TABULATION> (GCControl_Sp) ÷ [0.3]
-÷ 0041 × 0020 ÷	#  ÷ [0.2] LATIN CAPITAL LETTER A (Upper) × [12.0] SPACE (Sp) ÷ [0.3]
-÷ 0041 × 0061 ÷	#  ÷ [0.2] LATIN CAPITAL LETTER A (Upper) × [12.0] LATIN SMALL LETTER A (Lower) ÷ [0.3]
-÷ 0041 × 0041 ÷	#  ÷ [0.2] LATIN CAPITAL LETTER A (Upper) × [12.0] LATIN CAPITAL LETTER A (Upper) ÷ [0.3]
-÷ 0041 × 00A0 ÷	#  ÷ [0.2] LATIN CAPITAL LETTER A (Upper) × [12.0] NO-BREAK SPACE (OLetter) ÷ [0.3]
-÷ 0041 × 0030 ÷	#  ÷ [0.2] LATIN CAPITAL LETTER A (Upper) × [12.0] DIGIT ZERO (Numeric) ÷ [0.3]
-÷ 0041 × 002E ÷	#  ÷ [0.2] LATIN CAPITAL LETTER A (Upper) × [12.0] FULL STOP (ATerm) ÷ [0.3]
-÷ 0041 × 0021 ÷	#  ÷ [0.2] LATIN CAPITAL LETTER A (Upper) × [12.0] EXCLAMATION MARK (STerm) ÷ [0.3]
-÷ 0041 × 0022 ÷	#  ÷ [0.2] LATIN CAPITAL LETTER A (Upper) × [12.0] QUOTATION MARK (Close) ÷ [0.3]
-÷ 00A0 × 0023 ÷	#  ÷ [0.2] NO-BREAK SPACE (OLetter) × [12.0] NUMBER SIGN (Other) ÷ [0.3]
-÷ 00A0 × 0001 ÷	#  ÷ [0.2] NO-BREAK SPACE (OLetter) × [12.0] <START OF HEADING> (GCControl) ÷ [0.3]
-÷ 00A0 × 0300 ÷	#  ÷ [0.2] NO-BREAK SPACE (OLetter) × [5.0] COMBINING GRAVE ACCENT (GCExtend) ÷ [0.3]
-÷ 00A0 × 00AD ÷	#  ÷ [0.2] NO-BREAK SPACE (OLetter) × [5.0] SOFT HYPHEN (GCControl_Format) ÷ [0.3]
-÷ 00A0 × 000A ÷	#  ÷ [0.2] NO-BREAK SPACE (OLetter) × [12.0] <LINE FEED (LF)> (GCLF_Sep) ÷ [0.3]
-÷ 00A0 × 000D ÷	#  ÷ [0.2] NO-BREAK SPACE (OLetter) × [12.0] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [0.3]
-÷ 00A0 × 0085 ÷	#  ÷ [0.2] NO-BREAK SPACE (OLetter) × [12.0] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [0.3]
-÷ 00A0 × 0009 ÷	#  ÷ [0.2] NO-BREAK SPACE (OLetter) × [12.0] <CHARACTER TABULATION> (GCControl_Sp) ÷ [0.3]
-÷ 00A0 × 0020 ÷	#  ÷ [0.2] NO-BREAK SPACE (OLetter) × [12.0] SPACE (Sp) ÷ [0.3]
-÷ 00A0 × 0061 ÷	#  ÷ [0.2] NO-BREAK SPACE (OLetter) × [12.0] LATIN SMALL LETTER A (Lower) ÷ [0.3]
-÷ 00A0 × 0041 ÷	#  ÷ [0.2] NO-BREAK SPACE (OLetter) × [12.0] LATIN CAPITAL LETTER A (Upper) ÷ [0.3]
-÷ 00A0 × 00A0 ÷	#  ÷ [0.2] NO-BREAK SPACE (OLetter) × [12.0] NO-BREAK SPACE (OLetter) ÷ [0.3]
-÷ 00A0 × 0030 ÷	#  ÷ [0.2] NO-BREAK SPACE (OLetter) × [12.0] DIGIT ZERO (Numeric) ÷ [0.3]
-÷ 00A0 × 002E ÷	#  ÷ [0.2] NO-BREAK SPACE (OLetter) × [12.0] FULL STOP (ATerm) ÷ [0.3]
-÷ 00A0 × 0021 ÷	#  ÷ [0.2] NO-BREAK SPACE (OLetter) × [12.0] EXCLAMATION MARK (STerm) ÷ [0.3]
-÷ 00A0 × 0022 ÷	#  ÷ [0.2] NO-BREAK SPACE (OLetter) × [12.0] QUOTATION MARK (Close) ÷ [0.3]
-÷ 0030 × 0023 ÷	#  ÷ [0.2] DIGIT ZERO (Numeric) × [12.0] NUMBER SIGN (Other) ÷ [0.3]
-÷ 0030 × 0001 ÷	#  ÷ [0.2] DIGIT ZERO (Numeric) × [12.0] <START OF HEADING> (GCControl) ÷ [0.3]
-÷ 0030 × 0300 ÷	#  ÷ [0.2] DIGIT ZERO (Numeric) × [5.0] COMBINING GRAVE ACCENT (GCExtend) ÷ [0.3]
-÷ 0030 × 00AD ÷	#  ÷ [0.2] DIGIT ZERO (Numeric) × [5.0] SOFT HYPHEN (GCControl_Format) ÷ [0.3]
-÷ 0030 × 000A ÷	#  ÷ [0.2] DIGIT ZERO (Numeric) × [12.0] <LINE FEED (LF)> (GCLF_Sep) ÷ [0.3]
-÷ 0030 × 000D ÷	#  ÷ [0.2] DIGIT ZERO (Numeric) × [12.0] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [0.3]
-÷ 0030 × 0085 ÷	#  ÷ [0.2] DIGIT ZERO (Numeric) × [12.0] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [0.3]
-÷ 0030 × 0009 ÷	#  ÷ [0.2] DIGIT ZERO (Numeric) × [12.0] <CHARACTER TABULATION> (GCControl_Sp) ÷ [0.3]
-÷ 0030 × 0020 ÷	#  ÷ [0.2] DIGIT ZERO (Numeric) × [12.0] SPACE (Sp) ÷ [0.3]
-÷ 0030 × 0061 ÷	#  ÷ [0.2] DIGIT ZERO (Numeric) × [12.0] LATIN SMALL LETTER A (Lower) ÷ [0.3]
-÷ 0030 × 0041 ÷	#  ÷ [0.2] DIGIT ZERO (Numeric) × [12.0] LATIN CAPITAL LETTER A (Upper) ÷ [0.3]
-÷ 0030 × 00A0 ÷	#  ÷ [0.2] DIGIT ZERO (Numeric) × [12.0] NO-BREAK SPACE (OLetter) ÷ [0.3]
-÷ 0030 × 0030 ÷	#  ÷ [0.2] DIGIT ZERO (Numeric) × [12.0] DIGIT ZERO (Numeric) ÷ [0.3]
-÷ 0030 × 002E ÷	#  ÷ [0.2] DIGIT ZERO (Numeric) × [12.0] FULL STOP (ATerm) ÷ [0.3]
-÷ 0030 × 0021 ÷	#  ÷ [0.2] DIGIT ZERO (Numeric) × [12.0] EXCLAMATION MARK (STerm) ÷ [0.3]
-÷ 0030 × 0022 ÷	#  ÷ [0.2] DIGIT ZERO (Numeric) × [12.0] QUOTATION MARK (Close) ÷ [0.3]
-÷ 002E ÷ 0023 ÷	#  ÷ [0.2] FULL STOP (ATerm) ÷ [11.0] NUMBER SIGN (Other) ÷ [0.3]
-÷ 002E ÷ 0001 ÷	#  ÷ [0.2] FULL STOP (ATerm) ÷ [11.0] <START OF HEADING> (GCControl) ÷ [0.3]
-÷ 002E × 0300 ÷	#  ÷ [0.2] FULL STOP (ATerm) × [5.0] COMBINING GRAVE ACCENT (GCExtend) ÷ [0.3]
-÷ 002E × 00AD ÷	#  ÷ [0.2] FULL STOP (ATerm) × [5.0] SOFT HYPHEN (GCControl_Format) ÷ [0.3]
-÷ 002E × 000A ÷	#  ÷ [0.2] FULL STOP (ATerm) × [9.0] <LINE FEED (LF)> (GCLF_Sep) ÷ [0.3]
-÷ 002E × 000D ÷	#  ÷ [0.2] FULL STOP (ATerm) × [9.0] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [0.3]
-÷ 002E × 0085 ÷	#  ÷ [0.2] FULL STOP (ATerm) × [9.0] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [0.3]
-÷ 002E × 0009 ÷	#  ÷ [0.2] FULL STOP (ATerm) × [9.0] <CHARACTER TABULATION> (GCControl_Sp) ÷ [0.3]
-÷ 002E × 0020 ÷	#  ÷ [0.2] FULL STOP (ATerm) × [9.0] SPACE (Sp) ÷ [0.3]
-÷ 002E × 0061 ÷	#  ÷ [0.2] FULL STOP (ATerm) × [8.0] LATIN SMALL LETTER A (Lower) ÷ [0.3]
-÷ 002E ÷ 0041 ÷	#  ÷ [0.2] FULL STOP (ATerm) ÷ [11.0] LATIN CAPITAL LETTER A (Upper) ÷ [0.3]
-÷ 002E ÷ 00A0 ÷	#  ÷ [0.2] FULL STOP (ATerm) ÷ [11.0] NO-BREAK SPACE (OLetter) ÷ [0.3]
-÷ 002E × 0030 ÷	#  ÷ [0.2] FULL STOP (ATerm) × [6.0] DIGIT ZERO (Numeric) ÷ [0.3]
-÷ 002E × 002E ÷	#  ÷ [0.2] FULL STOP (ATerm) × [8.1] FULL STOP (ATerm) ÷ [0.3]
-÷ 002E × 0021 ÷	#  ÷ [0.2] FULL STOP (ATerm) × [8.1] EXCLAMATION MARK (STerm) ÷ [0.3]
-÷ 002E × 0022 ÷	#  ÷ [0.2] FULL STOP (ATerm) × [9.0] QUOTATION MARK (Close) ÷ [0.3]
-÷ 0021 ÷ 0023 ÷	#  ÷ [0.2] EXCLAMATION MARK (STerm) ÷ [11.0] NUMBER SIGN (Other) ÷ [0.3]
-÷ 0021 ÷ 0001 ÷	#  ÷ [0.2] EXCLAMATION MARK (STerm) ÷ [11.0] <START OF HEADING> (GCControl) ÷ [0.3]
-÷ 0021 × 0300 ÷	#  ÷ [0.2] EXCLAMATION MARK (STerm) × [5.0] COMBINING GRAVE ACCENT (GCExtend) ÷ [0.3]
-÷ 0021 × 00AD ÷	#  ÷ [0.2] EXCLAMATION MARK (STerm) × [5.0] SOFT HYPHEN (GCControl_Format) ÷ [0.3]
-÷ 0021 × 000A ÷	#  ÷ [0.2] EXCLAMATION MARK (STerm) × [9.0] <LINE FEED (LF)> (GCLF_Sep) ÷ [0.3]
-÷ 0021 × 000D ÷	#  ÷ [0.2] EXCLAMATION MARK (STerm) × [9.0] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [0.3]
-÷ 0021 × 0085 ÷	#  ÷ [0.2] EXCLAMATION MARK (STerm) × [9.0] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [0.3]
-÷ 0021 × 0009 ÷	#  ÷ [0.2] EXCLAMATION MARK (STerm) × [9.0] <CHARACTER TABULATION> (GCControl_Sp) ÷ [0.3]
-÷ 0021 × 0020 ÷	#  ÷ [0.2] EXCLAMATION MARK (STerm) × [9.0] SPACE (Sp) ÷ [0.3]
-÷ 0021 ÷ 0061 ÷	#  ÷ [0.2] EXCLAMATION MARK (STerm) ÷ [11.0] LATIN SMALL LETTER A (Lower) ÷ [0.3]
-÷ 0021 ÷ 0041 ÷	#  ÷ [0.2] EXCLAMATION MARK (STerm) ÷ [11.0] LATIN CAPITAL LETTER A (Upper) ÷ [0.3]
-÷ 0021 ÷ 00A0 ÷	#  ÷ [0.2] EXCLAMATION MARK (STerm) ÷ [11.0] NO-BREAK SPACE (OLetter) ÷ [0.3]
-÷ 0021 ÷ 0030 ÷	#  ÷ [0.2] EXCLAMATION MARK (STerm) ÷ [11.0] DIGIT ZERO (Numeric) ÷ [0.3]
-÷ 0021 × 002E ÷	#  ÷ [0.2] EXCLAMATION MARK (STerm) × [8.1] FULL STOP (ATerm) ÷ [0.3]
-÷ 0021 × 0021 ÷	#  ÷ [0.2] EXCLAMATION MARK (STerm) × [8.1] EXCLAMATION MARK (STerm) ÷ [0.3]
-÷ 0021 × 0022 ÷	#  ÷ [0.2] EXCLAMATION MARK (STerm) × [9.0] QUOTATION MARK (Close) ÷ [0.3]
-÷ 0022 × 0023 ÷	#  ÷ [0.2] QUOTATION MARK (Close) × [12.0] NUMBER SIGN (Other) ÷ [0.3]
-÷ 0022 × 0001 ÷	#  ÷ [0.2] QUOTATION MARK (Close) × [12.0] <START OF HEADING> (GCControl) ÷ [0.3]
-÷ 0022 × 0300 ÷	#  ÷ [0.2] QUOTATION MARK (Close) × [5.0] COMBINING GRAVE ACCENT (GCExtend) ÷ [0.3]
-÷ 0022 × 00AD ÷	#  ÷ [0.2] QUOTATION MARK (Close) × [5.0] SOFT HYPHEN (GCControl_Format) ÷ [0.3]
-÷ 0022 × 000A ÷	#  ÷ [0.2] QUOTATION MARK (Close) × [12.0] <LINE FEED (LF)> (GCLF_Sep) ÷ [0.3]
-÷ 0022 × 000D ÷	#  ÷ [0.2] QUOTATION MARK (Close) × [12.0] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [0.3]
-÷ 0022 × 0085 ÷	#  ÷ [0.2] QUOTATION MARK (Close) × [12.0] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [0.3]
-÷ 0022 × 0009 ÷	#  ÷ [0.2] QUOTATION MARK (Close) × [12.0] <CHARACTER TABULATION> (GCControl_Sp) ÷ [0.3]
-÷ 0022 × 0020 ÷	#  ÷ [0.2] QUOTATION MARK (Close) × [12.0] SPACE (Sp) ÷ [0.3]
-÷ 0022 × 0061 ÷	#  ÷ [0.2] QUOTATION MARK (Close) × [12.0] LATIN SMALL LETTER A (Lower) ÷ [0.3]
-÷ 0022 × 0041 ÷	#  ÷ [0.2] QUOTATION MARK (Close) × [12.0] LATIN CAPITAL LETTER A (Upper) ÷ [0.3]
-÷ 0022 × 00A0 ÷	#  ÷ [0.2] QUOTATION MARK (Close) × [12.0] NO-BREAK SPACE (OLetter) ÷ [0.3]
-÷ 0022 × 0030 ÷	#  ÷ [0.2] QUOTATION MARK (Close) × [12.0] DIGIT ZERO (Numeric) ÷ [0.3]
-÷ 0022 × 002E ÷	#  ÷ [0.2] QUOTATION MARK (Close) × [12.0] FULL STOP (ATerm) ÷ [0.3]
-÷ 0022 × 0021 ÷	#  ÷ [0.2] QUOTATION MARK (Close) × [12.0] EXCLAMATION MARK (STerm) ÷ [0.3]
-÷ 0022 × 0022 ÷	#  ÷ [0.2] QUOTATION MARK (Close) × [12.0] QUOTATION MARK (Close) ÷ [0.3]
-÷ 0028 × 0022 × 0047 × 006F × 002E × 0022 × 0029 × 0020 ÷ 0028 × 0048 × 0065 × 0020 × 0064 × 0069 × 0064 × 002E × 0029 ÷	#  ÷ [0.2] LEFT PARENTHESIS (Close) × [12.0] QUOTATION MARK (Close) × [12.0] LATIN CAPITAL LETTER G (Upper) × [12.0] LATIN SMALL LETTER O (Lower) × [12.0] FULL STOP (ATerm) × [9.0] QUOTATION MARK (Close) × [9.0] RIGHT PARENTHESIS (Close) × [9.0] SPACE (Sp) ÷ [11.0] LEFT PARENTHESIS (Close) × [12.0] LATIN CAPITAL LETTER H (Upper) × [12.0] LATIN SMALL LETTER E (Lower) × [12.0] SPACE (Sp) × [12.0] LATIN SMALL LETTER D (Lower) × [12.0] LATIN SMALL LETTER I (Lower) × [12.0] LATIN SMALL LETTER D (Lower) × [12.0] FULL STOP (ATerm) × [9.0] RIGHT PARENTHESIS (Close) ÷ [0.3]
-÷ 0028 × 201C × 0047 × 006F × 003F × 201D × 0029 × 0020 ÷ 0028 × 0048 × 0065 × 0020 × 0064 × 0069 × 0064 × 002E × 0029 ÷	#  ÷ [0.2] LEFT PARENTHESIS (Close) × [12.0] LEFT DOUBLE QUOTATION MARK (Close) × [12.0] LATIN CAPITAL LETTER G (Upper) × [12.0] LATIN SMALL LETTER O (Lower) × [12.0] QUESTION MARK (STerm) × [9.0] RIGHT DOUBLE QUOTATION MARK (Close) × [9.0] RIGHT PARENTHESIS (Close) × [9.0] SPACE (Sp) ÷ [11.0] LEFT PARENTHESIS (Close) × [12.0] LATIN CAPITAL LETTER H (Upper) × [12.0] LATIN SMALL LETTER E (Lower) × [12.0] SPACE (Sp) × [12.0] LATIN SMALL LETTER D (Lower) × [12.0] LATIN SMALL LETTER I (Lower) × [12.0] LATIN SMALL LETTER D (Lower) × [12.0] FULL STOP (ATerm) × [9.0] RIGHT PARENTHESIS (Close) ÷ [0.3]
-÷ 0055 × 002E × 0053 × 002E × 0041 × 0300 × 002E × 0020 × 0069 × 0073 ÷	#  ÷ [0.2] LATIN CAPITAL LETTER U (Upper) × [12.0] FULL STOP (ATerm) × [7.0] LATIN CAPITAL LETTER S (Upper) × [12.0] FULL STOP (ATerm) × [7.0] LATIN CAPITAL LETTER A (Upper) × [5.0] COMBINING GRAVE ACCENT (GCExtend) × [12.0] FULL STOP (ATerm) × [8.0] SPACE (Sp) × [8.0] LATIN SMALL LETTER I (Lower) × [12.0] LATIN SMALL LETTER S (Lower) ÷ [0.3]
-÷ 0055 × 002E × 0053 × 002E × 0041 × 0300 × 003F × 0020 ÷ 0048 × 0065 ÷	#  ÷ [0.2] LATIN CAPITAL LETTER U (Upper) × [12.0] FULL STOP (ATerm) × [7.0] LATIN CAPITAL LETTER S (Upper) × [12.0] FULL STOP (ATerm) × [7.0] LATIN CAPITAL LETTER A (Upper) × [5.0] COMBINING GRAVE ACCENT (GCExtend) × [12.0] QUESTION MARK (STerm) × [9.0] SPACE (Sp) ÷ [11.0] LATIN CAPITAL LETTER H (Upper) × [12.0] LATIN SMALL LETTER E (Lower) ÷ [0.3]
-÷ 0055 × 002E × 0053 × 002E × 0041 × 0300 × 002E ÷	#  ÷ [0.2] LATIN CAPITAL LETTER U (Upper) × [12.0] FULL STOP (ATerm) × [7.0] LATIN CAPITAL LETTER S (Upper) × [12.0] FULL STOP (ATerm) × [7.0] LATIN CAPITAL LETTER A (Upper) × [5.0] COMBINING GRAVE ACCENT (GCExtend) × [12.0] FULL STOP (ATerm) ÷ [0.3]
-÷ 0033 × 002E × 0034 ÷	#  ÷ [0.2] DIGIT THREE (Numeric) × [12.0] FULL STOP (ATerm) × [6.0] DIGIT FOUR (Numeric) ÷ [0.3]
-÷ 0063 × 002E × 0064 ÷	#  ÷ [0.2] LATIN SMALL LETTER C (Lower) × [12.0] FULL STOP (ATerm) × [8.0] LATIN SMALL LETTER D (Lower) ÷ [0.3]
-÷ 0065 × 0074 × 0063 × 002E × 0029 × 2019 ÷ 00A0 × 2018 × 0028 × 0074 × 0068 × 0065 ÷	#  ÷ [0.2] LATIN SMALL LETTER E (Lower) × [12.0] LATIN SMALL LETTER T (Lower) × [12.0] LATIN SMALL LETTER C (Lower) × [12.0] FULL STOP (ATerm) × [9.0] RIGHT PARENTHESIS (Close) × [9.0] RIGHT SINGLE QUOTATION MARK (Close) ÷ [11.0] NO-BREAK SPACE (OLetter) × [12.0] LEFT SINGLE QUOTATION MARK (Close) × [12.0] LEFT PARENTHESIS (Close) × [12.0] LATIN SMALL LETTER T (Lower) × [12.0] LATIN SMALL LETTER H (Lower) × [12.0] LATIN SMALL LETTER E (Lower) ÷ [0.3]
-÷ 0065 × 0074 × 0063 × 002E × 0029 × 2019 ÷ 00A0 × 2018 × 0028 × 0054 × 0068 × 0065 ÷	#  ÷ [0.2] LATIN SMALL LETTER E (Lower) × [12.0] LATIN SMALL LETTER T (Lower) × [12.0] LATIN SMALL LETTER C (Lower) × [12.0] FULL STOP (ATerm) × [9.0] RIGHT PARENTHESIS (Close) × [9.0] RIGHT SINGLE QUOTATION MARK (Close) ÷ [11.0] NO-BREAK SPACE (OLetter) × [12.0] LEFT SINGLE QUOTATION MARK (Close) × [12.0] LEFT PARENTHESIS (Close) × [12.0] LATIN CAPITAL LETTER T (Upper) × [12.0] LATIN SMALL LETTER H (Lower) × [12.0] LATIN SMALL LETTER E (Lower) ÷ [0.3]
-÷ 0074 × 0068 × 0065 × 0020 × 0072 × 0065 × 0073 × 0070 × 002E × 0020 × 006C × 0065 × 0061 × 0064 × 0065 × 0072 × 0073 × 0020 × 0061 × 0072 × 0065 ÷	#  ÷ [0.2] LATIN SMALL LETTER T (Lower) × [12.0] LATIN SMALL LETTER H (Lower) × [12.0] LATIN SMALL LETTER E (Lower) × [12.0] SPACE (Sp) × [12.0] LATIN SMALL LETTER R (Lower) × [12.0] LATIN SMALL LETTER E (Lower) × [12.0] LATIN SMALL LETTER S (Lower) × [12.0] LATIN SMALL LETTER P (Lower) × [12.0] FULL STOP (ATerm) × [8.0] SPACE (Sp) × [8.0] LATIN SMALL LETTER L (Lower) × [12.0] LATIN SMALL LETTER E (Lower) × [12.0] LATIN SMALL LETTER A (Lower) × [12.0] LATIN SMALL LETTER D (Lower) × [12.0] LATIN SMALL LETTER E (Lower) × [12.0] LATIN SMALL LETTER R (Lower) × [12.0] LATIN SMALL LETTER S (Lower) × [12.0] SPACE (Sp) × [12.0] LATIN SMALL LETTER A (Lower) × [12.0] LATIN SMALL LETTER R (Lower) × [12.0] LATIN SMALL LETTER E (Lower) ÷ [0.3]
-÷ 5B57 × 002E ÷ 5B57 ÷	#  ÷ [0.2] CJK UNIFIED IDEOGRAPH-5B57 (OLetter) × [12.0] FULL STOP (ATerm) ÷ [11.0] CJK UNIFIED IDEOGRAPH-5B57 (OLetter) ÷ [0.3]
-÷ 0065 × 0074 × 0063 × 002E ÷ 5B83 ÷	#  ÷ [0.2] LATIN SMALL LETTER E (Lower) × [12.0] LATIN SMALL LETTER T (Lower) × [12.0] LATIN SMALL LETTER C (Lower) × [12.0] FULL STOP (ATerm) ÷ [11.0] CJK UNIFIED IDEOGRAPH-5B83 (OLetter) ÷ [0.3]
-÷ 0065 × 0074 × 0063 × 002E × 3002 ÷	#  ÷ [0.2] LATIN SMALL LETTER E (Lower) × [12.0] LATIN SMALL LETTER T (Lower) × [12.0] LATIN SMALL LETTER C (Lower) × [12.0] FULL STOP (ATerm) × [8.1] IDEOGRAPHIC FULL STOP (STerm) ÷ [0.3]
-÷ 5B57 × 3002 ÷ 5B83 ÷	#  ÷ [0.2] CJK UNIFIED IDEOGRAPH-5B57 (OLetter) × [12.0] IDEOGRAPHIC FULL STOP (STerm) ÷ [11.0] CJK UNIFIED IDEOGRAPH-5B83 (OLetter) ÷ [0.3]
-÷ 2060 × 0028 × 2060 × 0022 × 2060 × 0047 × 2060 × 006F × 2060 × 002E × 2060 × 0022 × 2060 × 0029 × 2060 × 0020 × 2060 ÷ 0028 × 2060 × 0048 × 2060 × 0065 × 2060 × 0020 × 2060 × 0064 × 2060 × 0069 × 2060 × 0064 × 2060 × 002E × 2060 × 0029 × 2060 × 2060 ÷	#  ÷ [0.2] WORD JOINER (GCControl_Format) × [12.0] LEFT PARENTHESIS (Close) × [5.0] WORD JOINER (GCControl_Format) × [12.0] QUOTATION MARK (Close) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN CAPITAL LETTER G (Upper) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER O (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] FULL STOP (ATerm) × [5.0] WORD JOINER (GCControl_Format) × [9.0] QUOTATION MARK (Close) × [5.0] WORD JOINER (GCControl_Format) × [9.0] RIGHT PARENTHESIS (Close) × [5.0] WORD JOINER (GCControl_Format) × [9.0] SPACE (Sp) × [5.0] WORD JOINER (GCControl_Format) ÷ [11.0] LEFT PARENTHESIS (Close) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN CAPITAL LETTER H (Upper) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER E (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] SPACE (Sp) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER D (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER I (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER D (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] FULL STOP (ATerm) × [5.0] WORD JOINER (GCControl_Format) × [9.0] RIGHT PARENTHESIS (Close) × [5.0] WORD JOINER (GCControl_Format) × [5.0] WORD JOINER (GCControl_Format) ÷ [0.3]
-÷ 2060 × 0028 × 2060 × 201C × 2060 × 0047 × 2060 × 006F × 2060 × 003F × 2060 × 201D × 2060 × 0029 × 2060 × 0020 × 2060 ÷ 0028 × 2060 × 0048 × 2060 × 0065 × 2060 × 0020 × 2060 × 0064 × 2060 × 0069 × 2060 × 0064 × 2060 × 002E × 2060 × 0029 × 2060 × 2060 ÷	#  ÷ [0.2] WORD JOINER (GCControl_Format) × [12.0] LEFT PARENTHESIS (Close) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LEFT DOUBLE QUOTATION MARK (Close) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN CAPITAL LETTER G (Upper) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER O (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] QUESTION MARK (STerm) × [5.0] WORD JOINER (GCControl_Format) × [9.0] RIGHT DOUBLE QUOTATION MARK (Close) × [5.0] WORD JOINER (GCControl_Format) × [9.0] RIGHT PARENTHESIS (Close) × [5.0] WORD JOINER (GCControl_Format) × [9.0] SPACE (Sp) × [5.0] WORD JOINER (GCControl_Format) ÷ [11.0] LEFT PARENTHESIS (Close) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN CAPITAL LETTER H (Upper) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER E (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] SPACE (Sp) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER D (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER I (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER D (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] FULL STOP (ATerm) × [5.0] WORD JOINER (GCControl_Format) × [9.0] RIGHT PARENTHESIS (Close) × [5.0] WORD JOINER (GCControl_Format) × [5.0] WORD JOINER (GCControl_Format) ÷ [0.3]
-÷ 2060 × 0055 × 2060 × 002E × 2060 × 0053 × 2060 × 002E × 2060 × 0041 × 2060 × 0300 × 002E × 2060 × 0020 × 2060 × 0069 × 2060 × 0073 × 2060 × 2060 ÷	#  ÷ [0.2] WORD JOINER (GCControl_Format) × [12.0] LATIN CAPITAL LETTER U (Upper) × [5.0] WORD JOINER (GCControl_Format) × [12.0] FULL STOP (ATerm) × [5.0] WORD JOINER (GCControl_Format) × [7.0] LATIN CAPITAL LETTER S (Upper) × [5.0] WORD JOINER (GCControl_Format) × [12.0] FULL STOP (ATerm) × [5.0] WORD JOINER (GCControl_Format) × [7.0] LATIN CAPITAL LETTER A (Upper) × [5.0] WORD JOINER (GCControl_Format) × [5.0] COMBINING GRAVE ACCENT (GCExtend) × [12.0] FULL STOP (ATerm) × [5.0] WORD JOINER (GCControl_Format) × [9.0] SPACE (Sp) × [5.0] WORD JOINER (GCControl_Format) × [8.0] LATIN SMALL LETTER I (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER S (Lower) × [5.0] WORD JOINER (GCControl_Format) × [5.0] WORD JOINER (GCControl_Format) ÷ [0.3]
-÷ 2060 × 0055 × 2060 × 002E × 2060 × 0053 × 2060 × 002E × 2060 × 0041 × 2060 × 0300 × 003F × 2060 × 0020 × 2060 ÷ 0048 × 2060 × 0065 × 2060 × 2060 ÷	#  ÷ [0.2] WORD JOINER (GCControl_Format) × [12.0] LATIN CAPITAL LETTER U (Upper) × [5.0] WORD JOINER (GCControl_Format) × [12.0] FULL STOP (ATerm) × [5.0] WORD JOINER (GCControl_Format) × [7.0] LATIN CAPITAL LETTER S (Upper) × [5.0] WORD JOINER (GCControl_Format) × [12.0] FULL STOP (ATerm) × [5.0] WORD JOINER (GCControl_Format) × [7.0] LATIN CAPITAL LETTER A (Upper) × [5.0] WORD JOINER (GCControl_Format) × [5.0] COMBINING GRAVE ACCENT (GCExtend) × [12.0] QUESTION MARK (STerm) × [5.0] WORD JOINER (GCControl_Format) × [9.0] SPACE (Sp) × [5.0] WORD JOINER (GCControl_Format) ÷ [11.0] LATIN CAPITAL LETTER H (Upper) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER E (Lower) × [5.0] WORD JOINER (GCControl_Format) × [5.0] WORD JOINER (GCControl_Format) ÷ [0.3]
-÷ 2060 × 0055 × 2060 × 002E × 2060 × 0053 × 2060 × 002E × 2060 × 0041 × 2060 × 0300 × 002E × 2060 × 2060 ÷	#  ÷ [0.2] WORD JOINER (GCControl_Format) × [12.0] LATIN CAPITAL LETTER U (Upper) × [5.0] WORD JOINER (GCControl_Format) × [12.0] FULL STOP (ATerm) × [5.0] WORD JOINER (GCControl_Format) × [7.0] LATIN CAPITAL LETTER S (Upper) × [5.0] WORD JOINER (GCControl_Format) × [12.0] FULL STOP (ATerm) × [5.0] WORD JOINER (GCControl_Format) × [7.0] LATIN CAPITAL LETTER A (Upper) × [5.0] WORD JOINER (GCControl_Format) × [5.0] COMBINING GRAVE ACCENT (GCExtend) × [12.0] FULL STOP (ATerm) × [5.0] WORD JOINER (GCControl_Format) × [5.0] WORD JOINER (GCControl_Format) ÷ [0.3]
-÷ 2060 × 0033 × 2060 × 002E × 2060 × 0034 × 2060 × 2060 ÷	#  ÷ [0.2] WORD JOINER (GCControl_Format) × [12.0] DIGIT THREE (Numeric) × [5.0] WORD JOINER (GCControl_Format) × [12.0] FULL STOP (ATerm) × [5.0] WORD JOINER (GCControl_Format) × [6.0] DIGIT FOUR (Numeric) × [5.0] WORD JOINER (GCControl_Format) × [5.0] WORD JOINER (GCControl_Format) ÷ [0.3]
-÷ 2060 × 0063 × 2060 × 002E × 2060 × 0064 × 2060 × 2060 ÷	#  ÷ [0.2] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER C (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] FULL STOP (ATerm) × [5.0] WORD JOINER (GCControl_Format) × [8.0] LATIN SMALL LETTER D (Lower) × [5.0] WORD JOINER (GCControl_Format) × [5.0] WORD JOINER (GCControl_Format) ÷ [0.3]
-÷ 2060 × 0065 × 2060 × 0074 × 2060 × 0063 × 2060 × 002E × 2060 × 0029 × 2060 × 2019 × 2060 ÷ 00A0 × 2060 × 2018 × 2060 × 0028 × 2060 × 0074 × 2060 × 0068 × 2060 × 0065 × 2060 × 2060 ÷	#  ÷ [0.2] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER E (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER T (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER C (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] FULL STOP (ATerm) × [5.0] WORD JOINER (GCControl_Format) × [9.0] RIGHT PARENTHESIS (Close) × [5.0] WORD JOINER (GCControl_Format) × [9.0] RIGHT SINGLE QUOTATION MARK (Close) × [5.0] WORD JOINER (GCControl_Format) ÷ [11.0] NO-BREAK SPACE (OLetter) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LEFT SINGLE QUOTATION MARK (Close) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LEFT PARENTHESIS (Close) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER T (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER H (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER E (Lower) × [5.0] WORD JOINER (GCControl_Format) × [5.0] WORD JOINER (GCControl_Format) ÷ [0.3]
-÷ 2060 × 0065 × 2060 × 0074 × 2060 × 0063 × 2060 × 002E × 2060 × 0029 × 2060 × 2019 × 2060 ÷ 00A0 × 2060 × 2018 × 2060 × 0028 × 2060 × 0054 × 2060 × 0068 × 2060 × 0065 × 2060 × 2060 ÷	#  ÷ [0.2] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER E (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER T (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER C (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] FULL STOP (ATerm) × [5.0] WORD JOINER (GCControl_Format) × [9.0] RIGHT PARENTHESIS (Close) × [5.0] WORD JOINER (GCControl_Format) × [9.0] RIGHT SINGLE QUOTATION MARK (Close) × [5.0] WORD JOINER (GCControl_Format) ÷ [11.0] NO-BREAK SPACE (OLetter) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LEFT SINGLE QUOTATION MARK (Close) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LEFT PARENTHESIS (Close) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN CAPITAL LETTER T (Upper) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER H (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER E (Lower) × [5.0] WORD JOINER (GCControl_Format) × [5.0] WORD JOINER (GCControl_Format) ÷ [0.3]
-÷ 2060 × 0074 × 2060 × 0068 × 2060 × 0065 × 2060 × 0020 × 2060 × 0072 × 2060 × 0065 × 2060 × 0073 × 2060 × 0070 × 2060 × 002E × 2060 × 0020 × 2060 × 006C × 2060 × 0065 × 2060 × 0061 × 2060 × 0064 × 2060 × 0065 × 2060 × 0072 × 2060 × 0073 × 2060 × 0020 × 2060 × 0061 × 2060 × 0072 × 2060 × 0065 × 2060 × 2060 ÷	#  ÷ [0.2] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER T (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER H (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER E (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] SPACE (Sp) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER R (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER E (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER S (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER P (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] FULL STOP (ATerm) × [5.0] WORD JOINER (GCControl_Format) × [9.0] SPACE (Sp) × [5.0] WORD JOINER (GCControl_Format) × [8.0] LATIN SMALL LETTER L (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER E (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER A (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER D (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER E (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER R (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER S (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] SPACE (Sp) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER A (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER R (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER E (Lower) × [5.0] WORD JOINER (GCControl_Format) × [5.0] WORD JOINER (GCControl_Format) ÷ [0.3]
-÷ 2060 × 5B57 × 2060 × 002E × 2060 ÷ 5B57 × 2060 × 2060 ÷	#  ÷ [0.2] WORD JOINER (GCControl_Format) × [12.0] CJK UNIFIED IDEOGRAPH-5B57 (OLetter) × [5.0] WORD JOINER (GCControl_Format) × [12.0] FULL STOP (ATerm) × [5.0] WORD JOINER (GCControl_Format) ÷ [11.0] CJK UNIFIED IDEOGRAPH-5B57 (OLetter) × [5.0] WORD JOINER (GCControl_Format) × [5.0] WORD JOINER (GCControl_Format) ÷ [0.3]
-÷ 2060 × 0065 × 2060 × 0074 × 2060 × 0063 × 2060 × 002E × 2060 ÷ 5B83 × 2060 × 2060 ÷	#  ÷ [0.2] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER E (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER T (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER C (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] FULL STOP (ATerm) × [5.0] WORD JOINER (GCControl_Format) ÷ [11.0] CJK UNIFIED IDEOGRAPH-5B83 (OLetter) × [5.0] WORD JOINER (GCControl_Format) × [5.0] WORD JOINER (GCControl_Format) ÷ [0.3]
-÷ 2060 × 0065 × 2060 × 0074 × 2060 × 0063 × 2060 × 002E × 2060 × 3002 × 2060 × 2060 ÷	#  ÷ [0.2] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER E (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER T (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] LATIN SMALL LETTER C (Lower) × [5.0] WORD JOINER (GCControl_Format) × [12.0] FULL STOP (ATerm) × [5.0] WORD JOINER (GCControl_Format) × [8.1] IDEOGRAPHIC FULL STOP (STerm) × [5.0] WORD JOINER (GCControl_Format) × [5.0] WORD JOINER (GCControl_Format) ÷ [0.3]
-÷ 2060 × 5B57 × 2060 × 3002 × 2060 ÷ 5B83 × 2060 × 2060 ÷	#  ÷ [0.2] WORD JOINER (GCControl_Format) × [12.0] CJK UNIFIED IDEOGRAPH-5B57 (OLetter) × [5.0] WORD JOINER (GCControl_Format) × [12.0] IDEOGRAPHIC FULL STOP (STerm) × [5.0] WORD JOINER (GCControl_Format) ÷ [11.0] CJK UNIFIED IDEOGRAPH-5B83 (OLetter) × [5.0] WORD JOINER (GCControl_Format) × [5.0] WORD JOINER (GCControl_Format) ÷ [0.3]
-# Lines: 256
diff --git a/ucd/auxiliary/WordBreakProperty.txt b/ucd/auxiliary/WordBreakProperty.txt
deleted file mode 100644
index 6803c2a..0000000
--- a/ucd/auxiliary/WordBreakProperty.txt
+++ /dev/null
@@ -1,521 +0,0 @@
-# WordBreakProperty-5.0.0.txt
-# Date: 2006-06-07, 23:23:03 GMT [MD]
-#
-# Unicode Character Database
-# Copyright (c) 1991-2006 Unicode, Inc.
-# For terms of use, see http://www.unicode.org/terms_of_use.html
-# For documentation, see UCD.html
-
-# ================================================
-
-# Property:	Word_Break
-
-#  All code points not explicitly listed for Word_Break
-#  have the value Other (XX).
-
-# @missing: 0000..10FFFF; Other
-
-# ================================================
-
-00AD          ; Format # Cf       SOFT HYPHEN
-0600..0603    ; Format # Cf   [4] ARABIC NUMBER SIGN..ARABIC SIGN SAFHA
-06DD          ; Format # Cf       ARABIC END OF AYAH
-070F          ; Format # Cf       SYRIAC ABBREVIATION MARK
-17B4..17B5    ; Format # Cf   [2] KHMER VOWEL INHERENT AQ..KHMER VOWEL INHERENT AA
-200B          ; Format # Cf       ZERO WIDTH SPACE
-200E..200F    ; Format # Cf   [2] LEFT-TO-RIGHT MARK..RIGHT-TO-LEFT MARK
-202A..202E    ; Format # Cf   [5] LEFT-TO-RIGHT EMBEDDING..RIGHT-TO-LEFT OVERRIDE
-2060..2063    ; Format # Cf   [4] WORD JOINER..INVISIBLE SEPARATOR
-206A..206F    ; Format # Cf   [6] INHIBIT SYMMETRIC SWAPPING..NOMINAL DIGIT SHAPES
-FEFF          ; Format # Cf       ZERO WIDTH NO-BREAK SPACE
-FFF9..FFFB    ; Format # Cf   [3] INTERLINEAR ANNOTATION ANCHOR..INTERLINEAR ANNOTATION TERMINATOR
-1D173..1D17A  ; Format # Cf   [8] MUSICAL SYMBOL BEGIN BEAM..MUSICAL SYMBOL END PHRASE
-E0001         ; Format # Cf       LANGUAGE TAG
-E0020..E007F  ; Format # Cf  [96] TAG SPACE..CANCEL TAG
-
-# Total code points: 136
-
-# ================================================
-
-3031..3035    ; Katakana # Lm   [5] VERTICAL KANA REPEAT MARK..VERTICAL KANA REPEAT MARK LOWER HALF
-309B..309C    ; Katakana # Sk   [2] KATAKANA-HIRAGANA VOICED SOUND MARK..KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK
-30A0          ; Katakana # Pd       KATAKANA-HIRAGANA DOUBLE HYPHEN
-30A1..30FA    ; Katakana # Lo  [90] KATAKANA LETTER SMALL A..KATAKANA LETTER VO
-30FC..30FE    ; Katakana # Lm   [3] KATAKANA-HIRAGANA PROLONGED SOUND MARK..KATAKANA VOICED ITERATION MARK
-30FF          ; Katakana # Lo       KATAKANA DIGRAPH KOTO
-31F0..31FF    ; Katakana # Lo  [16] KATAKANA LETTER SMALL KU..KATAKANA LETTER SMALL RO
-FF66..FF6F    ; Katakana # Lo  [10] HALFWIDTH KATAKANA LETTER WO..HALFWIDTH KATAKANA LETTER SMALL TU
-FF70          ; Katakana # Lm       HALFWIDTH KATAKANA-HIRAGANA PROLONGED SOUND MARK
-FF71..FF9D    ; Katakana # Lo  [45] HALFWIDTH KATAKANA LETTER A..HALFWIDTH KATAKANA LETTER N
-FF9E..FF9F    ; Katakana # Lm   [2] HALFWIDTH KATAKANA VOICED SOUND MARK..HALFWIDTH KATAKANA SEMI-VOICED SOUND MARK
-
-# Total code points: 176
-
-# ================================================
-
-0041..005A    ; ALetter # L&  [26] LATIN CAPITAL LETTER A..LATIN CAPITAL LETTER Z
-0061..007A    ; ALetter # L&  [26] LATIN SMALL LETTER A..LATIN SMALL LETTER Z
-00AA          ; ALetter # L&       FEMININE ORDINAL INDICATOR
-00B5          ; ALetter # L&       MICRO SIGN
-00BA          ; ALetter # L&       MASCULINE ORDINAL INDICATOR
-00C0..00D6    ; ALetter # L&  [23] LATIN CAPITAL LETTER A WITH GRAVE..LATIN CAPITAL LETTER O WITH DIAERESIS
-00D8..00F6    ; ALetter # L&  [31] LATIN CAPITAL LETTER O WITH STROKE..LATIN SMALL LETTER O WITH DIAERESIS
-00F8..01BA    ; ALetter # L& [195] LATIN SMALL LETTER O WITH STROKE..LATIN SMALL LETTER EZH WITH TAIL
-01BB          ; ALetter # Lo       LATIN LETTER TWO WITH STROKE
-01BC..01BF    ; ALetter # L&   [4] LATIN CAPITAL LETTER TONE FIVE..LATIN LETTER WYNN
-01C0..01C3    ; ALetter # Lo   [4] LATIN LETTER DENTAL CLICK..LATIN LETTER RETROFLEX CLICK
-01C4..0293    ; ALetter # L& [208] LATIN CAPITAL LETTER DZ WITH CARON..LATIN SMALL LETTER EZH WITH CURL
-0294          ; ALetter # Lo       LATIN LETTER GLOTTAL STOP
-0295..02AF    ; ALetter # L&  [27] LATIN LETTER PHARYNGEAL VOICED FRICATIVE..LATIN SMALL LETTER TURNED H WITH FISHHOOK AND TAIL
-02B0..02C1    ; ALetter # Lm  [18] MODIFIER LETTER SMALL H..MODIFIER LETTER REVERSED GLOTTAL STOP
-02C6..02D1    ; ALetter # Lm  [12] MODIFIER LETTER CIRCUMFLEX ACCENT..MODIFIER LETTER HALF TRIANGULAR COLON
-02E0..02E4    ; ALetter # Lm   [5] MODIFIER LETTER SMALL GAMMA..MODIFIER LETTER SMALL REVERSED GLOTTAL STOP
-02EE          ; ALetter # Lm       MODIFIER LETTER DOUBLE APOSTROPHE
-037A          ; ALetter # Lm       GREEK YPOGEGRAMMENI
-037B..037D    ; ALetter # L&   [3] GREEK SMALL REVERSED LUNATE SIGMA SYMBOL..GREEK SMALL REVERSED DOTTED LUNATE SIGMA SYMBOL
-0386          ; ALetter # L&       GREEK CAPITAL LETTER ALPHA WITH TONOS
-0388..038A    ; ALetter # L&   [3] GREEK CAPITAL LETTER EPSILON WITH TONOS..GREEK CAPITAL LETTER IOTA WITH TONOS
-038C          ; ALetter # L&       GREEK CAPITAL LETTER OMICRON WITH TONOS
-038E..03A1    ; ALetter # L&  [20] GREEK CAPITAL LETTER UPSILON WITH TONOS..GREEK CAPITAL LETTER RHO
-03A3..03CE    ; ALetter # L&  [44] GREEK CAPITAL LETTER SIGMA..GREEK SMALL LETTER OMEGA WITH TONOS
-03D0..03F5    ; ALetter # L&  [38] GREEK BETA SYMBOL..GREEK LUNATE EPSILON SYMBOL
-03F7..0481    ; ALetter # L& [139] GREEK CAPITAL LETTER SHO..CYRILLIC SMALL LETTER KOPPA
-048A..0513    ; ALetter # L& [138] CYRILLIC CAPITAL LETTER SHORT I WITH TAIL..CYRILLIC SMALL LETTER EL WITH HOOK
-0531..0556    ; ALetter # L&  [38] ARMENIAN CAPITAL LETTER AYB..ARMENIAN CAPITAL LETTER FEH
-0559          ; ALetter # Lm       ARMENIAN MODIFIER LETTER LEFT HALF RING
-0561..0587    ; ALetter # L&  [39] ARMENIAN SMALL LETTER AYB..ARMENIAN SMALL LIGATURE ECH YIWN
-05D0..05EA    ; ALetter # Lo  [27] HEBREW LETTER ALEF..HEBREW LETTER TAV
-05F0..05F2    ; ALetter # Lo   [3] HEBREW LIGATURE YIDDISH DOUBLE VAV..HEBREW LIGATURE YIDDISH DOUBLE YOD
-05F3          ; ALetter # Po       HEBREW PUNCTUATION GERESH
-0621..063A    ; ALetter # Lo  [26] ARABIC LETTER HAMZA..ARABIC LETTER GHAIN
-0640          ; ALetter # Lm       ARABIC TATWEEL
-0641..064A    ; ALetter # Lo  [10] ARABIC LETTER FEH..ARABIC LETTER YEH
-066E..066F    ; ALetter # Lo   [2] ARABIC LETTER DOTLESS BEH..ARABIC LETTER DOTLESS QAF
-0671..06D3    ; ALetter # Lo  [99] ARABIC LETTER ALEF WASLA..ARABIC LETTER YEH BARREE WITH HAMZA ABOVE
-06D5          ; ALetter # Lo       ARABIC LETTER AE
-06E5..06E6    ; ALetter # Lm   [2] ARABIC SMALL WAW..ARABIC SMALL YEH
-06EE..06EF    ; ALetter # Lo   [2] ARABIC LETTER DAL WITH INVERTED V..ARABIC LETTER REH WITH INVERTED V
-06FA..06FC    ; ALetter # Lo   [3] ARABIC LETTER SHEEN WITH DOT BELOW..ARABIC LETTER GHAIN WITH DOT BELOW
-06FF          ; ALetter # Lo       ARABIC LETTER HEH WITH INVERTED V
-0710          ; ALetter # Lo       SYRIAC LETTER ALAPH
-0712..072F    ; ALetter # Lo  [30] SYRIAC LETTER BETH..SYRIAC LETTER PERSIAN DHALATH
-074D..076D    ; ALetter # Lo  [33] SYRIAC LETTER SOGDIAN ZHAIN..ARABIC LETTER SEEN WITH TWO DOTS VERTICALLY ABOVE
-0780..07A5    ; ALetter # Lo  [38] THAANA LETTER HAA..THAANA LETTER WAAVU
-07B1          ; ALetter # Lo       THAANA LETTER NAA
-07CA..07EA    ; ALetter # Lo  [33] NKO LETTER A..NKO LETTER JONA RA
-07F4..07F5    ; ALetter # Lm   [2] NKO HIGH TONE APOSTROPHE..NKO LOW TONE APOSTROPHE
-07FA          ; ALetter # Lm       NKO LAJANYALAN
-0903          ; ALetter # Mc       DEVANAGARI SIGN VISARGA
-0904..0939    ; ALetter # Lo  [54] DEVANAGARI LETTER SHORT A..DEVANAGARI LETTER HA
-093D          ; ALetter # Lo       DEVANAGARI SIGN AVAGRAHA
-093E..0940    ; ALetter # Mc   [3] DEVANAGARI VOWEL SIGN AA..DEVANAGARI VOWEL SIGN II
-0949..094C    ; ALetter # Mc   [4] DEVANAGARI VOWEL SIGN CANDRA O..DEVANAGARI VOWEL SIGN AU
-0950          ; ALetter # Lo       DEVANAGARI OM
-0958..0961    ; ALetter # Lo  [10] DEVANAGARI LETTER QA..DEVANAGARI LETTER VOCALIC LL
-097B..097F    ; ALetter # Lo   [5] DEVANAGARI LETTER GGA..DEVANAGARI LETTER BBA
-0982..0983    ; ALetter # Mc   [2] BENGALI SIGN ANUSVARA..BENGALI SIGN VISARGA
-0985..098C    ; ALetter # Lo   [8] BENGALI LETTER A..BENGALI LETTER VOCALIC L
-098F..0990    ; ALetter # Lo   [2] BENGALI LETTER E..BENGALI LETTER AI
-0993..09A8    ; ALetter # Lo  [22] BENGALI LETTER O..BENGALI LETTER NA
-09AA..09B0    ; ALetter # Lo   [7] BENGALI LETTER PA..BENGALI LETTER RA
-09B2          ; ALetter # Lo       BENGALI LETTER LA
-09B6..09B9    ; ALetter # Lo   [4] BENGALI LETTER SHA..BENGALI LETTER HA
-09BD          ; ALetter # Lo       BENGALI SIGN AVAGRAHA
-09BF..09C0    ; ALetter # Mc   [2] BENGALI VOWEL SIGN I..BENGALI VOWEL SIGN II
-09C7..09C8    ; ALetter # Mc   [2] BENGALI VOWEL SIGN E..BENGALI VOWEL SIGN AI
-09CB..09CC    ; ALetter # Mc   [2] BENGALI VOWEL SIGN O..BENGALI VOWEL SIGN AU
-09CE          ; ALetter # Lo       BENGALI LETTER KHANDA TA
-09DC..09DD    ; ALetter # Lo   [2] BENGALI LETTER RRA..BENGALI LETTER RHA
-09DF..09E1    ; ALetter # Lo   [3] BENGALI LETTER YYA..BENGALI LETTER VOCALIC LL
-09F0..09F1    ; ALetter # Lo   [2] BENGALI LETTER RA WITH MIDDLE DIAGONAL..BENGALI LETTER RA WITH LOWER DIAGONAL
-0A03          ; ALetter # Mc       GURMUKHI SIGN VISARGA
-0A05..0A0A    ; ALetter # Lo   [6] GURMUKHI LETTER A..GURMUKHI LETTER UU
-0A0F..0A10    ; ALetter # Lo   [2] GURMUKHI LETTER EE..GURMUKHI LETTER AI
-0A13..0A28    ; ALetter # Lo  [22] GURMUKHI LETTER OO..GURMUKHI LETTER NA
-0A2A..0A30    ; ALetter # Lo   [7] GURMUKHI LETTER PA..GURMUKHI LETTER RA
-0A32..0A33    ; ALetter # Lo   [2] GURMUKHI LETTER LA..GURMUKHI LETTER LLA
-0A35..0A36    ; ALetter # Lo   [2] GURMUKHI LETTER VA..GURMUKHI LETTER SHA
-0A38..0A39    ; ALetter # Lo   [2] GURMUKHI LETTER SA..GURMUKHI LETTER HA
-0A3E..0A40    ; ALetter # Mc   [3] GURMUKHI VOWEL SIGN AA..GURMUKHI VOWEL SIGN II
-0A59..0A5C    ; ALetter # Lo   [4] GURMUKHI LETTER KHHA..GURMUKHI LETTER RRA
-0A5E          ; ALetter # Lo       GURMUKHI LETTER FA
-0A72..0A74    ; ALetter # Lo   [3] GURMUKHI IRI..GURMUKHI EK ONKAR
-0A83          ; ALetter # Mc       GUJARATI SIGN VISARGA
-0A85..0A8D    ; ALetter # Lo   [9] GUJARATI LETTER A..GUJARATI VOWEL CANDRA E
-0A8F..0A91    ; ALetter # Lo   [3] GUJARATI LETTER E..GUJARATI VOWEL CANDRA O
-0A93..0AA8    ; ALetter # Lo  [22] GUJARATI LETTER O..GUJARATI LETTER NA
-0AAA..0AB0    ; ALetter # Lo   [7] GUJARATI LETTER PA..GUJARATI LETTER RA
-0AB2..0AB3    ; ALetter # Lo   [2] GUJARATI LETTER LA..GUJARATI LETTER LLA
-0AB5..0AB9    ; ALetter # Lo   [5] GUJARATI LETTER VA..GUJARATI LETTER HA
-0ABD          ; ALetter # Lo       GUJARATI SIGN AVAGRAHA
-0ABE..0AC0    ; ALetter # Mc   [3] GUJARATI VOWEL SIGN AA..GUJARATI VOWEL SIGN II
-0AC9          ; ALetter # Mc       GUJARATI VOWEL SIGN CANDRA O
-0ACB..0ACC    ; ALetter # Mc   [2] GUJARATI VOWEL SIGN O..GUJARATI VOWEL SIGN AU
-0AD0          ; ALetter # Lo       GUJARATI OM
-0AE0..0AE1    ; ALetter # Lo   [2] GUJARATI LETTER VOCALIC RR..GUJARATI LETTER VOCALIC LL
-0B02..0B03    ; ALetter # Mc   [2] ORIYA SIGN ANUSVARA..ORIYA SIGN VISARGA
-0B05..0B0C    ; ALetter # Lo   [8] ORIYA LETTER A..ORIYA LETTER VOCALIC L
-0B0F..0B10    ; ALetter # Lo   [2] ORIYA LETTER E..ORIYA LETTER AI
-0B13..0B28    ; ALetter # Lo  [22] ORIYA LETTER O..ORIYA LETTER NA
-0B2A..0B30    ; ALetter # Lo   [7] ORIYA LETTER PA..ORIYA LETTER RA
-0B32..0B33    ; ALetter # Lo   [2] ORIYA LETTER LA..ORIYA LETTER LLA
-0B35..0B39    ; ALetter # Lo   [5] ORIYA LETTER VA..ORIYA LETTER HA
-0B3D          ; ALetter # Lo       ORIYA SIGN AVAGRAHA
-0B40          ; ALetter # Mc       ORIYA VOWEL SIGN II
-0B47..0B48    ; ALetter # Mc   [2] ORIYA VOWEL SIGN E..ORIYA VOWEL SIGN AI
-0B4B..0B4C    ; ALetter # Mc   [2] ORIYA VOWEL SIGN O..ORIYA VOWEL SIGN AU
-0B5C..0B5D    ; ALetter # Lo   [2] ORIYA LETTER RRA..ORIYA LETTER RHA
-0B5F..0B61    ; ALetter # Lo   [3] ORIYA LETTER YYA..ORIYA LETTER VOCALIC LL
-0B71          ; ALetter # Lo       ORIYA LETTER WA
-0B83          ; ALetter # Lo       TAMIL SIGN VISARGA
-0B85..0B8A    ; ALetter # Lo   [6] TAMIL LETTER A..TAMIL LETTER UU
-0B8E..0B90    ; ALetter # Lo   [3] TAMIL LETTER E..TAMIL LETTER AI
-0B92..0B95    ; ALetter # Lo   [4] TAMIL LETTER O..TAMIL LETTER KA
-0B99..0B9A    ; ALetter # Lo   [2] TAMIL LETTER NGA..TAMIL LETTER CA
-0B9C          ; ALetter # Lo       TAMIL LETTER JA
-0B9E..0B9F    ; ALetter # Lo   [2] TAMIL LETTER NYA..TAMIL LETTER TTA
-0BA3..0BA4    ; ALetter # Lo   [2] TAMIL LETTER NNA..TAMIL LETTER TA
-0BA8..0BAA    ; ALetter # Lo   [3] TAMIL LETTER NA..TAMIL LETTER PA
-0BAE..0BB9    ; ALetter # Lo  [12] TAMIL LETTER MA..TAMIL LETTER HA
-0BBF          ; ALetter # Mc       TAMIL VOWEL SIGN I
-0BC1..0BC2    ; ALetter # Mc   [2] TAMIL VOWEL SIGN U..TAMIL VOWEL SIGN UU
-0BC6..0BC8    ; ALetter # Mc   [3] TAMIL VOWEL SIGN E..TAMIL VOWEL SIGN AI
-0BCA..0BCC    ; ALetter # Mc   [3] TAMIL VOWEL SIGN O..TAMIL VOWEL SIGN AU
-0C01..0C03    ; ALetter # Mc   [3] TELUGU SIGN CANDRABINDU..TELUGU SIGN VISARGA
-0C05..0C0C    ; ALetter # Lo   [8] TELUGU LETTER A..TELUGU LETTER VOCALIC L
-0C0E..0C10    ; ALetter # Lo   [3] TELUGU LETTER E..TELUGU LETTER AI
-0C12..0C28    ; ALetter # Lo  [23] TELUGU LETTER O..TELUGU LETTER NA
-0C2A..0C33    ; ALetter # Lo  [10] TELUGU LETTER PA..TELUGU LETTER LLA
-0C35..0C39    ; ALetter # Lo   [5] TELUGU LETTER VA..TELUGU LETTER HA
-0C41..0C44    ; ALetter # Mc   [4] TELUGU VOWEL SIGN U..TELUGU VOWEL SIGN VOCALIC RR
-0C60..0C61    ; ALetter # Lo   [2] TELUGU LETTER VOCALIC RR..TELUGU LETTER VOCALIC LL
-0C82..0C83    ; ALetter # Mc   [2] KANNADA SIGN ANUSVARA..KANNADA SIGN VISARGA
-0C85..0C8C    ; ALetter # Lo   [8] KANNADA LETTER A..KANNADA LETTER VOCALIC L
-0C8E..0C90    ; ALetter # Lo   [3] KANNADA LETTER E..KANNADA LETTER AI
-0C92..0CA8    ; ALetter # Lo  [23] KANNADA LETTER O..KANNADA LETTER NA
-0CAA..0CB3    ; ALetter # Lo  [10] KANNADA LETTER PA..KANNADA LETTER LLA
-0CB5..0CB9    ; ALetter # Lo   [5] KANNADA LETTER VA..KANNADA LETTER HA
-0CBD          ; ALetter # Lo       KANNADA SIGN AVAGRAHA
-0CBE          ; ALetter # Mc       KANNADA VOWEL SIGN AA
-0CC0..0CC1    ; ALetter # Mc   [2] KANNADA VOWEL SIGN II..KANNADA VOWEL SIGN U
-0CC3..0CC4    ; ALetter # Mc   [2] KANNADA VOWEL SIGN VOCALIC R..KANNADA VOWEL SIGN VOCALIC RR
-0CC7..0CC8    ; ALetter # Mc   [2] KANNADA VOWEL SIGN EE..KANNADA VOWEL SIGN AI
-0CCA..0CCB    ; ALetter # Mc   [2] KANNADA VOWEL SIGN O..KANNADA VOWEL SIGN OO
-0CDE          ; ALetter # Lo       KANNADA LETTER FA
-0CE0..0CE1    ; ALetter # Lo   [2] KANNADA LETTER VOCALIC RR..KANNADA LETTER VOCALIC LL
-0D02..0D03    ; ALetter # Mc   [2] MALAYALAM SIGN ANUSVARA..MALAYALAM SIGN VISARGA
-0D05..0D0C    ; ALetter # Lo   [8] MALAYALAM LETTER A..MALAYALAM LETTER VOCALIC L
-0D0E..0D10    ; ALetter # Lo   [3] MALAYALAM LETTER E..MALAYALAM LETTER AI
-0D12..0D28    ; ALetter # Lo  [23] MALAYALAM LETTER O..MALAYALAM LETTER NA
-0D2A..0D39    ; ALetter # Lo  [16] MALAYALAM LETTER PA..MALAYALAM LETTER HA
-0D3F..0D40    ; ALetter # Mc   [2] MALAYALAM VOWEL SIGN I..MALAYALAM VOWEL SIGN II
-0D46..0D48    ; ALetter # Mc   [3] MALAYALAM VOWEL SIGN E..MALAYALAM VOWEL SIGN AI
-0D4A..0D4C    ; ALetter # Mc   [3] MALAYALAM VOWEL SIGN O..MALAYALAM VOWEL SIGN AU
-0D60..0D61    ; ALetter # Lo   [2] MALAYALAM LETTER VOCALIC RR..MALAYALAM LETTER VOCALIC LL
-0D82..0D83    ; ALetter # Mc   [2] SINHALA SIGN ANUSVARAYA..SINHALA SIGN VISARGAYA
-0D85..0D96    ; ALetter # Lo  [18] SINHALA LETTER AYANNA..SINHALA LETTER AUYANNA
-0D9A..0DB1    ; ALetter # Lo  [24] SINHALA LETTER ALPAPRAANA KAYANNA..SINHALA LETTER DANTAJA NAYANNA
-0DB3..0DBB    ; ALetter # Lo   [9] SINHALA LETTER SANYAKA DAYANNA..SINHALA LETTER RAYANNA
-0DBD          ; ALetter # Lo       SINHALA LETTER DANTAJA LAYANNA
-0DC0..0DC6    ; ALetter # Lo   [7] SINHALA LETTER VAYANNA..SINHALA LETTER FAYANNA
-0DD0..0DD1    ; ALetter # Mc   [2] SINHALA VOWEL SIGN KETTI AEDA-PILLA..SINHALA VOWEL SIGN DIGA AEDA-PILLA
-0DD8..0DDE    ; ALetter # Mc   [7] SINHALA VOWEL SIGN GAETTA-PILLA..SINHALA VOWEL SIGN KOMBUVA HAA GAYANUKITTA
-0DF2..0DF3    ; ALetter # Mc   [2] SINHALA VOWEL SIGN DIGA GAETTA-PILLA..SINHALA VOWEL SIGN DIGA GAYANUKITTA
-0F00          ; ALetter # Lo       TIBETAN SYLLABLE OM
-0F40..0F47    ; ALetter # Lo   [8] TIBETAN LETTER KA..TIBETAN LETTER JA
-0F49..0F6A    ; ALetter # Lo  [34] TIBETAN LETTER NYA..TIBETAN LETTER FIXED-FORM RA
-0F7F          ; ALetter # Mc       TIBETAN SIGN RNAM BCAD
-0F88..0F8B    ; ALetter # Lo   [4] TIBETAN SIGN LCE TSA CAN..TIBETAN SIGN GRU MED RGYINGS
-10A0..10C5    ; ALetter # L&  [38] GEORGIAN CAPITAL LETTER AN..GEORGIAN CAPITAL LETTER HOE
-10D0..10FA    ; ALetter # Lo  [43] GEORGIAN LETTER AN..GEORGIAN LETTER AIN
-10FC          ; ALetter # Lm       MODIFIER LETTER GEORGIAN NAR
-1100..1159    ; ALetter # Lo  [90] HANGUL CHOSEONG KIYEOK..HANGUL CHOSEONG YEORINHIEUH
-115F..11A2    ; ALetter # Lo  [68] HANGUL CHOSEONG FILLER..HANGUL JUNGSEONG SSANGARAEA
-11A8..11F9    ; ALetter # Lo  [82] HANGUL JONGSEONG KIYEOK..HANGUL JONGSEONG YEORINHIEUH
-1200..1248    ; ALetter # Lo  [73] ETHIOPIC SYLLABLE HA..ETHIOPIC SYLLABLE QWA
-124A..124D    ; ALetter # Lo   [4] ETHIOPIC SYLLABLE QWI..ETHIOPIC SYLLABLE QWE
-1250..1256    ; ALetter # Lo   [7] ETHIOPIC SYLLABLE QHA..ETHIOPIC SYLLABLE QHO
-1258          ; ALetter # Lo       ETHIOPIC SYLLABLE QHWA
-125A..125D    ; ALetter # Lo   [4] ETHIOPIC SYLLABLE QHWI..ETHIOPIC SYLLABLE QHWE
-1260..1288    ; ALetter # Lo  [41] ETHIOPIC SYLLABLE BA..ETHIOPIC SYLLABLE XWA
-128A..128D    ; ALetter # Lo   [4] ETHIOPIC SYLLABLE XWI..ETHIOPIC SYLLABLE XWE
-1290..12B0    ; ALetter # Lo  [33] ETHIOPIC SYLLABLE NA..ETHIOPIC SYLLABLE KWA
-12B2..12B5    ; ALetter # Lo   [4] ETHIOPIC SYLLABLE KWI..ETHIOPIC SYLLABLE KWE
-12B8..12BE    ; ALetter # Lo   [7] ETHIOPIC SYLLABLE KXA..ETHIOPIC SYLLABLE KXO
-12C0          ; ALetter # Lo       ETHIOPIC SYLLABLE KXWA
-12C2..12C5    ; ALetter # Lo   [4] ETHIOPIC SYLLABLE KXWI..ETHIOPIC SYLLABLE KXWE
-12C8..12D6    ; ALetter # Lo  [15] ETHIOPIC SYLLABLE WA..ETHIOPIC SYLLABLE PHARYNGEAL O
-12D8..1310    ; ALetter # Lo  [57] ETHIOPIC SYLLABLE ZA..ETHIOPIC SYLLABLE GWA
-1312..1315    ; ALetter # Lo   [4] ETHIOPIC SYLLABLE GWI..ETHIOPIC SYLLABLE GWE
-1318..135A    ; ALetter # Lo  [67] ETHIOPIC SYLLABLE GGA..ETHIOPIC SYLLABLE FYA
-1380..138F    ; ALetter # Lo  [16] ETHIOPIC SYLLABLE SEBATBEIT MWA..ETHIOPIC SYLLABLE PWE
-13A0..13F4    ; ALetter # Lo  [85] CHEROKEE LETTER A..CHEROKEE LETTER YV
-1401..166C    ; ALetter # Lo [620] CANADIAN SYLLABICS E..CANADIAN SYLLABICS CARRIER TTSA
-166F..1676    ; ALetter # Lo   [8] CANADIAN SYLLABICS QAI..CANADIAN SYLLABICS NNGAA
-1681..169A    ; ALetter # Lo  [26] OGHAM LETTER BEITH..OGHAM LETTER PEITH
-16A0..16EA    ; ALetter # Lo  [75] RUNIC LETTER FEHU FEOH FE F..RUNIC LETTER X
-16EE..16F0    ; ALetter # Nl   [3] RUNIC ARLAUG SYMBOL..RUNIC BELGTHOR SYMBOL
-1700..170C    ; ALetter # Lo  [13] TAGALOG LETTER A..TAGALOG LETTER YA
-170E..1711    ; ALetter # Lo   [4] TAGALOG LETTER LA..TAGALOG LETTER HA
-1720..1731    ; ALetter # Lo  [18] HANUNOO LETTER A..HANUNOO LETTER HA
-1740..1751    ; ALetter # Lo  [18] BUHID LETTER A..BUHID LETTER HA
-1760..176C    ; ALetter # Lo  [13] TAGBANWA LETTER A..TAGBANWA LETTER YA
-176E..1770    ; ALetter # Lo   [3] TAGBANWA LETTER LA..TAGBANWA LETTER SA
-1820..1842    ; ALetter # Lo  [35] MONGOLIAN LETTER A..MONGOLIAN LETTER CHI
-1843          ; ALetter # Lm       MONGOLIAN LETTER TODO LONG VOWEL SIGN
-1844..1877    ; ALetter # Lo  [52] MONGOLIAN LETTER TODO E..MONGOLIAN LETTER MANCHU ZHA
-1880..18A8    ; ALetter # Lo  [41] MONGOLIAN LETTER ALI GALI ANUSVARA ONE..MONGOLIAN LETTER MANCHU ALI GALI BHA
-1900..191C    ; ALetter # Lo  [29] LIMBU VOWEL-CARRIER LETTER..LIMBU LETTER HA
-1923..1926    ; ALetter # Mc   [4] LIMBU VOWEL SIGN EE..LIMBU VOWEL SIGN AU
-1929..192B    ; ALetter # Mc   [3] LIMBU SUBJOINED LETTER YA..LIMBU SUBJOINED LETTER WA
-1930..1931    ; ALetter # Mc   [2] LIMBU SMALL LETTER KA..LIMBU SMALL LETTER NGA
-1933..1938    ; ALetter # Mc   [6] LIMBU SMALL LETTER TA..LIMBU SMALL LETTER LA
-1A00..1A16    ; ALetter # Lo  [23] BUGINESE LETTER KA..BUGINESE LETTER HA
-1A19..1A1B    ; ALetter # Mc   [3] BUGINESE VOWEL SIGN E..BUGINESE VOWEL SIGN AE
-1B04          ; ALetter # Mc       BALINESE SIGN BISAH
-1B05..1B33    ; ALetter # Lo  [47] BALINESE LETTER AKARA..BALINESE LETTER HA
-1B35          ; ALetter # Mc       BALINESE VOWEL SIGN TEDUNG
-1B3B          ; ALetter # Mc       BALINESE VOWEL SIGN RA REPA TEDUNG
-1B3D..1B41    ; ALetter # Mc   [5] BALINESE VOWEL SIGN LA LENGA TEDUNG..BALINESE VOWEL SIGN TALING REPA TEDUNG
-1B43          ; ALetter # Mc       BALINESE VOWEL SIGN PEPET TEDUNG
-1B45..1B4B    ; ALetter # Lo   [7] BALINESE LETTER KAF SASAK..BALINESE LETTER ASYURA SASAK
-1D00..1D2B    ; ALetter # L&  [44] LATIN LETTER SMALL CAPITAL A..CYRILLIC LETTER SMALL CAPITAL EL
-1D2C..1D61    ; ALetter # Lm  [54] MODIFIER LETTER CAPITAL A..MODIFIER LETTER SMALL CHI
-1D62..1D77    ; ALetter # L&  [22] LATIN SUBSCRIPT SMALL LETTER I..LATIN SMALL LETTER TURNED G
-1D78          ; ALetter # Lm       MODIFIER LETTER CYRILLIC EN
-1D79..1D9A    ; ALetter # L&  [34] LATIN SMALL LETTER INSULAR G..LATIN SMALL LETTER EZH WITH RETROFLEX HOOK
-1D9B..1DBF    ; ALetter # Lm  [37] MODIFIER LETTER SMALL TURNED ALPHA..MODIFIER LETTER SMALL THETA
-1E00..1E9B    ; ALetter # L& [156] LATIN CAPITAL LETTER A WITH RING BELOW..LATIN SMALL LETTER LONG S WITH DOT ABOVE
-1EA0..1EF9    ; ALetter # L&  [90] LATIN CAPITAL LETTER A WITH DOT BELOW..LATIN SMALL LETTER Y WITH TILDE
-1F00..1F15    ; ALetter # L&  [22] GREEK SMALL LETTER ALPHA WITH PSILI..GREEK SMALL LETTER EPSILON WITH DASIA AND OXIA
-1F18..1F1D    ; ALetter # L&   [6] GREEK CAPITAL LETTER EPSILON WITH PSILI..GREEK CAPITAL LETTER EPSILON WITH DASIA AND OXIA
-1F20..1F45    ; ALetter # L&  [38] GREEK SMALL LETTER ETA WITH PSILI..GREEK SMALL LETTER OMICRON WITH DASIA AND OXIA
-1F48..1F4D    ; ALetter # L&   [6] GREEK CAPITAL LETTER OMICRON WITH PSILI..GREEK CAPITAL LETTER OMICRON WITH DASIA AND OXIA
-1F50..1F57    ; ALetter # L&   [8] GREEK SMALL LETTER UPSILON WITH PSILI..GREEK SMALL LETTER UPSILON WITH DASIA AND PERISPOMENI
-1F59          ; ALetter # L&       GREEK CAPITAL LETTER UPSILON WITH DASIA
-1F5B          ; ALetter # L&       GREEK CAPITAL LETTER UPSILON WITH DASIA AND VARIA
-1F5D          ; ALetter # L&       GREEK CAPITAL LETTER UPSILON WITH DASIA AND OXIA
-1F5F..1F7D    ; ALetter # L&  [31] GREEK CAPITAL LETTER UPSILON WITH DASIA AND PERISPOMENI..GREEK SMALL LETTER OMEGA WITH OXIA
-1F80..1FB4    ; ALetter # L&  [53] GREEK SMALL LETTER ALPHA WITH PSILI AND YPOGEGRAMMENI..GREEK SMALL LETTER ALPHA WITH OXIA AND YPOGEGRAMMENI
-1FB6..1FBC    ; ALetter # L&   [7] GREEK SMALL LETTER ALPHA WITH PERISPOMENI..GREEK CAPITAL LETTER ALPHA WITH PROSGEGRAMMENI
-1FBE          ; ALetter # L&       GREEK PROSGEGRAMMENI
-1FC2..1FC4    ; ALetter # L&   [3] GREEK SMALL LETTER ETA WITH VARIA AND YPOGEGRAMMENI..GREEK SMALL LETTER ETA WITH OXIA AND YPOGEGRAMMENI
-1FC6..1FCC    ; ALetter # L&   [7] GREEK SMALL LETTER ETA WITH PERISPOMENI..GREEK CAPITAL LETTER ETA WITH PROSGEGRAMMENI
-1FD0..1FD3    ; ALetter # L&   [4] GREEK SMALL LETTER IOTA WITH VRACHY..GREEK SMALL LETTER IOTA WITH DIALYTIKA AND OXIA
-1FD6..1FDB    ; ALetter # L&   [6] GREEK SMALL LETTER IOTA WITH PERISPOMENI..GREEK CAPITAL LETTER IOTA WITH OXIA
-1FE0..1FEC    ; ALetter # L&  [13] GREEK SMALL LETTER UPSILON WITH VRACHY..GREEK CAPITAL LETTER RHO WITH DASIA
-1FF2..1FF4    ; ALetter # L&   [3] GREEK SMALL LETTER OMEGA WITH VARIA AND YPOGEGRAMMENI..GREEK SMALL LETTER OMEGA WITH OXIA AND YPOGEGRAMMENI
-1FF6..1FFC    ; ALetter # L&   [7] GREEK SMALL LETTER OMEGA WITH PERISPOMENI..GREEK CAPITAL LETTER OMEGA WITH PROSGEGRAMMENI
-2071          ; ALetter # L&       SUPERSCRIPT LATIN SMALL LETTER I
-207F          ; ALetter # L&       SUPERSCRIPT LATIN SMALL LETTER N
-2090..2094    ; ALetter # Lm   [5] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER SCHWA
-2102          ; ALetter # L&       DOUBLE-STRUCK CAPITAL C
-2107          ; ALetter # L&       EULER CONSTANT
-210A..2113    ; ALetter # L&  [10] SCRIPT SMALL G..SCRIPT SMALL L
-2115          ; ALetter # L&       DOUBLE-STRUCK CAPITAL N
-2119..211D    ; ALetter # L&   [5] DOUBLE-STRUCK CAPITAL P..DOUBLE-STRUCK CAPITAL R
-2124          ; ALetter # L&       DOUBLE-STRUCK CAPITAL Z
-2126          ; ALetter # L&       OHM SIGN
-2128          ; ALetter # L&       BLACK-LETTER CAPITAL Z
-212A..212D    ; ALetter # L&   [4] KELVIN SIGN..BLACK-LETTER CAPITAL C
-212F..2134    ; ALetter # L&   [6] SCRIPT SMALL E..SCRIPT SMALL O
-2135..2138    ; ALetter # Lo   [4] ALEF SYMBOL..DALET SYMBOL
-2139          ; ALetter # L&       INFORMATION SOURCE
-213C..213F    ; ALetter # L&   [4] DOUBLE-STRUCK SMALL PI..DOUBLE-STRUCK CAPITAL PI
-2145..2149    ; ALetter # L&   [5] DOUBLE-STRUCK ITALIC CAPITAL D..DOUBLE-STRUCK ITALIC SMALL J
-214E          ; ALetter # L&       TURNED SMALL F
-2160..2182    ; ALetter # Nl  [35] ROMAN NUMERAL ONE..ROMAN NUMERAL TEN THOUSAND
-2183..2184    ; ALetter # L&   [2] ROMAN NUMERAL REVERSED ONE HUNDRED..LATIN SMALL LETTER REVERSED C
-24B6..24E9    ; ALetter # So  [52] CIRCLED LATIN CAPITAL LETTER A..CIRCLED LATIN SMALL LETTER Z
-2C00..2C2E    ; ALetter # L&  [47] GLAGOLITIC CAPITAL LETTER AZU..GLAGOLITIC CAPITAL LETTER LATINATE MYSLITE
-2C30..2C5E    ; ALetter # L&  [47] GLAGOLITIC SMALL LETTER AZU..GLAGOLITIC SMALL LETTER LATINATE MYSLITE
-2C60..2C6C    ; ALetter # L&  [13] LATIN CAPITAL LETTER L WITH DOUBLE BAR..LATIN SMALL LETTER Z WITH DESCENDER
-2C74..2C77    ; ALetter # L&   [4] LATIN SMALL LETTER V WITH CURL..LATIN SMALL LETTER TAILLESS PHI
-2C80..2CE4    ; ALetter # L& [101] COPTIC CAPITAL LETTER ALFA..COPTIC SYMBOL KAI
-2D00..2D25    ; ALetter # L&  [38] GEORGIAN SMALL LETTER AN..GEORGIAN SMALL LETTER HOE
-2D30..2D65    ; ALetter # Lo  [54] TIFINAGH LETTER YA..TIFINAGH LETTER YAZZ
-2D6F          ; ALetter # Lm       TIFINAGH MODIFIER LETTER LABIALIZATION MARK
-2D80..2D96    ; ALetter # Lo  [23] ETHIOPIC SYLLABLE LOA..ETHIOPIC SYLLABLE GGWE
-2DA0..2DA6    ; ALetter # Lo   [7] ETHIOPIC SYLLABLE SSA..ETHIOPIC SYLLABLE SSO
-2DA8..2DAE    ; ALetter # Lo   [7] ETHIOPIC SYLLABLE CCA..ETHIOPIC SYLLABLE CCO
-2DB0..2DB6    ; ALetter # Lo   [7] ETHIOPIC SYLLABLE ZZA..ETHIOPIC SYLLABLE ZZO
-2DB8..2DBE    ; ALetter # Lo   [7] ETHIOPIC SYLLABLE CCHA..ETHIOPIC SYLLABLE CCHO
-2DC0..2DC6    ; ALetter # Lo   [7] ETHIOPIC SYLLABLE QYA..ETHIOPIC SYLLABLE QYO
-2DC8..2DCE    ; ALetter # Lo   [7] ETHIOPIC SYLLABLE KYA..ETHIOPIC SYLLABLE KYO
-2DD0..2DD6    ; ALetter # Lo   [7] ETHIOPIC SYLLABLE XYA..ETHIOPIC SYLLABLE XYO
-2DD8..2DDE    ; ALetter # Lo   [7] ETHIOPIC SYLLABLE GYA..ETHIOPIC SYLLABLE GYO
-3005          ; ALetter # Lm       IDEOGRAPHIC ITERATION MARK
-303B          ; ALetter # Lm       VERTICAL IDEOGRAPHIC ITERATION MARK
-303C          ; ALetter # Lo       MASU MARK
-3105..312C    ; ALetter # Lo  [40] BOPOMOFO LETTER B..BOPOMOFO LETTER GN
-3131..318E    ; ALetter # Lo  [94] HANGUL LETTER KIYEOK..HANGUL LETTER ARAEAE
-31A0..31B7    ; ALetter # Lo  [24] BOPOMOFO LETTER BU..BOPOMOFO FINAL LETTER H
-A000..A014    ; ALetter # Lo  [21] YI SYLLABLE IT..YI SYLLABLE E
-A015          ; ALetter # Lm       YI SYLLABLE WU
-A016..A48C    ; ALetter # Lo [1143] YI SYLLABLE BIT..YI SYLLABLE YYR
-A717..A71A    ; ALetter # Lm   [4] MODIFIER LETTER DOT VERTICAL BAR..MODIFIER LETTER LOWER RIGHT CORNER ANGLE
-A800..A801    ; ALetter # Lo   [2] SYLOTI NAGRI LETTER A..SYLOTI NAGRI LETTER I
-A803..A805    ; ALetter # Lo   [3] SYLOTI NAGRI LETTER U..SYLOTI NAGRI LETTER O
-A807..A80A    ; ALetter # Lo   [4] SYLOTI NAGRI LETTER KO..SYLOTI NAGRI LETTER GHO
-A80C..A822    ; ALetter # Lo  [23] SYLOTI NAGRI LETTER CO..SYLOTI NAGRI LETTER HO
-A823..A824    ; ALetter # Mc   [2] SYLOTI NAGRI VOWEL SIGN A..SYLOTI NAGRI VOWEL SIGN I
-A827          ; ALetter # Mc       SYLOTI NAGRI VOWEL SIGN OO
-A840..A873    ; ALetter # Lo  [52] PHAGS-PA LETTER KA..PHAGS-PA LETTER CANDRABINDU
-AC00..D7A3    ; ALetter # Lo [11172] HANGUL SYLLABLE GA..HANGUL SYLLABLE HIH
-FA30..FA6A    ; ALetter # Lo  [59] CJK COMPATIBILITY IDEOGRAPH-FA30..CJK COMPATIBILITY IDEOGRAPH-FA6A
-FB00..FB06    ; ALetter # L&   [7] LATIN SMALL LIGATURE FF..LATIN SMALL LIGATURE ST
-FB13..FB17    ; ALetter # L&   [5] ARMENIAN SMALL LIGATURE MEN NOW..ARMENIAN SMALL LIGATURE MEN XEH
-FB1D          ; ALetter # Lo       HEBREW LETTER YOD WITH HIRIQ
-FB1F..FB28    ; ALetter # Lo  [10] HEBREW LIGATURE YIDDISH YOD YOD PATAH..HEBREW LETTER WIDE TAV
-FB2A..FB36    ; ALetter # Lo  [13] HEBREW LETTER SHIN WITH SHIN DOT..HEBREW LETTER ZAYIN WITH DAGESH
-FB38..FB3C    ; ALetter # Lo   [5] HEBREW LETTER TET WITH DAGESH..HEBREW LETTER LAMED WITH DAGESH
-FB3E          ; ALetter # Lo       HEBREW LETTER MEM WITH DAGESH
-FB40..FB41    ; ALetter # Lo   [2] HEBREW LETTER NUN WITH DAGESH..HEBREW LETTER SAMEKH WITH DAGESH
-FB43..FB44    ; ALetter # Lo   [2] HEBREW LETTER FINAL PE WITH DAGESH..HEBREW LETTER PE WITH DAGESH
-FB46..FBB1    ; ALetter # Lo [108] HEBREW LETTER TSADI WITH DAGESH..ARABIC LETTER YEH BARREE WITH HAMZA ABOVE FINAL FORM
-FBD3..FD3D    ; ALetter # Lo [363] ARABIC LETTER NG ISOLATED FORM..ARABIC LIGATURE ALEF WITH FATHATAN ISOLATED FORM
-FD50..FD8F    ; ALetter # Lo  [64] ARABIC LIGATURE TEH WITH JEEM WITH MEEM INITIAL FORM..ARABIC LIGATURE MEEM WITH KHAH WITH MEEM INITIAL FORM
-FD92..FDC7    ; ALetter # Lo  [54] ARABIC LIGATURE MEEM WITH JEEM WITH KHAH INITIAL FORM..ARABIC LIGATURE NOON WITH JEEM WITH YEH FINAL FORM
-FDF0..FDFB    ; ALetter # Lo  [12] ARABIC LIGATURE SALLA USED AS KORANIC STOP SIGN ISOLATED FORM..ARABIC LIGATURE JALLAJALALOUHOU
-FE70..FE74    ; ALetter # Lo   [5] ARABIC FATHATAN ISOLATED FORM..ARABIC KASRATAN ISOLATED FORM
-FE76..FEFC    ; ALetter # Lo [135] ARABIC FATHA ISOLATED FORM..ARABIC LIGATURE LAM WITH ALEF FINAL FORM
-FF21..FF3A    ; ALetter # L&  [26] FULLWIDTH LATIN CAPITAL LETTER A..FULLWIDTH LATIN CAPITAL LETTER Z
-FF41..FF5A    ; ALetter # L&  [26] FULLWIDTH LATIN SMALL LETTER A..FULLWIDTH LATIN SMALL LETTER Z
-FFA0..FFBE    ; ALetter # Lo  [31] HALFWIDTH HANGUL FILLER..HALFWIDTH HANGUL LETTER HIEUH
-FFC2..FFC7    ; ALetter # Lo   [6] HALFWIDTH HANGUL LETTER A..HALFWIDTH HANGUL LETTER E
-FFCA..FFCF    ; ALetter # Lo   [6] HALFWIDTH HANGUL LETTER YEO..HALFWIDTH HANGUL LETTER OE
-FFD2..FFD7    ; ALetter # Lo   [6] HALFWIDTH HANGUL LETTER YO..HALFWIDTH HANGUL LETTER YU
-FFDA..FFDC    ; ALetter # Lo   [3] HALFWIDTH HANGUL LETTER EU..HALFWIDTH HANGUL LETTER I
-10000..1000B  ; ALetter # Lo  [12] LINEAR B SYLLABLE B008 A..LINEAR B SYLLABLE B046 JE
-1000D..10026  ; ALetter # Lo  [26] LINEAR B SYLLABLE B036 JO..LINEAR B SYLLABLE B032 QO
-10028..1003A  ; ALetter # Lo  [19] LINEAR B SYLLABLE B060 RA..LINEAR B SYLLABLE B042 WO
-1003C..1003D  ; ALetter # Lo   [2] LINEAR B SYLLABLE B017 ZA..LINEAR B SYLLABLE B074 ZE
-1003F..1004D  ; ALetter # Lo  [15] LINEAR B SYLLABLE B020 ZO..LINEAR B SYLLABLE B091 TWO
-10050..1005D  ; ALetter # Lo  [14] LINEAR B SYMBOL B018..LINEAR B SYMBOL B089
-10080..100FA  ; ALetter # Lo [123] LINEAR B IDEOGRAM B100 MAN..LINEAR B IDEOGRAM VESSEL B305
-10140..10174  ; ALetter # Nl  [53] GREEK ACROPHONIC ATTIC ONE QUARTER..GREEK ACROPHONIC STRATIAN FIFTY MNAS
-10300..1031E  ; ALetter # Lo  [31] OLD ITALIC LETTER A..OLD ITALIC LETTER UU
-10330..10340  ; ALetter # Lo  [17] GOTHIC LETTER AHSA..GOTHIC LETTER PAIRTHRA
-10341         ; ALetter # Nl       GOTHIC LETTER NINETY
-10342..10349  ; ALetter # Lo   [8] GOTHIC LETTER RAIDA..GOTHIC LETTER OTHAL
-1034A         ; ALetter # Nl       GOTHIC LETTER NINE HUNDRED
-10380..1039D  ; ALetter # Lo  [30] UGARITIC LETTER ALPA..UGARITIC LETTER SSU
-103A0..103C3  ; ALetter # Lo  [36] OLD PERSIAN SIGN A..OLD PERSIAN SIGN HA
-103C8..103CF  ; ALetter # Lo   [8] OLD PERSIAN SIGN AURAMAZDAA..OLD PERSIAN SIGN BUUMISH
-103D1..103D5  ; ALetter # Nl   [5] OLD PERSIAN NUMBER ONE..OLD PERSIAN NUMBER HUNDRED
-10400..1044F  ; ALetter # L&  [80] DESERET CAPITAL LETTER LONG I..DESERET SMALL LETTER EW
-10450..1049D  ; ALetter # Lo  [78] SHAVIAN LETTER PEEP..OSMANYA LETTER OO
-10800..10805  ; ALetter # Lo   [6] CYPRIOT SYLLABLE A..CYPRIOT SYLLABLE JA
-10808         ; ALetter # Lo       CYPRIOT SYLLABLE JO
-1080A..10835  ; ALetter # Lo  [44] CYPRIOT SYLLABLE KA..CYPRIOT SYLLABLE WO
-10837..10838  ; ALetter # Lo   [2] CYPRIOT SYLLABLE XA..CYPRIOT SYLLABLE XE
-1083C         ; ALetter # Lo       CYPRIOT SYLLABLE ZA
-1083F         ; ALetter # Lo       CYPRIOT SYLLABLE ZO
-10900..10915  ; ALetter # Lo  [22] PHOENICIAN LETTER ALF..PHOENICIAN LETTER TAU
-10A00         ; ALetter # Lo       KHAROSHTHI LETTER A
-10A10..10A13  ; ALetter # Lo   [4] KHAROSHTHI LETTER KA..KHAROSHTHI LETTER GHA
-10A15..10A17  ; ALetter # Lo   [3] KHAROSHTHI LETTER CA..KHAROSHTHI LETTER JA
-10A19..10A33  ; ALetter # Lo  [27] KHAROSHTHI LETTER NYA..KHAROSHTHI LETTER TTTHA
-12000..1236E  ; ALetter # Lo [879] CUNEIFORM SIGN A..CUNEIFORM SIGN ZUM
-12400..12462  ; ALetter # Nl  [99] CUNEIFORM NUMERIC SIGN TWO ASH..CUNEIFORM NUMERIC SIGN OLD ASSYRIAN ONE QUARTER
-1D400..1D454  ; ALetter # L&  [85] MATHEMATICAL BOLD CAPITAL A..MATHEMATICAL ITALIC SMALL G
-1D456..1D49C  ; ALetter # L&  [71] MATHEMATICAL ITALIC SMALL I..MATHEMATICAL SCRIPT CAPITAL A
-1D49E..1D49F  ; ALetter # L&   [2] MATHEMATICAL SCRIPT CAPITAL C..MATHEMATICAL SCRIPT CAPITAL D
-1D4A2         ; ALetter # L&       MATHEMATICAL SCRIPT CAPITAL G
-1D4A5..1D4A6  ; ALetter # L&   [2] MATHEMATICAL SCRIPT CAPITAL J..MATHEMATICAL SCRIPT CAPITAL K
-1D4A9..1D4AC  ; ALetter # L&   [4] MATHEMATICAL SCRIPT CAPITAL N..MATHEMATICAL SCRIPT CAPITAL Q
-1D4AE..1D4B9  ; ALetter # L&  [12] MATHEMATICAL SCRIPT CAPITAL S..MATHEMATICAL SCRIPT SMALL D
-1D4BB         ; ALetter # L&       MATHEMATICAL SCRIPT SMALL F
-1D4BD..1D4C3  ; ALetter # L&   [7] MATHEMATICAL SCRIPT SMALL H..MATHEMATICAL SCRIPT SMALL N
-1D4C5..1D505  ; ALetter # L&  [65] MATHEMATICAL SCRIPT SMALL P..MATHEMATICAL FRAKTUR CAPITAL B
-1D507..1D50A  ; ALetter # L&   [4] MATHEMATICAL FRAKTUR CAPITAL D..MATHEMATICAL FRAKTUR CAPITAL G
-1D50D..1D514  ; ALetter # L&   [8] MATHEMATICAL FRAKTUR CAPITAL J..MATHEMATICAL FRAKTUR CAPITAL Q
-1D516..1D51C  ; ALetter # L&   [7] MATHEMATICAL FRAKTUR CAPITAL S..MATHEMATICAL FRAKTUR CAPITAL Y
-1D51E..1D539  ; ALetter # L&  [28] MATHEMATICAL FRAKTUR SMALL A..MATHEMATICAL DOUBLE-STRUCK CAPITAL B
-1D53B..1D53E  ; ALetter # L&   [4] MATHEMATICAL DOUBLE-STRUCK CAPITAL D..MATHEMATICAL DOUBLE-STRUCK CAPITAL G
-1D540..1D544  ; ALetter # L&   [5] MATHEMATICAL DOUBLE-STRUCK CAPITAL I..MATHEMATICAL DOUBLE-STRUCK CAPITAL M
-1D546         ; ALetter # L&       MATHEMATICAL DOUBLE-STRUCK CAPITAL O
-1D54A..1D550  ; ALetter # L&   [7] MATHEMATICAL DOUBLE-STRUCK CAPITAL S..MATHEMATICAL DOUBLE-STRUCK CAPITAL Y
-1D552..1D6A5  ; ALetter # L& [340] MATHEMATICAL DOUBLE-STRUCK SMALL A..MATHEMATICAL ITALIC SMALL DOTLESS J
-1D6A8..1D6C0  ; ALetter # L&  [25] MATHEMATICAL BOLD CAPITAL ALPHA..MATHEMATICAL BOLD CAPITAL OMEGA
-1D6C2..1D6DA  ; ALetter # L&  [25] MATHEMATICAL BOLD SMALL ALPHA..MATHEMATICAL BOLD SMALL OMEGA
-1D6DC..1D6FA  ; ALetter # L&  [31] MATHEMATICAL BOLD EPSILON SYMBOL..MATHEMATICAL ITALIC CAPITAL OMEGA
-1D6FC..1D714  ; ALetter # L&  [25] MATHEMATICAL ITALIC SMALL ALPHA..MATHEMATICAL ITALIC SMALL OMEGA
-1D716..1D734  ; ALetter # L&  [31] MATHEMATICAL ITALIC EPSILON SYMBOL..MATHEMATICAL BOLD ITALIC CAPITAL OMEGA
-1D736..1D74E  ; ALetter # L&  [25] MATHEMATICAL BOLD ITALIC SMALL ALPHA..MATHEMATICAL BOLD ITALIC SMALL OMEGA
-1D750..1D76E  ; ALetter # L&  [31] MATHEMATICAL BOLD ITALIC EPSILON SYMBOL..MATHEMATICAL SANS-SERIF BOLD CAPITAL OMEGA
-1D770..1D788  ; ALetter # L&  [25] MATHEMATICAL SANS-SERIF BOLD SMALL ALPHA..MATHEMATICAL SANS-SERIF BOLD SMALL OMEGA
-1D78A..1D7A8  ; ALetter # L&  [31] MATHEMATICAL SANS-SERIF BOLD EPSILON SYMBOL..MATHEMATICAL SANS-SERIF BOLD ITALIC CAPITAL OMEGA
-1D7AA..1D7C2  ; ALetter # L&  [25] MATHEMATICAL SANS-SERIF BOLD ITALIC SMALL ALPHA..MATHEMATICAL SANS-SERIF BOLD ITALIC SMALL OMEGA
-1D7C4..1D7CB  ; ALetter # L&   [8] MATHEMATICAL SANS-SERIF BOLD ITALIC EPSILON SYMBOL..MATHEMATICAL BOLD SMALL DIGAMMA
-
-# Total code points: 21149
-
-# ================================================
-
-0027          ; MidLetter # Po       APOSTROPHE
-003A          ; MidLetter # Po       COLON
-00B7          ; MidLetter # Po       MIDDLE DOT
-05F4          ; MidLetter # Po       HEBREW PUNCTUATION GERSHAYIM
-2019          ; MidLetter # Pf       RIGHT SINGLE QUOTATION MARK
-2027          ; MidLetter # Po       HYPHENATION POINT
-
-# Total code points: 6
-
-# ================================================
-
-002C          ; MidNum # Po       COMMA
-002E          ; MidNum # Po       FULL STOP
-003B          ; MidNum # Po       SEMICOLON
-037E          ; MidNum # Po       GREEK QUESTION MARK
-0589          ; MidNum # Po       ARMENIAN FULL STOP
-060D          ; MidNum # Po       ARABIC DATE SEPARATOR
-07F8          ; MidNum # Po       NKO COMMA
-2044          ; MidNum # Sm       FRACTION SLASH
-FE10          ; MidNum # Po       PRESENTATION FORM FOR VERTICAL COMMA
-FE13..FE14    ; MidNum # Po   [2] PRESENTATION FORM FOR VERTICAL COLON..PRESENTATION FORM FOR VERTICAL SEMICOLON
-
-# Total code points: 11
-
-# ================================================
-
-0030..0039    ; Numeric # Nd  [10] DIGIT ZERO..DIGIT NINE
-0660..0669    ; Numeric # Nd  [10] ARABIC-INDIC DIGIT ZERO..ARABIC-INDIC DIGIT NINE
-066B..066C    ; Numeric # Po   [2] ARABIC DECIMAL SEPARATOR..ARABIC THOUSANDS SEPARATOR
-06F0..06F9    ; Numeric # Nd  [10] EXTENDED ARABIC-INDIC DIGIT ZERO..EXTENDED ARABIC-INDIC DIGIT NINE
-07C0..07C9    ; Numeric # Nd  [10] NKO DIGIT ZERO..NKO DIGIT NINE
-0966..096F    ; Numeric # Nd  [10] DEVANAGARI DIGIT ZERO..DEVANAGARI DIGIT NINE
-09E6..09EF    ; Numeric # Nd  [10] BENGALI DIGIT ZERO..BENGALI DIGIT NINE
-0A66..0A6F    ; Numeric # Nd  [10] GURMUKHI DIGIT ZERO..GURMUKHI DIGIT NINE
-0AE6..0AEF    ; Numeric # Nd  [10] GUJARATI DIGIT ZERO..GUJARATI DIGIT NINE
-0B66..0B6F    ; Numeric # Nd  [10] ORIYA DIGIT ZERO..ORIYA DIGIT NINE
-0BE6..0BEF    ; Numeric # Nd  [10] TAMIL DIGIT ZERO..TAMIL DIGIT NINE
-0C66..0C6F    ; Numeric # Nd  [10] TELUGU DIGIT ZERO..TELUGU DIGIT NINE
-0CE6..0CEF    ; Numeric # Nd  [10] KANNADA DIGIT ZERO..KANNADA DIGIT NINE
-0D66..0D6F    ; Numeric # Nd  [10] MALAYALAM DIGIT ZERO..MALAYALAM DIGIT NINE
-0E50..0E59    ; Numeric # Nd  [10] THAI DIGIT ZERO..THAI DIGIT NINE
-0ED0..0ED9    ; Numeric # Nd  [10] LAO DIGIT ZERO..LAO DIGIT NINE
-0F20..0F29    ; Numeric # Nd  [10] TIBETAN DIGIT ZERO..TIBETAN DIGIT NINE
-1040..1049    ; Numeric # Nd  [10] MYANMAR DIGIT ZERO..MYANMAR DIGIT NINE
-17E0..17E9    ; Numeric # Nd  [10] KHMER DIGIT ZERO..KHMER DIGIT NINE
-1810..1819    ; Numeric # Nd  [10] MONGOLIAN DIGIT ZERO..MONGOLIAN DIGIT NINE
-1946..194F    ; Numeric # Nd  [10] LIMBU DIGIT ZERO..LIMBU DIGIT NINE
-19D0..19D9    ; Numeric # Nd  [10] NEW TAI LUE DIGIT ZERO..NEW TAI LUE DIGIT NINE
-1B50..1B59    ; Numeric # Nd  [10] BALINESE DIGIT ZERO..BALINESE DIGIT NINE
-104A0..104A9  ; Numeric # Nd  [10] OSMANYA DIGIT ZERO..OSMANYA DIGIT NINE
-1D7CE..1D7FF  ; Numeric # Nd  [50] MATHEMATICAL BOLD DIGIT ZERO..MATHEMATICAL MONOSPACE DIGIT NINE
-
-# Total code points: 282
-
-# ================================================
-
-005F          ; ExtendNumLet # Pc       LOW LINE
-203F..2040    ; ExtendNumLet # Pc   [2] UNDERTIE..CHARACTER TIE
-2054          ; ExtendNumLet # Pc       INVERTED UNDERTIE
-FE33..FE34    ; ExtendNumLet # Pc   [2] PRESENTATION FORM FOR VERTICAL LOW LINE..PRESENTATION FORM FOR VERTICAL WAVY LOW LINE
-FE4D..FE4F    ; ExtendNumLet # Pc   [3] DASHED LOW LINE..WAVY LOW LINE
-FF3F          ; ExtendNumLet # Pc       FULLWIDTH LOW LINE
-
-# Total code points: 10
-
-# EOF
diff --git a/ucd/auxiliary/WordBreakTest.txt b/ucd/auxiliary/WordBreakTest.txt
deleted file mode 100644
index 2ece456..0000000
--- a/ucd/auxiliary/WordBreakTest.txt
+++ /dev/null
@@ -1,517 +0,0 @@
-# WordBreakTest-5.0.0.txt
-# Date: 2006-06-11, 20:09:15 GMT [MD]
-#
-# Unicode Character Database
-# Copyright (c) 1991-2006 Unicode, Inc.
-# For terms of use, see http://www.unicode.org/terms_of_use.html
-# For documentation, see UCD.html
-#
-# Default Word Break Test
-#
-# Format:
-# <string> (# <comment>)? 
-#  <string> contains hex Unicode code points, with 
-#	÷ wherever there is a break opportunity, and 
-#	× wherever there is not.
-#  <comment> the format can change, but currently it shows:
-#	- the sample character name
-#	- (x) the line_break property* for the sample character
-#	- [x] the rule that determines whether there is a break or not
-#
-# These samples may be extended or changed in the future.
-#
-÷ 0020 ÷ 0020 ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] SPACE (Other) ÷ [0.3]
-÷ 0020 ÷ 0001 ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] <START OF HEADING> (GCControl) ÷ [0.3]
-÷ 0020 × 0300 ÷	#  ÷ [0.2] SPACE (Other) × [4.0] COMBINING GRAVE ACCENT (GCExtend) ÷ [0.3]
-÷ 0020 ÷ 000A ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] <LINE FEED (LF)> (GCLF_Sep) ÷ [0.3]
-÷ 0020 ÷ 000D ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [0.3]
-÷ 0020 ÷ 0085 ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [0.3]
-÷ 0020 × 00AD ÷	#  ÷ [0.2] SPACE (Other) × [4.0] SOFT HYPHEN (GCControl_Format) ÷ [0.3]
-÷ 0020 ÷ 3031 ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] VERTICAL KANA REPEAT MARK (Katakana) ÷ [0.3]
-÷ 0020 ÷ 0041 ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] LATIN CAPITAL LETTER A (ALetter) ÷ [0.3]
-÷ 0020 ÷ 0027 ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] APOSTROPHE (MidLetter) ÷ [0.3]
-÷ 0020 ÷ 002C ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] COMMA (MidNum) ÷ [0.3]
-÷ 0020 ÷ 0030 ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] DIGIT ZERO (Numeric) ÷ [0.3]
-÷ 0020 ÷ 005F ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] LOW LINE (ExtendNumLet) ÷ [0.3]
-÷ 0020 ÷ 0061 × 2060 ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] LATIN SMALL LETTER A (ALetter) × [4.0] WORD JOINER (GCControl_Format) ÷ [0.3]
-÷ 0020 ÷ 0061 ÷ 003A ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] LATIN SMALL LETTER A (ALetter) ÷ [999.0] COLON (MidLetter) ÷ [0.3]
-÷ 0020 ÷ 0061 ÷ 0027 ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] LATIN SMALL LETTER A (ALetter) ÷ [999.0] APOSTROPHE (MidLetter) ÷ [0.3]
-÷ 0020 ÷ 0061 ÷ 0027 × 2060 ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] LATIN SMALL LETTER A (ALetter) ÷ [999.0] APOSTROPHE (MidLetter) × [4.0] WORD JOINER (GCControl_Format) ÷ [0.3]
-÷ 0020 ÷ 0061 ÷ 002C ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] LATIN SMALL LETTER A (ALetter) ÷ [999.0] COMMA (MidNum) ÷ [0.3]
-÷ 0020 ÷ 0031 ÷ 003A ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] DIGIT ONE (Numeric) ÷ [999.0] COLON (MidLetter) ÷ [0.3]
-÷ 0020 ÷ 0031 ÷ 0027 ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] DIGIT ONE (Numeric) ÷ [999.0] APOSTROPHE (MidLetter) ÷ [0.3]
-÷ 0020 ÷ 0031 ÷ 002C ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] DIGIT ONE (Numeric) ÷ [999.0] COMMA (MidNum) ÷ [0.3]
-÷ 0020 ÷ 0031 ÷ 002E × 2060 ÷	#  ÷ [0.2] SPACE (Other) ÷ [999.0] DIGIT ONE (Numeric) ÷ [999.0] FULL STOP (MidNum) × [4.0] WORD JOINER (GCControl_Format) ÷ [0.3]
-÷ 0001 ÷ 0020 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) ÷ [999.0] SPACE (Other) ÷ [0.3]
-÷ 0001 ÷ 0001 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) ÷ [999.0] <START OF HEADING> (GCControl) ÷ [0.3]
-÷ 0001 × 0300 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) × [4.0] COMBINING GRAVE ACCENT (GCExtend) ÷ [0.3]
-÷ 0001 ÷ 000A ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) ÷ [999.0] <LINE FEED (LF)> (GCLF_Sep) ÷ [0.3]
-÷ 0001 ÷ 000D ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) ÷ [999.0] <CARRIAGE RETURN (CR)> (GCCR_Sep) ÷ [0.3]
-÷ 0001 ÷ 0085 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) ÷ [999.0] <NEXT LINE (NEL)> (GCControl_Sep) ÷ [0.3]
-÷ 0001 × 00AD ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) × [4.0] SOFT HYPHEN (GCControl_Format) ÷ [0.3]
-÷ 0001 ÷ 3031 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) ÷ [999.0] VERTICAL KANA REPEAT MARK (Katakana) ÷ [0.3]
-÷ 0001 ÷ 0041 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) ÷ [999.0] LATIN CAPITAL LETTER A (ALetter) ÷ [0.3]
-÷ 0001 ÷ 0027 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) ÷ [999.0] APOSTROPHE (MidLetter) ÷ [0.3]
-÷ 0001 ÷ 002C ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) ÷ [999.0] COMMA (MidNum) ÷ [0.3]
-÷ 0001 ÷ 0030 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) ÷ [999.0] DIGIT ZERO (Numeric) ÷ [0.3]
-÷ 0001 ÷ 005F ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) ÷ [999.0] LOW LINE (ExtendNumLet) ÷ [0.3]
-÷ 0001 ÷ 0061 × 2060 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) ÷ [999.0] LATIN SMALL LETTER A (ALetter) × [4.0] WORD JOINER (GCControl_Format) ÷ [0.3]
-÷ 0001 ÷ 0061 ÷ 003A ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) ÷ [999.0] LATIN SMALL LETTER A (ALetter) ÷ [999.0] COLON (MidLetter) ÷ [0.3]
-÷ 0001 ÷ 0061 ÷ 0027 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) ÷ [999.0] LATIN SMALL LETTER A (ALetter) ÷ [999.0] APOSTROPHE (MidLetter) ÷ [0.3]
-÷ 0001 ÷ 0061 ÷ 0027 × 2060 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) ÷ [999.0] LATIN SMALL LETTER A (ALetter) ÷ [999.0] APOSTROPHE (MidLetter) × [4.0] WORD JOINER (GCControl_Format) ÷ [0.3]
-÷ 0001 ÷ 0061 ÷ 002C ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) ÷ [999.0] LATIN SMALL LETTER A (ALetter) ÷ [999.0] COMMA (MidNum) ÷ [0.3]
-÷ 0001 ÷ 0031 ÷ 003A ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) ÷ [999.0] DIGIT ONE (Numeric) ÷ [999.0] COLON (MidLetter) ÷ [0.3]
-÷ 0001 ÷ 0031 ÷ 0027 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) ÷ [999.0] DIGIT ONE (Numeric) ÷ [999.0] APOSTROPHE (MidLetter) ÷ [0.3]
-÷ 0001 ÷ 0031 ÷ 002C ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) ÷ [999.0] DIGIT ONE (Numeric) ÷ [999.0] COMMA (MidNum) ÷ [0.3]
-÷ 0001 ÷ 0031 ÷ 002E × 2060 ÷	#  ÷ [0.2] <START OF HEADING> (GCControl) ÷ [999.0] DIGIT ONE (Numeric) ÷ [999.0] FULL STOP (MidNum) × [4.0] WORD JOINER (GCControl_Format) ÷ [0.3]
-÷ 0300 ÷ 0020 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (GCExtend) ÷ [999.0] SPACE (Other) ÷ [0.3]
-÷ 0300 ÷ 0001 ÷	#  ÷ [0.2] COMBINING GRAVE ACCENT (GCExtend) ÷ [999.0] <START OF HEADING> (GCControl) ÷ [0.3]
-÷ 0300 × 0300 ÷