/com.adobe.epubcheck/src/com/adobe/epubcheck/dtd/oeb12.dtdinc

http://epubcheck.googlecode.com/ · Unknown · 1136 lines · 789 code · 347 blank · 0 comment · 0 complexity · 5cc0255a1df1a948d74bf0a718b9fce0 MD5 · raw file

  1. <!--
  2. Title:
  3. Mnemonic Character Entities For the Open eBook Publication
  4. Structure Version 1.2
  5. Version:
  6. 1.2
  7. Revision:
  8. 20020930-x (supercedes 20020424-x)
  9. Revision History Note:
  10. This revision, 20020930-x, which supercedes the prior revision
  11. 20020424-x, updates: 1) an email address within this comment
  12. prologue, and 2) the Unicode version number referenced in various
  13. comments throughout this document. No changes whatsoever were
  14. made to the parsed content of this DTD fragment.
  15. Previous Version:
  16. 1.0.1 (Revision of 22-November-2000, "Character Entities for
  17. the Open eBook Publication Structure Version 1.0.1")
  18. Authors:
  19. Version 1.0; 1.0.1
  20. Gunter Hille <hille@abc.de>
  21. Ben Trafford <ben@legendary.org>
  22. Garret Wilson <garret@globalmentor.com>
  23. This Version 1.2 updated and edited by:
  24. Jon Noring <jon@noring.name>
  25. Usage:
  26. <!ENTITY % OEBEntities
  27. PUBLIC "+//ISBN 0-9673008-1-9//DTD OEB 1.2 Entities//EN"
  28. "http://openebook.org/dtds/oeb-1.2/oeb12.ent">
  29. %OEBEntities;
  30. Summary:
  31. This DTD fragment exactly duplicates, with some reorganization,
  32. correction, and reformatting of the descriptive text, the 253
  33. character entity declarations in the XHTML 1.1 DTD. Refer to:
  34. http://www.w3.org/TR/xhtml1/DTD/xhtml-lat1.ent
  35. http://www.w3.org/TR/xhtml1/DTD/xhtml-symbol.ent
  36. http://www.w3.org/TR/xhtml1/DTD/xhtml-special.ent
  37. Relation to OEBPS Version 1.0.1:
  38. The 253 character entities declared herein include all 249 from
  39. Version 1.0.1 plus four of the five pre-defined XML 1.0 character
  40. entities of &amp;, &lt;, &gt;, &quot; (the fifth pre-defined XML
  41. character entity, &apos;, is one of the 249 character entities
  42. already declared in Version 1.0.1.)
  43. The five pre-defined XML 1.0 character entities are included for
  44. completeness and interoperability as recommended by W3C, and to
  45. follow XHTML practice. (Further information on the purpose and
  46. usage of these five pre-defined XML character entities, and the
  47. normative reference, is given in the Usage Note below.)
  48. Relation to Unicode 3.2.0 and ISO/IEC 10646:
  49. The mnemonic character entities declared herein substitute for
  50. numeric character references, the numeric values for the
  51. associated characters specified by Unicode (in turn, the Unicode
  52. Character Data Set conforms with the ISO/IEC 10646 character set
  53. which XML 1.0 specifies.) The current version of Unicode is
  54. 3.2.0. General information on Unicode, including information on
  55. the latest version, is found at
  56. http://www.unicode.org/
  57. In addition, Unicode has categorized the massive number of
  58. characters in its Character Database using two different systems:
  59. Character Blocks and Script Names. These two systems are used
  60. herein for general categorization of the 253 character entities.
  61. The text files listing the code points for these two systems are:
  62. http://www.unicode.org/Public/UNIDATA/Blocks.txt
  63. http://www.unicode.org/Public/UNIDATA/Scripts.txt
  64. Tutorial Note to Document Authors: Character Entity Usage
  65. To insert the desired special character into the content of an
  66. OEBPS Document or Package file (which are XML documents), prefix
  67. the associated mnemonic character entity with the '&' character
  68. and terminate with the ';' character.
  69. Example: to insert the "em dash" character (which has the
  70. mnemonic 'mdash'), use &mdash; .
  71. If preferred, the character can instead be inserted using the
  72. direct (Unicode) numerical character reference, the codes of
  73. which are given herein (see the above note on Unicode.) So, for
  74. the "em dash" character one can use, instead of &mdash;, either
  75. the decimal &#8212; or the hexadecimal equivalent &#x2014; .
  76. Importantly note that within the content (PCDATA) of all OEBPS
  77. documents and package files, the special XML characters '&' and
  78. '<', when intended to be used literally, MUST be represented with
  79. the mnemonic character entities of &amp; and &lt; (or the numerical
  80. character entity equivalents), respectively. In addition, it is
  81. considered good practice to use the &gt; (or numerical equivalent)
  82. for the '>' symbol, although it is not necessary except in very
  83. unusual and rare circumstances. The two other special XML character
  84. entities, apostrophe (&apos;) and quote (&quot;), are only
  85. necessary within element attribute values to literally represent
  86. these characters, and for similar non-content purposes.
  87. (The normative reference on the five XML pre-defined mnemonic
  88. character entities is given in Sections 2.4 and 4.6 of the XML
  89. 1.0 Specification, Second Edition:
  90. http://www.w3.org/TR/2000/REC-xml-20001006
  91. )
  92. +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
  93. Portions (C) International Organization for
  94. Standardization 1986. Permission to copy in any
  95. form is granted for use with conforming SGML
  96. systems and applications as defined in ISO 8879,
  97. provided this notice is included in all copies.
  98. +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
  99. -->
  100. <!--
  101. +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
  102. XML 1.0 Pre-Defined Character Entities
  103. +-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+
  104. Drawn From Unicode 3.2.0 Character Sets:
  105. Block Name(s): Basic Latin (U+0000 to U+007F)
  106. Script Name(s): (none)
  107. -->
  108. <!ENTITY quot "&#34;" ><!-- quotation mark
  109. APL quote
  110. ==================== U+0022 ISOnum -->
  111. <!ENTITY amp "&#38;#38;" ><!-- ampersand
  112. ==================== U+0026 ISOnum -->
  113. <!ENTITY apos "&#39;" ><!-- apostrophe mark
  114. ==================== U+0027 ISOnum -->
  115. <!ENTITY lt "&#38;#60;" ><!-- less-than sign
  116. ==================== U+003C ISOnum -->
  117. <!ENTITY gt "&#62;" ><!-- greater-than sign
  118. ==================== U+003E ISOnum -->
  119. <!--
  120. +-+-+-+-+-+-+-+-+-+-+-+-+
  121. Extended Latin Script
  122. +-+-+-+-+-+-+-+-+-+-+-+-+
  123. Drawn From Unicode 3.2.0 Character Sets:
  124. Block Name(s): Latin-1 Supplement (U+0080 to U+00FF)
  125. Latin Extended-A (U+0100 to U+017F)
  126. Latin Extended-B (U+0180 to U+024F)
  127. Script Name(s): Latin
  128. -->
  129. <!ENTITY ordf "&#170;" ><!-- feminine ordinal indicator
  130. ==================== U+00AA ISOnum -->
  131. <!ENTITY ordm "&#186;" ><!-- masculine ordinal indicator
  132. ==================== U+00BA ISOnum -->
  133. <!ENTITY Agrave "&#192;" ><!-- Latin capital letter A with grave
  134. Latin capital letter A grave
  135. =================== U+00C0 ISOlat1 -->
  136. <!ENTITY Aacute "&#193;" ><!-- Latin capital letter A with acute
  137. =================== U+00C1 ISOlat1 -->
  138. <!ENTITY Acirc "&#194;" ><!-- Latin capital letter A with circumflex
  139. =================== U+00C2 ISOlat1 -->
  140. <!ENTITY Atilde "&#195;" ><!-- Latin capital letter A with tilde
  141. =================== U+00C3 ISOlat1 -->
  142. <!ENTITY Auml "&#196;" ><!-- Latin capital letter A with diaeresis
  143. =================== U+00C4 ISOlat1 -->
  144. <!ENTITY Aring "&#197;" ><!-- Latin capital letter A with ring above
  145. Latin capital letter A ring
  146. =================== U+00C5 ISOlat1 -->
  147. <!ENTITY AElig "&#198;" ><!-- Latin capital letter AE
  148. Latin capital ligature AE
  149. =================== U+00C6 ISOlat1 -->
  150. <!ENTITY Ccedil "&#199;" ><!-- Latin capital letter C with cedilla
  151. =================== U+00C7 ISOlat1 -->
  152. <!ENTITY Egrave "&#200;" ><!-- Latin capital letter E with grave
  153. =================== U+00C8 ISOlat1 -->
  154. <!ENTITY Eacute "&#201;" ><!-- Latin capital letter E with acute
  155. =================== U+00C9 ISOlat1 -->
  156. <!ENTITY Ecirc "&#202;" ><!-- Latin capital letter E with circumflex
  157. =================== U+00CA ISOlat1 -->
  158. <!ENTITY Euml "&#203;" ><!-- Latin capital letter E with diaeresis
  159. =================== U+00CB ISOlat1 -->
  160. <!ENTITY Igrave "&#204;" ><!-- Latin capital letter I with grave
  161. =================== U+00CC ISOlat1 -->
  162. <!ENTITY Iacute "&#205;" ><!-- Latin capital letter I with acute
  163. =================== U+00CD ISOlat1 -->
  164. <!ENTITY Icirc "&#206;" ><!-- Latin capital letter I with circumflex
  165. =================== U+00CE ISOlat1 -->
  166. <!ENTITY Iuml "&#207;" ><!-- Latin capital letter I with diaeresis
  167. =================== U+00CF ISOlat1 -->
  168. <!ENTITY ETH "&#208;" ><!-- Latin capital letter ETH
  169. =================== U+00D0 ISOlat1 -->
  170. <!ENTITY Ntilde "&#209;" ><!-- Latin capital letter N with tilde
  171. =================== U+00D1 ISOlat1 -->
  172. <!ENTITY Ograve "&#210;" ><!-- Latin capital letter O with grave
  173. =================== U+00D2 ISOlat1 -->
  174. <!ENTITY Oacute "&#211;" ><!-- Latin capital letter O with acute
  175. =================== U+00D3 ISOlat1 -->
  176. <!ENTITY Ocirc "&#212;" ><!-- Latin capital letter O with circumflex
  177. =================== U+00D4 ISOlat1 -->
  178. <!ENTITY Otilde "&#213;" ><!-- Latin capital letter O with tilde
  179. =================== U+00D5 ISOlat1 -->
  180. <!ENTITY Ouml "&#214;" ><!-- Latin capital letter O with diaeresis
  181. =================== U+00D6 ISOlat1 -->
  182. <!ENTITY Oslash "&#216;" ><!-- Latin capital letter O with stroke
  183. Latin capital letter O slash
  184. =================== U+00D8 ISOlat1 -->
  185. <!ENTITY Ugrave "&#217;" ><!-- Latin capital letter U with grave
  186. =================== U+00D9 ISOlat1 -->
  187. <!ENTITY Uacute "&#218;" ><!-- Latin capital letter U with acute
  188. =================== U+00DA ISOlat1 -->
  189. <!ENTITY Ucirc "&#219;" ><!-- Latin capital letter U with circumflex
  190. =================== U+00DB ISOlat1 -->
  191. <!ENTITY Uuml "&#220;" ><!-- Latin capital letter U with diaeresis
  192. =================== U+00DC ISOlat1 -->
  193. <!ENTITY Yacute "&#221;" ><!-- Latin capital letter Y with acute
  194. =================== U+00DD ISOlat1 -->
  195. <!ENTITY THORN "&#222;" ><!-- Latin capital letter THORN
  196. =================== U+00DE ISOlat1 -->
  197. <!ENTITY szlig "&#223;" ><!-- Latin small letter sharp s
  198. ess-zed
  199. =================== U+00DF ISOlat1 -->
  200. <!ENTITY agrave "&#224;" ><!-- Latin small letter a with grave
  201. Latin small letter a grave
  202. =================== U+00E0 ISOlat1 -->
  203. <!ENTITY aacute "&#225;" ><!-- Latin small letter a with acute
  204. =================== U+00E1 ISOlat1 -->
  205. <!ENTITY acirc "&#226;" ><!-- Latin small letter a with circumflex
  206. =================== U+00E2 ISOlat1 -->
  207. <!ENTITY atilde "&#227;" ><!-- Latin small letter a with tilde
  208. =================== U+00E3 ISOlat1 -->
  209. <!ENTITY auml "&#228;" ><!-- Latin small letter a with diaeresis
  210. =================== U+00E4 ISOlat1 -->
  211. <!ENTITY aring "&#229;" ><!-- Latin small letter a with ring above
  212. Latin small letter a ring
  213. =================== U+00E5 ISOlat1 -->
  214. <!ENTITY aelig "&#230;" ><!-- Latin small letter ae
  215. Latin small ligature ae
  216. =================== U+00E6 ISOlat1 -->
  217. <!ENTITY ccedil "&#231;" ><!-- Latin small letter c with cedilla
  218. =================== U+00E7 ISOlat1 -->
  219. <!ENTITY egrave "&#232;" ><!-- Latin small letter e with grave
  220. =================== U+00E8 ISOlat1 -->
  221. <!ENTITY eacute "&#233;" ><!-- Latin small letter e with acute
  222. =================== U+00E9 ISOlat1 -->
  223. <!ENTITY ecirc "&#234;" ><!-- Latin small letter e with circumflex
  224. =================== U+00EA ISOlat1 -->
  225. <!ENTITY euml "&#235;" ><!-- Latin small letter e with diaeresis
  226. =================== U+00EB ISOlat1 -->
  227. <!ENTITY igrave "&#236;" ><!-- Latin small letter i with grave
  228. =================== U+00EC ISOlat1 -->
  229. <!ENTITY iacute "&#237;" ><!-- Latin small letter i with acute
  230. =================== U+00ED ISOlat1 -->
  231. <!ENTITY icirc "&#238;" ><!-- Latin small letter i with circumflex
  232. =================== U+00EE ISOlat1 -->
  233. <!ENTITY iuml "&#239;" ><!-- Latin small letter i with diaeresis
  234. =================== U+00EF ISOlat1 -->
  235. <!ENTITY eth "&#240;" ><!-- Latin small letter eth
  236. =================== U+00F0 ISOlat1 -->
  237. <!ENTITY ntilde "&#241;" ><!-- Latin small letter n with tilde
  238. =================== U+00F1 ISOlat1 -->
  239. <!ENTITY ograve "&#242;" ><!-- Latin small letter o with grave
  240. =================== U+00F2 ISOlat1 -->
  241. <!ENTITY oacute "&#243;" ><!-- Latin small letter o with acute
  242. =================== U+00F3 ISOlat1 -->
  243. <!ENTITY ocirc "&#244;" ><!-- Latin small letter o with circumflex
  244. =================== U+00F4 ISOlat1 -->
  245. <!ENTITY otilde "&#245;" ><!-- Latin small letter o with tilde
  246. =================== U+00F5 ISOlat1 -->
  247. <!ENTITY ouml "&#246;" ><!-- Latin small letter o with diaeresis
  248. =================== U+00F6 ISOlat1 -->
  249. <!ENTITY oslash "&#248;" ><!-- Latin small letter o with stroke
  250. Latin small letter o slash
  251. =================== U+00F8 ISOlat1 -->
  252. <!ENTITY ugrave "&#249;" ><!-- Latin small letter u with grave
  253. =================== U+00F9 ISOlat1 -->
  254. <!ENTITY uacute "&#250;" ><!-- Latin small letter u with acute
  255. =================== U+00FA ISOlat1 -->
  256. <!ENTITY ucirc "&#251;" ><!-- Latin small letter u with circumflex
  257. =================== U+00FB ISOlat1 -->
  258. <!ENTITY uuml "&#252;" ><!-- Latin small letter u with diaeresis
  259. =================== U+00FC ISOlat1 -->
  260. <!ENTITY yacute "&#253;" ><!-- Latin small letter y with acute
  261. =================== U+00FD ISOlat1 -->
  262. <!ENTITY thorn "&#254;" ><!-- Latin small letter thorn with
  263. =================== U+00FE ISOlat1 -->
  264. <!ENTITY yuml "&#255;" ><!-- Latin small letter y with diaeresis
  265. =================== U+00FF ISOlat1 -->
  266. <!ENTITY OElig "&#338;" ><!-- Latin capital ligature OE
  267. =================== U+0152 ISOlat2 -->
  268. <!ENTITY oelig "&#339;" ><!-- Latin small ligature oe
  269. =================== U+0153 ISOlat2 -->
  270. <!ENTITY Scaron "&#352;" ><!-- Latin capital letter S with caron
  271. =================== U+0160 ISOlat2 -->
  272. <!ENTITY scaron "&#353;" ><!-- Latin small letter s with caron
  273. =================== U+0161 ISOlat2 -->
  274. <!ENTITY Yuml "&#376;" ><!-- Latin capital letter Y with diaeresis
  275. =================== U+0178 ISOlat2 -->
  276. <!ENTITY fnof "&#402;" ><!-- Latin small f with hook
  277. function
  278. florin
  279. =================== U+0192 ISOtech -->
  280. <!--
  281. +-+-+-+-+-+-+-+
  282. Greek Script
  283. +-+-+-+-+-+-+-+
  284. Drawn From Unicode 3.2.0 Character Sets:
  285. Block Name(s): Greek (U+0370 to U+03FF)
  286. Script Name(s): Greek
  287. -->
  288. <!ENTITY Alpha "&#913;" ><!-- Greek capital letter alpha
  289. =========================== U+0391 -->
  290. <!ENTITY Beta "&#914;" ><!-- Greek capital letter beta
  291. =========================== U+0392 -->
  292. <!ENTITY Gamma "&#915;" ><!-- Greek capital letter gamma
  293. =================== U+0393 ISOgrk3 -->
  294. <!ENTITY Delta "&#916;" ><!-- Greek capital letter delta
  295. =================== U+0394 ISOgrk3 -->
  296. <!ENTITY Epsilon "&#917;" ><!-- Greek capital letter epsilon
  297. =========================== U+0395 -->
  298. <!ENTITY Zeta "&#918;" ><!-- Greek capital letter zeta
  299. =========================== U+0396 -->
  300. <!ENTITY Eta "&#919;" ><!-- Greek capital letter eta
  301. =========================== U+0397 -->
  302. <!ENTITY Theta "&#920;" ><!-- Greek capital letter theta
  303. =================== U+0398 ISOgrk3 -->
  304. <!ENTITY Iota "&#921;" ><!-- Greek capital letter iota
  305. =========================== U+0399 -->
  306. <!ENTITY Kappa "&#922;" ><!-- Greek capital letter kappa
  307. =========================== U+039A -->
  308. <!ENTITY Lambda "&#923;" ><!-- Greek capital letter lambda
  309. =================== U+039B ISOgrk3 -->
  310. <!ENTITY Mu "&#924;" ><!-- Greek capital letter mu
  311. =========================== U+039C -->
  312. <!ENTITY Nu "&#925;" ><!-- Greek capital letter nu
  313. =========================== U+039D -->
  314. <!ENTITY Xi "&#926;" ><!-- Greek capital letter xi
  315. =================== U+039E ISOgrk3 -->
  316. <!ENTITY Omicron "&#927;" ><!-- Greek capital letter omicron
  317. =========================== U+039F -->
  318. <!ENTITY Pi "&#928;" ><!-- Greek capital letter pi
  319. =================== U+03A0 ISOgrk3 -->
  320. <!ENTITY Rho "&#929;" ><!-- Greek capital letter rho
  321. =========================== U+03A1 -->
  322. <!ENTITY Sigma "&#931;" ><!-- Greek capital letter sigma
  323. =================== U+03A3 ISOgrk3 -->
  324. <!ENTITY Tau "&#932;" ><!-- Greek capital letter tau
  325. =========================== U+03A4 -->
  326. <!ENTITY Upsilon "&#933;" ><!-- Greek capital letter upsilon
  327. =================== U+03A5 ISOgrk3 -->
  328. <!ENTITY Phi "&#934;" ><!-- Greek capital letter phi
  329. =================== U+03A6 ISOgrk3 -->
  330. <!ENTITY Chi "&#935;" ><!-- Greek capital letter chi
  331. =========================== U+03A7 -->
  332. <!ENTITY Psi "&#936;" ><!-- Greek capital letter psi
  333. =================== U+03A8 ISOgrk3 -->
  334. <!ENTITY Omega "&#937;" ><!-- Greek capital letter omega
  335. =================== U+03A9 ISOgrk3 -->
  336. <!ENTITY alpha "&#945;" ><!-- Greek small letter alpha
  337. =================== U+03B1 ISOgrk3 -->
  338. <!ENTITY beta "&#946;" ><!-- Greek small letter beta
  339. =================== U+03B2 ISOgrk3 -->
  340. <!ENTITY gamma "&#947;" ><!-- Greek small letter gamma
  341. =================== U+03B3 ISOgrk3 -->
  342. <!ENTITY delta "&#948;" ><!-- Greek small letter delta
  343. =================== U+03B4 ISOgrk3 -->
  344. <!ENTITY epsilon "&#949;" ><!-- Greek small letter epsilon
  345. =================== U+03B5 ISOgrk3 -->
  346. <!ENTITY zeta "&#950;" ><!-- Greek small letter zeta
  347. =================== U+03B6 ISOgrk3 -->
  348. <!ENTITY eta "&#951;" ><!-- Greek small letter eta
  349. =================== U+03B7 ISOgrk3 -->
  350. <!ENTITY theta "&#952;" ><!-- Greek small letter theta
  351. =================== U+03B8 ISOgrk3 -->
  352. <!ENTITY iota "&#953;" ><!-- Greek small letter iota
  353. =================== U+03B9 ISOgrk3 -->
  354. <!ENTITY kappa "&#954;" ><!-- Greek small letter kappa
  355. =================== U+03BA ISOgrk3 -->
  356. <!ENTITY lambda "&#955;" ><!-- Greek small letter lambda
  357. =================== U+03BB ISOgrk3 -->
  358. <!ENTITY mu "&#956;" ><!-- Greek small letter mu
  359. =================== U+03BC ISOgrk3 -->
  360. <!ENTITY nu "&#957;" ><!-- Greek small letter nu
  361. =================== U+03BD ISOgrk3 -->
  362. <!ENTITY xi "&#958;" ><!-- Greek small letter xi
  363. =================== U+03BE ISOgrk3 -->
  364. <!ENTITY omicron "&#959;" ><!-- Greek small letter omicron
  365. ======================= U+03BF NEW -->
  366. <!ENTITY pi "&#960;" ><!-- Greek small letter pi
  367. =================== U+03C0 ISOgrk3 -->
  368. <!ENTITY rho "&#961;" ><!-- Greek small letter rho
  369. =================== U+03C1 ISOgrk3 -->
  370. <!ENTITY sigmaf "&#962;" ><!-- Greek small letter final sigma
  371. =================== U+03C2 ISOgrk3 -->
  372. <!ENTITY sigma "&#963;" ><!-- Greek small letter sigma
  373. =================== U+03C3 ISOgrk3 -->
  374. <!ENTITY tau "&#964;" ><!-- Greek small letter tau
  375. =================== U+03C4 ISOgrk3 -->
  376. <!ENTITY upsilon "&#965;" ><!-- Greek small letter upsilon
  377. =================== U+03C5 ISOgrk3 -->
  378. <!ENTITY phi "&#966;" ><!-- Greek small letter phi
  379. =================== U+03C6 ISOgrk3 -->
  380. <!ENTITY chi "&#967;" ><!-- Greek small letter chi
  381. =================== U+03C7 ISOgrk3 -->
  382. <!ENTITY psi "&#968;" ><!-- Greek small letter psi
  383. =================== U+03C8 ISOgrk3 -->
  384. <!ENTITY omega "&#969;" ><!-- Greek small letter omega
  385. =================== U+03C9 ISOgrk3 -->
  386. <!ENTITY thetasym "&#977;" ><!-- Greek small letter theta symbol
  387. ======================= U+03D1 NEW -->
  388. <!ENTITY upsih "&#978;" ><!-- Greek upsilon with hook symbol
  389. ======================= U+03D2 NEW -->
  390. <!ENTITY piv "&#982;" ><!-- Greek pi symbol
  391. =================== U+03D6 ISOgrk3 -->
  392. <!--
  393. +-+-+-+-+-+-+-+-+-+-+-+
  394. General Punctuation
  395. +-+-+-+-+-+-+-+-+-+-+-+
  396. Drawn From Unicode 3.2.0 Character Sets:
  397. Block Name(s): General Punctuation (U+2000 to U+206F)
  398. Script Name(s): (none)
  399. -->
  400. <!ENTITY ensp "&#8194;" ><!-- en space
  401. ==================== U+2002 ISOpub -->
  402. <!ENTITY emsp "&#8195;" ><!-- em space
  403. ==================== U+2003 ISOpub -->
  404. <!ENTITY thinsp "&#8201;" ><!-- thin space
  405. ==================== U+2009 ISOpub -->
  406. <!ENTITY zwnj "&#8204;" ><!-- zero width non-joiner
  407. ============== U+200C NEW RFC 2070 -->
  408. <!ENTITY zwj "&#8205;" ><!-- zero width joiner
  409. ============== U+200D NEW RFC 2070 -->
  410. <!ENTITY lrm "&#8206;" ><!-- left-to-right mark
  411. ============== U+200E NEW RFC 2070 -->
  412. <!ENTITY rlm "&#8207;" ><!-- right-to-left mark
  413. ============== U+200F NEW RFC 2070 -->
  414. <!ENTITY ndash "&#8211;" ><!-- en dash
  415. ==================== U+2013 ISOpub -->
  416. <!ENTITY mdash "&#8212;" ><!-- em dash
  417. ==================== U+2014 ISOpub -->
  418. <!ENTITY lsquo "&#8216;" ><!-- left single quotation mark
  419. ==================== U+2018 ISOnum -->
  420. <!ENTITY rsquo "&#8217;" ><!-- right single quotation mark
  421. ==================== U+2019 ISOnum -->
  422. <!ENTITY sbquo "&#8218;" ><!-- single low-9 quotation mark
  423. ======================= U+201A NEW -->
  424. <!ENTITY ldquo "&#8220;" ><!-- left double quotation mark
  425. ==================== U+201C ISOnum -->
  426. <!ENTITY rdquo "&#8221;" ><!-- right double quotation mark
  427. ==================== U+201D ISOnum -->
  428. <!ENTITY bdquo "&#8222;" ><!-- double low-9 quotation mark
  429. ======================= U+201E NEW -->
  430. <!ENTITY dagger "&#8224;" ><!-- dagger
  431. ==================== U+2020 ISOpub -->
  432. <!ENTITY Dagger "&#8225;" ><!-- double dagger
  433. ==================== U+2021 ISOpub -->
  434. <!ENTITY bull "&#8226;" ><!-- bullet
  435. black small circle
  436. ==================== U+2022 ISOpub -->
  437. <!-- bullet is NOT the same as U+2219,
  438. 'bullet operator' -->
  439. <!ENTITY hellip "&#8230;" ><!-- horizontal ellipsis
  440. three dot leader
  441. ==================== U+2026 ISOpub -->
  442. <!ENTITY permil "&#8240;" ><!-- per mille sign
  443. =================== U+2030 ISOtech -->
  444. <!ENTITY prime "&#8242;" ><!-- prime
  445. minutes
  446. feet
  447. =================== U+2032 ISOtech -->
  448. <!ENTITY Prime "&#8243;" ><!-- double prime
  449. seconds
  450. inches
  451. =================== U+2033 ISOtech -->
  452. <!ENTITY lsaquo "&#8249;" ><!-- single left-pointing angle quotation
  453. mark
  454. ============== U+2039 ISO proposed -->
  455. <!ENTITY rsaquo "&#8250;" ><!-- single right-pointing angle quotation
  456. ============== U+203A ISO proposed -->
  457. <!ENTITY oline "&#8254;" ><!-- overline
  458. spacing overscore
  459. ======================= U+203E NEW -->
  460. <!ENTITY frasl "&#8260;" ><!-- fraction slash
  461. ======================= U+2044 NEW -->
  462. <!--
  463. +-+-+-+-+-+-+-+-+-+-+
  464. Spacing Modifiers
  465. +-+-+-+-+-+-+-+-+-+-+
  466. Drawn From Unicode 3.2.0 Character Sets:
  467. Block Name(s): Spacing Modifier Letters (U+0280 to U+02FF)
  468. Script Name(s): (none)
  469. Note: The Spacing Modifier Letters are an unusual class of
  470. characters. They are an assorted collection of small signs
  471. used to indicate modifications of the preceding or
  472. following character, and sometimes to be an independent
  473. character. They differ from diacritical marks in that they
  474. are treated as free-standing, independent characters, which
  475. form part of the word and do not break up the word. They
  476. have the "letter" property. Most of the characters are
  477. phonetic modifiers. For further information, refer to
  478. Section 7.8 of the Unicode 3.2 manual, an online version is
  479. at http://www.unicode.org/unicode/uni2book/ch07.pdf .
  480. -->
  481. <!ENTITY circ "&#710;" ><!-- modifier letter circumflex accent
  482. ==================== U+02C6 ISOpub -->
  483. <!ENTITY tilde "&#732;" ><!-- small tilde
  484. ==================== U+02DC ISOdia -->
  485. <!--
  486. +-+-+-+-+-+-+-+-+-+
  487. Various Symbols
  488. +-+-+-+-+-+-+-+-+-+
  489. Drawn From Unicode 3.2.0 Character Sets:
  490. Block Name(s): Latin-1 Supplement (U+0080 to U+00FF)
  491. Currency Symbols (U+20A0 to U+20CF)
  492. Letterlike Symbols (U+2100 to U+214F)
  493. Arrows (U+2190 to U+21FF)
  494. Mathematical Operators (U+2200 to U+22FF)
  495. Miscellaneous Technical (U+2300 to U+23FF)
  496. Geometric Shapes (U+25A0 to U+25FF)
  497. Miscellaneous Symbols (U+2600 to U+26FF)
  498. Script Name(s): (none, except Greek for "micro", U+00B5)
  499. -->
  500. <!ENTITY nbsp "&#160;" ><!-- no-break space
  501. non-breaking space
  502. ==================== U+00A0 ISOnum -->
  503. <!ENTITY iexcl "&#161;" ><!-- inverted exclamation mark
  504. ==================== U+00A1 ISOnum -->
  505. <!ENTITY cent "&#162;" ><!-- cent sign
  506. ==================== U+00A2 ISOnum -->
  507. <!ENTITY pound "&#163;" ><!-- pound sign
  508. ==================== U+00A3 ISOnum -->
  509. <!ENTITY curren "&#164;" ><!-- currency sign
  510. ==================== U+00A4 ISOnum -->
  511. <!ENTITY yen "&#165;" ><!-- yen sign
  512. yuan sign
  513. ==================== U+00A5 ISOnum -->
  514. <!ENTITY brvbar "&#166;" ><!-- broken bar
  515. broken vertical bar
  516. ==================== U+00A6 ISOnum -->
  517. <!ENTITY sect "&#167;" ><!-- section sign
  518. ==================== U+00A7 ISOnum -->
  519. <!ENTITY uml "&#168;" ><!-- diaeresis
  520. spacing diaeresis
  521. ==================== U+00A8 ISOdia -->
  522. <!ENTITY copy "&#169;" ><!-- copyright sign
  523. ==================== U+00A9 ISOnum -->
  524. <!ENTITY laquo "&#171;" ><!-- left-pointing double angle quotation
  525. mark
  526. left pointing guillemet
  527. ==================== U+00AB ISOnum -->
  528. <!ENTITY not "&#172;" ><!-- not sign
  529. ==================== U+00AC ISOnum -->
  530. <!ENTITY shy "&#173;" ><!-- soft hyphen
  531. discretionary hyphen
  532. ==================== U+00AD ISOnum -->
  533. <!ENTITY reg "&#174;" ><!-- registered sign
  534. registered trade mark sign
  535. ==================== U+00AE ISOnum -->
  536. <!ENTITY macr "&#175;" ><!-- macron
  537. spacing macron
  538. overline
  539. APL overbar
  540. ==================== U+00AF ISOdia -->
  541. <!ENTITY deg "&#176;" ><!-- degree sign
  542. ==================== U+00B0 ISOnum -->
  543. <!ENTITY plusmn "&#177;" ><!-- plus-minus sign
  544. plus-or-minus sign
  545. ==================== U+00B1 ISOnum -->
  546. <!ENTITY sup2 "&#178;" ><!-- superscript two
  547. superscript digit two
  548. squared
  549. ==================== U+00B2 ISOnum -->
  550. <!ENTITY sup3 "&#179;" ><!-- superscript three
  551. superscript digit three
  552. cubed
  553. ==================== U+00B3 ISOnum -->
  554. <!ENTITY acute "&#180;" ><!-- acute accent
  555. spacing acute
  556. ==================== U+00B4 ISOdia -->
  557. <!ENTITY micro "&#181;" ><!-- micro sign
  558. ==================== U+00B5 ISOnum -->
  559. <!ENTITY para "&#182;" ><!-- pilcrow sign
  560. paragraph sign
  561. ==================== U+00B6 ISOnum -->
  562. <!ENTITY middot "&#183;" ><!-- middle dot
  563. Georgian comma
  564. Greek middle dot
  565. ==================== U+00B7 ISOnum -->
  566. <!ENTITY cedil "&#184;" ><!-- cedilla
  567. spacing cedilla
  568. ==================== U+00B8 ISOdia -->
  569. <!ENTITY sup1 "&#185;" ><!-- superscript one
  570. superscript digit one
  571. ==================== U+00B9 ISOnum -->
  572. <!ENTITY raquo "&#187;" ><!-- right-pointing double angle quotation
  573. mark
  574. right pointing guillemet
  575. ==================== U+00BB ISOnum -->
  576. <!ENTITY frac14 "&#188;" ><!-- vulgar fraction one quarter
  577. fraction one quarter
  578. ==================== U+00BC ISOnum -->
  579. <!ENTITY frac12 "&#189;" ><!-- vulgar fraction one half
  580. fraction one half
  581. ==================== U+00BD ISOnum -->
  582. <!ENTITY frac34 "&#190;" ><!-- vulgar fraction three quarters
  583. fraction three quarters
  584. ==================== U+00BE ISOnum -->
  585. <!ENTITY iquest "&#191;" ><!-- inverted question mark
  586. turned question mark
  587. ==================== U+00BF ISOnum -->
  588. <!ENTITY times "&#215;" ><!-- multiplication sign
  589. ==================== U+00D7 ISOnum -->
  590. <!ENTITY divide "&#247;" ><!-- division sign
  591. ==================== U+00F7 ISOnum -->
  592. <!ENTITY euro "&#8364;" ><!-- euro sign
  593. ======================= U+20AC NEW -->
  594. <!ENTITY image "&#8465;" ><!-- blackletter capital I
  595. imaginary part
  596. =================== U+2111 ISOamso -->
  597. <!ENTITY weierp "&#8472;" ><!-- script capital P
  598. power set
  599. Weierstrass p
  600. =================== U+2118 ISOamso -->
  601. <!ENTITY real "&#8476;" ><!-- blackletter capital R
  602. real part symbol
  603. =================== U+211C ISOamso -->
  604. <!ENTITY trade "&#8482;" ><!-- trade mark sign
  605. ==================== U+2122 ISOnum -->
  606. <!ENTITY alefsym "&#8501;" ><!-- alef symbol
  607. first transfinite cardinal
  608. ======================= U+2135 NEW -->
  609. <!-- alef symbol is NOT the same as
  610. U+05D0, 'Hebrew letter alef',
  611. although the same glyph could be
  612. used to represent both -->
  613. <!ENTITY larr "&#8592;" ><!-- leftwards arrow
  614. ==================== U+2190 ISOnum -->
  615. <!ENTITY uarr "&#8593;" ><!-- upwards arrow
  616. ==================== U+2191 ISOnum -->
  617. <!ENTITY rarr "&#8594;" ><!-- rightwards arrow
  618. ==================== U+2192 ISOnum -->
  619. <!ENTITY darr "&#8595;" ><!-- downwards arrow
  620. ==================== U+2193 ISOnum -->
  621. <!ENTITY harr "&#8596;" ><!-- left right arrow
  622. =================== U+2194 ISOamsa -->
  623. <!ENTITY crarr "&#8629;" ><!-- downwards arrow with corner leftwards
  624. carriage return
  625. ======================= U+21B5 NEW -->
  626. <!ENTITY lArr "&#8656;" ><!-- leftwards double arrow
  627. =================== U+21D0 ISOtech -->
  628. <!-- Unicode does not say that lArr is
  629. the same as the 'is implied by'
  630. arrow, but also does not have any
  631. other character for that function.
  632. As ISOtech suggests, lArr can be
  633. used for 'is implied by'. -->
  634. <!ENTITY uArr "&#8657;" ><!-- upwards double arrow
  635. =================== U+21D1 ISOamsa -->
  636. <!ENTITY rArr "&#8658;" ><!-- rightwards double arrow
  637. =================== U+21D2 ISOtech -->
  638. <!-- Unicode does not say that rArr is
  639. the same as the 'implies' arrow,
  640. but also does not have any other
  641. character for that function. As
  642. ISOtech suggests, rArr can be used
  643. for 'implies'. -->
  644. <!ENTITY dArr "&#8659;" ><!-- downwards double arrow
  645. =================== U+21D3 ISOamsa -->
  646. <!ENTITY hArr "&#8660;" ><!-- left right double arrow
  647. =================== U+21D4 ISOamsa -->
  648. <!ENTITY forall "&#8704;" ><!-- for all
  649. =================== U+2200 ISOtech -->
  650. <!ENTITY part "&#8706;" ><!-- partial differential
  651. =================== U+2202 ISOtech -->
  652. <!ENTITY exist "&#8707;" ><!-- there exists
  653. =================== U+2203 ISOtech -->
  654. <!ENTITY empty "&#8709;" ><!-- empty set
  655. null set
  656. diameter
  657. =================== U+2205 ISOamso -->
  658. <!ENTITY nabla "&#8711;" ><!-- nabla
  659. backward difference
  660. =================== U+2207 ISOtech -->
  661. <!ENTITY isin "&#8712;" ><!-- element of
  662. =================== U+2208 ISOtech -->
  663. <!ENTITY notin "&#8713;" ><!-- not an element of
  664. =================== U+2209 ISOtech -->
  665. <!ENTITY ni "&#8715;" ><!-- contains as member
  666. =================== U+220B ISOtech -->
  667. <!ENTITY prod "&#8719;" ><!-- n-ary product
  668. product sign
  669. =================== U+220F ISOamsb -->
  670. <!-- prod is NOT the same character as
  671. U+03A0, 'Greek capital letter pi',
  672. although the same glyph could be
  673. used to represent both -->
  674. <!ENTITY sum "&#8721;" ><!-- n-ary summation
  675. =================== U+2211 ISOamsb -->
  676. <!-- sum is NOT the same character as
  677. U+03A3, 'Greek capital letter sigma',
  678. although the same glyph could be
  679. used to represent both -->
  680. <!ENTITY minus "&#8722;" ><!-- minus sign
  681. =================== U+2212 ISOtech -->
  682. <!ENTITY lowast "&#8727;" ><!-- asterisk operator
  683. =================== U+2217 ISOtech -->
  684. <!ENTITY radic "&#8730;" ><!-- square root
  685. radical sign
  686. =================== U+221A ISOtech -->
  687. <!ENTITY prop "&#8733;" ><!-- proportional to
  688. =================== U+221D ISOtech -->
  689. <!ENTITY infin "&#8734;" ><!-- infinity
  690. =================== U+221E ISOtech -->
  691. <!ENTITY ang "&#8736;" ><!-- angle
  692. =================== U+2220 ISOamso -->
  693. <!ENTITY and "&#8743;" ><!-- logical and
  694. wedge
  695. =================== U+2227 ISOtech -->
  696. <!ENTITY or "&#8744;" ><!-- logical or
  697. vee
  698. =================== U+2228 ISOtech -->
  699. <!ENTITY cap "&#8745;" ><!-- intersection
  700. cap
  701. =================== U+2229 ISOtech -->
  702. <!ENTITY cup "&#8746;" ><!-- union
  703. cup
  704. =================== U+222A ISOtech -->
  705. <!ENTITY int "&#8747;" ><!-- integral
  706. =================== U+222B ISOtech -->
  707. <!ENTITY there4 "&#8756;" ><!-- therefore
  708. =================== U+2234 ISOtech -->
  709. <!ENTITY sim "&#8764;" ><!-- tilde operator
  710. varies with
  711. similar to
  712. =================== U+223C ISOtech -->
  713. <!-- tilde operator is NOT the same
  714. character as U+007E, 'tilde',
  715. although the same glyph could be
  716. used to represent both -->
  717. <!ENTITY cong "&#8773;" ><!-- approximately equal to
  718. =================== U+2245 ISOtech -->
  719. <!ENTITY asymp "&#8776;" ><!-- almost equal to
  720. asymptotic to
  721. =================== U+2248 ISOamsr -->
  722. <!ENTITY ne "&#8800;" ><!-- not equal to
  723. =================== U+2260 ISOtech -->
  724. <!ENTITY equiv "&#8801;" ><!-- identical to
  725. =================== U+2261 ISOtech -->
  726. <!ENTITY le "&#8804;" ><!-- less-than or equal to
  727. =================== U+2264 ISOtech -->
  728. <!ENTITY ge "&#8805;" ><!-- greater-than or equal to
  729. =================== U+2265 ISOtech -->
  730. <!ENTITY sub "&#8834;" ><!-- subset of
  731. =================== U+2282 ISOtech -->
  732. <!ENTITY sup "&#8835;" ><!-- superset of
  733. =================== U+2283 ISOtech -->
  734. <!ENTITY nsub "&#8836;" ><!-- not a subset of
  735. =================== U+2284 ISOamsn -->
  736. <!ENTITY sube "&#8838;" ><!-- subset of or equal to
  737. =================== U+2286 ISOtech -->
  738. <!ENTITY supe "&#8839;" ><!-- superset of or equal to
  739. =================== U+2287 ISOtech -->
  740. <!ENTITY oplus "&#8853;" ><!-- circled plus
  741. direct sum
  742. =================== U+2295 ISOamsb -->
  743. <!ENTITY otimes "&#8855;" ><!-- circled times
  744. vector product
  745. =================== U+2297 ISOamsb -->
  746. <!ENTITY perp "&#8869;" ><!-- up tack
  747. orthogonal to
  748. perpendicular
  749. =================== U+22A5 ISOtech -->
  750. <!ENTITY sdot "&#8901;" ><!-- dot operator
  751. =================== U+22C5 ISOamsb -->
  752. <!-- dot operator is NOT the same
  753. character as U+00B7, 'middle dot' -->
  754. <!ENTITY lceil "&#8968;" ><!-- left ceiling
  755. APL upstile
  756. =================== U+2308 ISOamsc -->
  757. <!ENTITY rceil "&#8969;" ><!-- right ceiling
  758. =================== U+2309 ISOamsc -->
  759. <!ENTITY lfloor "&#8970;" ><!-- left floor
  760. APL downstile
  761. =================== U+230A ISOamsc -->
  762. <!ENTITY rfloor "&#8971;" ><!-- right floor
  763. =================== U+230B ISOamsc -->
  764. <!ENTITY lang "&#9001;" ><!-- left-pointing angle bracket
  765. bra
  766. =================== U+2329 ISOtech -->
  767. <!-- lang is NOT the same character as
  768. U+003C, 'less than', or U+2039,
  769. 'single left-pointing angle quotation
  770. mark' -->
  771. <!ENTITY rang "&#9002;" ><!-- right-pointing angle bracket
  772. ket
  773. =================== U+232A ISOtech -->
  774. <!-- rang is NOT the same character as
  775. U+003E, 'greater than', or U+203A,
  776. 'single right-pointing angle quotation
  777. mark' -->
  778. <!ENTITY loz "&#9674;" ><!-- lozenge
  779. ==================== U+25CA ISOpub -->
  780. <!ENTITY spades "&#9824;" ><!-- black spade suit
  781. ==================== U+2660 ISOpub -->
  782. <!ENTITY clubs "&#9827;" ><!-- black club suit
  783. shamrock
  784. ==================== U+2663 ISOpub -->
  785. <!ENTITY hearts "&#9829;" ><!-- black heart suit
  786. valentine
  787. ==================== U+2665 ISOpub -->
  788. <!ENTITY diams "&#9830;" ><!-- black diamond suit
  789. ==================== U+2666 ISOpub -->