This page is intended to help analyze troublesome characters like punctuation and symbols. It is not comprehensive at all yet.

Category Codes:

Code | UTR50 | MSFT | Meaning |
---|---|---|---|

U | U | S | Upright; translates between horizontal and vertical |

R | S | R | Sideways; rotates between horizontal and vertical |

T_{U} | T | ST | Typeset upright with alternate glyph. Best fallback is just upright. |

T_{R} | SB | RT | Typeset upright with alternate glyph. Best fallback is just sideways. |

Two modes are presented: Stacking (`text-orientation: upright`

) and Default (TBD).

Default orientation is not covered yet; focusing on stacked mode first because it's simpler.

See Ken Whistler's notes on this category.

Code | Description | Char | Stack | Mixed | Memo |
---|---|---|---|---|---|

U+005F | LOW LINE | _ | U | R | Match double low line, overline |

U+203F | UNDERTIE | ‿ | R | R | Intended to link consecutive letters |

U+2040 | CHARACTER TIE | ⁀ | R | R | Intended to link consecutive letters |

U+2054 | INVERTED UNDERTIE | ⁔ | R | R | Intended to link consecutive letters |

U+FE33 | PRESENTATION FORM FOR VERTICAL LOW LINE | ︳ | U | U | Vertical presentation forms always upright |

U+FE34 | PRESENTATION FORM FOR VERTICAL WAVY LOW LINE | ︴ | U | U | Vertical presentation forms always upright |

U+FE4D | DASHED LOW LINE | ﹍ | U | R | Match low line |

U+FE4E | CENTRELINE LOW LINE | ﹎ | U | R | Match low line |

U+FE4F | WAVY LOW LINE | ﹏ | U | R | Match low line |

U+FF3F | FULLWIDTH LOW LINE | ＿ | U | R | Match low line |

Code | Description | Char | Stack | Mix | Memo |
---|---|---|---|---|---|

U+002D | HYPHEN-MINUS | - | R | R | This character is used as hyphen, as minus, and as dash. Suggested to treating as dash / hyphen-bullet, since this seems to be more common than use as hyphen or minus. |

U+058A | ARMENIAN HYPHEN | ֊ | R | R | Hyphens are sideways |

U+05BE | HEBREW PUNCTUATION MAQAF | ־ | R | R | |

U+1400 | CANADIAN SYLLABICS HYPHEN | ᐀ | R | R | Hyphens are sideways |

U+1806 | MONGOLIAN TODO SOFT HYPHEN | ᠆ | V | V | Mongolian is always sideways ?:? DVO=U |

U+2010 | HYPHEN | ‐ | R | R | Hyphens are sideways |

U+2011 | NON-BREAKING HYPHEN | ‑ | R | R | Hyphens are sideways |

U+2012 | FIGURE DASH | ‒ | R | R | Dashes are always sideways |

U+2013 | EN DASH | – | R | R | Dashes are always sideways |

U+2014 | EM DASH | — | R | R | Dashes are always sideways |

U+2015 | HORIZONTAL BAR | ― | R | R | Dashes are always sideways (EM DASH in Windows code page) |

U+2E17 | DOUBLE OBLIQUE HYPHEN | ⸗ | R | R | Hyphens are sideways |

U+2E1A | HYPHEN WITH DIAERESIS | ⸚ | R | R | Hyphens are sideways |

U+2E3A | TWO-EM DASH | ⸺ | R | R | Dashes are always sideways |

U+2E3B | THREE-EM DASH | ⸻ | R | R | Dashes are always sideways |

U+301C | WAVE DASH | 〜 | T_{R} | T_{R} | Wave dash must transform DVO=T |

U+3030 | WAVY DASH | 〰 | T_{R} | T_{R} | Wave dash must transform DVO=T |

U+30A0 | KATAKANA-HIRAGANA DOUBLE HYPHEN | ゠ | T_{R} | T_{R} | Sideways in JIS. Japanese fonts with this glyph expected to have vertical alternate. |

U+FE31 | PRESENTATION FORM FOR VERTICAL EM DASH | ︱ | U | U | Vertical presentation forms are always upright |

U+FE32 | PRESENTATION FORM FOR VERTICAL EN DASH | ︲ | U | U | Vertical presentation forms are always upright |

U+FE58 | SMALL EM DASH | ﹘ | R | R | Dashes are always sideways |

U+FF5E | FULLWIDTH TILDE | ～ | T | T | Dashes are sideways, and this is considered equivalent to WAVE DASH U+301C even though it's technically a Math Symbol (Sm) fullwidth variant of U+007E |

U+FE63 | SMALL HYPHEN-MINUS | ﹣ | R | R | Match fullwidth variant |

U+FF0D | FULLWIDTH HYPHEN-MINUS | － | R | R | Used as dash |

Code | Description | Char | Stack | Mix | Memo |
---|---|---|---|---|---|

U+0028 | LEFT PARENTHESIS | `(` | R | R | Brackets are sideways to enclose their text |

U+005B | LEFT SQUARE BRACKET | `[` | R | R | Brackets are sideways to enclose their text |

U+007B | LEFT CURLY BRACKET | `{` | R | R | Brackets are sideways to enclose their text |

U+0F3A | TIBETAN MARK GUG RTAGS GYON | `༺` | U | R | Unsure about Tibetan, assuming upright for stacked mode |

U+0F3C | TIBETAN MARK ANG KHANG GYON | `༼` | U | R | Unsure about Tibetan, assuming upright for stacked mode |

U+169B | OGHAM FEATHER MARK | `᚛` | R | R | Ogham is always sideways |

U+201A | SINGLE LOW-9 QUOTATION MARK | `‚` | T_{U} | R | Quotation marks are upright in stacked mode DVO=S |

U+201E | DOUBLE LOW-9 QUOTATION MARK | `„` | T_{U} | R | Quotation marks are upright in stacked mode DVO=S |

U+2045 | LEFT SQUARE BRACKET WITH QUILL | `⁅` | R | R | Brackets are sideways to enclose their text |

U+207D | SUPERSCRIPT LEFT PARENTHESIS | `⁽` | R | R | Brackets are sideways to enclose their text DVO=U |

U+208D | SUBSCRIPT LEFT PARENTHESIS | `₍` | R | R | Brackets are sideways to enclose their text DVO=U |

U+2329 | LEFT-POINTING ANGLE BRACKET | `〈` | R | R | Brackets are sideways to enclose their text |

U+2768 | MEDIUM LEFT PARENTHESIS ORNAMENT | `❨` | R | R | Brackets are sideways to enclose their text |

U+276A | MEDIUM FLATTENED LEFT PARENTHESIS ORNAMENT | `❪` | R | R | Brackets are sideways to enclose their text |

U+276C | MEDIUM LEFT-POINTING ANGLE BRACKET ORNAMENT | `❬` | R | R | Brackets are sideways to enclose their text |

U+276E | HEAVY LEFT-POINTING ANGLE QUOTATION MARK ORNAMENT | `❮` | R | R | Guillemets are sideways to enclose their text |

U+2770 | HEAVY LEFT-POINTING ANGLE BRACKET ORNAMENT | `❰` | R | R | Brackets are sideways to enclose their text |

U+2772 | LIGHT LEFT TORTOISE SHELL BRACKET ORNAMENT | `❲` | R | R | Brackets are sideways to enclose their text |

U+2774 | MEDIUM LEFT CURLY BRACKET ORNAMENT | `❴` | R | R | Brackets are sideways to enclose their text |

U+27C5 | LEFT S-SHAPED BAG DELIMITER | `⟅` | R | R | Brackets are sideways to enclose their text |

U+27E6 | MATHEMATICAL LEFT WHITE SQUARE BRACKET | `⟦` | R | R | Brackets are sideways to enclose their text |

U+27E8 | MATHEMATICAL LEFT ANGLE BRACKET | `⟨` | R | R | Brackets are sideways to enclose their text |

U+27EA | MATHEMATICAL LEFT DOUBLE ANGLE BRACKET | `⟪` | R | R | Brackets are sideways to enclose their text |

U+27EC | MATHEMATICAL LEFT WHITE TORTOISE SHELL BRACKET | `⟬` | R | R | Brackets are sideways to enclose their text |

U+27EE | MATHEMATICAL LEFT FLATTENED PARENTHESIS | `⟮` | R | R | Brackets are sideways to enclose their text |

U+2983 | LEFT WHITE CURLY BRACKET | `⦃` | R | R | Brackets are sideways to enclose their text |

U+2985 | LEFT WHITE PARENTHESIS | `⦅` | R | R | Brackets are sideways to enclose their text |

U+2987 | Z NOTATION LEFT IMAGE BRACKET | `⦇` | R | R | Brackets are sideways to enclose their text |

U+2989 | Z NOTATION LEFT BINDING BRACKET | `⦉` | R | R | Brackets are sideways to enclose their text |

U+298B | LEFT SQUARE BRACKET WITH UNDERBAR | `⦋` | R | R | Brackets are sideways to enclose their text |

U+298D | LEFT SQUARE BRACKET WITH TICK IN TOP CORNER | `⦍` | R | R | Brackets are sideways to enclose their text |

U+298F | LEFT SQUARE BRACKET WITH TICK IN BOTTOM CORNER | `⦏` | R | R | Brackets are sideways to enclose their text |

U+2991 | LEFT ANGLE BRACKET WITH DOT | `⦑` | R | R | Brackets are sideways to enclose their text |

U+2993 | LEFT ARC LESS-THAN BRACKET | `⦓` | R | R | Brackets are sideways to enclose their text |

U+2995 | DOUBLE LEFT ARC GREATER-THAN BRACKET | `⦕` | R | R | Brackets are sideways to enclose their text |

U+2997 | LEFT BLACK TORTOISE SHELL BRACKET | `⦗` | R | R | Brackets are sideways to enclose their text |

U+29D8 | LEFT WIGGLY FENCE | `⧘` | R | R | Brackets are sideways to enclose their text |

U+29DA | LEFT DOUBLE WIGGLY FENCE | `⧚` | R | R | Brackets are sideways to enclose their text |

U+29FC | LEFT-POINTING CURVED ANGLE BRACKET | `⧼` | R | R | Brackets are sideways to enclose their text |

U+2E22 | TOP LEFT HALF BRACKET | `⸢` | R | R | Brackets are sideways to enclose their text |

U+2E24 | BOTTOM LEFT HALF BRACKET | `⸤` | R | R | Brackets are sideways to enclose their text |

U+2E26 | LEFT SIDEWAYS U BRACKET | `⸦` | R | R | Brackets are sideways to enclose their text |

U+2E28 | LEFT DOUBLE PARENTHESIS | `⸨` | R | R | Brackets are sideways to enclose their text |

U+3008 | LEFT ANGLE BRACKET | `〈` | T_{R} | T_{R} | Brackets are sideways to enclose their text (CJK fonts usually have vertical glyph) |

U+300A | LEFT DOUBLE ANGLE BRACKET | `《` | T_{R} | T_{R} | Brackets are sideways to enclose their text (CJK fonts usually have vertical glyph) |

U+300C | LEFT CORNER BRACKET | `「` | T_{R} | T_{R} | Brackets are sideways to enclose their text (CJK fonts usually have vertical glyph) |

U+300E | LEFT WHITE CORNER BRACKET | `『` | T_{R} | T_{R} | Brackets are sideways to enclose their text (CJK fonts usually have vertical glyph) |

U+3010 | LEFT BLACK LENTICULAR BRACKET | `【` | T_{R} | T_{R} | Brackets are sideways to enclose their text (CJK fonts usually have vertical glyph) |

U+3014 | LEFT TORTOISE SHELL BRACKET | `〔` | T_{R} | T_{R} | Brackets are sideways to enclose their text (CJK fonts usually have vertical glyph) |

U+3016 | LEFT WHITE LENTICULAR BRACKET | `〖` | T_{R} | T_{R} | Brackets are sideways to enclose their text (CJK fonts usually have vertical glyph) |

U+3018 | LEFT WHITE TORTOISE SHELL BRACKET | `〘` | T_{R} | T_{R} | Brackets are sideways to enclose their text (CJK fonts usually have vertical glyph) |

U+301A | LEFT WHITE SQUARE BRACKET | `〚` | T_{R} | T_{R} | Brackets are sideways to enclose their text (CJK fonts usually have vertical glyph) |

U+301D | REVERSED DOUBLE PRIME QUOTATION MARK | `〝` | T_{U} | T_{U} | Quotation marks are upright in stacked mode DVO=S |

U+FD3E | ORNATE LEFT PARENTHESIS | `﴾` | R | R | Brackets are sideways to enclose their text |

U+FE17 | PRESENTATION FORM FOR VERTICAL LEFT WHITE LENTICULAR BRACKET | `︗` | U | U | Vertical presentation forms are always upright |

U+FE35 | PRESENTATION FORM FOR VERTICAL LEFT PARENTHESIS | `︵` | U | U | Vertical presentation forms are always upright |

U+FE37 | PRESENTATION FORM FOR VERTICAL LEFT CURLY BRACKET | `︷` | U | U | Vertical presentation forms are always upright |

U+FE39 | PRESENTATION FORM FOR VERTICAL LEFT TORTOISE SHELL BRACKET | `︹` | U | U | Vertical presentation forms are always upright |

U+FE3B | PRESENTATION FORM FOR VERTICAL LEFT BLACK LENTICULAR BRACKET | `︻` | U | U | Vertical presentation forms are always upright |

U+FE3D | PRESENTATION FORM FOR VERTICAL LEFT DOUBLE ANGLE BRACKET | `︽` | U | U | Vertical presentation forms are always upright |

U+FE3F | PRESENTATION FORM FOR VERTICAL LEFT ANGLE BRACKET | `︿` | U | U | Vertical presentation forms are always upright |

U+FE41 | PRESENTATION FORM FOR VERTICAL LEFT CORNER BRACKET | `﹁` | U | U | Vertical presentation forms are always upright |

U+FE43 | PRESENTATION FORM FOR VERTICAL LEFT WHITE CORNER BRACKET | `﹃` | U | U | Vertical presentation forms are always upright |

U+FE47 | PRESENTATION FORM FOR VERTICAL LEFT SQUARE BRACKET | `﹇` | U | U | Vertical presentation forms are always upright |

U+FE59 | SMALL LEFT PARENTHESIS | `﹙` | R | R | Brackets are sideways to enclose their text |

U+FE5B | SMALL LEFT CURLY BRACKET | `﹛` | R | R | Brackets are sideways to enclose their text |

U+FE5D | SMALL LEFT TORTOISE SHELL BRACKET | `﹝` | R | R | Brackets are sideways to enclose their text |

U+FF08 | FULLWIDTH LEFT PARENTHESIS | `（` | T_{R} | T_{R} | Brackets are sideways to enclose their text (CJK fonts usually have vertical glyph) |

U+FF3B | FULLWIDTH LEFT SQUARE BRACKET | `［` | T_{R} | T_{R} | Brackets are sideways to enclose their text (CJK fonts usually have vertical glyph) |

U+FF5B | FULLWIDTH LEFT CURLY BRACKET | `｛` | T_{R} | T_{R} | Brackets are sideways to enclose their text (CJK fonts usually have vertical glyph) |

U+FF5F | FULLWIDTH LEFT WHITE PARENTHESIS | `｟` | T_{R} | T_{R} | Brackets are sideways to enclose their text (CJK fonts usually have vertical glyph) |

U+FF62 | HALFWIDTH LEFT CORNER BRACKET | `｢` | R | R | Brackets are sideways to enclose their text DVO=SB |

Code | Description | Char | Stack | Mix | Memo |
---|---|---|---|---|---|

U+0029 | RIGHT PARENTHESIS | ) | R | R | Brackets are sideways to enclose their text |

U+005D | RIGHT SQUARE BRACKET | ] | R | R | Brackets are sideways to enclose their text |

U+007D | RIGHT CURLY BRACKET | } | R | R | Brackets are sideways to enclose their text |

U+0F3B | TIBETAN MARK GUG RTAGS GYAS | ༻ | U | R | Unsure about Tibetan, assuming upright for stacked mode |

U+0F3D | TIBETAN MARK ANG KHANG GYAS | ༽ | U | R | Unsure about Tibetan, assuming upright for stacked mode |

U+169C | OGHAM REVERSED FEATHER MARK | ᚜ | R | R | Ogham is always sideways |

U+2046 | RIGHT SQUARE BRACKET WITH QUILL | ⁆ | R | R | Brackets are sideways to enclose their text |

U+207E | SUPERSCRIPT RIGHT PARENTHESIS | ⁾ | R | R | Brackets are sideways to enclose their text |

U+208E | SUBSCRIPT RIGHT PARENTHESIS | ₎ | R | R | Brackets are sideways to enclose their text |

U+232A | RIGHT-POINTING ANGLE BRACKET | 〉 | R | R | Brackets are sideways to enclose their text |

U+2769 | MEDIUM RIGHT PARENTHESIS ORNAMENT | ❩ | R | R | Brackets are sideways to enclose their text |

U+276B | MEDIUM FLATTENED RIGHT PARENTHESIS ORNAMENT | ❫ | R | R | Brackets are sideways to enclose their text |

U+276D | MEDIUM RIGHT-POINTING ANGLE BRACKET ORNAMENT | ❭ | R | R | Brackets are sideways to enclose their text |

U+276F | HEAVY RIGHT-POINTING ANGLE QUOTATION MARK ORNAMENT | ❯ | R | R | Guillemets are sideways to enclose their text |

U+2771 | HEAVY RIGHT-POINTING ANGLE BRACKET ORNAMENT | ❱ | R | R | Brackets are sideways to enclose their text |

U+2773 | LIGHT RIGHT TORTOISE SHELL BRACKET ORNAMENT | ❳ | R | R | Brackets are sideways to enclose their text |

U+2775 | MEDIUM RIGHT CURLY BRACKET ORNAMENT | ❵ | R | R | Brackets are sideways to enclose their text |

U+27C6 | RIGHT S-SHAPED BAG DELIMITER | ⟆ | R | R | Brackets are sideways to enclose their text |

U+27E7 | MATHEMATICAL RIGHT WHITE SQUARE BRACKET | ⟧ | R | R | Brackets are sideways to enclose their text |

U+27E9 | MATHEMATICAL RIGHT ANGLE BRACKET | ⟩ | R | R | Brackets are sideways to enclose their text |

U+27EB | MATHEMATICAL RIGHT DOUBLE ANGLE BRACKET | ⟫ | R | R | Brackets are sideways to enclose their text |

U+27ED | MATHEMATICAL RIGHT WHITE TORTOISE SHELL BRACKET | ⟭ | R | R | Brackets are sideways to enclose their text |

U+27EF | MATHEMATICAL RIGHT FLATTENED PARENTHESIS | ⟯ | R | R | Brackets are sideways to enclose their text |

U+2984 | RIGHT WHITE CURLY BRACKET | ⦄ | R | R | Brackets are sideways to enclose their text |

U+2986 | RIGHT WHITE PARENTHESIS | ⦆ | R | R | Brackets are sideways to enclose their text |

U+2988 | Z NOTATION RIGHT IMAGE BRACKET | ⦈ | R | R | Brackets are sideways to enclose their text |

U+298A | Z NOTATION RIGHT BINDING BRACKET | ⦊ | R | R | Brackets are sideways to enclose their text |

U+298C | RIGHT SQUARE BRACKET WITH UNDERBAR | ⦌ | R | R | Brackets are sideways to enclose their text |

U+298E | RIGHT SQUARE BRACKET WITH TICK IN BOTTOM CORNER | ⦎ | R | R | Brackets are sideways to enclose their text |

U+2990 | RIGHT SQUARE BRACKET WITH TICK IN TOP CORNER | ⦐ | R | R | Brackets are sideways to enclose their text |

U+2992 | RIGHT ANGLE BRACKET WITH DOT | ⦒ | R | R | Brackets are sideways to enclose their text |

U+2994 | RIGHT ARC GREATER-THAN BRACKET | ⦔ | R | R | Brackets are sideways to enclose their text |

U+2996 | DOUBLE RIGHT ARC LESS-THAN BRACKET | ⦖ | R | R | Brackets are sideways to enclose their text |

U+2998 | RIGHT BLACK TORTOISE SHELL BRACKET | ⦘ | R | R | Brackets are sideways to enclose their text |

U+29D9 | RIGHT WIGGLY FENCE | ⧙ | R | R | Brackets are sideways to enclose their text |

U+29DB | RIGHT DOUBLE WIGGLY FENCE | ⧛ | R | R | Brackets are sideways to enclose their text |

U+29FD | RIGHT-POINTING CURVED ANGLE BRACKET | ⧽ | R | R | Brackets are sideways to enclose their text |

U+2E23 | TOP RIGHT HALF BRACKET | ⸣ | R | R | Brackets are sideways to enclose their text |

U+2E25 | BOTTOM RIGHT HALF BRACKET | ⸥ | R | R | Brackets are sideways to enclose their text |

U+2E27 | RIGHT SIDEWAYS U BRACKET | ⸧ | R | R | Brackets are sideways to enclose their text |

U+2E29 | RIGHT DOUBLE PARENTHESIS | ⸩ | R | R | Brackets are sideways to enclose their text |

U+3009 | RIGHT ANGLE BRACKET | 〉 | T_{R} | T_{R} | Brackets are sideways to enclose their text |

U+300B | RIGHT DOUBLE ANGLE BRACKET | 》 | T_{R} | T_{R} | Brackets are sideways to enclose their text |

U+300D | RIGHT CORNER BRACKET | 」 | T_{R} | T_{R} | Brackets are sideways to enclose their text |

U+300F | RIGHT WHITE CORNER BRACKET | 』 | T_{R} | T_{R} | Brackets are sideways to enclose their text |

U+3011 | RIGHT BLACK LENTICULAR BRACKET | 】 | T_{R} | T_{R} | Brackets are sideways to enclose their text |

U+3015 | RIGHT TORTOISE SHELL BRACKET | 〕 | T_{R} | T_{R} | Brackets are sideways to enclose their text |

U+3017 | RIGHT WHITE LENTICULAR BRACKET | 〗 | T_{R} | T_{R} | Brackets are sideways to enclose their text |

U+3019 | RIGHT WHITE TORTOISE SHELL BRACKET | 〙 | T_{R} | T_{R} | Brackets are sideways to enclose their text |

U+301B | RIGHT WHITE SQUARE BRACKET | 〛 | T_{R} | T_{R} | Brackets are sideways to enclose their text |

U+301E | DOUBLE PRIME QUOTATION MARK | 〞 | T_{U} | T_{U} | Quotation marks are upright, but need some shifting. Prime quotes are mainly used for CJK, should be upright. UTR#50 DVO has quotes S |

U+301F | LOW DOUBLE PRIME QUOTATION MARK | 〟 | T_{U} | T_{U} | Quotation marks are upright, but need some shifting. Prime quotes are mainly used for CJK, should be upright. UTR#50 DVO has quotes S |

U+FD3F | ORNATE RIGHT PARENTHESIS | ﴿ | R | R | Brackets are sideways to enclose their text |

U+FE18 | PRESENTATION FORM FOR VERTICAL RIGHT WHITE LENTICULAR BRAKCET | ︘ | U | U | Vertical presentation forms are always upright |

U+FE36 | PRESENTATION FORM FOR VERTICAL RIGHT PARENTHESIS | ︶ | U | U | Vertical presentation forms are always upright |

U+FE38 | PRESENTATION FORM FOR VERTICAL RIGHT CURLY BRACKET | ︸ | U | U | Vertical presentation forms are always upright |

U+FE3A | PRESENTATION FORM FOR VERTICAL RIGHT TORTOISE SHELL BRACKET | ︺ | U | U | Vertical presentation forms are always upright |

U+FE3C | PRESENTATION FORM FOR VERTICAL RIGHT BLACK LENTICULAR BRACKET | ︼ | U | U | Vertical presentation forms are always upright |

U+FE3E | PRESENTATION FORM FOR VERTICAL RIGHT DOUBLE ANGLE BRACKET | ︾ | U | U | Vertical presentation forms are always upright |

U+FE40 | PRESENTATION FORM FOR VERTICAL RIGHT ANGLE BRACKET | ﹀ | U | U | Vertical presentation forms are always upright |

U+FE42 | PRESENTATION FORM FOR VERTICAL RIGHT CORNER BRACKET | ﹂ | U | U | Vertical presentation forms are always upright |

U+FE44 | PRESENTATION FORM FOR VERTICAL RIGHT WHITE CORNER BRACKET | ﹄ | U | U | Vertical presentation forms are always upright |

U+FE48 | PRESENTATION FORM FOR VERTICAL RIGHT SQUARE BRACKET | ﹈ | U | U | Vertical presentation forms are always upright |

U+FE5A | SMALL RIGHT PARENTHESIS | ﹚ | R | R | Brackets are sideways to enclose their text |

U+FE5C | SMALL RIGHT CURLY BRACKET | ﹜ | R | R | Brackets are sideways to enclose their text |

U+FE5E | SMALL RIGHT TORTOISE SHELL BRACKET | ﹞ | R | R | Brackets are sideways to enclose their text |

U+FF09 | FULLWIDTH RIGHT PARENTHESIS | ） | T_{R} | T_{R} | Brackets are sideways to enclose their text (CJK fonts usually have vertical glyph) |

U+FF3D | FULLWIDTH RIGHT SQUARE BRACKET | ］ | T_{R} | T_{R} | Brackets are sideways to enclose their text (CJK fonts usually have vertical glyph) |

U+FF5D | FULLWIDTH RIGHT CURLY BRACKET | ｝ | T_{R} | T_{R} | Brackets are sideways to enclose their text (CJK fonts usually have vertical glyph) |

U+FF60 | FULLWIDTH RIGHT WHITE PARENTHESIS | ｠ | T_{R} | T_{R} | Brackets are sideways to enclose their text (CJK fonts usually have vertical glyph) |

U+FF63 | HALFWIDTH RIGHT CORNER BRACKET | ｣ | R | R | Brackets are sideways to enclose their text |

Code | Description | Char | Stack | Mix | Memo |
---|---|---|---|---|---|

U+00AB | LEFT-POINTING DOUBLE ANGLE QUOTATION MARK | « | R | R | Guillmets are sideways to enclose text |

U+2018 | LEFT SINGLE QUOTATION MARK | ‘ | T_{U} | R | Quotation marks are upright in stacked mode, but need some shifting UTR#50 DVO has quotes S |

U+201B | SINGLE HIGH-REVERSED-9 QUOTATION MARK | ‛ | T_{U} | R | Quotation marks are upright in stacked mode, but need some shifting UTR#50 DVO has quotes S |

U+201C | LEFT DOUBLE QUOTATION MARK | “ | T_{U} | R | Quotation marks are upright in stacked mode, but need some shifting UTR#50 DVO has quotes S |

U+201F | DOUBLE HIGH-REVERSED-9 QUOTATION MARK | ‟ | T_{U} | R | Quotation marks are upright in stacked mode, but need some shifting UTR#50 DVO has quotes S |

U+2039 | SINGLE LEFT-POINTING ANGLE QUOTATION MARK | ‹ | R | R | Guillmets are sideways to enclose text |

U+2E02 | LEFT SUBSTITUTION BRACKET | ⸂ | U? | R | New Testament Editorial Symbols… DVO=U |

U+2E04 | LEFT DOTTED SUBSTITUTION BRACKET | ⸄ | U? | R | New Testament Editorial Symbols… DVO=U |

U+2E09 | LEFT TRANSPOSITION BRACKET | ⸉ | U? | R | New Testament Editorial Symbols… DVO=U |

U+2E0C | LEFT RAISED OMISSION BRACKET | ⸌ | U? | R | New Testament Editorial Symbols… DVO=U |

U+2E1C | LEFT LOW PARAPHRASE BRACKET | ⸜ | U? | R | N'Ko punctuation DVO=U |

U+2E20 | LEFT VERTICAL BAR WITH QUILL | ⸠ | R | R | Brackets are sideways to enclose text |

Code | Description | Char | Stack | Mix | Memo |
---|---|---|---|---|---|

U+00BB | RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK | » | R | R | Guillmets are sideways to enclose text |

U+2019 | RIGHT SINGLE QUOTATION MARK | ’ | T_{U} | R | Quotation marks are upright in stacked mode, but need some shifting UTR#50 DVO has quotes S |

U+201D | RIGHT DOUBLE QUOTATION MARK | ” | T_{U} | R | Quotation marks are upright in stacked mode, but need some shifting UTR#50 DVO has quotes S |

U+203A | SINGLE RIGHT-POINTING ANGLE QUOTATION MARK | › | R | R | Guillmets are sideways to enclose text |

U+2E03 | RIGHT SUBSTITUTION BRACKET | ⸃ | U? | R | New Testament Editorial Symbols… DVO=U |

U+2E05 | RIGHT DOTTED SUBSTITUTION BRACKET | ⸅ | U? | R | New Testament Editorial Symbols… DVO=U |

U+2E0A | RIGHT TRANSPOSITION BRACKET | ⸊ | U? | R | New Testament Editorial Symbols… DVO=U |

U+2E0D | RIGHT RAISED OMISSION BRACKET | ⸍ | U? | R | New Testament Editorial Symbols… DVO=U |

U+2E1D | RIGHT LOW PARAPHRASE BRACKET | ⸝ | U? | R | N'Ko punctuation DVO=U |

U+2E21 | RIGHT VERTICAL BAR WITH QUILL | ⸡ | R | R | Brackets are sideways to enclose text |

Code | Description | Char | Stack | Mix | Memo |
---|---|---|---|---|---|

U+0020 | SPACE | ` ` | U | R | Probably better to stack upright like letters, allow font to set vertical metrics. DVO=S |

U+00A0 | NO-BREAK SPACE | ` ` | U | R | Must match U+0020 DVO=S |

U+1680 | OGHAM SPACE MARK | ` ` | R | R | Ogham is sideways |

U+180E | MONGOLIAN VOWEL SEPARATOR | `` | V | V | Mongolian is sideways DVO=U |

U+2000 | EN QUAD | ` ` | R | R | Fixed-size spacing. Provide spacing in advance direction. |

U+2001 | EM QUAD | ` ` | R | R | Fixed-size spacing. Provide spacing in advance direction. |

U+2002 | EN SPACE | ` ` | R | R | Fixed-size spacing. Provide spacing in advance direction. |

U+2003 | EM SPACE | ` ` | R | R | Fixed-size spacing. Provide spacing in advance direction. |

U+2004 | THREE-PER-EM SPACE | ` ` | R | R | Fixed-size spacing. Provide spacing in advance direction.. |

U+2005 | FOUR-PER-EM SPACE | ` ` | R | R | Fixed-size spacing. Provide spacing in advance direction. |

U+2006 | SIX-PER-EM SPACE | ` ` | R | R | Fixed-size spacing. Provide spacing in advance direction. |

U+2007 | FIGURE SPACE | ` ` | U | R | Should provide same advance as a digit, so match digits. |

U+2008 | PUNCTUATION SPACE | ` ` | T | R | Should match advance of comma/period. |

U+2009 | THIN SPACE | ` ` | R | R | Provide spacing in advance direction. Often used with e.g. dashes and guillmets. |

U+200A | HAIR SPACE | ` ` | R | R | Provide spacing in advance direction. Often used with e.g. dashes and guillmets. |

U+202F | NARROW NO-BREAK SPACE | ` ` | R | R | Provide spacing in advance direction. Often used with e.g. dashes and guillmets. |

U+205F | MEDIUM MATHEMATICAL SPACE | ` ` | R | R | Provide spacing in advance direction. Used to space mathematical operators. |

U+3000 | IDEOGRAPHIC SPACE | ` ` | U | U | Make upright so that vertical metrics can be used to match non-square ideographic characters. DVO=S |

Code | Description | Char | Stack | Mix | Memo |
---|---|---|---|---|---|

U+0021 | EXCLAMATION MARK | ! | U | R | |

U+0022 | QUOTATION MARK | “ | T_{U} | R | Needs different position within bounding box, and/or different advance width (esp. when used as open-quote). |

U+0023 | NUMBER SIGN | # | U | R | |

U+0025 | PERCENT SIGN | % | U | R | |

U+0026 | AMPERSAND | & | U | R | |

U+0027 | APOSTROPHE | ' | T_{U} | R | Needs different position within bounding box and/or different advance width. |

U+002A | ASTERISK | * | U | R | |

U+002C | COMMA | , | T_{U} | R | Needs different position within bounding box and/or different advance width. |

U+002E | FULL STOP | . | T_{U} | R | Needs different position within bounding box and/or different advance width. |

U+002F | SOLIDUS | / | U | R | |

U+003A | COLON | : | U | R | |

U+003B | SEMICOLON | ; | U | R | |

U+003F | QUESTION MARK | ? | U | R | |

U+0040 | COMMERCIAL AT | @ | U | R | |

U+005C | REVERSE SOLIDUS | \ | U | R | |

U+00A1 | INVERTED EXCLAMATION MARK | ¡ | U | R | |

U+00A7 | SECTION SIGN | § | U | U | |

U+00B6 | PILCROW SIGN | ¶ | U | U | |

U+00B7 | MIDDLE DOT | · | U | R | |

U+00BF | INVERTED QUESTION MARK | ¿ | U | R | |

U+2016 | DOUBLE VERTICAL LINE | ‖ | U | U | Most modern fonts have rotated vert, but JIS0213 says U. Taro wants R |

U+2017 | DOUBLE LOW LINE | ‗ | U | R | Match LOW LINE? |

U+2020 | DAGGER | † | U | U | |

U+2021 | DOUBLE DAGGER | ‡ | U | U | |

U+2022 | BULLET | • | U | R | |

U+2023 | TRIANGULAR BULLET | ‣ | U | R | |

U+2024 | ONE DOT LEADER | ․ | R | R | Leaders always parallel to inline direction DVO=U |

U+2025 | TWO DOT LEADER | ‥ | R | R | Leaders always parallel to inline direction DVO=U |

U+2026 | HORIZONTAL ELLIPSIS | … | R | R | Ellipsis always parallel to inline direction DVO=U |

U+2027 | HYPHENATION POINT | ‧ | U | R | |

U+2030 | PER MILLE SIGN | ‰ | U | U | Used in East Asian codepages |

U+2031 | PER TEN THOUSAND SIGN | ‱ | U | U | Used in East Asian codepages |

U+2032 | PRIME | ′ | U | R | |

U+2033 | DOUBLE PRIME | ″ | U | R | |

U+2034 | TRIPLE PRIME | ‴ | U | R | |

U+2035 | REVERSED PRIME | ‵ | U | R | |

U+2036 | REVERSED DOUBLE PRIME | ‶ | U | R | |

U+2037 | REVERSED TRIPLE PRIME | ‷ | U | R | |

U+2038 | CARET | ‸ | U | R | |

U+203B | REFERENCE MARK | ※ | U | U | |

U+203C | DOUBLE EXCLAMATION MARK | ‼ | U | U | |

U+203D | INTERROBANG | ‽ | U | U | |

U+203E | OVERLINE | ‾ | U | R | Match LOW LINE |

U+2041 | CARET INSERTION POINT | ⁁ | U | R | |

U+2042 | ASTERISM | ⁂ | U | U | In JIS0213 |

U+2043 | HYPHEN BULLET | ⁃ | R | R | Match hyphen |

U+2047 | DOUBLE QUESTION MARK | ⁇ | U | U | |

U+2048 | QUESTION EXCLAMATION MARK | ⁈ | U | U | |

U+2049 | EXCLAMATION QUESTION MARK | ⁉ | U | U | |

U+204A | TIRONIAN SIGN ET | ⁊ | U | R | Used with Latin |

U+204B | REVERSED PILCROW SIGN | ⁋ | U | U | Match PILCROW |

U+204C | BLACK LEFTWARDS BULLET | ⁌ | U | R | |

U+204D | BLACK RIGHTWARDS BULLET | ⁍ | U | R | |

U+204E | LOW ASTERISK | ⁎ | U | R | Match asterisk |

U+204F | REVERSED SEMICOLON | ⁏ | U | R | Match semicolon |

U+2050 | CLOSE UP | ⁐ | R | R | Copyediting symbol |

U+2051 | TWO ASTERISKS ALIGNED VERTICALLY | ⁑ | U | U | In JIS0213 |

U+2053 | SWUNG DASH | ⁓ | R | R | Dashes are always sideways |

U+2055 | FLOWER PUNCTUATION MARK | ⁕ | U | R | |

U+2056 | THREE DOT PUNCTUATION | ⁖ | U | R | |

U+2057 | QUADRUPLE PRIME | ⁗ | U | R | |

U+2058 | FOUR DOT PUNCTUATION | ⁘ | U | R | |

U+2059 | FIVE DOT PUNCTUATION | ⁙ | U | R | |

U+205A | TWO DOT PUNCTUATION | ⁚ | U | R | See picture |

U+205B | FOUR DOT MARK | ⁛ | U | R | |

U+205C | DOTTED CROSS | ⁜ | U | R | |

U+205D | TRICOLON | ⁝ | U | R | |

U+205E | VERTICAL FOUR DOTS | ⁞ | U | R | |

U+2E00 | RIGHT ANGLE SUBSTITUTION MARKER | ⸀ | U | R | |

U+2E01 | RIGHT ANGLE DOTTED SUBSTITUTION MARKER | ⸁ | U | R | |

U+2E06 | RAISED INTERPOLATION MARKER | ⸆ | U | R | |

U+2E07 | RAISED DOTTED INTERPOLATION MARKER | ⸇ | U | R | |

U+2E08 | DOTTED TRANSPOSITION MARKER | ⸈ | U | R | |

U+2E0B | RAISED SQUARE | ⸋ | U | R | |

U+2E0E | EDITORIAL CORONIS | ⸎ | U | R | |

U+2E0F | PARAGRAPHOS | ⸏ | U | R | |

U+2E10 | FORKED PARAGRAPHOS | ⸐ | U | R | |

U+2E11 | REVERSED FORKED PARAGRAPHOS | ⸑ | U | R | |

U+2E12 | HYPODIASTOLE | ⸒ | U | R | |

U+2E13 | DOTTED OBELOS | ⸓ | U | R | |

U+2E14 | DOWNWARDS ANCORA | ⸔ | U | R | |

U+2E15 | UPWARDS ANCORA | ⸕ | U | R | |

U+2E16 | DOTTED RIGHT-POINTING ANGLE | ⸖ | U | R | |

U+2E18 | INVERTED INTERROBANG | ⸘ | U | R | Mismatch with interrobang‽ |

U+2E19 | PALM BRANCH | ⸙ | U | R | |

U+2E1B | TILDE WITH RING ABOVE | ⸛ | U | R | |

U+2E1E | TILDE WITH DOT ABOVE | ⸞ | U | R | |

U+2E1F | TILDE WITH DOT BELOW | ⸟ | U | R | |

U+2E2A | TWO DOTS OVER ONE DOT PUNCTUATION | ⸪ | U | R | |

U+2E2B | ONE DOT OVER TWO DOTS PUNCTUATION | ⸫ | U | R | |

U+2E2C | SQUARED FOUR DOT PUNCTUATION | ⸬ | U | R | |

U+2E2D | FIVE DOT MARK | ⸭ | U | R | |

U+2E2E | REVERSED QUESTION MARK | ⸮ | U | R | |

U+2E30 | RING POINT | ⸰ | U | R | |

U+2E31 | WORD SEPARATOR MIDDLE DOT | ⸱ | U | R | |

U+2E32 | TURNED COMMA | ⸲ | U | R | |

U+2E33 | RAISED DOT | ⸳ | U | R | |

U+2E34 | RAISED COMMA | ⸴ | U | R | |

U+2E35 | TURNED SEMICOLON | ⸵ | U | R | |

U+2E36 | DAGGER WITH LEFT GUARD | ⸶ | U | U | Match DAGGER |

U+2E37 | DAGGER WITH RIGHT GUARD | ⸷ | U | U | Match DAGGER |

U+2E38 | TURNED DAGGER | ⸸ | U | U | Match DAGGER |

U+2E39 | TOP HALF SECTION SIGN | ⸹ | U | U | Match SECTION SIGN |

U+3001 | IDEOGRAPHIC COMMA | 、 | T_{U} | T_{U} | Ideographic variants upright; comma needs shifting |

U+3002 | IDEOGRAPHIC FULL STOP | 。 | T_{U} | T_{U} | Ideographic variants upright; full stop needs shifting |

U+3003 | DITTO MARK | 〃 | U | U | |

U+303D | PART ALTERNATION MARK | 〽 | U | U | Used in Japanese verse notation |

U+30FB | KATAKANA MIDDLE DOT | ・ | U | U | Katakana middle dot is upright |

U+FE45 | SESAME DOT | ﹅ | U | U | Sesame dots are always upright |

U+FE46 | WHITE SESAME DOT | ﹆ | U | U | Sesame dots are always upright |

U+FE49 | DASHED OVERLINE | ﹉ | U | R | Match dashed low line |

U+FE4A | CENTRELINE OVERLINE | ﹊ | U | R | Match centerline low line |

U+FE4B | WAVY OVERLINE | ﹋ | U | R | Match wavy low line |

U+FE4C | DOUBLE WAVY OVERLINE | ﹌ | U | R | Match wavy low line |

U+10B3A | TINY TWO DOTS OVER ONE DOT PUNCTUATION | 𐬺 | U | R | Match Avestan |

U+10B3B | SMALL TWO DOTS OVER ONE DOT PUNCTUATION | 𐬻 | U | R | Match Avestan |

U+10B3C | LARGE TWO DOTS OVER ONE DOT PUNCTUATION | 𐬼 | U | R | Match Avestan |

U+10B3D | LARGE ONE DOT OVER TWO DOTS PUNCTUATION | 𐬽 | U | R | Match Avestan |

U+10B3E | LARGE TWO RINGS OVER ONE RING PUNCTUATION | 𐬾 | U | R | Match Avestan |

U+10B3F | LARGE ONE RING OVER TWO RINGS PUNCTUATION | 𐬿 | U | R | Match Avestan |

Code | Description | Char | Stack | Mix | Memo |
---|---|---|---|---|---|

U+FE50 | SMALL COMMA | ﹐ | T_{U} | T_{U} | Small variants upright; comma needs shifting DVO=U |

U+FE51 | SMALL IDEOGRAPHIC COMMA | ﹑ | T_{U} | T_{U} | Small variants upright; comma needs shifting DVO=U |

U+FE52 | SMALL FULL STOP | ﹒ | T_{U} | T_{U} | Small variants upright; full stop needs shifting DVO=U |

U+FE54 | SMALL SEMICOLON | ﹔ | T_{U} | T_{U} | Small semicolons are either upright (Chinese-style) or rotated sideways (Japanese-style) DVO=U |

U+FE55 | SMALL COLON | ﹕ | T_{U} | T_{U} | Small colons are either upright (Chinese-style) or rotated sideways (Japanese-style) DVO=U |

U+FE56 | SMALL QUESTION MARK | ﹖ | U | U | Small variants upright |

U+FE57 | SMALL EXCLAMATION MARK | ﹗ | U | U | Small variants upright |

U+FE5F | SMALL NUMBER SIGN | ﹟ | U | U | Small variants upright |

U+FE60 | SMALL AMPERSAND | ﹠ | U | U | Small variants upright |

U+FE61 | SMALL ASTERISK | ﹡ | U | U | Small variants upright |

U+FE68 | SMALL REVERSE SOLIDUS | ﹨ | U | U | Small variants upright |

U+FE6A | SMALL PERCENT SIGN | ﹪ | U | U | Small variants upright |

U+FE6B | SMALL COMMERCIAL AT | ﹫ | U | U | Small variants upright |

U+FF01 | FULLWIDTH EXCLAMATION MARK | ！ | U | U | Fullwidth variants are upright |

U+FF02 | FULLWIDTH QUOTATION MARK | ＂ | T_{U} | T_{U} | Fullwidth variants are upright; quotes need alt glyph UTR#50 DVO has U |

U+FF03 | FULLWIDTH NUMBER SIGN | ＃ | U | U | Fullwidth variants are upright |

U+FF05 | FULLWIDTH PERCENT SIGN | ％ | U | U | Fullwidth variants are upright |

U+FF06 | FULLWIDTH AMPERSAND | ＆ | U | U | Fullwidth variants are upright |

U+FF07 | FULLWIDTH APOSTROPHE | ＇ | T_{U} | T_{U} | Fullwidth variants are upright; quotes need alt glyph DVO=U |

U+FF0A | FULLWIDTH ASTERISK | ＊ | U | U | Fullwidth variants are upright |

U+FF0C | FULLWIDTH COMMA | ， | T_{U} | T_{U} | Fullwidth variants are upright; comma needs shifting DVO=U |

U+FF0E | FULLWIDTH FULL STOP | ． | T_{U} | T_{U} | Fullwidth variants are upright; full stop needs shifting DVO=U |

U+FF0F | FULLWIDTH SOLIDUS | ／ | U | U | Fullwidth variants are upright |

U+FF1A | FULLWIDTH COLON | ： | T_{U} | T_{U} | Fullwidth colons are either upright (Chinese-style) or rotated sideways (Japanese-style) DVO=U |

U+FF1B | FULLWIDTH SEMICOLON | ； | T_{U} | T_{U} | Fullwidth semicolons are either upright (Chinese-style) or rotated sideways (Japanese-style) DVO=U |

U+FF1F | FULLWIDTH QUESTION MARK | ？ | U | U | Fullwidth variants are upright |

U+FF20 | FULLWIDTH COMMERCIAL AT | ＠ | U | U | Fullwidth variants are upright |

U+FF3C | FULLWIDTH REVERSE SOLIDUS | ＼ | U | U | Fullwidth variants are upright |

U+FF61 | HALFWIDTH IDEOGRAPHIC FULL STOP | ｡ | T_{U} | R | Halfwidth is R. Upright full stop needs shifting; Halfwidth needs transform to be half-width |

U+FF64 | HALFWIDTH IDEOGRAPHIC COMMA | ､ | T_{U} | R | Halfwidth is R. Upright comma needs shifting; Halfwidth needs transform to be half-width |

U+FF65 | HALFWIDTH KATAKANA MIDDLE DOT | ･ | T_{U} | R | Halfwidth is R. Halfwidth needs transform to be half-width? DVO=U |

U+FE10 | PRESENTATION FORM FOR VERTICAL COMMA | ︐ | U | U | Vertical presentation forms are always upright |

U+FE11 | PRESENTATION FORM FOR VERTICAL IDEOGRAPHIC COMMA | ︑ | U | U | Vertical presentation forms are always upright |

U+FE12 | PRESENTATION FORM FOR VERTICAL IDEOGRAPHIC FULL STOP | ︒ | U | U | Vertical presentation forms are always upright |

U+FE13 | PRESENTATION FORM FOR VERTICAL COLON | ︓ | U | U | Vertical presentation forms are always upright |

U+FE14 | PRESENTATION FORM FOR VERTICAL SEMICOLON | ︔ | U | U | Vertical presentation forms are always upright |

U+FE15 | PRESENTATION FORM FOR VERTICAL EXCLAMATION MARK | ︕ | U | U | Vertical presentation forms are always upright |

U+FE16 | PRESENTATION FORM FOR VERTICAL QUESTION MARK | ︖ | U | U | Vertical presentation forms are always upright |

U+FE19 | PRESENTATION FORM FOR VERTICAL HORIZONTAL ELLIPSIS | ︙ | U | U | Vertical presentation forms are always upright |

U+FE30 | PRESENTATION FORM FOR VERTICAL TWO DOT LEADER | ︰ | U | U | Vertical presentation forms are always upright |

Code | Description | Char | Stack | Mix | Memo |
---|---|---|---|---|---|

U+037E | GREEK QUESTION MARK | ; | U | R | |

U+0387 | GREEK ANO TELEIA | · | U | R | |

U+055A | ARMENIAN APOSTROPHE | ՚ | U | R | |

U+055B | ARMENIAN EMPHASIS MARK | ՛ | U | R | |

U+055C | ARMENIAN EXCLAMATION MARK | ՜ | U | R | |

U+055D | ARMENIAN COMMA | ՝ | U | R | |

U+055E | ARMENIAN QUESTION MARK | ՞ | U | R | |

U+055F | ARMENIAN ABBREVIATION MARK | ՟ | U | R | |

U+0589 | ARMENIAN FULL STOP | ։ | U | R | |

U+05C0 | HEBREW PUNCTUATION PASEQ | ׀ | U | R | |

U+05C3 | HEBREW PUNCTUATION SOF PASUQ | ׃ | U | R | |

U+05C6 | HEBREW PUNCTUATION NUN HAFUKHA | ׆ | U | R | |

U+05F3 | HEBREW PUNCTUATION GERESH | ׳ | U | R | |

U+05F4 | HEBREW PUNCTUATION GERSHAYIM | ״ | U | R | |

U+0609 | ARABIC-INDIC PER MILLE SIGN | ؉ | U | R | |

U+060A | ARABIC-INDIC PER TEN THOUSAND SIGN | ؊ | U | R | |

U+060C | ARABIC COMMA | ، | U | R | |

U+060D | ARABIC DATE SEPARATOR | ؍ | U | R | |

U+061B | ARABIC SEMICOLON | ؛ | U | R | |

U+061E | ARABIC TRIPLE DOT PUNCTUATION MARK | ؞ | U | R | |

U+061F | ARABIC QUESTION MARK | ؟ | U | R | |

U+066A | ARABIC PERCENT SIGN | ٪ | U | R | |

U+066B | ARABIC DECIMAL SEPARATOR | ٫ | U | R | |

U+066C | ARABIC THOUSANDS SEPARATOR | ٬ | U | R | |

U+066D | ARABIC FIVE POINTED STAR | ٭ | U | R | |

U+06D4 | ARABIC FULL STOP | ۔ | U | R | |

U+0700 | SYRIAC END OF PARAGRAPH | ܀ | U | R | |

U+0701 | SYRIAC SUPRALINEAR FULL STOP | ܁ | U | R | |

U+0702 | SYRIAC SUBLINEAR FULL STOP | ܂ | U | R | |

U+0703 | SYRIAC SUPRALINEAR COLON | ܃ | U | R | |

U+0704 | SYRIAC SUBLINEAR COLON | ܄ | U | R | |

U+0705 | SYRIAC HORIZONTAL COLON | ܅ | U | R | |

U+0706 | SYRIAC COLON SKEWED LEFT | ܆ | U | R | |

U+0707 | SYRIAC COLON SKEWED RIGHT | ܇ | U | R | |

U+0708 | SYRIAC SUPRALINEAR COLON SKEWED LEFT | ܈ | U | R | |

U+0709 | SYRIAC SUBLINEAR COLON SKEWED RIGHT | ܉ | U | R | |

U+070A | SYRIAC CONTRACTION | ܊ | U | R | |

U+070B | SYRIAC HARKLEAN OBELUS | ܋ | U | R | |

U+070C | SYRIAC HARKLEAN METOBELUS | ܌ | U | R | |

U+070D | SYRIAC HARKLEAN ASTERISCUS | ܍ | U | R | |

U+07F7 | NKO SYMBOL GBAKURUNEN | ߷ | U | R | |

U+07F8 | NKO COMMA | ߸ | U | R | |

U+07F9 | NKO EXCLAMATION MARK | ߹ | U | R | |

U+0830 | SAMARITAN PUNCTUATION NEQUDAA | ࠰ | U | R | |

U+0831 | SAMARITAN PUNCTUATION AFSAAQ | ࠱ | U | R | |

U+0832 | SAMARITAN PUNCTUATION ANGED | ࠲ | U | R | |

U+0833 | SAMARITAN PUNCTUATION BAU | ࠳ | U | R | |

U+0834 | SAMARITAN PUNCTUATION ATMAAU | ࠴ | U | R | |

U+0835 | SAMARITAN PUNCTUATION SHIYYAALAA | ࠵ | U | R | |

U+0836 | SAMARITAN ABBREVIATION MARK | ࠶ | U | R | |

U+0837 | SAMARITAN PUNCTUATION MELODIC QITSA | ࠷ | U | R | |

U+0838 | SAMARITAN PUNCTUATION ZIQAA | ࠸ | U | R | |

U+0839 | SAMARITAN PUNCTUATION QITSA | ࠹ | U | R | |

U+083A | SAMARITAN PUNCTUATION ZAEF | ࠺ | U | R | |

U+083B | SAMARITAN PUNCTUATION TURU | ࠻ | U | R | |

U+083C | SAMARITAN PUNCTUATION ARKAANU | ࠼ | U | R | |

U+083D | SAMARITAN PUNCTUATION SOF MASHFAAT | ࠽ | U | R | |

U+083E | SAMARITAN PUNCTUATION ANNAAU | ࠾ | U | R | |

U+085E | MANDAIC PUNCTUATION | ࡞ | U | R | Mandaic DVO=U |

U+0964 | DEVANAGARI DANDA | । | U | R | |

U+0965 | DEVANAGARI DOUBLE DANDA | ॥ | U | R | |

U+0970 | DEVANAGARI ABBREVIATION SIGN | ॰ | U | R | |

U+0AF0 | GUJARATI ABBREVIATION SIGN | ૰ | U | R | |

U+0DF4 | SINHALA PUNCTUATION KUNDDALIYA | ෴ | U | R | |

U+0E4F | THAI CHARACTER FONGMAN | ๏ | U | R | |

U+0E5A | THAI CHARACTER ANGKHANKHU | ๚ | U | R | |

U+0E5B | THAI CHARACTER KHOMUT | ๛ | U | R | |

U+0F04 | TIBETAN MARK INITIAL YIG MGO MDUN MA | ༄ | U | R | |

U+0F05 | TIBETAN MARK CLOSING YIG MGO SGAB MA | ༅ | U | R | |

U+0F06 | TIBETAN MARK CARET YIG MGO PHUR SHAD MA | ༆ | U | R | |

U+0F07 | TIBETAN MARK YIG MGO TSHEG SHAD MA | ༇ | U | R | |

U+0F08 | TIBETAN MARK SBRUL SHAD | ༈ | U | R | |

U+0F09 | TIBETAN MARK BSKUR YIG MGO | ༉ | U | R | |

U+0F0A | TIBETAN MARK BKA- SHOG YIG MGO | ༊ | U | R | |

U+0F0B | TIBETAN MARK INTERSYLLABIC TSHEG | ་ | U | R | |

U+0F0C | TIBETAN MARK DELIMITER TSHEG BSTAR | ༌ | U | R | |

U+0F0D | TIBETAN MARK SHAD | ། | U | R | |

U+0F0E | TIBETAN MARK NYIS SHAD | ༎ | U | R | |

U+0F0F | TIBETAN MARK TSHEG SHAD | ༏ | U | R | |

U+0F10 | TIBETAN MARK NYIS TSHEG SHAD | ༐ | U | R | |

U+0F11 | TIBETAN MARK RIN CHEN SPUNGS SHAD | ༑ | U | R | |

U+0F12 | TIBETAN MARK RGYA GRAM SHAD | ༒ | U | R | |

U+0F14 | TIBETAN MARK GTER TSHEG | ༔ | U | R | |

U+0F85 | TIBETAN MARK PALUTA | ྅ | U | R | |

U+0FD0 | TIBETAN MARK BSKA- SHOG GI MGO RGYAN | ࿐ | U | R | |

U+0FD1 | TIBETAN MARK MNYAM YIG GI MGO RGYAN | ࿑ | U | R | |

U+0FD2 | TIBETAN MARK NYIS TSHEG | ࿒ | U | R | |

U+0FD3 | TIBETAN MARK INITIAL BRDA RNYING YIG MGO MDUN MA | ࿓ | U | R | |

U+0FD4 | TIBETAN MARK CLOSING BRDA RNYING YIG MGO SGAB MA | ࿔ | U | R | |

U+0FD9 | TIBETAN MARK LEADING MCHAN RTAGS | ࿙ | U | R | |

U+0FDA | TIBETAN MARK TRAILING MCHAN RTAGS | ࿚ | U | R | |

U+104A | MYANMAR SIGN LITTLE SECTION | ၊ | U | R | |

U+104B | MYANMAR SIGN SECTION | ။ | U | R | |

U+104C | MYANMAR SYMBOL LOCATIVE | ၌ | U | R | |

U+104D | MYANMAR SYMBOL COMPLETED | ၍ | U | R | |

U+104E | MYANMAR SYMBOL AFOREMENTIONED | ၎ | U | R | |

U+104F | MYANMAR SYMBOL GENITIVE | ၏ | U | R | |

U+10FB | GEORGIAN PARAGRAPH SEPARATOR | ჻ | U | R | |

U+1360 | ETHIOPIC SECTION MARK | ፠ | U | R | |

U+1361 | ETHIOPIC WORDSPACE | ፡ | U | R | |

U+1362 | ETHIOPIC FULL STOP | ። | U | R | |

U+1363 | ETHIOPIC COMMA | ፣ | U | R | |

U+1364 | ETHIOPIC SEMICOLON | ፤ | U | R | |

U+1365 | ETHIOPIC COLON | ፥ | U | R | |

U+1366 | ETHIOPIC PREFACE COLON | ፦ | U | R | |

U+1367 | ETHIOPIC QUESTION MARK | ፧ | U | R | |

U+1368 | ETHIOPIC PARAGRAPH SEPARATOR | ፨ | U | R | |

U+166D | CANADIAN SYLLABICS CHI SIGN | ᙭ | U | R | |

U+166E | CANADIAN SYLLABICS FULL STOP | ᙮ | U | R | |

U+16EB | RUNIC SINGLE PUNCTUATION | ᛫ | U | R | |

U+16EC | RUNIC MULTIPLE PUNCTUATION | ᛬ | U | R | |

U+16ED | RUNIC CROSS PUNCTUATION | ᛭ | U | R | |

U+1735 | PHILIPPINE SINGLE PUNCTUATION | ᜵ | U | R | |

U+1736 | PHILIPPINE DOUBLE PUNCTUATION | ᜶ | U | R | |

U+17D4 | KHMER SIGN KHAN | ។ | U | R | |

U+17D5 | KHMER SIGN BARIYOOSAN | ៕ | U | R | |

U+17D6 | KHMER SIGN CAMNUC PII KUUH | ៖ | U | R | |

U+17D8 | KHMER SIGN BEYYAL | ៘ | U | R | |

U+17D9 | KHMER SIGN PHNAEK MUAN | ៙ | U | R | |

U+17DA | KHMER SIGN KOOMUUT | ៚ | U | R | |

U+1800 | MONGOLIAN BIRGA | ᠀ | V | V | Match Mongolian letters DVO=U |

U+1801 | MONGOLIAN ELLIPSIS | ᠁ | V | V | Match Mongolian letters DVO=U |

U+1802 | MONGOLIAN COMMA | ᠂ | V | V | Match Mongolian letters DVO=U |

U+1803 | MONGOLIAN FULL STOP | ᠃ | V | V | Match Mongolian letters DVO=U |

U+1804 | MONGOLIAN COLON | ᠄ | V | V | Match Mongolian letters DVO=U |

U+1805 | MONGOLIAN FOUR DOTS | ᠅ | V | V | Match Mongolian letters DVO=U |

U+1807 | MONGOLIAN SIBE SYLLABLE BOUNDARY MARKER | ᠇ | V | V | Match Mongolian letters DVO=U |

U+1808 | MONGOLIAN MANCHU COMMA | ᠈ | V | V | Match Mongolian letters DVO=U |

U+1809 | MONGOLIAN MANCHU FULL STOP | ᠉ | V | V | Match Mongolian letters DVO=U |

U+180A | MONGOLIAN NIRUGU | ᠊ | V | V | Match Mongolian letters DVO=U |

U+1944 | LIMBU EXCLAMATION MARK | ᥄ | U | R | |

U+1945 | LIMBU QUESTION MARK | ᥅ | U | R | |

U+1A1E | BUGINESE PALLAWA | ᨞ | U | R | |

U+1A1F | BUGINESE END OF SECTION | ᨟ | U | R | |

U+1AA0 | TAI THAM SIGN WIANG | ᪠ | U | R | |

U+1AA1 | TAI THAM SIGN WIANGWAAK | ᪡ | U | R | |

U+1AA2 | TAI THAM SIGN SAWAN | ᪢ | U | R | |

U+1AA3 | TAI THAM SIGN KEOW | ᪣ | U | R | |

U+1AA4 | TAI THAM SIGN HOY | ᪤ | U | R | |

U+1AA5 | TAI THAM SIGN DOKMAI | ᪥ | U | R | |

U+1AA6 | TAI THAM SIGN REVERSED ROTATED RANA | ᪦ | U | R | |

U+1AA8 | TAI THAM SIGN KAAN | ᪨ | U | R | |

U+1AA9 | TAI THAM SIGN KAANKUU | ᪩ | U | R | |

U+1AAA | TAI THAM SIGN SATKAAN | ᪪ | U | R | |

U+1AAB | TAI THAM SIGN SATKAANKUU | ᪫ | U | R | |

U+1AAC | TAI THAM SIGN HANG | ᪬ | U | R | |

U+1AAD | TAI THAM SIGN CAANG | ᪭ | U | R | |

U+1B5A | BALINESE PANTI | ᭚ | U | R | |

U+1B5B | BALINESE PAMADA | ᭛ | U | R | |

U+1B5C | BALINESE WINDU | ᭜ | U | R | |

U+1B5D | BALINESE CARIK PAMUNGKAH | ᭝ | U | R | |

U+1B5E | BALINESE CARIK SIKI | ᭞ | U | R | |

U+1B5F | BALINESE CARIK PAREREN | ᭟ | U | R | |

U+1B60 | BALINESE PAMENENG | ᭠ | U | R | |

U+1BFC | BATAK SYMBOL BINDU NA METEK | ᯼ | U | R | |

U+1BFD | BATAK SYMBOL BINDU PINARBORAS | ᯽ | U | R | |

U+1BFE | BATAK SYMBOL BINDU JUDUL | ᯾ | U | R | |

U+1BFF | BATAK SYMBOL BINDU PANGOLAT | ᯿ | U | R | |

U+1C3B | LEPCHA PUNCTUATION TA-ROL | ᰻ | U | R | |

U+1C3C | LEPCHA PUNCTUATION NYET THYOOM TA-ROL | ᰼ | U | R | |

U+1C3D | LEPCHA PUNCTUATION CER-WA | ᰽ | U | R | |

U+1C3E | LEPCHA PUNCTUATION TSHOOK CER-WA | ᰾ | U | R | |

U+1C3F | LEPCHA PUNCTUATION TSHOOK | ᰿ | U | R | |

U+1C7E | OL CHIKI PUNCTUATION MUCAAD | ᱾ | U | R | |

U+1C7F | OL CHIKI PUNCTUATION DOUBLE MUCAAD | ᱿ | U | R | |

U+1CC0 | SUNDANESE PUNCTUATION BINDU SURYA | ᳀ | U | R | |

U+1CC1 | SUNDANESE PUNCTUATION BINDU PANGLONG | ᳁ | U | R | |

U+1CC2 | SUNDANESE PUNCTUATION BINDU PURNAMA | ᳂ | U | R | |

U+1CC3 | SUNDANESE PUNCTUATION BINDU CAKRA | ᳃ | U | R | |

U+1CC4 | SUNDANESE PUNCTUATION BINDU LEU SATANGA | ᳄ | U | R | |

U+1CC5 | SUNDANESE PUNCTUATION BINDU KA SATANGA | ᳅ | U | R | |

U+1CC6 | SUNDANESE PUNCTUATION BINDU DA SATANGA | ᳆ | U | R | |

U+1CC7 | SUNDANESE PUNCTUATION BINDU BA SATANGA | ᳇ | U | R | |

U+1CD3 | VEDIC SIGN NIHSHVASA | ᳓ | U | R | |

U+2CF9 | COPTIC OLD NUBIAN FULL STOP | ⳹ | U | R | |

U+2CFA | COPTIC OLD NUBIAN DIRECT QUESTION MARK | ⳺ | U | R | |

U+2CFB | COPTIC OLD NUBIAN INDIRECT QUESTION MARK | ⳻ | U | R | |

U+2CFC | COPTIC OLD NUBIAN VERSE DIVIDER | ⳼ | U | R | |

U+2CFE | COPTIC FULL STOP | ⳾ | U | R | |

U+2CFF | COPTIC MORPHOLOGICAL DIVIDER | ⳿ | U | R | |

U+2D70 | TIFINAGH SEPARATOR MARK | ⵰ | U | R | |

U+A4FE | LISU PUNCTUATION COMMA | ꓾ | U | R | |

U+A4FF | LISU PUNCTUATION FULL STOP | ꓿ | U | R | |

U+A60D | VAI COMMA | ꘍ | U | R | |

U+A60E | VAI FULL STOP | ꘎ | U | R | |

U+A60F | VAI QUESTION MARK | ꘏ | U | R | |

U+A673 | SLAVONIC ASTERISK | ꙳ | U | R | |

U+A67E | CYRILLIC KAVYKA | ꙾ | U | R | |

U+A6F2 | BAMUM NJAEMLI | ꛲ | U | R | |

U+A6F3 | BAMUM FULL STOP | ꛳ | U | R | |

U+A6F4 | BAMUM COLON | ꛴ | U | R | |

U+A6F5 | BAMUM COMMA | ꛵ | U | R | |

U+A6F6 | BAMUM SEMICOLON | ꛶ | U | R | |

U+A6F7 | BAMUM QUESTION MARK | ꛷ | U | R | |

U+A874 | PHAGS-PA SINGLE HEAD MARK | ꡴ | V | V | Match Phags-pa letters DVO=U |

U+A875 | PHAGS-PA DOUBLE HEAD MARK | ꡵ | V | V | Match Phags-pa letters DVO=U |

U+A876 | PHAGS-PA MARK SHAD | ꡶ | V | V | Match Phags-pa letters DVO=U |

U+A877 | PHAGS-PA MARK DOUBLE SHAD | ꡷ | V | V | Match Phags-pa letters DVO=U |

U+A8CE | SAURASHTRA DANDA | ꣎ | U | R | |

U+A8CF | SAURASHTRA DOUBLE DANDA | ꣏ | U | R | |

U+A8F8 | DEVANAGARI SIGN PUSHPIKA | ꣸ | U | R | |

U+A8F9 | DEVANAGARI GAP FILLER | ꣹ | U | R | |

U+A8FA | DEVANAGARI CARET | ꣺ | U | R | |

U+A92E | KAYAH LI SIGN CWI | ꤮ | U | R | |

U+A92F | KAYAH LI SIGN SHYA | ꤯ | U | R | |

U+A95F | REJANG SECTION MARK | ꥟ | U | R | |

U+A9C1 | JAVANESE LEFT RERENGGAN | ꧁ | U | R | |

U+A9C2 | JAVANESE RIGHT RERENGGAN | ꧂ | U | R | |

U+A9C3 | JAVANESE PADA ANDAP | ꧃ | U | R | |

U+A9C4 | JAVANESE PADA MADYA | ꧄ | U | R | |

U+A9C5 | JAVANESE PADA LUHUR | ꧅ | U | R | |

U+A9C6 | JAVANESE PADA WINDU | ꧆ | U | R | |

U+A9C7 | JAVANESE PADA PANGKAT | ꧇ | U | R | |

U+A9C8 | JAVANESE PADA LINGSA | ꧈ | U | R | |

U+A9C9 | JAVANESE PADA LUNGSI | ꧉ | U | R | |

U+A9CA | JAVANESE PADA ADEG | ꧊ | U | R | |

U+A9CB | JAVANESE PADA ADEG ADEG | ꧋ | U | R | |

U+A9CC | JAVANESE PADA PISELEH | ꧌ | U | R | |

U+A9CD | JAVANESE TURNED PADA PISELEH | ꧍ | U | R | |

U+A9DE | JAVANESE PADA TIRTA TUMETES | ꧞ | U | R | |

U+A9DF | JAVANESE PADA ISEN-ISEN | ꧟ | U | R | |

U+AA5C | CHAM PUNCTUATION SPIRAL | ꩜ | U | R | |

U+AA5D | CHAM PUNCTUATION DANDA | ꩝ | U | R | |

U+AA5E | CHAM PUNCTUATION DOUBLE DANDA | ꩞ | U | R | |

U+AA5F | CHAM PUNCTUATION TRIPLE DANDA | ꩟ | U | R | |

U+AADE | TAI VIET SYMBOL HO HOI | ꫞ | U | R | |

U+AADF | TAI VIET SYMBOL KOI KOI | ꫟ | U | R | |

U+AAF0 | MEETEI MAYEK CHEIKHAN | ꫰ | U | R | |

U+AAF1 | MEETEI MAYEK AHANG KHUDAM | ꫱ | U | R | |

U+ABEB | MEETEI MAYEK CHEIKHEI | ꯫ | U | R | |

U+10100 | AEGEAN WORD SEPARATOR LINE | 𐄀 | U | R | |

U+10101 | AEGEAN WORD SEPARATOR DOT | 𐄁 | U | R | |

U+10102 | AEGEAN CHECK MARK | 𐄂 | U | R | |

U+1039F | UGARITIC WORD DIVIDER | 𐎟 | U | R | |

U+103D0 | OLD PERSIAN WORD DIVIDER | 𐏐 | U | R | |

U+10857 | IMPERIAL ARAMAIC SECTION SIGN | 𐡗 | U | R | |

U+1091F | PHOENICIAN WORD SEPARATOR | 𐤟 | U | R | |

U+1093F | LYDIAN TRIANGULAR MARK | 𐤿 | U | R | |

U+10A50 | KHAROSHTHI PUNCTUATION DOT | 𐩐 | U | R | |

U+10A51 | KHAROSHTHI PUNCTUATION SMALL CIRCLE | 𐩑 | U | R | |

U+10A52 | KHAROSHTHI PUNCTUATION CIRCLE | 𐩒 | U | R | |

U+10A53 | KHAROSHTHI PUNCTUATION CRESCENT BAR | 𐩓 | U | R | |

U+10A54 | KHAROSHTHI PUNCTUATION MANGALAM | 𐩔 | U | R | |

U+10A55 | KHAROSHTHI PUNCTUATION LOTUS | 𐩕 | U | R | |

U+10A56 | KHAROSHTHI PUNCTUATION DANDA | 𐩖 | U | R | |

U+10A57 | KHAROSHTHI PUNCTUATION DOUBLE DANDA | 𐩗 | U | R | |

U+10A58 | KHAROSHTHI PUNCTUATION LINES | 𐩘 | U | R | |

U+10A7F | OLD SOUTH ARABIAN NUMERIC INDICATOR | 𐩿 | U | R | |

U+10B39 | AVESTAN ABBREVIATION MARK | 𐬹 | U | R | |

U+11047 | BRAHMI DANDA | 𑁇 | U | R | |

U+11048 | BRAHMI DOUBLE DANDA | 𑁈 | U | R | |

U+11049 | BRAHMI PUNCTUATION DOT | 𑁉 | U | R | |

U+1104A | BRAHMI PUNCTUATION DOUBLE DOT | 𑁊 | U | R | |

U+1104B | BRAHMI PUNCTUATION LINE | 𑁋 | U | R | |

U+1104C | BRAHMI PUNCTUATION CRESCENT BAR | 𑁌 | U | R | |

U+1104D | BRAHMI PUNCTUATION LOTUS | 𑁍 | U | R | |

U+110BB | KAITHI ABBREVIATION SIGN | 𑂻 | U | R | |

U+110BC | KAITHI ENUMERATION SIGN | 𑂼 | U | R | |

U+110BE | KAITHI SECTION MARK | 𑂾 | U | R | |

U+110BF | KAITHI DOUBLE SECTION MARK | 𑂿 | U | R | |

U+110C0 | KAITHI DANDA | 𑃀 | U | R | |

U+110C1 | KAITHI DOUBLE DANDA | 𑃁 | U | R | |

U+11140 | CHAKMA SECTION MARK | 𑅀 | U | R | |

U+11141 | CHAKMA DANDA | 𑅁 | U | R | |

U+11142 | CHAKMA DOUBLE DANDA | 𑅂 | U | R | |

U+11143 | CHAKMA QUESTION MARK | 𑅃 | U | R | |

U+111C5 | SHARADA DANDA | 𑇅 | U | R | |

U+111C6 | SHARADA DOUBLE DANDA | 𑇆 | U | R | |

U+111C7 | SHARADA ABBREVIATION SIGN | 𑇇 | U | R | |

U+111C8 | SHARADA SEPARATOR | 𑇈 | U | R | |

U+12470 | CUNEIFORM PUNCTUATION SIGN OLD ASSYRIAN WORD DIVIDER | 𒑰 | U | R | |

U+12471 | CUNEIFORM PUNCTUATION SIGN VERTICAL COLON | 𒑱 | U | R | |

U+12472 | CUNEIFORM PUNCTUATION SIGN DIAGONAL COLON | 𒑲 | U | R | |

U+12473 | CUNEIFORM PUNCTUATION SIGN DIAGONAL TRICOLON | 𒑳 | U | R |

Code | Description | UTR | WM | Memo |
---|---|---|---|---|

U+2018-2019 | LEFT/RIGHT SINGLE QUOTATION MARK | T | UF | Some people says T, while some says SB. Some says these should be consistent with U+201C/201D. |

U+201C-201D | Curly quotes | SB | V | JLREQ defines “use double curly quotes in horizontal and double prime in vertical”. Some people think these are SB because these code points are for horizontal only, and it's author's responsibility to replace them to U+301D/301F in vertical flow. Some says T, so that replacement can happen automatically when user switched text flow just like small Kana. Since it's split, probably T is better. |

U+201E-201F | DOUBLE LOW/HIGH-REVERSED–9 QUOTATION MARK | SB | UF | Some people says T, while some says SB. Some says these should be consistent with U+201C/201D. |

U+301D/301F | REVERSED/Low DOUBLE PRIME QUOTATION MARK | T | V | These are double-quotes for vertical flow in Japanese. Some fonts use these glyphs as vert for U+2018/2019. Some fonts (Meiryo) uses DOUBLE PRIME glyphs for U+2018/2019 even in horizontal flow. These can also be used in math as double-dashes? T is probably good. |

U+301E | DOUBLE PRIME QUOTATION MARK | T | V | These are glyphs only for vertical. T is probably good. UTR#50 DVO has quotes S |

Code | Description | UTR | WM | Memo |
---|---|---|---|---|

U+00A9 | COPYRIGHT SIGN | U | S | Examples: copyright_horz.png copyright_vert.jpg |

U+00AE | REGISTERED SIGN | U | S | UTR#50 4.4 mentions this could be contextual. Tailoring can be another option along with U+00A9 COPYRIGHT SIGN. |

U+2016 | Double vertical line | U | V | Typically rotated in Japanese typesetting |

U+3033-3035 | VERTICAL KANA REPEAT MARK | U | U | These are glyphs only for vertical. T is probably good. |

U+303B | VERTICAL IDEOGRAPHIC ITERATION MARK | U | U | These are glyphs only for vertical. T is probably good. |

U+3303, 3305, 3306, etc. | SQUARE of 3 chars | T | U | When one char in a line, it can be either align center or left (top) and that should be up to font designers. Showing representative glyphs can be misleading. |

U+337B-337F | Square era names | T | U | Adobe/Apple fonts use vertically compressed glyphs, while MS/Ricoh fonts use horizontally compressed glyphs using Tate-Chu-Yoko. JLTF says both should be allowed. |

U+FF0C | FULLWIDTH COMMA | U | U | Should be T, to make consistent with U+FF0C. |

U+FF0E | FULLWIDTH FULL STOP | U | U | Should be T, to make consistent with U+3002. |

U+FF1A | Fullwidth colon | U | V | Typically rotated in Japanese typesetting. Traditional Chinese typesets upright. Should be T. |

U+FF1B | Fullwidth semicolon | U | V | Not used in Japanese, typically upright with no vert altglyph. Fonts seem inconsistent. Traditional Chinese typesets upright. Should be T. |

U+FF1D | Fullwidth equals sign | U | U | Typically rotated in Japanese fonts. What about Chinese? |

U+FF5C | Fullwidth vertical line | U | U | Typically rotated in Japanese typesetting |

- EAW=JKST means the code point existed in legacy encoding of Japan/Korean/Simplified Chinese/Traditional Chinese. One could argue that EAC is legacy, but even today's Input Methods emit them, so EAW=A can indicate that the code are likely to be used even today.

Code | Name | Cat | EAC | EAO | WM | EAW | Comments |
---|---|---|---|---|---|---|---|

2100 | ACCOUNT OF | So | 19.3 | U | S | N | |

2101 | ADDRESSED TO THE SUBJECT | So | 19.3 | U | S | N | |

2102 | DOUBLE-STRUCK CAPITAL C | Lu | 19.3 | U | S | N | |

2103 | DEGREE CELSIUS | So | 13 | U | S | JKST | Googling “air temperature” (気温) hits several usage of this code point; e.g., 1, 2, 3. They appear after ASCII/full-width digits because (I guess) they assume web is horizontal, but I expect it be formatted as full-width or Han digits if vertical. Type “do” (degree) in MS-IME and you get this. |

2104 | CENTRE LINE SYMBOL | So | 19.3 | U | S | N | |

2105 | CARE OF | So | 19.3 | U | S | ST | |

2106 | CADA UNA | So | 19.3 | U | S | N | |

2107 | EULER CONSTANT | Lu | 19.3 | U | S | N | |

2108 | SCRUPLE | So | 19.3 | U | S | N | |

2109 | DEGREE FAHRENHEIT | So | 13 | U | S | KST | |

210A | SCRIPT SMALL G | Ll | 19.3 | U | S | N | |

210B | SCRIPT CAPITAL H | Lu | 19.3 | U | S | N | |

210C | BLACK-LETTER CAPITAL H | Lu | 19.3 | U | S | N | |

210D | DOUBLE-STRUCK CAPITAL H | Lu | 19.3 | U | S | N | |

210E | PLANCK CONSTANT | Ll | 19.3 | U | S | N | |

210F | PLANCK CONSTANT OVER TWO PI | Ll | 19.3 | U | S | N | |

2110 | SCRIPT CAPITAL I | Lu | 19.3 | U | S | N | |

2111 | BLACK-LETTER CAPITAL I | Lu | 19.3 | U | S | N | |

2112 | SCRIPT CAPITAL L | Lu | 19.3 | U | S | N | |

2113 | SCRIPT SMALL L | Ll | 13 | U | S | K | |

2114 | L B BAR SYMBOL | So | 19.3 | U | S | N | |

2115 | DOUBLE-STRUCK CAPITAL N | Lu | 19.3 | U | S | N | |

2116 | NUMERO SIGN | So | 12 | U | S | JKS | Upright makes sense to me and I remember I saw some instances, but can't find right now. I'll look for further. Type “bangou” (number) in MS-IME and you get this. |

2117 | SOUND RECORDING COPYRIGHT | So | 19.3 | U | S | N | |

2118 | SCRIPT CAPITAL P | Sm | 19.3 | U | S | N | |

2119 | DOUBLE-STRUCK CAPITAL P | Lu | 19.3 | U | S | N | |

211A | DOUBLE-STRUCK CAPITAL Q | Lu | 19.3 | U | S | N | |

211B | SCRIPT CAPITAL R | Lu | 19.3 | U | S | N | |

211C | BLACK-LETTER CAPITAL R | Lu | 19.3 | U | S | N | |

211D | DOUBLE-STRUCK CAPITAL R | Lu | 19.3 | U | S | N | |

211E | PRESCRIPTION TAKE | So | 19.3 | U | S | N | |

211F | RESPONSE | So | 19.3 | U | S | N | |

2120 | SERVICE MARK | So | 19.3 | U | S | N | |

2121 | TELEPHONE SIGN | So | 19.3 | U | S | JKS | I thought I can find examples in vertical-flow-name cards, but could not find the use of this code. I'm pretty sure if I send this code as text to printing company, they'll set upright though. I'll look for further. Type “denwa” (phone) in MS-IME and you get this. |

2122 | TRADE MARK SIGN | So | 19.3 | U | S | K | |

2123 | VERSICLE | So | 19.3 | U | S | N | |

2124 | DOUBLE-STRUCK CAPITAL Z | Lu | 19.3 | U | S | N | |

2125 | OUNCE SIGN | So | 19.3 | U | S | N | |

2126 | OHM SIGN | Lu | 19.3 | U | S | K | |

2127 | INVERTED OHM SIGN | So | 19.3 | U | S | N | |

2128 | BLACK-LETTER CAPITAL Z | Lu | 19.3 | U | S | N | |

2129 | TURNED GREEK SMALL LETTER IOTA | So | 19.3 | U | S | N | |

212A | KELVIN SIGN | Lu | 19.3 | U | S | N | |

212B | ANGSTROM SIGN | Lu | 19.3 | U | S | JK | Type “ongusutoro-mu” (angstrom) in MS-IME and you get this; e.g., 1, 2. Interestingly, there're questions on the web asking “how to type half-width angstrom.” Answers are “apply English font” or “type using English keyboard.” The code point is the same, but Japanese fonts usually have full-width glyph here, while roman fonts have proportional, so that's what they mean by “half-width”. I couldn't find “how to type full-width angstrom” question; no idea because it's not used, or because it's too easy to figure out (use Input Method), but my guess is this is less commonly used unit symbol and therefore far less important to set upright than other symbols such as U+2103, U+2116, or U+2121. |

212C | SCRIPT CAPITAL B | Lu | 19.3 | U | S | N | |

212D | BLACK-LETTER CAPITAL C | Lu | 19.3 | U | S | N | |

212E | ESTIMATED SYMBOL | So | 19.3 | U | S | N | |

212F | SCRIPT SMALL E | Ll | 19.3 | U | S | N | |

2130 | SCRIPT CAPITAL E | Lu | 19.3 | U | S | N | |

2131 | SCRIPT CAPITAL F | Lu | 19.3 | U | S | N | |

2132 | TURNED CAPITAL F | Lu | 19.3 | U | S | N | |

2133 | SCRIPT CAPITAL M | Lu | 19.3 | U | S | N | |

2134 | SCRIPT SMALL O | Ll | 19.3 | U | S | N | |

2135 | ALEF SYMBOL | Lo | 19.3 | U | S | N | |

2136 | BET SYMBOL | Lo | 19.3 | U | S | N | |

2137 | GIMEL SYMBOL | Lo | 19.3 | U | S | N | |

2138 | DALET SYMBOL | Lo | 19.3 | U | S | N | |

2139 | INFORMATION SOURCE | Ll | 19.3 | U | S | N | |

213A | ROTATED CAPITAL Q | So | 19.3 | U | S | N | |

213B | FACSIMILE SIGN | So | 19.3 | U | S | N | |

213C | DOUBLE-STRUCK SMALL PI | Ll | 19.3 | U | S | N | |

213D | DOUBLE-STRUCK SMALL GAMMA | Ll | 19.3 | U | S | N | |

213E | DOUBLE-STRUCK CAPITAL GAMMA | Lu | 19.3 | U | S | N | |

213F | DOUBLE-STRUCK CAPITAL PI | Lu | 19.3 | U | S | N | |

2140 | DOUBLE-STRUCK N-ARY SUMMATION | Sm | 19.3 | U | S | N | |

2141 | TURNED SANS-SERIF CAPITAL G | Sm | 19.3 | U | S | N | |

2142 | TURNED SANS-SERIF CAPITAL L | Sm | 19.3 | U | S | N | |

2143 | REVERSED SANS-SERIF CAPITAL L | Sm | 19.3 | U | S | N | |

2144 | TURNED SANS-SERIF CAPITAL Y | Sm | 19.3 | U | S | N | |

2145 | DOUBLE-STRUCK ITALIC CAPITAL D | Lu | 19.3 | U | S | N | |

2146 | DOUBLE-STRUCK ITALIC SMALL D | Ll | 19.3 | U | S | N | |

2147 | DOUBLE-STRUCK ITALIC SMALL E | Ll | 19.3 | U | S | N | |

2148 | DOUBLE-STRUCK ITALIC SMALL I | Ll | 19.3 | U | S | N | |

2149 | DOUBLE-STRUCK ITALIC SMALL J | Ll | 19.3 | U | S | N | |

214A | PROPERTY LINE | So | 19.3 | U | S | N | |

214B | TURNED AMPERSAND | Sm | 19.3 | U | S | N | |

214C | PER SIGN | So | 19.3 | U | S | N | |

214D | AKTIESELSKAB | So | 19.3 | U | S | N | |

214E | TURNED SMALL F | Ll | 19.3 | U | S | N | |

214F | SYMBOL FOR SAMARITAN SOURCE | So | 19.3 | U | S | N |