Punctuation Orientation By Codepoint

This page is intended to help analyze troublesome characters like punctuation and symbols. It is not comprehensive at all yet.

Category Codes:

CodeUTR50MSFTMeaning
UUSUpright; translates between horizontal and vertical
RSRSideways; rotates between horizontal and vertical
TUTSTTypeset upright with alternate glyph. Best fallback is just upright.
TRSBRTTypeset upright with alternate glyph. Best fallback is just sideways.

Two modes are presented: Stacking (text-orientation: upright) and Default (TBD).

FIXME Default orientation is not covered yet; focusing on stacked mode first because it's simpler.

Connecting Punctuation

Punctuation, Connector (Pc)

See Ken Whistler's notes on this category.

CodeDescriptionCharStackMixedMemo
U+005FLOW LINE_URMatch double low line, overline
U+203FUNDERTIERRIntended to link consecutive letters
U+2040CHARACTER TIERRIntended to link consecutive letters
U+2054INVERTED UNDERTIERRIntended to link consecutive letters
U+FE33PRESENTATION FORM FOR VERTICAL LOW LINEUUVertical presentation forms always upright
U+FE34PRESENTATION FORM FOR VERTICAL WAVY LOW LINEUUVertical presentation forms always upright
U+FE4DDASHED LOW LINEURMatch low line
U+FE4ECENTRELINE LOW LINEURMatch low line
U+FE4FWAVY LOW LINEURMatch low line
U+FF3FFULLWIDTH LOW LINE_URMatch low line

Punctuation, Dash (Pd)

CodeDescriptionCharStackMixMemo
U+002DHYPHEN-MINUS-RRThis character is used as hyphen, as minus, and as dash. Suggested to treating as dash / hyphen-bullet, since this seems to be more common than use as hyphen or minus.
U+058AARMENIAN HYPHEN֊RRHyphens are sideways
U+05BEHEBREW PUNCTUATION MAQAF־RR
U+1400CANADIAN SYLLABICS HYPHENRRHyphens are sideways
U+1806MONGOLIAN TODO SOFT HYPHENVVMongolian is always sideways ?:? DVO=U
U+2010HYPHENRRHyphens are sideways
U+2011NON-BREAKING HYPHENRRHyphens are sideways
U+2012FIGURE DASHRRDashes are always sideways
U+2013EN DASHRRDashes are always sideways
U+2014EM DASHRRDashes are always sideways
U+2015HORIZONTAL BARRRDashes are always sideways (EM DASH in Windows code page)
U+2E17DOUBLE OBLIQUE HYPHENRRHyphens are sideways
U+2E1AHYPHEN WITH DIAERESISRRHyphens are sideways
U+2E3ATWO-EM DASHRRDashes are always sideways
U+2E3BTHREE-EM DASHRRDashes are always sideways
U+301CWAVE DASHTRTRWave dash must transform :?: DVO=T
U+3030WAVY DASHTRTRWave dash must transform :?: DVO=T
U+30A0KATAKANA-HIRAGANA DOUBLE HYPHENTRTR Sideways in JIS. Japanese fonts with this glyph expected to have vertical alternate.
U+FE31PRESENTATION FORM FOR VERTICAL EM DASHUUVertical presentation forms are always upright
U+FE32PRESENTATION FORM FOR VERTICAL EN DASHUUVertical presentation forms are always upright
U+FE58SMALL EM DASHRRDashes are always sideways
U+FF5EFULLWIDTH TILDETT:!: Dashes are sideways, and this is considered equivalent to WAVE DASH U+301C even though it's technically a Math Symbol (Sm) fullwidth variant of U+007E
U+FE63SMALL HYPHEN-MINUSRRMatch fullwidth variant
U+FF0DFULLWIDTH HYPHEN-MINUSRRUsed as dash

Enclosing Punctuation

Punctuation, Open (Ps)

CodeDescriptionCharStackMixMemo
U+0028LEFT PARENTHESIS(RRBrackets are sideways to enclose their text
U+005BLEFT SQUARE BRACKET[RRBrackets are sideways to enclose their text
U+007BLEFT CURLY BRACKET{RRBrackets are sideways to enclose their text
U+0F3ATIBETAN MARK GUG RTAGS GYONUR:?: Unsure about Tibetan, assuming upright for stacked mode
U+0F3CTIBETAN MARK ANG KHANG GYONUR:?: Unsure about Tibetan, assuming upright for stacked mode
U+169BOGHAM FEATHER MARKRROgham is always sideways
U+201ASINGLE LOW-9 QUOTATION MARKTURQuotation marks are upright in stacked mode :?: DVO=S
U+201EDOUBLE LOW-9 QUOTATION MARKTURQuotation marks are upright in stacked mode :?: DVO=S
U+2045LEFT SQUARE BRACKET WITH QUILLRRBrackets are sideways to enclose their text
U+207DSUPERSCRIPT LEFT PARENTHESISRRBrackets are sideways to enclose their text :?: DVO=U
U+208DSUBSCRIPT LEFT PARENTHESISRRBrackets are sideways to enclose their text :?: DVO=U
U+2329LEFT-POINTING ANGLE BRACKETRRBrackets are sideways to enclose their text
U+2768MEDIUM LEFT PARENTHESIS ORNAMENTRRBrackets are sideways to enclose their text
U+276AMEDIUM FLATTENED LEFT PARENTHESIS ORNAMENTRRBrackets are sideways to enclose their text
U+276CMEDIUM LEFT-POINTING ANGLE BRACKET ORNAMENTRRBrackets are sideways to enclose their text
U+276EHEAVY LEFT-POINTING ANGLE QUOTATION MARK ORNAMENTRRGuillemets are sideways to enclose their text
U+2770HEAVY LEFT-POINTING ANGLE BRACKET ORNAMENTRRBrackets are sideways to enclose their text
U+2772LIGHT LEFT TORTOISE SHELL BRACKET ORNAMENTRRBrackets are sideways to enclose their text
U+2774MEDIUM LEFT CURLY BRACKET ORNAMENTRRBrackets are sideways to enclose their text
U+27C5LEFT S-SHAPED BAG DELIMITERRRBrackets are sideways to enclose their text
U+27E6MATHEMATICAL LEFT WHITE SQUARE BRACKETRRBrackets are sideways to enclose their text
U+27E8MATHEMATICAL LEFT ANGLE BRACKETRRBrackets are sideways to enclose their text
U+27EAMATHEMATICAL LEFT DOUBLE ANGLE BRACKETRRBrackets are sideways to enclose their text
U+27ECMATHEMATICAL LEFT WHITE TORTOISE SHELL BRACKETRRBrackets are sideways to enclose their text
U+27EEMATHEMATICAL LEFT FLATTENED PARENTHESISRRBrackets are sideways to enclose their text
U+2983LEFT WHITE CURLY BRACKETRRBrackets are sideways to enclose their text
U+2985LEFT WHITE PARENTHESISRRBrackets are sideways to enclose their text
U+2987Z NOTATION LEFT IMAGE BRACKETRRBrackets are sideways to enclose their text
U+2989Z NOTATION LEFT BINDING BRACKETRRBrackets are sideways to enclose their text
U+298BLEFT SQUARE BRACKET WITH UNDERBARRRBrackets are sideways to enclose their text
U+298DLEFT SQUARE BRACKET WITH TICK IN TOP CORNERRRBrackets are sideways to enclose their text
U+298FLEFT SQUARE BRACKET WITH TICK IN BOTTOM CORNERRRBrackets are sideways to enclose their text
U+2991LEFT ANGLE BRACKET WITH DOTRRBrackets are sideways to enclose their text
U+2993LEFT ARC LESS-THAN BRACKETRRBrackets are sideways to enclose their text
U+2995DOUBLE LEFT ARC GREATER-THAN BRACKETRRBrackets are sideways to enclose their text
U+2997LEFT BLACK TORTOISE SHELL BRACKETRRBrackets are sideways to enclose their text
U+29D8LEFT WIGGLY FENCERRBrackets are sideways to enclose their text
U+29DALEFT DOUBLE WIGGLY FENCERRBrackets are sideways to enclose their text
U+29FCLEFT-POINTING CURVED ANGLE BRACKETRRBrackets are sideways to enclose their text
U+2E22TOP LEFT HALF BRACKETRRBrackets are sideways to enclose their text
U+2E24BOTTOM LEFT HALF BRACKETRRBrackets are sideways to enclose their text
U+2E26LEFT SIDEWAYS U BRACKETRRBrackets are sideways to enclose their text
U+2E28LEFT DOUBLE PARENTHESISRRBrackets are sideways to enclose their text
U+3008LEFT ANGLE BRACKETTRTRBrackets are sideways to enclose their text (CJK fonts usually have vertical glyph)
U+300ALEFT DOUBLE ANGLE BRACKETTRTRBrackets are sideways to enclose their text (CJK fonts usually have vertical glyph)
U+300CLEFT CORNER BRACKETTRTRBrackets are sideways to enclose their text (CJK fonts usually have vertical glyph)
U+300ELEFT WHITE CORNER BRACKETTRTRBrackets are sideways to enclose their text (CJK fonts usually have vertical glyph)
U+3010LEFT BLACK LENTICULAR BRACKETTRTRBrackets are sideways to enclose their text (CJK fonts usually have vertical glyph)
U+3014LEFT TORTOISE SHELL BRACKETTRTRBrackets are sideways to enclose their text (CJK fonts usually have vertical glyph)
U+3016LEFT WHITE LENTICULAR BRACKETTRTRBrackets are sideways to enclose their text (CJK fonts usually have vertical glyph)
U+3018LEFT WHITE TORTOISE SHELL BRACKETTRTRBrackets are sideways to enclose their text (CJK fonts usually have vertical glyph)
U+301ALEFT WHITE SQUARE BRACKETTRTRBrackets are sideways to enclose their text (CJK fonts usually have vertical glyph)
U+301DREVERSED DOUBLE PRIME QUOTATION MARKTUTUQuotation marks are upright in stacked mode :?: DVO=S
U+FD3EORNATE LEFT PARENTHESISRRBrackets are sideways to enclose their text
U+FE17PRESENTATION FORM FOR VERTICAL LEFT WHITE LENTICULAR BRACKETUUVertical presentation forms are always upright
U+FE35PRESENTATION FORM FOR VERTICAL LEFT PARENTHESISUUVertical presentation forms are always upright
U+FE37PRESENTATION FORM FOR VERTICAL LEFT CURLY BRACKETUUVertical presentation forms are always upright
U+FE39PRESENTATION FORM FOR VERTICAL LEFT TORTOISE SHELL BRACKETUUVertical presentation forms are always upright
U+FE3BPRESENTATION FORM FOR VERTICAL LEFT BLACK LENTICULAR BRACKETUUVertical presentation forms are always upright
U+FE3DPRESENTATION FORM FOR VERTICAL LEFT DOUBLE ANGLE BRACKETUUVertical presentation forms are always upright
U+FE3FPRESENTATION FORM FOR VERTICAL LEFT ANGLE BRACKET︿UUVertical presentation forms are always upright
U+FE41PRESENTATION FORM FOR VERTICAL LEFT CORNER BRACKETUUVertical presentation forms are always upright
U+FE43PRESENTATION FORM FOR VERTICAL LEFT WHITE CORNER BRACKETUUVertical presentation forms are always upright
U+FE47PRESENTATION FORM FOR VERTICAL LEFT SQUARE BRACKETUUVertical presentation forms are always upright
U+FE59SMALL LEFT PARENTHESISRR:?: Brackets are sideways to enclose their text
U+FE5BSMALL LEFT CURLY BRACKETRR:?: Brackets are sideways to enclose their text
U+FE5DSMALL LEFT TORTOISE SHELL BRACKETRR:?: Brackets are sideways to enclose their text
U+FF08FULLWIDTH LEFT PARENTHESISTRTRBrackets are sideways to enclose their text (CJK fonts usually have vertical glyph)
U+FF3BFULLWIDTH LEFT SQUARE BRACKETTRTRBrackets are sideways to enclose their text (CJK fonts usually have vertical glyph)
U+FF5BFULLWIDTH LEFT CURLY BRACKETTRTRBrackets are sideways to enclose their text (CJK fonts usually have vertical glyph)
U+FF5FFULLWIDTH LEFT WHITE PARENTHESISTRTRBrackets are sideways to enclose their text (CJK fonts usually have vertical glyph)
U+FF62HALFWIDTH LEFT CORNER BRACKETRRBrackets are sideways to enclose their text :?: DVO=SB

Punctuation, Close (Pe)

CodeDescriptionCharStackMixMemo
U+0029RIGHT PARENTHESIS)RRBrackets are sideways to enclose their text
U+005DRIGHT SQUARE BRACKET]RRBrackets are sideways to enclose their text
U+007DRIGHT CURLY BRACKET}RRBrackets are sideways to enclose their text
U+0F3BTIBETAN MARK GUG RTAGS GYASUR:?: Unsure about Tibetan, assuming upright for stacked mode
U+0F3DTIBETAN MARK ANG KHANG GYASUR:?: Unsure about Tibetan, assuming upright for stacked mode
U+169COGHAM REVERSED FEATHER MARKRROgham is always sideways
U+2046RIGHT SQUARE BRACKET WITH QUILLRRBrackets are sideways to enclose their text
U+207ESUPERSCRIPT RIGHT PARENTHESISRRBrackets are sideways to enclose their text
U+208ESUBSCRIPT RIGHT PARENTHESISRRBrackets are sideways to enclose their text
U+232ARIGHT-POINTING ANGLE BRACKETRRBrackets are sideways to enclose their text
U+2769MEDIUM RIGHT PARENTHESIS ORNAMENTRRBrackets are sideways to enclose their text
U+276BMEDIUM FLATTENED RIGHT PARENTHESIS ORNAMENTRRBrackets are sideways to enclose their text
U+276DMEDIUM RIGHT-POINTING ANGLE BRACKET ORNAMENTRRBrackets are sideways to enclose their text
U+276FHEAVY RIGHT-POINTING ANGLE QUOTATION MARK ORNAMENTRRGuillemets are sideways to enclose their text
U+2771HEAVY RIGHT-POINTING ANGLE BRACKET ORNAMENTRRBrackets are sideways to enclose their text
U+2773LIGHT RIGHT TORTOISE SHELL BRACKET ORNAMENTRRBrackets are sideways to enclose their text
U+2775MEDIUM RIGHT CURLY BRACKET ORNAMENTRRBrackets are sideways to enclose their text
U+27C6RIGHT S-SHAPED BAG DELIMITERRRBrackets are sideways to enclose their text
U+27E7MATHEMATICAL RIGHT WHITE SQUARE BRACKETRRBrackets are sideways to enclose their text
U+27E9MATHEMATICAL RIGHT ANGLE BRACKETRRBrackets are sideways to enclose their text
U+27EBMATHEMATICAL RIGHT DOUBLE ANGLE BRACKETRRBrackets are sideways to enclose their text
U+27EDMATHEMATICAL RIGHT WHITE TORTOISE SHELL BRACKETRRBrackets are sideways to enclose their text
U+27EFMATHEMATICAL RIGHT FLATTENED PARENTHESISRRBrackets are sideways to enclose their text
U+2984RIGHT WHITE CURLY BRACKETRRBrackets are sideways to enclose their text
U+2986RIGHT WHITE PARENTHESISRRBrackets are sideways to enclose their text
U+2988Z NOTATION RIGHT IMAGE BRACKETRRBrackets are sideways to enclose their text
U+298AZ NOTATION RIGHT BINDING BRACKETRRBrackets are sideways to enclose their text
U+298CRIGHT SQUARE BRACKET WITH UNDERBARRRBrackets are sideways to enclose their text
U+298ERIGHT SQUARE BRACKET WITH TICK IN BOTTOM CORNERRRBrackets are sideways to enclose their text
U+2990RIGHT SQUARE BRACKET WITH TICK IN TOP CORNERRRBrackets are sideways to enclose their text
U+2992RIGHT ANGLE BRACKET WITH DOTRRBrackets are sideways to enclose their text
U+2994RIGHT ARC GREATER-THAN BRACKETRRBrackets are sideways to enclose their text
U+2996DOUBLE RIGHT ARC LESS-THAN BRACKETRRBrackets are sideways to enclose their text
U+2998RIGHT BLACK TORTOISE SHELL BRACKETRRBrackets are sideways to enclose their text
U+29D9RIGHT WIGGLY FENCERRBrackets are sideways to enclose their text
U+29DBRIGHT DOUBLE WIGGLY FENCERRBrackets are sideways to enclose their text
U+29FDRIGHT-POINTING CURVED ANGLE BRACKETRRBrackets are sideways to enclose their text
U+2E23TOP RIGHT HALF BRACKETRRBrackets are sideways to enclose their text
U+2E25BOTTOM RIGHT HALF BRACKETRRBrackets are sideways to enclose their text
U+2E27RIGHT SIDEWAYS U BRACKETRRBrackets are sideways to enclose their text
U+2E29RIGHT DOUBLE PARENTHESISRRBrackets are sideways to enclose their text
U+3009RIGHT ANGLE BRACKETTRTRBrackets are sideways to enclose their text
U+300BRIGHT DOUBLE ANGLE BRACKETTRTRBrackets are sideways to enclose their text
U+300DRIGHT CORNER BRACKETTRTRBrackets are sideways to enclose their text
U+300FRIGHT WHITE CORNER BRACKETTRTRBrackets are sideways to enclose their text
U+3011RIGHT BLACK LENTICULAR BRACKETTRTRBrackets are sideways to enclose their text
U+3015RIGHT TORTOISE SHELL BRACKETTRTRBrackets are sideways to enclose their text
U+3017RIGHT WHITE LENTICULAR BRACKETTRTRBrackets are sideways to enclose their text
U+3019RIGHT WHITE TORTOISE SHELL BRACKETTRTRBrackets are sideways to enclose their text
U+301BRIGHT WHITE SQUARE BRACKETTRTRBrackets are sideways to enclose their text
U+301EDOUBLE PRIME QUOTATION MARKTUTUQuotation marks are upright, but need some shifting. Prime quotes are mainly used for CJK, should be upright. :?: UTR#50 DVO has quotes S
U+301FLOW DOUBLE PRIME QUOTATION MARKTUTUQuotation marks are upright, but need some shifting. Prime quotes are mainly used for CJK, should be upright. :?: UTR#50 DVO has quotes S
U+FD3FORNATE RIGHT PARENTHESIS﴿RRBrackets are sideways to enclose their text
U+FE18PRESENTATION FORM FOR VERTICAL RIGHT WHITE LENTICULAR BRAKCETUUVertical presentation forms are always upright
U+FE36PRESENTATION FORM FOR VERTICAL RIGHT PARENTHESISUUVertical presentation forms are always upright
U+FE38PRESENTATION FORM FOR VERTICAL RIGHT CURLY BRACKETUUVertical presentation forms are always upright
U+FE3APRESENTATION FORM FOR VERTICAL RIGHT TORTOISE SHELL BRACKETUUVertical presentation forms are always upright
U+FE3CPRESENTATION FORM FOR VERTICAL RIGHT BLACK LENTICULAR BRACKETUUVertical presentation forms are always upright
U+FE3EPRESENTATION FORM FOR VERTICAL RIGHT DOUBLE ANGLE BRACKETUUVertical presentation forms are always upright
U+FE40PRESENTATION FORM FOR VERTICAL RIGHT ANGLE BRACKETUUVertical presentation forms are always upright
U+FE42PRESENTATION FORM FOR VERTICAL RIGHT CORNER BRACKETUUVertical presentation forms are always upright
U+FE44PRESENTATION FORM FOR VERTICAL RIGHT WHITE CORNER BRACKETUUVertical presentation forms are always upright
U+FE48PRESENTATION FORM FOR VERTICAL RIGHT SQUARE BRACKETUUVertical presentation forms are always upright
U+FE5ASMALL RIGHT PARENTHESISRRBrackets are sideways to enclose their text
U+FE5CSMALL RIGHT CURLY BRACKETRRBrackets are sideways to enclose their text
U+FE5ESMALL RIGHT TORTOISE SHELL BRACKETRRBrackets are sideways to enclose their text
U+FF09FULLWIDTH RIGHT PARENTHESISTRTRBrackets are sideways to enclose their text (CJK fonts usually have vertical glyph)
U+FF3DFULLWIDTH RIGHT SQUARE BRACKETTRTRBrackets are sideways to enclose their text (CJK fonts usually have vertical glyph)
U+FF5DFULLWIDTH RIGHT CURLY BRACKETTRTRBrackets are sideways to enclose their text (CJK fonts usually have vertical glyph)
U+FF60FULLWIDTH RIGHT WHITE PARENTHESISTRTRBrackets are sideways to enclose their text (CJK fonts usually have vertical glyph)
U+FF63HALFWIDTH RIGHT CORNER BRACKETRRBrackets are sideways to enclose their text

Punctuation, Initial quote (Pi)

CodeDescriptionCharStackMixMemo
U+00ABLEFT-POINTING DOUBLE ANGLE QUOTATION MARK«RRGuillmets are sideways to enclose text
U+2018LEFT SINGLE QUOTATION MARKTURQuotation marks are upright in stacked mode, but need some shifting :?: UTR#50 DVO has quotes S
U+201BSINGLE HIGH-REVERSED-9 QUOTATION MARKTURQuotation marks are upright in stacked mode, but need some shifting :?: UTR#50 DVO has quotes S
U+201CLEFT DOUBLE QUOTATION MARKTURQuotation marks are upright in stacked mode, but need some shifting :?: UTR#50 DVO has quotes S
U+201FDOUBLE HIGH-REVERSED-9 QUOTATION MARKTURQuotation marks are upright in stacked mode, but need some shifting :?: UTR#50 DVO has quotes S
U+2039SINGLE LEFT-POINTING ANGLE QUOTATION MARKRRGuillmets are sideways to enclose text
U+2E02LEFT SUBSTITUTION BRACKETU?R:?: New Testament Editorial Symbols… :?: DVO=U
U+2E04LEFT DOTTED SUBSTITUTION BRACKETU?R:?: New Testament Editorial Symbols… :?: DVO=U
U+2E09LEFT TRANSPOSITION BRACKETU?R:?: New Testament Editorial Symbols… :?: DVO=U
U+2E0CLEFT RAISED OMISSION BRACKETU?R:?: New Testament Editorial Symbols… :?: DVO=U
U+2E1CLEFT LOW PARAPHRASE BRACKETU?R:?: N'Ko punctuation :?: DVO=U
U+2E20LEFT VERTICAL BAR WITH QUILLRRBrackets are sideways to enclose text

Punctuation, Final quote (Pf)

CodeDescriptionCharStackMixMemo
U+00BBRIGHT-POINTING DOUBLE ANGLE QUOTATION MARK»RRGuillmets are sideways to enclose text
U+2019RIGHT SINGLE QUOTATION MARKTURQuotation marks are upright in stacked mode, but need some shifting :?: UTR#50 DVO has quotes S
U+201DRIGHT DOUBLE QUOTATION MARKTURQuotation marks are upright in stacked mode, but need some shifting :?: UTR#50 DVO has quotes S
U+203ASINGLE RIGHT-POINTING ANGLE QUOTATION MARKRRGuillmets are sideways to enclose text
U+2E03RIGHT SUBSTITUTION BRACKETU?R:?: New Testament Editorial Symbols… :?: DVO=U
U+2E05RIGHT DOTTED SUBSTITUTION BRACKETU?R:?: New Testament Editorial Symbols… :?: DVO=U
U+2E0ARIGHT TRANSPOSITION BRACKETU?R:?: New Testament Editorial Symbols… :?: DVO=U
U+2E0DRIGHT RAISED OMISSION BRACKETU?R:?: New Testament Editorial Symbols… :?: DVO=U
U+2E1DRIGHT LOW PARAPHRASE BRACKETU?R:?: N'Ko punctuation :?: DVO=U
U+2E21RIGHT VERTICAL BAR WITH QUILLRRBrackets are sideways to enclose text

Separating Punctuation

Spaces (Zs)

CodeDescriptionCharStackMixMemo
U+0020SPACE URProbably better to stack upright like letters, allow font to set vertical metrics. :?: DVO=S
U+00A0NO-BREAK SPACE URMust match U+0020 :?: DVO=S
U+1680OGHAM SPACE MARKRROgham is sideways
U+180EMONGOLIAN VOWEL SEPARATORVVMongolian is sideways :?: DVO=U
U+2000EN QUADRRFixed-size spacing. Provide spacing in advance direction.
U+2001EM QUADRRFixed-size spacing. Provide spacing in advance direction.
U+2002EN SPACERRFixed-size spacing. Provide spacing in advance direction.
U+2003EM SPACERRFixed-size spacing. Provide spacing in advance direction.
U+2004THREE-PER-EM SPACERR Fixed-size spacing. Provide spacing in advance direction..
U+2005FOUR-PER-EM SPACERRFixed-size spacing. Provide spacing in advance direction.
U+2006SIX-PER-EM SPACERRFixed-size spacing. Provide spacing in advance direction.
U+2007FIGURE SPACEURShould provide same advance as a digit, so match digits.
U+2008PUNCTUATION SPACETR:?: Should match advance of comma/period.
U+2009THIN SPACERRProvide spacing in advance direction. Often used with e.g. dashes and guillmets.
U+200AHAIR SPACERRProvide spacing in advance direction. Often used with e.g. dashes and guillmets.
U+202FNARROW NO-BREAK SPACERRProvide spacing in advance direction. Often used with e.g. dashes and guillmets.
U+205FMEDIUM MATHEMATICAL SPACERRProvide spacing in advance direction. Used to space mathematical operators.
U+3000IDEOGRAPHIC SPACE UUMake upright so that vertical metrics can be used to match non-square ideographic characters. :?: DVO=S

Common Other Punctuation (Po)

CodeDescriptionCharStackMixMemo
U+0021EXCLAMATION MARK!UR
U+0022QUOTATION MARKTURNeeds different position within bounding box, and/or different advance width (esp. when used as open-quote).
U+0023NUMBER SIGN#UR
U+0025PERCENT SIGN%UR
U+0026AMPERSAND&UR
U+0027APOSTROPHE'TURNeeds different position within bounding box and/or different advance width.
U+002AASTERISK*UR
U+002CCOMMA,TURNeeds different position within bounding box and/or different advance width.
U+002EFULL STOP.TURNeeds different position within bounding box and/or different advance width.
U+002FSOLIDUS/UR
U+003ACOLON:UR
U+003BSEMICOLON;UR
U+003FQUESTION MARK?UR
U+0040COMMERCIAL AT@UR
U+005CREVERSE SOLIDUS\UR
U+00A1INVERTED EXCLAMATION MARK¡UR
U+00A7SECTION SIGN§UU:?:
U+00B6PILCROW SIGNUU
U+00B7MIDDLE DOT·UR
U+00BFINVERTED QUESTION MARK¿UR
U+2016DOUBLE VERTICAL LINEUU:?: Most modern fonts have rotated vert, but JIS0213 says U. Taro wants R
U+2017DOUBLE LOW LINEURMatch LOW LINE?
U+2020DAGGERUU
U+2021DOUBLE DAGGERUU
U+2022BULLETUR:?:
U+2023TRIANGULAR BULLETUR:?:
U+2024ONE DOT LEADERRRLeaders always parallel to inline direction :?: DVO=U
U+2025TWO DOT LEADERRRLeaders always parallel to inline direction :?: DVO=U
U+2026HORIZONTAL ELLIPSISRREllipsis always parallel to inline direction :?: DVO=U
U+2027HYPHENATION POINTUR
U+2030PER MILLE SIGNUUUsed in East Asian codepages
U+2031PER TEN THOUSAND SIGNUUUsed in East Asian codepages
U+2032PRIMEUR
U+2033DOUBLE PRIMEUR
U+2034TRIPLE PRIMEUR
U+2035REVERSED PRIMEUR
U+2036REVERSED DOUBLE PRIMEUR
U+2037REVERSED TRIPLE PRIMEUR
U+2038CARETUR
U+203BREFERENCE MARKUU
U+203CDOUBLE EXCLAMATION MARKUU
U+203DINTERROBANGUU:?:
U+203EOVERLINEURMatch LOW LINE
U+2041CARET INSERTION POINTUR
U+2042ASTERISMUU:?: In JIS0213
U+2043HYPHEN BULLETRRMatch hyphen
U+2047DOUBLE QUESTION MARKUU
U+2048QUESTION EXCLAMATION MARKUU
U+2049EXCLAMATION QUESTION MARKUU
U+204ATIRONIAN SIGN ETURUsed with Latin
U+204BREVERSED PILCROW SIGNUUMatch PILCROW
U+204CBLACK LEFTWARDS BULLETUR
U+204DBLACK RIGHTWARDS BULLETUR
U+204ELOW ASTERISKURMatch asterisk
U+204FREVERSED SEMICOLONURMatch semicolon
U+2050CLOSE UPRRCopyediting symbol
U+2051TWO ASTERISKS ALIGNED VERTICALLYUU:?: In JIS0213
U+2053SWUNG DASHRRDashes are always sideways
U+2055FLOWER PUNCTUATION MARKUR
U+2056THREE DOT PUNCTUATIONUR
U+2057QUADRUPLE PRIMEUR
U+2058FOUR DOT PUNCTUATIONUR
U+2059FIVE DOT PUNCTUATIONUR
U+205ATWO DOT PUNCTUATIONURSee picture
U+205BFOUR DOT MARKUR
U+205CDOTTED CROSSUR
U+205DTRICOLONUR
U+205EVERTICAL FOUR DOTSUR
U+2E00RIGHT ANGLE SUBSTITUTION MARKERUR
U+2E01RIGHT ANGLE DOTTED SUBSTITUTION MARKERUR
U+2E06RAISED INTERPOLATION MARKERUR
U+2E07RAISED DOTTED INTERPOLATION MARKERUR
U+2E08DOTTED TRANSPOSITION MARKERUR
U+2E0BRAISED SQUAREUR
U+2E0EEDITORIAL CORONISUR
U+2E0FPARAGRAPHOSUR
U+2E10FORKED PARAGRAPHOSUR
U+2E11REVERSED FORKED PARAGRAPHOSUR
U+2E12HYPODIASTOLEUR
U+2E13DOTTED OBELOSUR
U+2E14DOWNWARDS ANCORAUR
U+2E15UPWARDS ANCORAUR
U+2E16DOTTED RIGHT-POINTING ANGLEUR
U+2E18INVERTED INTERROBANGUR:?: Mismatch with interrobang‽
U+2E19PALM BRANCHUR
U+2E1BTILDE WITH RING ABOVEUR
U+2E1ETILDE WITH DOT ABOVEUR
U+2E1FTILDE WITH DOT BELOWUR
U+2E2ATWO DOTS OVER ONE DOT PUNCTUATIONUR
U+2E2BONE DOT OVER TWO DOTS PUNCTUATIONUR
U+2E2CSQUARED FOUR DOT PUNCTUATIONUR
U+2E2DFIVE DOT MARKUR
U+2E2EREVERSED QUESTION MARKUR
U+2E30RING POINTUR
U+2E31WORD SEPARATOR MIDDLE DOTUR
U+2E32TURNED COMMAUR
U+2E33RAISED DOTUR
U+2E34RAISED COMMAUR
U+2E35TURNED SEMICOLONUR
U+2E36DAGGER WITH LEFT GUARDUUMatch DAGGER
U+2E37DAGGER WITH RIGHT GUARDUUMatch DAGGER
U+2E38TURNED DAGGERUUMatch DAGGER
U+2E39TOP HALF SECTION SIGNUUMatch SECTION SIGN
U+3001IDEOGRAPHIC COMMATUTUIdeographic variants upright; comma needs shifting
U+3002IDEOGRAPHIC FULL STOPTUTUIdeographic variants upright; full stop needs shifting
U+3003DITTO MARKUU
U+303DPART ALTERNATION MARKUUUsed in Japanese verse notation
U+30FBKATAKANA MIDDLE DOTUUKatakana middle dot is upright
U+FE45SESAME DOTUUSesame dots are always upright
U+FE46WHITE SESAME DOTUUSesame dots are always upright
U+FE49DASHED OVERLINEURMatch dashed low line
U+FE4ACENTRELINE OVERLINEURMatch centerline low line
U+FE4BWAVY OVERLINEURMatch wavy low line
U+FE4CDOUBLE WAVY OVERLINEURMatch wavy low line
U+10B3ATINY TWO DOTS OVER ONE DOT PUNCTUATION𐬺URMatch Avestan
U+10B3BSMALL TWO DOTS OVER ONE DOT PUNCTUATION𐬻URMatch Avestan
U+10B3CLARGE TWO DOTS OVER ONE DOT PUNCTUATION𐬼URMatch Avestan
U+10B3DLARGE ONE DOT OVER TWO DOTS PUNCTUATION𐬽URMatch Avestan
U+10B3ELARGE TWO RINGS OVER ONE RING PUNCTUATION𐬾URMatch Avestan
U+10B3FLARGE ONE RING OVER TWO RINGS PUNCTUATION𐬿URMatch Avestan

Small/Fullwidth/Halfwidth/Vertical Other Punctuation (Po)

CodeDescriptionCharStackMixMemo
U+FE50SMALL COMMATUTUSmall variants upright; comma needs shifting :?: DVO=U
U+FE51SMALL IDEOGRAPHIC COMMATUTUSmall variants upright; comma needs shifting :?: DVO=U
U+FE52SMALL FULL STOPTUTUSmall variants upright; full stop needs shifting :?: DVO=U
U+FE54SMALL SEMICOLONTUTUSmall semicolons are either upright (Chinese-style) or rotated sideways (Japanese-style) :?: DVO=U
U+FE55SMALL COLONTUTUSmall colons are either upright (Chinese-style) or rotated sideways (Japanese-style) :?: DVO=U
U+FE56SMALL QUESTION MARKUUSmall variants upright
U+FE57SMALL EXCLAMATION MARKUUSmall variants upright
U+FE5FSMALL NUMBER SIGNUUSmall variants upright
U+FE60SMALL AMPERSANDUUSmall variants upright
U+FE61SMALL ASTERISKUUSmall variants upright
U+FE68SMALL REVERSE SOLIDUSUUSmall variants upright
U+FE6ASMALL PERCENT SIGNUUSmall variants upright
U+FE6BSMALL COMMERCIAL ATUUSmall variants upright
U+FF01FULLWIDTH EXCLAMATION MARKUUFullwidth variants are upright
U+FF02FULLWIDTH QUOTATION MARKTUTUFullwidth variants are upright; quotes need alt glyph :?: UTR#50 DVO has U
U+FF03FULLWIDTH NUMBER SIGNUUFullwidth variants are upright
U+FF05FULLWIDTH PERCENT SIGNUUFullwidth variants are upright
U+FF06FULLWIDTH AMPERSANDUUFullwidth variants are upright
U+FF07FULLWIDTH APOSTROPHETUTUFullwidth variants are upright; :?: quotes need alt glyph :?: DVO=U
U+FF0AFULLWIDTH ASTERISKUUFullwidth variants are upright
U+FF0CFULLWIDTH COMMATUTUFullwidth variants are upright; comma needs shifting :?: DVO=U
U+FF0EFULLWIDTH FULL STOPTUTUFullwidth variants are upright; full stop needs shifting :?: DVO=U
U+FF0FFULLWIDTH SOLIDUSUUFullwidth variants are upright
U+FF1AFULLWIDTH COLONTUTUFullwidth colons are either upright (Chinese-style) or rotated sideways (Japanese-style) :?: DVO=U
U+FF1BFULLWIDTH SEMICOLONTUTUFullwidth semicolons are either upright (Chinese-style) or rotated sideways (Japanese-style) :?: DVO=U
U+FF1FFULLWIDTH QUESTION MARKUUFullwidth variants are upright
U+FF20FULLWIDTH COMMERCIAL ATUUFullwidth variants are upright
U+FF3CFULLWIDTH REVERSE SOLIDUSUUFullwidth variants are upright
U+FF61HALFWIDTH IDEOGRAPHIC FULL STOPTURHalfwidth is R. Upright full stop needs shifting; :?: Halfwidth needs transform to be half-width
U+FF64HALFWIDTH IDEOGRAPHIC COMMATURHalfwidth is R. Upright comma needs shifting; :?: Halfwidth needs transform to be half-width
U+FF65HALFWIDTH KATAKANA MIDDLE DOTTURHalfwidth is R. :?: Halfwidth needs transform to be half-width? :?: DVO=U
U+FE10PRESENTATION FORM FOR VERTICAL COMMAUUVertical presentation forms are always upright
U+FE11PRESENTATION FORM FOR VERTICAL IDEOGRAPHIC COMMAUUVertical presentation forms are always upright
U+FE12PRESENTATION FORM FOR VERTICAL IDEOGRAPHIC FULL STOPUUVertical presentation forms are always upright
U+FE13PRESENTATION FORM FOR VERTICAL COLONUUVertical presentation forms are always upright
U+FE14PRESENTATION FORM FOR VERTICAL SEMICOLONUUVertical presentation forms are always upright
U+FE15PRESENTATION FORM FOR VERTICAL EXCLAMATION MARKUUVertical presentation forms are always upright
U+FE16PRESENTATION FORM FOR VERTICAL QUESTION MARKUUVertical presentation forms are always upright
U+FE19PRESENTATION FORM FOR VERTICAL HORIZONTAL ELLIPSISUUVertical presentation forms are always upright
U+FE30PRESENTATION FORM FOR VERTICAL TWO DOT LEADERUUVertical presentation forms are always upright

Script-specific Other Punctuation (Po)

CodeDescriptionCharStackMixMemo
U+037EGREEK QUESTION MARK;UR
U+0387GREEK ANO TELEIA·UR
U+055AARMENIAN APOSTROPHE՚UR
U+055BARMENIAN EMPHASIS MARK՛UR
U+055CARMENIAN EXCLAMATION MARK՜UR
U+055DARMENIAN COMMA՝UR
U+055EARMENIAN QUESTION MARK՞UR
U+055FARMENIAN ABBREVIATION MARK՟UR
U+0589ARMENIAN FULL STOP։UR
U+05C0HEBREW PUNCTUATION PASEQ׀UR
U+05C3HEBREW PUNCTUATION SOF PASUQ׃UR
U+05C6HEBREW PUNCTUATION NUN HAFUKHA׆UR
U+05F3HEBREW PUNCTUATION GERESH׳UR
U+05F4HEBREW PUNCTUATION GERSHAYIM״UR
U+0609ARABIC-INDIC PER MILLE SIGN؉UR
U+060AARABIC-INDIC PER TEN THOUSAND SIGN؊UR
U+060CARABIC COMMA،UR
U+060DARABIC DATE SEPARATOR؍UR
U+061BARABIC SEMICOLON؛UR
U+061EARABIC TRIPLE DOT PUNCTUATION MARK؞UR
U+061FARABIC QUESTION MARK؟UR
U+066AARABIC PERCENT SIGN٪UR
U+066BARABIC DECIMAL SEPARATOR٫UR
U+066CARABIC THOUSANDS SEPARATOR٬UR
U+066DARABIC FIVE POINTED STAR٭UR
U+06D4ARABIC FULL STOP۔UR
U+0700SYRIAC END OF PARAGRAPH܀UR
U+0701SYRIAC SUPRALINEAR FULL STOP܁UR
U+0702SYRIAC SUBLINEAR FULL STOP܂UR
U+0703SYRIAC SUPRALINEAR COLON܃UR
U+0704SYRIAC SUBLINEAR COLON܄UR
U+0705SYRIAC HORIZONTAL COLON܅UR
U+0706SYRIAC COLON SKEWED LEFT܆UR
U+0707SYRIAC COLON SKEWED RIGHT܇UR
U+0708SYRIAC SUPRALINEAR COLON SKEWED LEFT܈UR
U+0709SYRIAC SUBLINEAR COLON SKEWED RIGHT܉UR
U+070ASYRIAC CONTRACTION܊UR
U+070BSYRIAC HARKLEAN OBELUS܋UR
U+070CSYRIAC HARKLEAN METOBELUS܌UR
U+070DSYRIAC HARKLEAN ASTERISCUS܍UR
U+07F7NKO SYMBOL GBAKURUNEN߷UR
U+07F8NKO COMMA߸UR
U+07F9NKO EXCLAMATION MARK߹UR
U+0830SAMARITAN PUNCTUATION NEQUDAAUR
U+0831SAMARITAN PUNCTUATION AFSAAQUR
U+0832SAMARITAN PUNCTUATION ANGEDUR
U+0833SAMARITAN PUNCTUATION BAUUR
U+0834SAMARITAN PUNCTUATION ATMAAUUR
U+0835SAMARITAN PUNCTUATION SHIYYAALAAUR
U+0836SAMARITAN ABBREVIATION MARKUR
U+0837SAMARITAN PUNCTUATION MELODIC QITSAUR
U+0838SAMARITAN PUNCTUATION ZIQAAUR
U+0839SAMARITAN PUNCTUATION QITSAUR
U+083ASAMARITAN PUNCTUATION ZAEFUR
U+083BSAMARITAN PUNCTUATION TURUUR
U+083CSAMARITAN PUNCTUATION ARKAANUUR
U+083DSAMARITAN PUNCTUATION SOF MASHFAATUR
U+083ESAMARITAN PUNCTUATION ANNAAUUR
U+085EMANDAIC PUNCTUATIONUR:?: Mandaic :?: DVO=U
U+0964DEVANAGARI DANDAUR
U+0965DEVANAGARI DOUBLE DANDAUR
U+0970DEVANAGARI ABBREVIATION SIGNUR
U+0AF0GUJARATI ABBREVIATION SIGNUR
U+0DF4SINHALA PUNCTUATION KUNDDALIYAUR
U+0E4FTHAI CHARACTER FONGMANUR
U+0E5ATHAI CHARACTER ANGKHANKHUUR
U+0E5BTHAI CHARACTER KHOMUTUR
U+0F04TIBETAN MARK INITIAL YIG MGO MDUN MAUR
U+0F05TIBETAN MARK CLOSING YIG MGO SGAB MAUR
U+0F06TIBETAN MARK CARET YIG MGO PHUR SHAD MAUR
U+0F07TIBETAN MARK YIG MGO TSHEG SHAD MAUR
U+0F08TIBETAN MARK SBRUL SHADUR
U+0F09TIBETAN MARK BSKUR YIG MGOUR
U+0F0ATIBETAN MARK BKA- SHOG YIG MGOUR
U+0F0BTIBETAN MARK INTERSYLLABIC TSHEGUR
U+0F0CTIBETAN MARK DELIMITER TSHEG BSTARUR
U+0F0DTIBETAN MARK SHADUR
U+0F0ETIBETAN MARK NYIS SHADUR
U+0F0FTIBETAN MARK TSHEG SHADUR
U+0F10TIBETAN MARK NYIS TSHEG SHADUR
U+0F11TIBETAN MARK RIN CHEN SPUNGS SHADUR
U+0F12TIBETAN MARK RGYA GRAM SHADUR
U+0F14TIBETAN MARK GTER TSHEGUR
U+0F85TIBETAN MARK PALUTAUR
U+0FD0TIBETAN MARK BSKA- SHOG GI MGO RGYANUR
U+0FD1TIBETAN MARK MNYAM YIG GI MGO RGYANUR
U+0FD2TIBETAN MARK NYIS TSHEGUR
U+0FD3TIBETAN MARK INITIAL BRDA RNYING YIG MGO MDUN MAUR
U+0FD4TIBETAN MARK CLOSING BRDA RNYING YIG MGO SGAB MAUR
U+0FD9TIBETAN MARK LEADING MCHAN RTAGSUR
U+0FDATIBETAN MARK TRAILING MCHAN RTAGSUR
U+104AMYANMAR SIGN LITTLE SECTIONUR
U+104BMYANMAR SIGN SECTIONUR
U+104CMYANMAR SYMBOL LOCATIVEUR
U+104DMYANMAR SYMBOL COMPLETEDUR
U+104EMYANMAR SYMBOL AFOREMENTIONEDUR
U+104FMYANMAR SYMBOL GENITIVEUR
U+10FBGEORGIAN PARAGRAPH SEPARATORUR
U+1360ETHIOPIC SECTION MARKUR
U+1361ETHIOPIC WORDSPACEUR
U+1362ETHIOPIC FULL STOPUR
U+1363ETHIOPIC COMMAUR
U+1364ETHIOPIC SEMICOLONUR
U+1365ETHIOPIC COLONUR
U+1366ETHIOPIC PREFACE COLONUR
U+1367ETHIOPIC QUESTION MARKUR
U+1368ETHIOPIC PARAGRAPH SEPARATORUR
U+166DCANADIAN SYLLABICS CHI SIGNUR
U+166ECANADIAN SYLLABICS FULL STOPUR
U+16EBRUNIC SINGLE PUNCTUATIONUR
U+16ECRUNIC MULTIPLE PUNCTUATIONUR
U+16EDRUNIC CROSS PUNCTUATIONUR
U+1735PHILIPPINE SINGLE PUNCTUATIONUR
U+1736PHILIPPINE DOUBLE PUNCTUATIONUR
U+17D4KHMER SIGN KHANUR
U+17D5KHMER SIGN BARIYOOSANUR
U+17D6KHMER SIGN CAMNUC PII KUUHUR
U+17D8KHMER SIGN BEYYALUR
U+17D9KHMER SIGN PHNAEK MUANUR
U+17DAKHMER SIGN KOOMUUTUR
U+1800MONGOLIAN BIRGAVVMatch Mongolian letters :?: DVO=U
U+1801MONGOLIAN ELLIPSISVVMatch Mongolian letters :?: DVO=U
U+1802MONGOLIAN COMMAVVMatch Mongolian letters :?: DVO=U
U+1803MONGOLIAN FULL STOPVVMatch Mongolian letters :?: DVO=U
U+1804MONGOLIAN COLONVVMatch Mongolian letters :?: DVO=U
U+1805MONGOLIAN FOUR DOTSVVMatch Mongolian letters :?: DVO=U
U+1807MONGOLIAN SIBE SYLLABLE BOUNDARY MARKERVVMatch Mongolian letters :?: DVO=U
U+1808MONGOLIAN MANCHU COMMAVVMatch Mongolian letters :?: DVO=U
U+1809MONGOLIAN MANCHU FULL STOPVVMatch Mongolian letters :?: DVO=U
U+180AMONGOLIAN NIRUGUVVMatch Mongolian letters :?: DVO=U
U+1944LIMBU EXCLAMATION MARKUR
U+1945LIMBU QUESTION MARKUR
U+1A1EBUGINESE PALLAWAUR
U+1A1FBUGINESE END OF SECTIONUR
U+1AA0TAI THAM SIGN WIANGUR
U+1AA1TAI THAM SIGN WIANGWAAKUR
U+1AA2TAI THAM SIGN SAWANUR
U+1AA3TAI THAM SIGN KEOWUR
U+1AA4TAI THAM SIGN HOYUR
U+1AA5TAI THAM SIGN DOKMAIUR
U+1AA6TAI THAM SIGN REVERSED ROTATED RANAUR
U+1AA8TAI THAM SIGN KAANUR
U+1AA9TAI THAM SIGN KAANKUUUR
U+1AAATAI THAM SIGN SATKAANUR
U+1AABTAI THAM SIGN SATKAANKUUUR
U+1AACTAI THAM SIGN HANGUR
U+1AADTAI THAM SIGN CAANGUR
U+1B5ABALINESE PANTIUR
U+1B5BBALINESE PAMADAUR
U+1B5CBALINESE WINDUUR
U+1B5DBALINESE CARIK PAMUNGKAHUR
U+1B5EBALINESE CARIK SIKIUR
U+1B5FBALINESE CARIK PARERENUR
U+1B60BALINESE PAMENENGUR
U+1BFCBATAK SYMBOL BINDU NA METEKUR:?:
U+1BFDBATAK SYMBOL BINDU PINARBORASUR:?:
U+1BFEBATAK SYMBOL BINDU JUDULUR:?:
U+1BFFBATAK SYMBOL BINDU PANGOLAT᯿UR:?:
U+1C3BLEPCHA PUNCTUATION TA-ROLUR
U+1C3CLEPCHA PUNCTUATION NYET THYOOM TA-ROLUR
U+1C3DLEPCHA PUNCTUATION CER-WAUR
U+1C3ELEPCHA PUNCTUATION TSHOOK CER-WAUR
U+1C3FLEPCHA PUNCTUATION TSHOOK᰿UR
U+1C7EOL CHIKI PUNCTUATION MUCAADUR
U+1C7FOL CHIKI PUNCTUATION DOUBLE MUCAAD᱿UR
U+1CC0SUNDANESE PUNCTUATION BINDU SURYAUR
U+1CC1SUNDANESE PUNCTUATION BINDU PANGLONGUR
U+1CC2SUNDANESE PUNCTUATION BINDU PURNAMAUR
U+1CC3SUNDANESE PUNCTUATION BINDU CAKRAUR
U+1CC4SUNDANESE PUNCTUATION BINDU LEU SATANGAUR
U+1CC5SUNDANESE PUNCTUATION BINDU KA SATANGAUR
U+1CC6SUNDANESE PUNCTUATION BINDU DA SATANGAUR
U+1CC7SUNDANESE PUNCTUATION BINDU BA SATANGAUR
U+1CD3VEDIC SIGN NIHSHVASAUR
U+2CF9COPTIC OLD NUBIAN FULL STOPUR
U+2CFACOPTIC OLD NUBIAN DIRECT QUESTION MARKUR
U+2CFBCOPTIC OLD NUBIAN INDIRECT QUESTION MARKUR
U+2CFCCOPTIC OLD NUBIAN VERSE DIVIDERUR
U+2CFECOPTIC FULL STOPUR
U+2CFFCOPTIC MORPHOLOGICAL DIVIDER⳿UR
U+2D70TIFINAGH SEPARATOR MARKUR
U+A4FELISU PUNCTUATION COMMAUR
U+A4FFLISU PUNCTUATION FULL STOPUR
U+A60DVAI COMMAUR
U+A60EVAI FULL STOPUR
U+A60FVAI QUESTION MARKUR
U+A673SLAVONIC ASTERISKUR
U+A67ECYRILLIC KAVYKAUR
U+A6F2BAMUM NJAEMLIUR
U+A6F3BAMUM FULL STOPUR
U+A6F4BAMUM COLONUR
U+A6F5BAMUM COMMAUR
U+A6F6BAMUM SEMICOLONUR
U+A6F7BAMUM QUESTION MARKUR
U+A874PHAGS-PA SINGLE HEAD MARKVVMatch Phags-pa letters :?: DVO=U
U+A875PHAGS-PA DOUBLE HEAD MARKVVMatch Phags-pa letters :?: DVO=U
U+A876PHAGS-PA MARK SHADVVMatch Phags-pa letters :?: DVO=U
U+A877PHAGS-PA MARK DOUBLE SHADVVMatch Phags-pa letters :?: DVO=U
U+A8CESAURASHTRA DANDAUR
U+A8CFSAURASHTRA DOUBLE DANDAUR
U+A8F8DEVANAGARI SIGN PUSHPIKAUR
U+A8F9DEVANAGARI GAP FILLERUR
U+A8FADEVANAGARI CARETUR
U+A92EKAYAH LI SIGN CWIUR
U+A92FKAYAH LI SIGN SHYAUR
U+A95FREJANG SECTION MARKUR
U+A9C1JAVANESE LEFT RERENGGANUR
U+A9C2JAVANESE RIGHT RERENGGANUR
U+A9C3JAVANESE PADA ANDAPUR
U+A9C4JAVANESE PADA MADYAUR
U+A9C5JAVANESE PADA LUHURUR
U+A9C6JAVANESE PADA WINDUUR
U+A9C7JAVANESE PADA PANGKATUR
U+A9C8JAVANESE PADA LINGSAUR
U+A9C9JAVANESE PADA LUNGSIUR
U+A9CAJAVANESE PADA ADEGUR
U+A9CBJAVANESE PADA ADEG ADEGUR
U+A9CCJAVANESE PADA PISELEHUR
U+A9CDJAVANESE TURNED PADA PISELEHUR
U+A9DEJAVANESE PADA TIRTA TUMETESUR
U+A9DFJAVANESE PADA ISEN-ISENUR
U+AA5CCHAM PUNCTUATION SPIRALUR
U+AA5DCHAM PUNCTUATION DANDAUR
U+AA5ECHAM PUNCTUATION DOUBLE DANDAUR
U+AA5FCHAM PUNCTUATION TRIPLE DANDAUR
U+AADETAI VIET SYMBOL HO HOIUR
U+AADFTAI VIET SYMBOL KOI KOIUR
U+AAF0MEETEI MAYEK CHEIKHANUR
U+AAF1MEETEI MAYEK AHANG KHUDAMUR
U+ABEBMEETEI MAYEK CHEIKHEIUR
U+10100AEGEAN WORD SEPARATOR LINE𐄀UR
U+10101AEGEAN WORD SEPARATOR DOT𐄁UR
U+10102AEGEAN CHECK MARK𐄂UR
U+1039FUGARITIC WORD DIVIDER𐎟UR
U+103D0OLD PERSIAN WORD DIVIDER𐏐UR
U+10857IMPERIAL ARAMAIC SECTION SIGN𐡗UR
U+1091FPHOENICIAN WORD SEPARATOR𐤟UR
U+1093FLYDIAN TRIANGULAR MARK𐤿UR
U+10A50KHAROSHTHI PUNCTUATION DOT𐩐UR
U+10A51KHAROSHTHI PUNCTUATION SMALL CIRCLE𐩑UR
U+10A52KHAROSHTHI PUNCTUATION CIRCLE𐩒UR
U+10A53KHAROSHTHI PUNCTUATION CRESCENT BAR𐩓UR
U+10A54KHAROSHTHI PUNCTUATION MANGALAM𐩔UR
U+10A55KHAROSHTHI PUNCTUATION LOTUS𐩕UR
U+10A56KHAROSHTHI PUNCTUATION DANDA𐩖UR
U+10A57KHAROSHTHI PUNCTUATION DOUBLE DANDA𐩗UR
U+10A58KHAROSHTHI PUNCTUATION LINES𐩘UR
U+10A7FOLD SOUTH ARABIAN NUMERIC INDICATOR𐩿UR
U+10B39AVESTAN ABBREVIATION MARK𐬹UR
U+11047BRAHMI DANDA𑁇UR
U+11048BRAHMI DOUBLE DANDA𑁈UR
U+11049BRAHMI PUNCTUATION DOT𑁉UR
U+1104ABRAHMI PUNCTUATION DOUBLE DOT𑁊UR
U+1104BBRAHMI PUNCTUATION LINE𑁋UR
U+1104CBRAHMI PUNCTUATION CRESCENT BAR𑁌UR
U+1104DBRAHMI PUNCTUATION LOTUS𑁍UR
U+110BBKAITHI ABBREVIATION SIGN𑂻UR
U+110BCKAITHI ENUMERATION SIGN𑂼UR
U+110BEKAITHI SECTION MARK𑂾UR
U+110BFKAITHI DOUBLE SECTION MARK𑂿UR
U+110C0KAITHI DANDA𑃀UR
U+110C1KAITHI DOUBLE DANDA𑃁UR
U+11140CHAKMA SECTION MARK𑅀UR
U+11141CHAKMA DANDA𑅁UR
U+11142CHAKMA DOUBLE DANDA𑅂UR
U+11143CHAKMA QUESTION MARK𑅃UR
U+111C5SHARADA DANDA𑇅UR
U+111C6SHARADA DOUBLE DANDA𑇆UR
U+111C7SHARADA ABBREVIATION SIGN𑇇UR
U+111C8SHARADA SEPARATOR𑇈UR
U+12470CUNEIFORM PUNCTUATION SIGN OLD ASSYRIAN WORD DIVIDER𒑰UR
U+12471CUNEIFORM PUNCTUATION SIGN VERTICAL COLON𒑱UR
U+12472CUNEIFORM PUNCTUATION SIGN DIAGONAL COLON𒑲UR
U+12473CUNEIFORM PUNCTUATION SIGN DIAGONAL TRICOLON𒑳UR

Symbols

Old Notes

More Notes on Quotes

CodeDescriptionUTRWMMemo
U+2018-2019LEFT/RIGHT SINGLE QUOTATION MARKTUFSome people says T, while some says SB. Some says these should be consistent with U+201C/201D.
U+201C-201DCurly quotesSBVJLREQ defines “use double curly quotes in horizontal and double prime in vertical”. Some people think these are SB because these code points are for horizontal only, and it's author's responsibility to replace them to U+301D/301F in vertical flow. Some says T, so that replacement can happen automatically when user switched text flow just like small Kana. Since it's split, probably T is better.
U+201E-201FDOUBLE LOW/HIGH-REVERSED–9 QUOTATION MARKSBUFSome people says T, while some says SB. Some says these should be consistent with U+201C/201D.
U+301D/301FREVERSED/Low DOUBLE PRIME QUOTATION MARKTVThese are double-quotes for vertical flow in Japanese. Some fonts use these glyphs as vert for U+2018/2019. Some fonts (Meiryo) uses DOUBLE PRIME glyphs for U+2018/2019 even in horizontal flow. These can also be used in math as double-dashes? T is probably good.
U+301EDOUBLE PRIME QUOTATION MARKTVThese are glyphs only for vertical. T is probably good. :?: UTR#50 DVO has quotes S

General Punctuation

CodeDescriptionUTRWMMemo
U+00A9COPYRIGHT SIGNUSExamples: copyright_horz.png copyright_vert.jpg
U+00AEREGISTERED SIGNUSUTR#50 4.4 mentions this could be contextual. Tailoring can be another option along with U+00A9 COPYRIGHT SIGN.
U+2016Double vertical lineUVTypically rotated in Japanese typesetting
U+3033-3035VERTICAL KANA REPEAT MARKUUThese are glyphs only for vertical. T is probably good.
U+303BVERTICAL IDEOGRAPHIC ITERATION MARKUUThese are glyphs only for vertical. T is probably good.
U+3303, 3305, 3306, etc.SQUARE of 3 charsTUWhen one char in a line, it can be either align center or left (top) and that should be up to font designers. Showing representative glyphs can be misleading.
U+337B-337FSquare era namesTUAdobe/Apple fonts use vertically compressed glyphs, while MS/Ricoh fonts use horizontally compressed glyphs using Tate-Chu-Yoko. JLTF says both should be allowed.
U+FF0CFULLWIDTH COMMAUUShould be T, to make consistent with U+FF0C.
U+FF0EFULLWIDTH FULL STOPUUShould be T, to make consistent with U+3002.
U+FF1AFullwidth colonUVTypically rotated in Japanese typesetting. Traditional Chinese typesets upright. Should be T.
U+FF1BFullwidth semicolonUVNot used in Japanese, typically upright with no vert altglyph. Fonts seem inconsistent. Traditional Chinese typesets upright. Should be T.
U+FF1DFullwidth equals signUUTypically rotated in Japanese fonts. What about Chinese?
U+FF5CFullwidth vertical lineUUTypically rotated in Japanese typesetting

Letterlike Symbols

  • EAW=JKST means the code point existed in legacy encoding of Japan/Korean/Simplified Chinese/Traditional Chinese. One could argue that EAC is legacy, but even today's Input Methods emit them, so EAW=A can indicate that the code are likely to be used even today.
CodeNameCatEACEAOWMEAWComments
2100ACCOUNT OFSo19.3USN
2101ADDRESSED TO THE SUBJECTSo19.3USN
2102DOUBLE-STRUCK CAPITAL CLu19.3USN
2103DEGREE CELSIUSSo13USJKSTGoogling “air temperature” (気温) hits several usage of this code point; e.g., 1, 2, 3. They appear after ASCII/full-width digits because (I guess) they assume web is horizontal, but I expect it be formatted as full-width or Han digits if vertical. Type “do” (degree) in MS-IME and you get this.
2104CENTRE LINE SYMBOLSo19.3USN
2105CARE OFSo19.3USST
2106CADA UNASo19.3USN
2107EULER CONSTANTLu19.3USN
2108SCRUPLESo19.3USN
2109DEGREE FAHRENHEITSo13USKST
210ASCRIPT SMALL GLl19.3USN
210BSCRIPT CAPITAL HLu19.3USN
210CBLACK-LETTER CAPITAL HLu19.3USN
210DDOUBLE-STRUCK CAPITAL HLu19.3USN
210EPLANCK CONSTANTLl19.3USN
210FPLANCK CONSTANT OVER TWO PILl19.3USN
2110SCRIPT CAPITAL ILu19.3USN
2111BLACK-LETTER CAPITAL ILu19.3USN
2112SCRIPT CAPITAL LLu19.3USN
2113SCRIPT SMALL LLl13USK
2114L B BAR SYMBOLSo19.3USN
2115DOUBLE-STRUCK CAPITAL NLu19.3USN
2116NUMERO SIGNSo12USJKSUpright makes sense to me and I remember I saw some instances, but can't find right now. I'll look for further. Type “bangou” (number) in MS-IME and you get this.
2117SOUND RECORDING COPYRIGHTSo19.3USN
2118SCRIPT CAPITAL PSm19.3USN
2119DOUBLE-STRUCK CAPITAL PLu19.3USN
211ADOUBLE-STRUCK CAPITAL QLu19.3USN
211BSCRIPT CAPITAL RLu19.3USN
211CBLACK-LETTER CAPITAL RLu19.3USN
211DDOUBLE-STRUCK CAPITAL RLu19.3USN
211EPRESCRIPTION TAKESo19.3USN
211FRESPONSESo19.3USN
2120SERVICE MARKSo19.3USN
2121TELEPHONE SIGNSo19.3USJKSI thought I can find examples in vertical-flow-name cards, but could not find the use of this code. I'm pretty sure if I send this code as text to printing company, they'll set upright though. I'll look for further. Type “denwa” (phone) in MS-IME and you get this.
2122TRADE MARK SIGNSo19.3USK
2123VERSICLESo19.3USN
2124DOUBLE-STRUCK CAPITAL ZLu19.3USN
2125OUNCE SIGNSo19.3USN
2126OHM SIGNLu19.3USK
2127INVERTED OHM SIGNSo19.3USN
2128BLACK-LETTER CAPITAL ZLu19.3USN
2129TURNED GREEK SMALL LETTER IOTASo19.3USN
212AKELVIN SIGNLu19.3USN
212BANGSTROM SIGNLu19.3USJKType “ongusutoro-mu” (angstrom) in MS-IME and you get this; e.g., 1, 2. Interestingly, there're questions on the web asking “how to type half-width angstrom.” Answers are “apply English font” or “type using English keyboard.” The code point is the same, but Japanese fonts usually have full-width glyph here, while roman fonts have proportional, so that's what they mean by “half-width”. I couldn't find “how to type full-width angstrom” question; no idea because it's not used, or because it's too easy to figure out (use Input Method), but my guess is this is less commonly used unit symbol and therefore far less important to set upright than other symbols such as U+2103, U+2116, or U+2121.
212CSCRIPT CAPITAL BLu19.3USN
212DBLACK-LETTER CAPITAL CLu19.3USN
212EESTIMATED SYMBOLSo19.3USN
212FSCRIPT SMALL ELl19.3USN
2130SCRIPT CAPITAL ELu19.3USN
2131SCRIPT CAPITAL FLu19.3USN
2132TURNED CAPITAL FLu19.3USN
2133SCRIPT CAPITAL MLu19.3USN
2134SCRIPT SMALL OLl19.3USN
2135ALEF SYMBOLLo19.3USN
2136BET SYMBOLLo19.3USN
2137GIMEL SYMBOLLo19.3USN
2138DALET SYMBOLLo19.3USN
2139INFORMATION SOURCELl19.3USN
213AROTATED CAPITAL QSo19.3USN
213BFACSIMILE SIGNSo19.3USN
213CDOUBLE-STRUCK SMALL PILl19.3USN
213DDOUBLE-STRUCK SMALL GAMMALl19.3USN
213EDOUBLE-STRUCK CAPITAL GAMMALu19.3USN
213FDOUBLE-STRUCK CAPITAL PILu19.3USN
2140DOUBLE-STRUCK N-ARY SUMMATIONSm19.3USN
2141TURNED SANS-SERIF CAPITAL GSm19.3USN
2142TURNED SANS-SERIF CAPITAL LSm19.3USN
2143REVERSED SANS-SERIF CAPITAL LSm19.3USN
2144TURNED SANS-SERIF CAPITAL YSm19.3USN
2145DOUBLE-STRUCK ITALIC CAPITAL DLu19.3USN
2146DOUBLE-STRUCK ITALIC SMALL DLl19.3USN
2147DOUBLE-STRUCK ITALIC SMALL ELl19.3USN
2148DOUBLE-STRUCK ITALIC SMALL ILl19.3USN
2149DOUBLE-STRUCK ITALIC SMALL JLl19.3USN
214APROPERTY LINESo19.3USN
214BTURNED AMPERSANDSm19.3USN
214CPER SIGNSo19.3USN
214DAKTIESELSKABSo19.3USN
214ETURNED SMALL FLl19.3USN
214FSYMBOL FOR SAMARITAN SOURCESo19.3USN
 
spec/utr50/punctuation.txt · Last modified: 2012/06/08 05:03 by kojiishi
Recent changes RSS feed Valid XHTML 1.0 Valid CSS Driven by DokuWiki