Convert > Other
- Fullwidth and Halfwidth to Normal Width: Converts fullwidth and halfwidth characters in the "Halfwidth and Fullwidth Forms" block (U+FF00 through U+FFEF) to their corresponding normal forms in various blocks. This is in effect applies the compatibility decompositions for these characters, and is equivalent to Convert > Normalization Forms > To NFKD, but applying to fullwidth and halfwidth characters only.
- Control Codes to Control Pictures: Converts control codes in the Basic Latin block (U+0000 through U+001F and U+007F) to the corresponding graphic characters in the Control Pictures block (U+2400 through U+2426).
- ASCII to Tag Characters: Converts Basic Latin characters in the range of U+0020 through U+007E to their corresponding Tag characters in the Tags block (U+E0020 through U+E007E), converts U+00A7 § (Section Sign) to U+E0001 (Language Tag), and converts U+00A2 ¢ (Cent Sign) to U+E007F (Cancel Tag).
- Tag Characters to ASCII: Converts Tag characters in the Tags block (U+E0020 through U+E007E) to their corresponding Basic Latin characters in the range of U+0020 through U+007E, converts U+E0001 (Language Tag) to U+00A7 § (Section Sign), and converts U+E007F (Cancel Tag) to U+00A2 ¢ (Cent Sign).
- ASCII to Typographic Characters: Applies the following conversions:
- U+0022 " (Quotation Mark) to either U+2018 (Left Single Quotation Mark) or U+2019 (Right Single Quotation Mark) depending on context;
- U+0027 ' (Apostrophe) to either U+201C (Left Double Quotation Mark) or U+201D (Right Double Quotation Mark) depending on context;
- Two consecutive U+0022 -- (Hyphen-Minus) to 0x2014 (Em Dash);
- Sequence of digit slash digit to the corresponding vulgar fraction (e.g. 1/2 to U+00BD ½).
- Strip Diacritics: Converts precomposed Latin, Greek, Cyrillic and Hebrew letters with diacritic marks to the corresponding simple letter (e.g. á to a). It also deletes any combining diacritical marks in the selected text.
- Strip Default Ignorable Characters: Removes all default ignorable characters from the selected text. See section headed "Default Ignorable Code Point" in the Unicode Standard section 5.21.
Download | Help Contents