An Unicode vendor-specific character table for japanese

  1. JIS version
  2. Shift-JIS version
  3. UTF-8 version

ISO/IEC 646 Shift-JIS JIS X 0221 (JIS X 0201) JIS X 0221 (ISO/IEC 646-IRV) Unicode Consortium Java (SJIS & EUCJIS) Java (JIS) Windows 95/NT MacOS
\ (D/12, REVERSE SOLIDUS) 0x5C - U+005C (REVERSE SOLIDUS) - - U+005C (REVERSE SOLIDUS) - -
~ (F/14, TILDE) 0x7E - U+007E (TILDE) - - U+007E (TILDE) - -
JIS X 0201 Shift-JIS JIS X 0221 (JIS X 0201) JIS X 0221 (ISO/IEC 646-IRV) Unicode Consortium Java (SJIS & EUCJIS) Java (JIS) Windows 95/NT MacOS
\ (D/12, YEN SIGN) 0x5C U+00A5 (YEN SIGN) - U+00A5 (YEN SIGN) U+005C (REVERSE SOLIDUS) U+00A5 (YEN SIGN) U+005C (REVERSE SOLIDUS) U+00A5 (YEN SIGN)
~ (F/14, OVERLINE) 0x7E U+203E (OVERLINE) - U+203E (OVERLINE) U+007E (TILDE) U+203E (OVERLINE) U+007E (TILDE) U+007E (TILDE)
JIS X 0208 Shift-JIS JIS X 0221 (JIS X 0201) JIS X 0221 (ISO/IEC 646-IRV) Unicode Consortium Java (SJIS & EUCJIS) Java (JIS) Windows 95/NT MacOS
P (1-17, OVERLINE) 0x8150 U+FFE3 (FULLWIDTH MACRON) U+203E (OVERLINE) U+FFE3 (FULLWIDTH MACRON) U+FFE3 (FULLWIDTH MACRON) U+FFE3 (FULLWIDTH MACRON) U+FFE3 (FULLWIDTH MACRON) U+203E (OVERLINE)
\ (1-29, EM DASH) 0x815C U+2014 (EM DASH) U+2014 (EM DASH) U+2015 (HORIZONTAL BAR) U+2015 (HORIZONTAL BAR) U+2015 (HORIZONTAL BAR) U+2015 (HORIZONTAL BAR) U+2014 (EM DASH)
_ (1-32, REVERSE SOLIDUS) 0x815F U+005C (REVERSE SOLIDUS) U+FF3C (FULLWIDTH REVERSE SOLIDUS) U+005C (REVERSE SOLIDUS) U+FF3C (FULLWIDTH REVERSE SOLIDUS) U+FF3C (FULLWIDTH REVERSE SOLIDUS) U+FF3C (FULLWIDTH REVERSE SOLIDUS) U+FF3C (FULLWIDTH REVERSE SOLIDUS)
` (1-33, WAVE DASH) 0x8160 U+301C (WAVE DASH) U+301C (WAVE DASH) U+301C (WAVE DASH) U+301C (WAVE DASH) U+301C (WAVE DASH) U+FF5E (FULLWIDTH TILDE) U+301C (WAVE DASH)
a (1-34, DOUBLE VERTICAL LINE) 0x8161 U+2016 (DOUBLE VERTICAL LINE) U+2016 (DOUBLE VERTICAL LINE) U+2016 (DOUBLE VERTICAL LINE) U+2016 (DOUBLE VERTICAL LINE) U+2016 (DOUBLE VERTICAL LINE) U+2225 (PARALLEL TO) U+2016 (DOUBLE VERTICAL LINE)
c (1-36, HORIZONTAL ELLIPSIS) 0x8163 U+2026 (HORIZONTAL ELLIPSIS) U+2026 (HORIZONTAL ELLIPSIS) U+2026 (HORIZONTAL ELLIPSIS) U+2026 (HORIZONTAL ELLIPSIS) U+2026 (HORIZONTAL ELLIPSIS) U+2026 (HORIZONTAL ELLIPSIS) U+22EF (MIDLINE HORIZONTAL ELLIPSIS)
| (1-61, MINUS SIGN) 0x817C U+2212 (MINUS SIGN) U+2212 (MINUS SIGN) U+2212 (MINUS SIGN) U+2212 (MINUS SIGN) U+2212 (MINUS SIGN) U+FF0D (FULLWIDTH HYPHEN-MINUS) U+2212 (MINUS SIGN)
(1-79, YEN SIGN) 0x818F U+FFE5 (FULLWIDTH YEN SIGN) U+00A5 (YEN SIGN) U+FFE5 (FULLWIDTH YEN SIGN) U+FFE5 (FULLWIDTH YEN SIGN) U+FFE5 (FULLWIDTH YEN SIGN) U+FFE5 (FULLWIDTH YEN SIGN) U+FFE5 (FULLWIDTH YEN SIGN)
(1-81, CENT SIGN) 0x8191 U+00A2 (CENT SIGN) U+00A2 (CENT SIGN) U+00A2 (CENT SIGN) U+00A2 (CENT SIGN) U+00A2 (CENT SIGN) U+FFE0 (FULLWIDTH CENT SIGN) U+00A2 (CENT SIGN)
(1-82, POUND SIGN) 0x8192 U+00A3 (POUND SIGN) U+00A3 (POUND SIGN) U+00A3 (POUND SIGN) U+00A3 (POUND SIGN) U+00A3 (POUND SIGN) U+FFE1 (FULLWIDTH POUND SIGN) U+00A3 (POUND SIGN)
(2-44, NOT SIGN) 0x81CA U+00AC (NOT SIGN) U+00AC (NOT SIGN) U+00AC (NOT SIGN) U+00AC (NOT SIGN) U+00AC (NOT SIGN) U+FFE2 (FULLWIDTH NOT SIGN) U+00AC (NOT SIGN)
JIS X 0212 Shift-JIS JIS X 0221 (JIS X 0201) JIS X 0221 (ISO/IEC 646-IRV) Unicode Consortium Java (SJIS & EUCJIS) Java (JIS) Windows 95/NT MacOS
(2-23, TILDE) - U+007E (TILDE) U+FF5E (FULL WIDTH TILDE) - - - - -

Special thanks to Lori Brownell <loribr@microsoft.com>, Toshihiro Nishimura <nishi@jd.cs.fujitsu.co.jp>, Yoshiyuki Yamaguchi <yama@kanto-ele.co.jp>, Sakita Takumi <sakita@ffc.newton.co.jp>, Katsuhiro Mihara <mihara@is.titech.ac.jp>.


References

  1. The Unicode Consortium: The Unicode Standard, Version 2.0, Addison Wesley, 1996.
  2. Japanese Industrial Standards Committee: Universal Multiple-Octet Coded Character Set (UCS) - Part 1: Architecture and Basic Multilingual Plane JIS X 0221-1995 (ISO/IEC 10646-1:1993)
  3. Glenn Adams, John H. Jenkins: Shift-JIS to Unicode (version 0.9), Unicode 1.1 & 2.0.
  4. Lori Brownell, K.D. Chang: cp932_ShiftJIS to Unicode table (version 2.0), Unicode 2.0.
  5. Peter Edberg: MacOS_Japanese [to Unicode] (version 0.2), Unicode 1.1 & 2.0.
  6. TOG/JVC CDE/Motif Technical WG: Problems and Solutions for Unicode and User/Vendor Defined Characters

Kazuhiro Kazama (Ingrid Project)