To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鸚??意??議??艶k?悠??猿??? 1110101001011111001111110011111110001000110100110011111100111111100010110110001100111111001111111000100110010000100000101000101100111111100101110100100100111111001111111000100110001110001111110011111100111111 ea5f3f3f88d33f3f8b633f3f8990828b3f97493f3f898e3f3f3f
EUC-JP 鸚??意??議??艶k?悠??猿??孼 11110011110000000011111100111111101100001101010100111111001111111011010111000100001111110011111110110001111100001010001111101011001111111100110110101010001111110011111110110001111011100011111100111111100011111011101011000011 f3c03f3fb0d53f3fb5c43f3fb1f0a3eb3fcdaa3f3fb1ee3f3f8fbac3
UTF-8 鸚쒓퍓意쎿룚議용옜艶k챷悠뉐쩂猿딆뵛孼 111010011011100010011010111011001001001010010011111011011000110110010011111001101000010010001111111011001000111010111111111010111010001110011010111010001010110110110000111011001001101010101001111011001001100010011100111010001000100110110110111011111011110110001011111011001011000110110111111001101000001010100000111010111000100110010000111011001010100110000010111001111000110010111111111010111001010010000110111010111011010110011011111001011010110110111100 e9b89aec9293ed8d93e6848fec8ebfeba39ae8adb0ec9aa9ec989ce889b6efbd8becb1b7e682a0eb8990eca982e78cbfeb9486ebb59be5adbc
UHC 鸚쒓퍓意쎿룚議용옜艶k챷悠뉐쩂猿딆뵛孼 1110010110100100100111001110101010111011100010101110101111110010100110111110011010001111100101101110110010100001101111111110101110111111101111111110011011111101101000111110101110101010100001001110101011101101100001111110010110100100100111001110101010111011100010101110110010010100100110111110010111101101 e5a49ceabb8aebf29be68f96eca1bfebbfbfe6fda3ebaa84eaed87e5a49ceabb8aec949be5ed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)