To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????\}?????????\{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101110001111101001111110011111100111111001111110011111100111111001111110011111100111111010111000111101101011110 3f3f3f3f3f3f3f3f3f5c7d3f3f3f3f3f3f3f3f3f5c7b5e
SJIS-WIN 嚥?????違??\}嚥?????違??\{^ 100110101000101100111111001111110011111100111111001111111000100011100001001111110011111101011100011111011001101010001011001111110011111100111111001111110011111110001000111000010011111100111111010111000111101101011110 9a8b3f3f3f3f3f88e13f3f5c7d9a8b3f3f3f3f3f88e13f3f5c7b5e
EUC-JP 嚥??瑗??違??\}嚥??瑗??違??\{^ 11010011111010110011111100111111100011111100110011000000001111110011111110110000111000110011111100111111010111000111110111010011111010110011111100111111100011111100110011000000001111110011111110110000111000110011111100111111010111000111101101011110 d3eb3f3f8fccc03f3fb0e33f3f5c7dd3eb3f3f8fccc03f3fb0e33f3f5c7b5e
UTF-8 嚥싳쉸瑗뜸뭄違꾩뒜\}嚥싳쉸瑗뜸뭄違꾩뒜\{^ 1110010110011010101001011110110010001011101100111110110010001001101110001110011110010001100101111110101110011100101110001110101110101101100001001110100110000001100101011110101010111110101010011110101110010010100111000101110001111101111001011001101010100101111011001000101110110011111011001000100110111000111001111001000110010111111010111001110010111000111010111010110110000100111010011000000110010101111010101011111010101001111010111001001010011100010111000111101101011110 e59aa5ec8bb3ec89b8e79197eb9cb8ebad84e98195eabea9eb929c5c7de59aa5ec8bb3ec89b8e79197eb9cb8ebad84e98195eabea9eb929c5c7b5e
UHC 嚥싳쉸瑗뜸뭄違꾩뒜\}嚥싳쉸瑗뜸뭄違꾩뒜\{^ 1110011010111111100110101110110010011010100011101110101010111100101101101110010010111001101100111110101011011110100001001110110010001010100110010101110001111101111001101011111110011010111011001001101010001110111010101011110010110110111001001011100110110011111010101101111010000100111011001000101010011001010111000111101101011110 e6bf9aec9a8eeabcb6e4b9b3eade84ec8a995c7de6bf9aec9a8eeabcb6e4b9b3eade84ec8a995c7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)