To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??悠??扱碣押り?幼?い 11100001100111110011111100111111100101110100100100111111001111111000100010110101111000011111000010001001100111111000001011101000001111111001011101100011001111111000001010100010 e19f3f3f97493f3f88b5e1f0899f82e83f97633f82a2
EUC-JP 癲??悠??扱碣押り?幼?い 11100010101000010011111100111111110011011010101000111111001111111011000010110111111000101111001010110010101000011010010011101010001111111100110111000100001111111010010010100100 e2a13f3fcdaa3f3fb0b7e2f2b2a1a4ea3fcdc43fa4a4
UTF-8 癲됱떜悠띷룚扱碣押り램幼뗨い 111001111001100110110010111010111001000010110001111010111001011010011100111001101000001010100000111010111001110110110111111010111010001110011010111001101000100110110001111001111010001010100011111001101000101010111100111000111000001010001010111010111001111010101000111001011011100110111100111010111001011110101000111000111000000110000100 e799b2eb90b1eb969ce682a0eb9db7eba39ae689b1e7a2a3e68abce3828aeb9ea8e5b9bceb97a8e38184
UHC 癲됱떜悠띷룚扱碣押り램幼뗨い 11101111101001101000100111101100100010111011001011101010111011011000110111100110100011111001011011010000111000101100101011100101111001001110001110101010111010101011011110100101111010101110101010001011111010001010101010100100 efa689ec8bb2eaed8de68f96d0e2cae5e4e3aaeab7a5eaea8be8aaa4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)