To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 繒攀?垣?彫逗??? 111110111000111110011101101100110011111110001010010111110011111110010010101001001001000010000000001111110011111100111111 fb8f9db33f8a5f3f92a490803f3f3f
EUC-JP 繒攀?垣?彫逗??? 10001111110101001101010011011010101101010011111110110011110000000011111111000100101001101011111111100000001111110011111100111111 8fd4d4dab53fb3c03fc4a6bfe03f3f3f
UTF-8 繒攀쑨垣렖彫逗렭렞炡 111001111011100110010010111001101001010010000000111011001001000110101000111001011001111010100011111010111010000010010110111001011011110110101011111010011000000010010111111010111010000010101101111010111010000010011110111001111000001010100001 e7b992e69480ec91a8e59ea3eba096e5bdabe98097eba0adeba09ee782a1
UHC 繒攀쑨垣렖彫逗렭렞炡 1111000111111001110110101110011110111110101001111110101010101111100011101010101111110000110000011101010011101000100011101011101010001110101011111110111111101000 f1f9dae7bea7eaaf8eabf0c1d4e88eba8eafefe8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)