To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???C???U 0011111100111111001111110100001100111111001111110011111101010101 3f3f3f433f3f3f55
SJIS-WIN ??縟C??縟U 00111111001111111110001101110100010000110011111100111111111000110111010001010101 3f3fe374433f3fe37455
EUC-JP ??縟C??縟U 00111111001111111110010111010101010000110011111100111111111001011101010101010101 3f3fe5d5433f3fe5d555
UTF-8 쐛숰縟C쐛숰縟U 1110110010010000100110111110110010001000101100001110011110111000100111110100001111101100100100001001101111101100100010001011000011100111101110001001111101010101 ec909bec88b0e7b89f43ec909bec88b0e7b89f55
UHC 쐛숰縟C쐛숰縟U 1001110010000001100110100100100011101001101100100100001110011100100000011001101001001000111010011011001001010101 9c819a48e9b2439c819a48e9b255

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)