To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????C???? 00111111001111110011111100111111001111110100001100111111001111110011111100111111 3f3f3f3f3f433f3f3f3f
SJIS-WIN ????港C???太 001111110011111100111111001111111000110101100000010000110011111100111111001111111001000110111110 3f3f3f3f8d60433f3f3f91be
EUC-JP ????港C???太 001111110011111100111111001111111011100111000001010000110011111100111111001111111100001011000000 3f3f3f3fb9c1433f3f3fc2c0
UTF-8 뤶쫷띵딪港C쫷띵딪太 11101011101001001011011011101100101010111011011111101011100111011011010111101011100101001010101011100110101110001010111101000011111011001010101110110111111010111001110110110101111010111001010010101010111001011010010010101010 eba4b6ecabb7eb9db5eb94aae6b8af43ecabb7eb9db5eb94aae5a4aa
UHC 뤶쫷띵딪港C쫷띵딪太 10001111111001001010011010001110101101101111001010110101111110101111100111111011010000111010011010001110101101101111001010110101111110101111011110111100 8fe4a68eb6f2b5faf9fb43a68eb6f2b5faf7bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)