To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN 綬鼇碍綬鼇碍B 10001110111110001110101010000111100010100101011010001110111110001110101010000111100010100101011001000010 8ef8ea878a568ef8ea878a5642
EUC-JP 綬鼇碍綬鼇碍B 10111100111110101111001111100111101100111011011110111100111110101111001111100111101100111011011101000010 bcfaf3e7b3b7bcfaf3e7b3b742
UTF-8 綬鼇碍綬鼇碍B 11100111101101101010110011101001101111001000011111100111101000101000110111100111101101101010110011101001101111001000011111100111101000101000110101000010 e7b6ace9bc87e7a28de7b6ace9bc87e7a28d42
UHC 綬鼇碍綬鼇碍B 11100010101110001110100010101000111001001111010011100010101110001110100010101000111001001111010001000010 e2b8e8a8e4f4e2b8e8a8e4f442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)