To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????M@????M@B 00111111001111110011111100111111010011010100000000111111001111110011111100111111010011010100000001000010 3f3f3f3f4d403f3f3f3f4d4042
SJIS-WIN ????M@????M@B 00111111001111110011111100111111010011010100000000111111001111110011111100111111010011010100000001000010 3f3f3f3f4d403f3f3f3f4d4042
EUC-JP ????M@????M@B 00111111001111110011111100111111010011010100000000111111001111110011111100111111010011010100000001000010 3f3f3f3f4d403f3f3f3f4d4042
UTF-8 횙짤횙쨔M@횙짤횙쨔M@B 1110110110011010100110011110110010100111101001001110110110011010100110011110110010101000100101000100110101000000111011011001101010011001111011001010011110100100111011011001101010011001111011001010100010010100010011010100000001000010 ed9a99eca7a4ed9a99eca8944d40ed9a99eca7a4ed9a99eca8944d4042
UHC 횙짤횙쨔M@횙짤횙쨔M@B 110000111001001111000010101010011100001110010011110000101011100101001101010000001100001110010011110000101010100111000011100100111100001010111001010011010100000001000010 c393c2a9c393c2b94d40c393c2a9c393c2b94d4042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)