To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN 哀???哀???B 1000100010100011001111110011111100111111100010001010001100111111001111110011111101000010 88a33f3f3f88a33f3f3f42
EUC-JP 哀???哀???B 1011000010100101001111110011111100111111101100001010010100111111001111110011111101000010 b0a53f3f3fb0a53f3f3f42
UTF-8 哀넘됱죦哀넘됱죦B 11100101100100111000000011101011100001001001100011101011100100001011000111101100101000111010011011100101100100111000000011101011100001001001100011101011100100001011000111101100101000111010011001000010 e59380eb8498eb90b1eca3a6e59380eb8498eb90b1eca3a642
UHC 哀넘됱죦哀넘됱죦B 1110010011101110101100111101000110001001111011001010000110000001111001001110111010110011110100011000100111101100101000011000000101000010 e4eeb3d189eca181e4eeb3d189eca18142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)