To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????AB 00111111001111110011111100111111001111110011111100111111001111110100000101000010 3f3f3f3f3f3f3f3f4142
SJIS-WIN 哀???哀??4AB 10001000101000110011111100111111001111111000100010100011001111110011111110000010010100110100000101000010 88a33f3f3f88a33f3f82534142
EUC-JP 哀???哀??4AB 10110000101001010011111100111111001111111011000010100101001111110011111110100011101101000100000101000010 b0a53f3f3fb0a53f3fa3b44142
UTF-8 哀읪딄퐤哀읪낅4AB 1110010110010011100000001110110010011101101010101110101110010100100001001110110110010000101001001110010110010011100000001110110010011101101010101110101110000010100001011110111110111100100101000100000101000010 e59380ec9daaeb9484ed90a4e59380ec9daaeb8285efbc944142
UHC 哀읪딄퐤哀읪낅4AB 111001001110111010011111110100011000101011101010101111011000110111100100111011101001111111010001100001011110101110100011101101000100000101000010 e4ee9fd18aeabd8de4ee9fd185eba3b44142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)