To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??碇ヒ〓??碇ぎ?羈 0011111100111111100100101111010010000011011100011000000110101100001111110011111110010010111101001000001010101100001111111110001110110001 3f3f92f4837181ac3f3f92f482ac3fe3b1
EUC-JP ??碇ヒ〓??碇ぎ?羈 0011111100111111110001001111011010100101110100101010001010101110001111110011111111000100111101101010010010101110001111111110011010110011 3f3fc4f6a5d2a2ae3f3fc4f6a4ae3fe6b3
UTF-8 ∼룫碇ヒ〓∼룫碇ぎ룶羈 111000101000100010111100111010111010001110101011111001111010001010000111111000111000001110010010111000111000000010010011111000101000100010111100111010111010001110101011111001111010001010000111111000111000000110001110111010111010001110110110111001111011111010001000 e288bceba3abe7a287e38392e38093e288bceba3abe7a287e3818eeba3b6e7be88
UHC ∼룫碇ヒ〓∼룫碇ぎ룶羈 10100001101011011000111110100010111011111110110110101011110100101010000111101011101000011010110110001111101000101110111111101101101010101010111010001111101010111101000110111100 a1ad8fa2efedabd2a1eba1ad8fa2efedaaae8fabd1bc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)