To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 弔????耿??趙貊? 100100101010001000111111001111110011111100111111111000111101010000111111001111111110011011100010111001101011101100111111 92a23f3f3f3fe3d43f3fe6e2e6bb3f
EUC-JP 弔?勖?饔耿??趙貊? 11000100101001000011111110001111101100111110110100111111100011111110100011101111111001101101011000111111001111111110110011100100111011001011110100111111 c4a43f8fb3ed3f8fe8efe6d63f3fece4ecbd3f
UTF-8 弔렲勖렢饔耿렕렟趙貊긺 111001011011110010010100111010111010000010110010111001011000101110010110111010111010000010100010111010011010010110010100111010001000000010111111111010111010000010010101111010111010000010011111111010001011011010011001111010001011001010001010111010101011100010111010 e5bc94eba0b2e58b96eba0a2e9a594e880bfeba095eba09fe8b699e8b28aeab8ba
UHC 弔렲勖렢饔耿렕렟趙貊긺 11110000110000001000111010111111111010011110110110001110101100111110100010111101110011001110101010001110101010101000111010110000111100001110000111011000111001111011000111100111 f0c08ebfe9ed8eb3e8bdccea8eaa8eb0f0e1d8e7b1e7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)