To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN ?????躪畯 001111110011111100111111001111110011111111100111010110001111101101101111 3f3f3f3f3fe758fb6f
EUC-JP 焌??饔?躪畯 1000111111001001111010000011111100111111100011111110100011101111001111111110110110111001100011111100110110111011 8fc9e83f3f8fe8ef3fedb98fcdbb
UTF-8 焌셍렟饔닺躪畯 111001111000010010001100111011001000010110001101111010111010000010011111111010011010010110010100111010111000101110111010111010001011101010101010111001111001010110101111 e7848cec858deba09fe9a594eb8bbae8baaae795af
UHC 焌셍렟饔닺躪畯 1111000111100000101111001100010010001110101100001110100010111101101101001110100011010111111101011111000111100001 f1e0bcc48eb0e8bdb4e8d7f5f1e1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)