To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???殃≫?鴉???? 0011111100111111001111111001111101101001100000011110001000111111111010011110101100111111001111110011111100111111 3f3f3f9f6981e23fe9eb3f3f3f3f
EUC-JP ???殃≫?鴉???? 0011111100111111001111111101110111001010101000101110010000111111111100101110110100111111001111110011111100111111 3f3f3fddcaa2e43ff2ed3f3f3f3f
UTF-8 溜븐쓻殃≫뙇鴉쇰㎗溜칆 111011111010011110001011111010111011100010010000111011001001001110111011111001101010111010000011111000101000100110101011111010111001100110000111111010011011010010001001111011001000011110110000111000111000111010010111111011111010011110001011111011001011100110000110 efa78bebb890ec93bbe6ae83e289abeb9987e9b489ec87b0e38e97efa78becb986
UHC 溜븐쓻殃≫뙇鴉쇰㎗溜칆 11101010111111101011101011101100100111011001011011100100111010101010000111101101100011001000110111100100101111001011110011101011101001111010001111101010111111101010111101010111 eafebaec9d96e4eaa1ed8c8de4bcbceba7a3eafeaf57

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)