To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???^?????????^??????^ 001111110011111100111111010111100011111100111111001111110011111100111111001111110011111100111111001111110101111000111111001111110011111100111111001111110011111101011110 3f3f3f5e3f3f3f3f3f3f3f3f3f5e3f3f3f3f3f3f5e
SJIS-WIN ???^??咐??咐???^??咐??咐^ 00111111001111110011111101011110001111110011111110011001111100110011111100111111100110011111001100111111001111110011111101011110001111110011111110011001111100110011111100111111100110011111001101011110 3f3f3f5e3f3f99f33f3f99f33f3f3f5e3f3f99f33f3f99f35e
EUC-JP ???^??咐??咐???^??咐??咐^ 00111111001111110011111101011110001111110011111111010010111101010011111100111111110100101111010100111111001111110011111101011110001111110011111111010010111101010011111100111111110100101111010101011110 3f3f3f5e3f3fd2f53f3fd2f53f3f3f5e3f3fd2f53f3fd2f55e
UTF-8 룶웩熉^룶웩咐룶웩咐룶웩熉^룶웩咐룶웩咐^ 111010111010001110110110111011001001101110101001111001111000011010001001010111101110101110100011101101101110110010011011101010011110010110010010100100001110101110100011101101101110110010011011101010011110010110010010100100001110101110100011101101101110110010011011101010011110011110000110100010010101111011101011101000111011011011101100100110111010100111100101100100101001000011101011101000111011011011101100100110111010100111100101100100101001000001011110 eba3b6ec9ba9e786895eeba3b6ec9ba9e59290eba3b6ec9ba9e59290eba3b6ec9ba9e786895eeba3b6ec9ba9e59290eba3b6ec9ba9e592905e
UHC 룶웩熉^룶웩咐룶웩咐룶웩熉^룶웩咐룶웩咐^ 100011111010101111000000101000011110100111111011010111101000111110101011110000001010000111011100111110111000111110101011110000001010000111011100111110111000111110101011110000001010000111101001111110110101111010001111101010111100000010100001110111001111101110001111101010111100000010100001110111001111101101011110 8fabc0a1e9fb5e8fabc0a1dcfb8fabc0a1dcfb8fabc0a1e9fb5e8fabc0a1dcfb8fabc0a1dcfb5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)