To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???{??????L???{??????L^ 0011111100111111001111110111101100111111001111110011111100111111001111110011111101001100001111110011111100111111011110110011111100111111001111110011111100111111001111110100110001011110 3f3f3f7b3f3f3f3f3f3f4c3f3f3f7b3f3f3f3f3f3f4c5e
SJIS-WIN ??咐{??咐??咐L??咐{??咐??咐L^ 0011111100111111100110011111001101111011001111110011111110011001111100110011111100111111100110011111001101001100001111110011111110011001111100110111101100111111001111111001100111110011001111110011111110011001111100110100110001011110 3f3f99f37b3f3f99f33f3f99f34c3f3f99f37b3f3f99f33f3f99f34c5e
EUC-JP ??咐{??咐??咐L??咐{??咐??咐L^ 0011111100111111110100101111010101111011001111110011111111010010111101010011111100111111110100101111010101001100001111110011111111010010111101010111101100111111001111111101001011110101001111110011111111010010111101010100110001011110 3f3fd2f57b3f3fd2f53f3fd2f54c3f3fd2f57b3f3fd2f53f3fd2f54c5e
UTF-8 룶웩咐{룶웩咐룶웩咐L룶웩咐{룶웩咐룶웩咐L^ 1110101110100011101101101110110010011011101010011110010110010010100100000111101111101011101000111011011011101100100110111010100111100101100100101001000011101011101000111011011011101100100110111010100111100101100100101001000001001100111010111010001110110110111011001001101110101001111001011001001010010000011110111110101110100011101101101110110010011011101010011110010110010010100100001110101110100011101101101110110010011011101010011110010110010010100100000100110001011110 eba3b6ec9ba9e592907beba3b6ec9ba9e59290eba3b6ec9ba9e592904ceba3b6ec9ba9e592907beba3b6ec9ba9e59290eba3b6ec9ba9e592904c5e
UHC 룶웩咐{룶웩咐룶웩咐L룶웩咐{룶웩咐룶웩咐L^ 1000111110101011110000001010000111011100111110110111101110001111101010111100000010100001110111001111101110001111101010111100000010100001110111001111101101001100100011111010101111000000101000011101110011111011011110111000111110101011110000001010000111011100111110111000111110101011110000001010000111011100111110110100110001011110 8fabc0a1dcfb7b8fabc0a1dcfb8fabc0a1dcfb4c8fabc0a1dcfb7b8fabc0a1dcfb8fabc0a1dcfb4c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)