To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????O^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4f5e
SJIS-WIN ?????????????????????O^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4f5e
EUC-JP ?????????????????????O^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4f5e
UTF-8 챕혻혗챘쨔혙챘혻째챘혻짙챔짯혟챙혝짚챙혦혙O^ 1110110010110001100101011110110110011000101110111110110110011000100101111110110010110001100110001110110010101000100101001110110110011000100110011110110010110001100110001110110110011000101110111110110010100111101110001110110010110001100110001110110110011000101110111110110010100111100110011110110010110001100101001110110010100111101011111110110110011000100111111110110010110001100110011110110110011000100111011110110010100111100110101110110010110001100110011110110110011000101001101110110110011000100110010100111101011110 ecb195ed98bbed9897ecb198eca894ed9899ecb198ed98bbeca7b8ecb198ed98bbeca799ecb194eca7afed989fecb199ed989deca79aecb199ed98a6ed98994f5e
UHC 챕혻혗챘쨔혙챘혻째챘혻짙챔짯혟챙혝짚챙혦혙O^ 1100001110101001110000101010000011000010100000101100001110101011110000101011100111000010100001001100001110101011110000101010000011000010101100001100001110101011110000101010000011000010101000111100001110101000110000101010110111000010100010011100001110101100110000101000011111000010101001001100001110101100110000101000111011000010100001000100111101011110 c3a9c2a0c282c3abc2b9c284c3abc2a0c2b0c3abc2a0c2a3c3a8c2adc289c3acc287c2a4c3acc28ec2844f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)