To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
UTF-8 챦짝혞챘짧혘챘챰쨌챦짠혡횖창챘짙혳챕혺쨀챘창 111011001011000110100110111011001010011110011101111011011001100010011110111011001011000110011000111011001010011110100111111011011001100010011000111011001011000110011000111011001011000110110000111011001010100010001100111011001011000110100110111011001010011110100000111011011001100010100001111011011001101010010110111011001011000010111101111011001011000110011000111011001010011110011001111011011001100010110011111011001011000110010101111011011001100010111010111011001010100010000000111011001011000110011000111011001011000010111101 ecb1a6eca79ded989eecb198eca7a7ed9898ecb198ecb1b0eca88cecb1a6eca7a0ed98a1ed9a96ecb0bdecb198eca799ed98b3ecb195ed98baeca880ecb198ecb0bd
UHC 챦짝혞챘짧혘챘챰쨌챦짠혡횖창챘짙혳챕혺쨀챘창 1100001110101111110000101010011011000010100010001100001110101011110000101010101011000010100000111100001110101011110000111011000111000010101101111100001110101111110000101010011111000010100010101100001110010000110000111010001011000011101010111100001010100011110000101001101011000011101010011100001010011111110000101011001111000011101010111100001110100010 c3afc2a6c288c3abc2aac283c3abc3b1c2b7c3afc2a7c28ac390c3a2c3abc2a3c29ac3a9c29fc2b3c3abc3a2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)