To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
EUC-JP ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
UTF-8 챦짠짙챙쨉쨩챦짠혷챙혢쨀챙쩍혶챦짠쨉횓쩌챙째 111011001011000110100110111011001010011110100000111011001010011110011001111011001011000110011001111011001010100010001001111011001010100010101001111011001011000110100110111011001010011110100000111011011001100010110111111011001011000110011001111011011001100010100010111011001010100010000000111011001011000110011001111011001010100110001101111011011001100010110110111011001011000110100110111011001010011110100000111011001010100010001001111011011001101010010011111011001010100110001100111011001011000110011001111011001010011110111000 ecb1a6eca7a0eca799ecb199eca889eca8a9ecb1a6eca7a0ed98b7ecb199ed98a2eca880ecb199eca98ded98b6ecb1a6eca7a0eca889ed9a93eca98cecb199eca7b8
UHC 챦짠짙챙쨉쨩챦짠혷챙혢쨀챙쩍혶챦짠쨉횓쩌챙째 1100001110101111110000101010011111000010101000111100001110101100110000101011010111000010101110111100001110101111110000101010011111000010100111101100001110101100110000101000101111000010101100111100001110101100110000101011110111000010100111011100001110101111110000101010011111000010101101011100001110001110110000101011110011000011101011001100001010110000 c3afc2a7c2a3c3acc2b5c2bbc3afc2a7c29ec3acc28bc2b3c3acc2bdc29dc3afc2a7c2b5c38ec2bcc3acc2b0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)