To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????C???C?????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100001100111111001111110011111101000011001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f433f3f3f433f3f3f3f3f3f
SJIS-WIN ???????????C???C?????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100001100111111001111110011111101000011001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f433f3f3f433f3f3f3f3f3f
EUC-JP ???????????C???C?????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100001100111111001111110011111101000011001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f433f3f3f433f3f3f3f3f3f
UTF-8 창횁천쨉챨챦횒횤찾횙체C횒횤체C챨챦창횁철째 1110110010110000101111011110110110011010100000011110110010110010100111001110110010101000100010011110110010110001101010001110110010110001101001101110110110011010100100101110110110011010101001001110110010110000101111101110110110011010100110011110110010110010101101000100001111101101100110101001001011101101100110101010010011101100101100101011010001000011111011001011000110101000111011001011000110100110111011001011000010111101111011011001101010000001111011001011001010100000111011001010011110111000 ecb0bded9a81ecb29ceca889ecb1a8ecb1a6ed9a92ed9aa4ecb0beed9a99ecb2b443ed9a92ed9aa4ecb2b443ecb1a8ecb1a6ecb0bded9a81ecb2a0eca7b8
UHC 창횁천쨉챨챦횒횤찾횙체C횒횤체C챨챦창횁철째 110000111010001011000011100000011100001110110101110000101011010111000011101100001100001110101111110000111000110111000011100110111100001110100011110000111001001111000011101111000100001111000011100011011100001110011011110000111011110001000011110000111011000011000011101011111100001110100010110000111000000111000011101101101100001010110000 c3a2c381c3b5c2b5c3b0c3afc38dc39bc3a3c393c3bc43c38dc39bc3bc43c3b0c3afc3a2c381c3b6c2b0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)