To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????C???????????C??^ 00111111001111110011111100111111001111110011111100111111001111110011111101000011001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000011001111110011111101011110 3f3f3f3f3f3f3f3f3f433f3f3f3f3f3f3f3f3f3f3f433f3f5e
SJIS-WIN ?????????C???????????C??^ 00111111001111110011111100111111001111110011111100111111001111110011111101000011001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000011001111110011111101011110 3f3f3f3f3f3f3f3f3f433f3f3f3f3f3f3f3f3f3f3f433f3f5e
EUC-JP ?????????C???????????C??^ 00111111001111110011111100111111001111110011111100111111001111110011111101000011001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000011001111110011111101011110 3f3f3f3f3f3f3f3f3f433f3f3f3f3f3f3f3f3f3f3f433f3f5e
UTF-8 횦횥챰체챨챠창쨉체C찼횦횦횥챰체챨챠창쨉체C찼횦^ 111011011001101010100110111011011001101010100101111011001011000110110000111011001011001010110100111011001011000110101000111011001011000110100000111011001011000010111101111011001010100010001001111011001011001010110100010000111110110010110000101111001110110110011010101001101110110110011010101001101110110110011010101001011110110010110001101100001110110010110010101101001110110010110001101010001110110010110001101000001110110010110000101111011110110010101000100010011110110010110010101101000100001111101100101100001011110011101101100110101010011001011110 ed9aa6ed9aa5ecb1b0ecb2b4ecb1a8ecb1a0ecb0bdeca889ecb2b443ecb0bced9aa6ed9aa6ed9aa5ecb1b0ecb2b4ecb1a8ecb1a0ecb0bdeca889ecb2b443ecb0bced9aa65e
UHC 횦횥챰체챨챠창쨉체C찼횦횦횥챰체챨챠창쨉체C찼횦^ 1100001110011101110000111001110011000011101100011100001110111100110000111011000011000011101011011100001110100010110000101011010111000011101111000100001111000011101000011100001110011101110000111001110111000011100111001100001110110001110000111011110011000011101100001100001110101101110000111010001011000010101101011100001110111100010000111100001110100001110000111001110101011110 c39dc39cc3b1c3bcc3b0c3adc3a2c2b5c3bc43c3a1c39dc39dc39cc3b1c3bcc3b0c3adc3a2c2b5c3bc43c3a1c39d5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)