To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????C?????????]K 001111110011111100111111001111110011111100111111001111110011111100111111010000110011111100111111001111110011111100111111001111110011111100111111001111110101110101001011 3f3f3f3f3f3f3f3f3f433f3f3f3f3f3f3f3f3f5d4b
SJIS-WIN ???仰??燁??C???仰??燁??]K 00111111001111110011111110001011110000100011111100111111111110110101100100111111001111110100001100111111001111110011111110001011110000100011111100111111111110110101100100111111001111110101110101001011 3f3f3f8bc23f3ffb593f3f433f3f3f8bc23f3ffb593f3f5d4b
EUC-JP ???仰??燁??C???仰??燁??]K 001111110011111100111111101101101100010000111111001111111000111111001010101100110011111100111111010000110011111100111111001111111011011011000100001111110011111110001111110010101011001100111111001111110101110101001011 3f3f3fb6c43f3f8fcab33f3f433f3f3fb6c43f3f8fcab33f3f5d4b
UTF-8 琉꿩뫖仰띠빴燁섎젗C琉꿩뫖仰띠빴燁섎젗]K 111011111010011110001100111010101011111110101001111010111010101110010110111001001011101110110000111010111001110110100000111010111011100110110100111001111000011110000001111011001000010010001110111011001010000010010111010000111110111110100111100011001110101010111111101010011110101110101011100101101110010010111011101100001110101110011101101000001110101110111001101101001110011110000111100000011110110010000100100011101110110010100000100101110101110101001011 efa78ceabfa9ebab96e4bbb0eb9da0ebb9b4e78781ec848eeca09743efa78ceabfa9ebab96e4bbb0eb9da0ebb9b4e78781ec848eeca0975d4b
UHC 琉꿩뫖仰띠빴燁섎젗C琉꿩뫖仰띠빴燁섎젗]K 111010111010010010110010111001101001000110111000111001001110011010110110111011001011101110100110111001111010011110011000111010111010000010010011010000111110101110100100101100101110011010010001101110001110010011100110101101101110110010111011101001101110011110100111100110001110101110100000100100110101110101001011 eba4b2e691b8e4e6b6ecbba6e7a798eba09343eba4b2e691b8e4e6b6ecbba6e7a798eba0935d4b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)