To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????????烏??曜??????③?? 00111111001111110011111100111111001111110011111100111111001111110011111110001001010001110011111100111111100101110110101000111111001111110011111100111111001111110011111110000111010000100011111100111111 3f3f3f3f3f3f3f3f3f89473f3f976a3f3f3f3f3f3f87423f3f
EUC-JP ??????瑗??烏??曜?????獒??? 00111111001111110011111100111111001111110011111110001111110011001100000000111111001111111011000110101000001111110011111111001101110010110011111100111111001111110011111100111111100011111100101110111011001111110011111100111111 3f3f3f3f3f3f8fccc03f3fb1a83f3fcdcb3f3f3f3f3f8fcbbb3f3f3f
UTF-8 琉꿩캒殮귛맽瑗꿰썏烏룸젗曜쒕툍溜잜쓼獒③빓溜 111011111010011110001100111010101011111110101001111011001011101010010010111011111010011010100101111010101011011110011011111010111010011110111101111001111001000110010111111010101011111110110000111011001000110110001111111001111000001110001111111010111010001110111000111011001010000010010111111001101001101110011100111011001001001010010101111011011000100010001101111011111010011110001011111011001001111010011100111011001001001110111100111001111000110110010010111000101001000110100010111010111011100110010011111011111010011110001011 efa78ceabfa9ecba92efa6a5eab79beba7bde79197eabfb0ec8d8fe7838feba3b8eca097e69b9cec9295ed888defa78bec9e9cec93bce78d92e291a2ebb993efa78b
UHC 琉꿩캒殮귛맽瑗꿰썏烏룸젗曜쒕툍溜잜쓼獒③빓溜 1110101110100100101100101110011010101111100110111110011011111001100000101110010110010000101111101110101010111100101100101110011110011011100000101110100010100001101101111110101110100000100100111110100011111000100111001110101110111000100001011110101011111110100111111110110110011101100101111110100010100011101010001110100110010101101101111110101011111110 eba4b2e6af9be6f982e590beeabcb2e79b82e8a1b7eba093e8f89cebb885eafe9fed9d97e8a3a8e995b7eafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)