To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??肢?淀斑??劑???漬蛟??淀斑??劑 001111110011111110001110100010000011111110010111100001001001010011000001001111110011111110011001100111010011111100111111001111111001001011010000111001011000000000111111001111111001011110000100100101001100000100111111001111111001100110011101 3f3f8e883f978494c13f3f999d3f3f3f92d0e5803f3f978494c13f3f999d
EUC-JP ??肢?淀斑??劑泮??漬蛟??淀斑??劑 0011111100111111101110111110100000111111110011011110010011001000110000110011111100111111110100011111110110001111110001111010100000111111001111111100010011010010111010011110000000111111001111111100110111100100110010001100001100111111001111111101000111111101 3f3fbbe83fcde4c8c33f3fd1fd8fc7a83f3fc4d2e9e03f3fcde4c8c33f3fd1fd
UTF-8 裏렦肢렖淀斑렡렮劑泮렠렋漬蛟렰렡淀斑렡렮劑 111011111010011110100111111010111010000010100110111010001000001010100010111010111010000010010110111001101011011110000000111001101001011010010001111010111010000010100001111010111010000010101110111001011000101010010001111001101011001110101110111010111010000010100000111010111010000010001011111001101011110010101100111010001001101110011111111010111010000010110000111010111010000010100001111001101011011110000000111001101001011010010001111010111010000010100001111010111010000010101110111001011000101010010001 efa7a7eba0a6e882a2eba096e6b780e69691eba0a1eba0aee58a91e6b3aeeba0a0eba08be6bcace89b9feba0b0eba0a1e6b780e69691eba0a1eba0aee58a91
UHC 裏렦肢렖淀斑렡렮劑泮렠렋漬蛟렰렡淀斑렡렮劑 111011001100000010001110101101011111001010110110100011101010101111101111111000111101101011101000100011101011001010001110101110111111000010100101110110101110101010001110101100011000111010100010111100101011000011001110111100011000111010111101100011101011001011101111111000111101101011101000100011101011001010001110101110111111000010100101 ecc08eb5f2b68eabefe3dae88eb28ebbf0a5daea8eb18ea2f2b0cef18ebd8eb2efe3dae88eb28ebbf0a5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)