To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???鍮?ぜ邑??閻??議?┃怨??沃 001111110011111100111111111010000100101000111111100000101011101010010111010101110011111100111111111010001000010100111111001111111000101101100011001111111000010010101011100010011000010100111111001111111001011110000000 3f3f3fe84a3f82ba97573f3fe8853f3f8b633f84ab89853f3f9780
EUC-JP ???鍮?ぜ邑??閻??議?┃怨??沃 001111110011111100111111111011111010101100111111101001001011110011001101101110000011111100111111111011111110010100111111001111111011010111000100001111111010100010101101101100011110010100111111001111111100110111100000 3f3f3fefab3fa4bccdb83f3fefe53f3fb5c43fa8adb1e53f3fcde0
UTF-8 捻뀀쉼鍮뽬ぜ邑룐뵺閻롢끉議뗰┃怨대궙沃 111011111010011010100100111010111000000010000000111011001000100110111100111010011000110110101110111010111011110110101100111000111000000110011100111010011000001010010001111010111010001110010000111010111011010110111010111010011001011010111011111010111010000110100010111010111000000110001001111010001010110110110000111010111001011110110000111000101001010010000011111001101000000010101000111010111000110010000000111010101011011010011001111001101011001010000011 efa6a4eb8080ec89bce98daeebbdace3819ce98291eba390ebb5bae996bbeba1a2eb8189e8adb0eb97b0e29483e680a8eb8c80eab699e6b283
UHC 捻뀀쉼鍮뽬ぜ邑룐뵺閻롢끉議뗰┃怨대궙沃 1110011011110111101100101110101110111101101100001110101110111001100101101110100010101010101111001110101111101001101101111110001010010100101110001110011110100010100011101110001110000101101111001110110010100001100010111110111110100110101011011110101010110011101101001110101110000010101011101110100010101010 e6f7b2ebbdb0ebb996e8aabcebe9b7e294b8e7a28ee385bceca18befa6adeab3b4eb82aee8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)