To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN ??????茹??v??????茹??vB 0011111100111111001111110011111100111111001111111110010010100101001111110011111101110110001111110011111100111111001111110011111100111111111001001010010100111111001111110111011001000010 3f3f3f3f3f3fe4a53f3f763f3f3f3f3f3fe4a53f3f7642
EUC-JP ???旿??茹??v???旿??茹??vB 001111110011111100111111100011111100000111110100001111110011111111101000101001110011111100111111011101100011111100111111001111111000111111000001111101000011111100111111111010001010011100111111001111110111011001000010 3f3f3f8fc1f43f3fe8a73f3f763f3f3f8fc1f43f3fe8a73f3f7642
UTF-8 若노젧旿껊젇茹띾뮮v若노젧旿껊젇茹띾뮮vB 111011111010010110110100111010111000010110111000111011001010000010100111111001101001011110111111111010101011101110001010111011001010000010000111111010001000110010111001111010111001110110111110111010111010111010101110011101101110111110100101101101001110101110000101101110001110110010100000101001111110011010010111101111111110101010111011100010101110110010100000100001111110100010001100101110011110101110011101101111101110101110101110101011100111011001000010 efa5b4eb85b8eca0a7e697bfeabb8aeca087e88cb9eb9dbeebaeae76efa5b4eb85b8eca0a7e697bfeabb8aeca087e88cb9eb9dbeebaeae7642
UHC 若노젧旿껊젇茹띾뮮v若노젧旿껊젇茹띾뮮vB 111001011010111010110011111010111010000010011111111001111111101010000011111010111010000010001010111001101010101010001101111010111001001010110111011101101110010110101110101100111110101110100000100111111110011111111010100000111110101110100000100010101110011010101010100011011110101110010010101101110111011001000010 e5aeb3eba09fe7fa83eba08ae6aa8deb92b776e5aeb3eba09fe7fa83eba08ae6aa8deb92b77642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)