To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 藥≪?訝??淫??[藥≪?訝??淫??[^ 1110010101011010100000011110000100111111111001100110001000111111001111111000100011111010001111110011111101011011111001010101101010000001111000010011111111100110011000100011111100111111100010001111101000111111001111110101101101011110 e55a81e13fe6623f3f88fa3f3f5be55a81e13fe6623f3f88fa3f3f5b5e
EUC-JP 藥≪?訝??淫??[藥≪?訝??淫??[^ 1110100110111011101000101110001100111111111010111100001100111111001111111011000011111100001111110011111101011011111010011011101110100010111000110011111111101011110000110011111100111111101100001111110000111111001111110101101101011110 e9bba2e33febc33f3fb0fc3f3f5be9bba2e33febc33f3fb0fc3f3f5b5e
UTF-8 藥≪꼹訝뽩래淫묇툟[藥≪꼹訝뽩래淫묇툟[^ 111010001001011110100101111000101000100110101010111010101011110010111001111010001010100010011101111010111011110110101001111010111001111010011000111001101011011110101011111010111010110010000111111011011000100010011111010110111110100010010111101001011110001010001001101010101110101010111100101110011110100010101000100111011110101110111101101010011110101110011110100110001110011010110111101010111110101110101100100001111110110110001000100111110101101101011110 e897a5e289aaeabcb9e8a89debbda9eb9e98e6b7abebac87ed889f5be897a5e289aaeabcb9e8a89debbda9eb9e98e6b7abebac87ed889f5b5e
UHC 藥≪꼹訝뽩래淫묇툟[藥≪꼹訝뽩래淫묇툟[^ 111001011011011110100001111011001000010010010001111001001011100010010110111001011011011110100001111010111110001010010001111001001011100010010110010110111110010110110111101000011110110010000100100100011110010010111000100101101110010110110111101000011110101111100010100100011110010010111000100101100101101101011110 e5b7a1ec8491e4b896e5b7a1ebe291e4b8965be5b7a1ec8491e4b896e5b7a1ebe291e4b8965b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)