To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 藥≪????淫??[藥≪????淫??[^ 111001010101101010000001111000010011111100111111001111110011111110001000111110100011111100111111010110111110010101011010100000011110000100111111001111110011111100111111100010001111101000111111001111110101101101011110 e55a81e13f3f3f3f88fa3f3f5be55a81e13f3f3f3f88fa3f3f5b5e
EUC-JP 藥≪????淫??[藥≪????淫??[^ 111010011011101110100010111000110011111100111111001111110011111110110000111111000011111100111111010110111110100110111011101000101110001100111111001111110011111100111111101100001111110000111111001111110101101101011110 e9bba2e33f3f3f3fb0fc3f3f5be9bba2e33f3f3f3fb0fc3f3f5b5e
UTF-8 藥≪꼹殮쎾래淫묊썪[藥≪꼹殮쎾래淫묊썪[^ 111010001001011110100101111000101000100110101010111010101011110010111001111011111010011010100101111011001000111010111110111010111001111010011000111001101011011110101011111010111010110010001010111011001000110110101010010110111110100010010111101001011110001010001001101010101110101010111100101110011110111110100110101001011110110010001110101111101110101110011110100110001110011010110111101010111110101110101100100010101110110010001101101010100101101101011110 e897a5e289aaeabcb9efa6a5ec8ebeeb9e98e6b7abebac8aec8daa5be897a5e289aaeabcb9efa6a5ec8ebeeb9e98e6b7abebac8aec8daa5b5e
UHC 藥≪꼹殮쎾래淫묊썪[藥≪꼹殮쎾래淫묊썪[^ 111001011011011110100001111011001000010010010001111001101111100110011011111001011011011110100001111010111110001010010001111001111001101110011011010110111110010110110111101000011110110010000100100100011110011011111001100110111110010110110111101000011110101111100010100100011110011110011011100110110101101101011110 e5b7a1ec8491e6f99be5b7a1ebe291e79b9b5be5b7a1ec8491e6f99be5b7a1ebe291e79b9b5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)