To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 藥??鎰??酉??鴦??鎰??酉??罌??違 111001010101101000111111001111111110100001001100001111110011111110010011110100010011111100111111111010011111000100111111001111111110100001001100001111110011111110010011110100010011111100111111111000111010000000111111001111111000100011100001 e55a3f3fe84c3f3f93d13f3fe9f13f3fe84c3f3f93d13f3fe3a03f3f88e1
EUC-JP 藥??鎰??酉??鴦??鎰??酉??罌??違 111010011011101100111111001111111110111110101101001111110011111111000110110100110011111100111111111100101111001100111111001111111110111110101101001111110011111111000110110100110011111100111111111001101010001000111111001111111011000011100011 e9bb3f3fefad3f3fc6d33f3ff2f33f3fefad3f3fc6d33f3fe6a23f3fb0e3
UTF-8 藥띲끏鎰먪독酉곴굻鴦볦빢鎰싩독酉귦뮁罌삘꽓違 111010001001011110100101111010111001110110110010111010111000000110001111111010011000111010110000111010111010100010101010111010111000111110000101111010011000010110001001111010101011001110110100111010101011010110111011111010011011010010100110111010111011001110100110111010111011100110100010111010011000111010110000111011001000101110101001111010111000111110000101111010011000010110001001111010101011011110100110111010111010111010000001111001111011110110001100111011001000001010011000111010101011110110010011111010011000000110010101 e897a5eb9db2eb818fe98eb0eba8aaeb8f85e98589eab3b4eab5bbe9b4a6ebb3a6ebb9a2e98eb0ec8ba9eb8f85e98589eab7a6ebae81e7bd8cec8298eabd93e98195
UHC 藥띲끏鎰먪독酉곴굻鴦볦빢鎰싩독酉귦뮁罌삘꽓違 1110010110110111100011011110001110000101101111111110110011110000100100001110011110110101101101101110101110110111100000011110101010110001101111111110010011101100100100111110110010010101101111101110110011110000100110101110011110110101101101101110101110110111100000101110110110010010100100001110010110100010101110111110001010000100101000101110101011011110 e5b78de385bfecf090e7b5b6ebb781eab1bfe4ec93ec95beecf09ae7b5b6ebb782ed9290e5a2bbe284a2eade

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)