To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????R?????^[?????R?????^[^ 001111110011111100111111001111110011111101010010001111110011111100111111001111110011111101011110010110110011111100111111001111110011111100111111010100100011111100111111001111110011111100111111010111100101101101011110 3f3f3f3f3f523f3f3f3f3f5e5b3f3f3f3f3f523f3f3f3f3f5e5b5e
SJIS-WIN 藥??壹?R藥??壹?^[藥??壹?R藥??壹?^[^ 1110010101011010001111110011111110011010111000110011111101010010111001010101101000111111001111111001101011100011001111110101111001011011111001010101101000111111001111111001101011100011001111110101001011100101010110100011111100111111100110101110001100111111010111100101101101011110 e55a3f3f9ae33f52e55a3f3f9ae33f5e5be55a3f3f9ae33f52e55a3f3f9ae33f5e5b5e
EUC-JP 藥??壹?R藥??壹?^[藥??壹?R藥??壹?^[^ 1110100110111011001111110011111111010100111001010011111101010010111010011011101100111111001111111101010011100101001111110101111001011011111010011011101100111111001111111101010011100101001111110101001011101001101110110011111100111111110101001110010100111111010111100101101101011110 e9bb3f3fd4e53f52e9bb3f3fd4e53f5e5be9bb3f3fd4e53f52e9bb3f3fd4e53f5e5b5e
UTF-8 藥쎈㉡壹퐊R藥쎈㉡壹퐊^[藥쎈㉡壹퐊R藥쎈㉡壹퐊^[^ 11101000100101111010010111101100100011101000100011100011100010011010000111100101101000111011100111101101100100001000101001010010111010001001011110100101111011001000111010001000111000111000100110100001111001011010001110111001111011011001000010001010010111100101101111101000100101111010010111101100100011101000100011100011100010011010000111100101101000111011100111101101100100001000101001010010111010001001011110100101111011001000111010001000111000111000100110100001111001011010001110111001111011011001000010001010010111100101101101011110 e897a5ec8e88e389a1e5a3b9ed908a52e897a5ec8e88e389a1e5a3b9ed908a5e5be897a5ec8e88e389a1e5a3b9ed908a52e897a5ec8e88e389a1e5a3b9ed908a5e5b5e
UHC 藥쎈㉡壹퐊R藥쎈㉡壹퐊^[藥쎈㉡壹퐊R藥쎈㉡壹퐊^[^ 1110010110110111101111011110101110101000101100101110110011101100101111010110111001010010111001011011011110111101111010111010100010110010111011001110110010111101011011100101111001011011111001011011011110111101111010111010100010110010111011001110110010111101011011100101001011100101101101111011110111101011101010001011001011101100111011001011110101101110010111100101101101011110 e5b7bdeba8b2ececbd6e52e5b7bdeba8b2ececbd6e5e5be5b7bdeba8b2ececbd6e52e5b7bdeba8b2ececbd6e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)