To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 藥??秧??弱??藥?????B 1110010101011010001111110011111111100010010111100011111100111111100011101110001100111111001111111110010101011010001111110011111100111111001111110011111101000010 e55a3f3fe25e3f3f8ee33f3fe55a3f3f3f3f3f42
EUC-JP 藥??秧??弱??藥??孼??B 11101001101110110011111100111111111000111011111100111111001111111011110011100101001111110011111111101001101110110011111100111111100011111011101011000011001111110011111101000010 e9bb3f3fe3bf3f3fbce53f3fe9bb3f3f8fbac33f3f42
UTF-8 藥썹떥秧녘쒼弱듾퓘藥썲룴孼껆떻B 11101000100101111010010111101100100011011011100111101011100101101010010111100111101001111010011111101011100001011001100011101100100100101011110011100101101111001011000111101011100100111011111011101101100100111001100011101000100101111010010111101100100011011011001011101011101000111011010011100101101011011011110011101010101110111000011011101011100101101011101101000010 e897a5ec8db9eb96a5e7a7a7eb8598ec92bce5bcb1eb93beed9398e897a5ec8db2eba3b4e5adbceabb86eb96bb42
UHC 藥썹떥秧녘쒼弱듾퓘藥썲룴孼껆떻B 11100101101101111011110111100111100010111011100011100100111010111011001111101000101111101011000011100101101100001000101011100100101111111000001111100101101101111011110111100101100011111010100111100101111011011000001111100111101101101011101101000010 e5b7bde78bb8e4ebb3e8beb0e5b08ae4bf83e5b7bde58fa9e5ed83e7b6bb42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)