To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 藥??秧?オ倭??}v藥??秧?オ倭??}vB 11100101010110100011111100111111111000100101111000111111100000110100100110011000011000000011111100111111011111010111011011100101010110100011111100111111111000100101111000111111100000110100100110011000011000000011111100111111011111010111011001000010 e55a3f3fe25e3f834998603f3f7d76e55a3f3fe25e3f834998603f3f7d7642
EUC-JP 藥??秧?オ倭??}v藥??秧?オ倭??}vB 11101001101110110011111100111111111000111011111100111111101001011010101011001111110000010011111100111111011111010111011011101001101110110011111100111111111000111011111100111111101001011010101011001111110000010011111100111111011111010111011001000010 e9bb3f3fe3bf3fa5aacfc13f3f7d76e9bb3f3fe3bf3fa5aacfc13f3f7d7642
UTF-8 藥썰퍟秧녕オ倭졿릫}v藥썰퍟秧녕オ倭졿릫}vB 1110100010010111101001011110110010001101101100001110110110001101100111111110011110100111101001111110101110000101100101011110001110000010101010101110010110000000101011011110110010100001101111111110101110100110101010110111110101110110111010001001011110100101111011001000110110110000111011011000110110011111111001111010011110100111111010111000010110010101111000111000001010101010111001011000000010101101111011001010000110111111111010111010011010101011011111010111011001000010 e897a5ec8db0ed8d9fe7a7a7eb8595e382aae580adeca1bfeba6ab7d76e897a5ec8db0ed8d9fe7a7a7eb8595e382aae580adeca1bfeba6ab7d7642
UHC 藥썰퍟秧녕オ倭졿릫}v藥썰퍟秧녕オ倭졿릫}vB 1110010110110111101111011110010010111011100101101110010011101011101100111110011110101011101010101110100011011110101000001110011010010000100011010111110101110110111001011011011110111101111001001011101110010110111001001110101110110011111001111010101110101010111010001101111010100000111001101001000010001101011111010111011001000010 e5b7bde4bb96e4ebb3e7abaae8dea0e6908d7d76e5b7bde4bb96e4ebb3e7abaae8dea0e6908d7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)