To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 藥〓?竊??節??藥〓?藥〓?竊??節??藥〓?B 11100101010110101000000110101100001111111110001010000110001111110011111110010000110111110011111100111111111001010101101010000001101011000011111111100101010110101000000110101100001111111110001010000110001111110011111110010000110111110011111100111111111001010101101010000001101011000011111101000010 e55a81ac3fe2863f3f90df3f3fe55a81ac3fe55a81ac3fe2863f3f90df3f3fe55a81ac3f42
EUC-JP 藥〓?竊??節??藥〓?藥〓?竊??節??藥〓?B 11101001101110111010001010101110001111111110001111100110001111110011111111000000111000010011111100111111111010011011101110100010101011100011111111101001101110111010001010101110001111111110001111100110001111110011111111000000111000010011111100111111111010011011101110100010101011100011111101000010 e9bba2ae3fe3e63f3fc0e13f3fe9bba2ae3fe9bba2ae3fe3e63f3fc0e13f3fe9bba2ae3f42
UTF-8 藥〓낄竊뺡뒽節껉튅藥〓콝藥〓낄竊뺡뒽節껉튅藥〓콝B 11101000100101111010010111100011100000001001001111101011100000101000010011100111101010111000101011101011101110101010000111101011100100101011110111100111101011111000000011101010101110111000100111101101100010101000010111101000100101111010010111100011100000001001001111101100101111011001110111101000100101111010010111100011100000001001001111101011100000101000010011100111101010111000101011101011101110101010000111101011100100101011110111100111101011111000000011101010101110111000100111101101100010101000010111101000100101111010010111100011100000001001001111101100101111011001110101000010 e897a5e38093eb8284e7ab8aebbaa1eb92bde7af80eabb89ed8a85e897a5e38093ecbd9de897a5e38093eb8284e7ab8aebbaa1eb92bde7af80eabb89ed8a85e897a5e38093ecbd9d42
UHC 藥〓낄竊뺡뒽節껉튅藥〓콝藥〓낄竊뺡뒽節껉튅藥〓콝B 11100101101101111010000111101011101100111010010111101111101111001001010111101001100010101011001111101111101111011000001111101010101110011001101011100101101101111010000111101011101100011001010111100101101101111010000111101011101100111010010111101111101111001001010111101001100010101011001111101111101111011000001111101010101110011001101011100101101101111010000111101011101100011001010101000010 e5b7a1ebb3a5efbc95e98ab3efbd83eab99ae5b7a1ebb195e5b7a1ebb3a5efbc95e98ab3efbd83eab99ae5b7a1ebb19542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)