To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????h?????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110110100000111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f3f3f
SJIS-WIN 逖滔ク晧価逖滔ク晧価h逖滔ク晧価逖滔ク晧価 11100111100110001001111111101011101110001001110111101100100010011011111111100111100110001001111111101011101110001001110111101100100010011011111101101000111001111001100010011111111010111011100010011101111011001000100110111111111001111001100010011111111010111011100010011101111011001000100110111111 e7989febb89dec89bfe7989febb89dec89bf68e7989febb89dec89bfe7989febb89dec89bf
EUC-JP 逖滔ク晧価逖滔ク晧価h逖滔ク晧価逖滔ク晧価 1110110111111000110111101110110110001110101110001101101011101110101100101100000111101101111110001101111011101101100011101011100011011010111011101011001011000001011010001110110111111000110111101110110110001110101110001101101011101110101100101100000111101101111110001101111011101101100011101011100011011010111011101011001011000001 edf8deed8eb8daeeb2c1edf8deed8eb8daeeb2c168edf8deed8eb8daeeb2c1edf8deed8eb8daeeb2c1
UTF-8 逖滔ク晧価逖滔ク晧価h逖滔ク晧価逖滔ク晧価 11101001100000001001011011100110101110111001010011101111101111011011100011100110100110011010011111100100101111101010000111101001100000001001011011100110101110111001010011101111101111011011100011100110100110011010011111100100101111101010000101101000111010011000000010010110111001101011101110010100111011111011110110111000111001101001100110100111111001001011111010100001111010011000000010010110111001101011101110010100111011111011110110111000111001101001100110100111111001001011111010100001 e98096e6bb94efbdb8e699a7e4bea1e98096e6bb94efbdb8e699a7e4bea168e98096e6bb94efbdb8e699a7e4bea1e98096e6bb94efbdb8e699a7e4bea1
UHC ?滔?晧??滔?晧?h?滔?晧??滔?晧? 0011111111010100101001010011111111111011110001010011111100111111110101001010010100111111111110111100010100111111011010000011111111010100101001010011111111111011110001010011111100111111110101001010010100111111111110111100010100111111 3fd4a53ffbc53f3fd4a53ffbc53f683fd4a53ffbc53f3fd4a53ffbc53f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)