To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 彗??蘂??臟??炯?彗??蘂??臟??炯?B 10011100011000010011111100111111111001010100000100111111001111111110010001100110001111110011111111100000011101100011111110011100011000010011111100111111111001010100000100111111001111111110010001100110001111110011111111100000011101100011111101000010 9c613f3fe5413f3fe4663f3fe0763f9c613f3fe5413f3fe4663f3fe0763f42
EUC-JP 彗??蘂??臟??炯?彗??蘂??臟??炯?B 11010111110000100011111100111111111010011010001000111111001111111110011111000111001111110011111111011111110101110011111111010111110000100011111100111111111010011010001000111111001111111110011111000111001111110011111111011111110101110011111101000010 d7c23f3fe9a23f3fe7c73f3fdfd73fd7c23f3fe9a23f3fe7c73f3fdfd73f42
UTF-8 彗뚲궙蘂⅛쾷臟뚿옶炯쭿彗뚲궙蘂⅛쾷臟뚿옶炯쭿B 11100101101111011001011111101011100110101011001011101010101101101001100111101000100110001000001011100010100001011001101111101100101111101011011111101000100001111001111111101011100110101011111111101100100110001011011011100111100000101010111111101100101011011011111111100101101111011001011111101011100110101011001011101010101101101001100111101000100110001000001011100010100001011001101111101100101111101011011111101000100001111001111111101011100110101011111111101100100110001011011011100111100000101010111111101100101011011011111101000010 e5bd97eb9ab2eab699e89882e2859becbeb7e8879feb9abfec98b6e782afecadbfe5bd97eb9ab2eab699e89882e2859becbeb7e8879feb9abfec98b6e782afecadbf42
UHC 彗뚲궙蘂⅛쾷臟뚿옶炯쭿彗뚲궙蘂⅛쾷臟뚿옶炯쭿B 111110111011001010001100111011101000001010101110111001111101111010101000111110111011001010001101111011011111010010001100111110111001111010101110111110111010011010101000010100011111101110110010100011001110111010000010101011101110011111011110101010001111101110110010100011011110110111110100100011001111101110011110101011101111101110100110101010000101000101000010 fbb28cee82aee7dea8fbb28dedf48cfb9eaefba6a851fbb28cee82aee7dea8fbb28dedf48cfb9eaefba6a85142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)