To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????h??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011010000011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 彗??蘂??臟??炯?h彗??蘂??臟??炯? 10011100011000010011111100111111111001010100000100111111001111111110010001100110001111110011111111100000011101100011111101101000100111000110000100111111001111111110010101000001001111110011111111100100011001100011111100111111111000000111011000111111 9c613f3fe5413f3fe4663f3fe0763f689c613f3fe5413f3fe4663f3fe0763f
EUC-JP 彗??蘂??臟??炯?h彗??蘂??臟??炯? 11010111110000100011111100111111111010011010001000111111001111111110011111000111001111110011111111011111110101110011111101101000110101111100001000111111001111111110100110100010001111110011111111100111110001110011111100111111110111111101011100111111 d7c23f3fe9a23f3fe7c73f3fdfd73f68d7c23f3fe9a23f3fe7c73f3fdfd73f
UTF-8 彗뚲궙蘂⅛쾷臟뚿옶炯쭼h彗뚲궙蘂⅛쾷臟뚿옶炯쭼 11100101101111011001011111101011100110101011001011101010101101101001100111101000100110001000001011100010100001011001101111101100101111101011011111101000100001111001111111101011100110101011111111101100100110001011011011100111100000101010111111101100101011011011110001101000111001011011110110010111111010111001101010110010111010101011011010011001111010001001100010000010111000101000010110011011111011001011111010110111111010001000011110011111111010111001101010111111111011001001100010110110111001111000001010101111111011001010110110111100 e5bd97eb9ab2eab699e89882e2859becbeb7e8879feb9abfec98b6e782afecadbc68e5bd97eb9ab2eab699e89882e2859becbeb7e8879feb9abfec98b6e782afecadbc
UHC 彗뚲궙蘂⅛쾷臟뚿옶炯쭼h彗뚲궙蘂⅛쾷臟뚿옶炯쭼 111110111011001010001100111011101000001010101110111001111101111010101000111110111011001010001101111011011111010010001100111110111001111010101110111110111010011010101000010011100110100011111011101100101000110011101110100000101010111011100111110111101010100011111011101100101000110111101101111101001000110011111011100111101010111011111011101001101010100001001110 fbb28cee82aee7dea8fbb28dedf48cfb9eaefba6a84e68fbb28cee82aee7dea8fbb28dedf48cfb9eaefba6a84e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)