To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????}B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111110101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f7d42
SJIS-WIN 疾鹿シ、疾鹿シ」疾鹿シ、シ」シ瘤セ鹿シエ疾汐シユ}B 100011101011111010001110101011011011110010100100100011101011111010001110101011011011110010100011100011101011111010001110101011011011110010100100101111001010001110111100111000011000111010111110100011101010110110111100101101001000111010111110100011101010110010111100110101010111110101000010 8ebe8eadbca48ebe8eadbca38ebe8eadbca4bca3bce18ebe8eadbcb48ebe8eacbcd57d42
EUC-JP 疾鹿シ、疾鹿シ」疾鹿シ、シ」シ瘤セ鹿シエ疾汐シユ}B 1011110011000000101111001010111110001110101111001000111010100100101111001100000010111100101011111000111010111100100011101010001110111100110000001011110010101111100011101011110010001110101001001000111010111100100011101010001110001110101111001110000111101110100011101011111010111100101011111000111010111100100011101011010010111100110000001011110010101110100011101011110010001110110101010111110101000010 bcc0bcaf8ebc8ea4bcc0bcaf8ebc8ea3bcc0bcaf8ebc8ea48ebc8ea38ebce1ee8ebebcaf8ebc8eb4bcc0bcae8ebc8ed57d42
UTF-8 疾鹿シ、疾鹿シ」疾鹿シ、シ」シ瘤セ鹿シエ疾汐シユ}B 1110011110010110101111101110100110111001101111111110111110111101101111001110111110111101101001001110011110010110101111101110100110111001101111111110111110111101101111001110111110111101101000111110011110010110101111101110100110111001101111111110111110111101101111001110111110111101101001001110111110111101101111001110111110111101101000111110111110111101101111001110011110011000101001001110111110111101101111101110100110111001101111111110111110111101101111001110111110111101101101001110011110010110101111101110011010110001100100001110111110111101101111001110111110111110100101010111110101000010 e796bee9b9bfefbdbcefbda4e796bee9b9bfefbdbcefbda3e796bee9b9bfefbdbcefbda4efbdbcefbda3efbdbce798a4efbdbee9b9bfefbdbcefbdb4e796bee6b190efbdbcefbe957d42
UHC 疾鹿??疾鹿??疾鹿?????瘤?鹿??疾汐??}B 111100101111000011010110111000110011111100111111111100101111000011010110111000110011111100111111111100101111000011010110111000110011111100111111001111110011111100111111110101111011101100111111110101101110001100111111001111111111001011110000111000001011000100111111001111110111110101000010 f2f0d6e33f3ff2f0d6e33f3ff2f0d6e33f3f3f3f3fd7bb3fd6e33f3ff2f0e0b13f3f7d42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)