To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 疾鹿シチ疾鹿シ」疾汐シヲ疾鹿シウシ」シ 100011101011111010001110101011011011110011000001100011101011111010001110101011011011110010100011100011101011111010001110101011001011110010100110100011101011111010001110101011011011110010110011101111001010001110111100 8ebe8eadbcc18ebe8eadbca38ebe8eacbca68ebe8eadbcb3bca3bc
EUC-JP 疾鹿シチ疾鹿シ」疾汐シヲ疾鹿シウシ」シ 1011110011000000101111001010111110001110101111001000111011000001101111001100000010111100101011111000111010111100100011101010001110111100110000001011110010101110100011101011110010001110101001101011110011000000101111001010111110001110101111001000111010110011100011101011110010001110101000111000111010111100 bcc0bcaf8ebc8ec1bcc0bcaf8ebc8ea3bcc0bcae8ebc8ea6bcc0bcaf8ebc8eb38ebc8ea38ebc
UTF-8 疾鹿シチ疾鹿シ」疾汐シヲ疾鹿シウシ」シ 111001111001011010111110111010011011100110111111111011111011110110111100111011111011111010000001111001111001011010111110111010011011100110111111111011111011110110111100111011111011110110100011111001111001011010111110111001101011000110010000111011111011110110111100111011111011110110100110111001111001011010111110111010011011100110111111111011111011110110111100111011111011110110110011111011111011110110111100111011111011110110100011111011111011110110111100 e796bee9b9bfefbdbcefbe81e796bee9b9bfefbdbcefbda3e796bee6b190efbdbcefbda6e796bee9b9bfefbdbcefbdb3efbdbcefbda3efbdbc
UHC 疾鹿??疾鹿??疾汐??疾鹿????? 111100101111000011010110111000110011111100111111111100101111000011010110111000110011111100111111111100101111000011100000101100010011111100111111111100101111000011010110111000110011111100111111001111110011111100111111 f2f0d6e33f3ff2f0d6e33f3ff2f0e0b13f3ff2f0d6e33f3f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)