To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????\ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011100 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5c
SJIS-WIN 疾鹿シツ疾痔シウ疾痔シウ疾鹿シウシ」シ瘤セ鹿シ・\ 1000111010111110100011101010110110111100110000101000111010111110100011101010010010111100101100111000111010111110100011101010010010111100101100111000111010111110100011101010110110111100101100111011110010100011101111001110000110001110101111101000111010101101101111001010010101011100 8ebe8eadbcc28ebe8ea4bcb38ebe8ea4bcb38ebe8eadbcb3bca3bce18ebe8eadbca55c
EUC-JP 疾鹿シツ疾痔シウ疾痔シウ疾鹿シウシ」シ瘤セ鹿シ・\ 10111100110000001011110010101111100011101011110010001110110000101011110011000000101111001010011010001110101111001000111010110011101111001100000010111100101001101000111010111100100011101011001110111100110000001011110010101111100011101011110010001110101100111000111010111100100011101010001110001110101111001110000111101110100011101011111010111100101011111000111010111100100011101010010101011100 bcc0bcaf8ebc8ec2bcc0bca68ebc8eb3bcc0bca68ebc8eb3bcc0bcaf8ebc8eb38ebc8ea38ebce1ee8ebebcaf8ebc8ea55c
UTF-8 疾鹿シツ疾痔シウ疾痔シウ疾鹿シウシ」シ瘤セ鹿シ・\ 11100111100101101011111011101001101110011011111111101111101111011011110011101111101111101000001011100111100101101011111011100111100101111001010011101111101111011011110011101111101111011011001111100111100101101011111011100111100101111001010011101111101111011011110011101111101111011011001111100111100101101011111011101001101110011011111111101111101111011011110011101111101111011011001111101111101111011011110011101111101111011010001111101111101111011011110011100111100110001010010011101111101111011011111011101001101110011011111111101111101111011011110011101111101111011010010101011100 e796bee9b9bfefbdbcefbe82e796bee79794efbdbcefbdb3e796bee79794efbdbcefbdb3e796bee9b9bfefbdbcefbdb3efbdbcefbda3efbdbce798a4efbdbee9b9bfefbdbcefbda55c
UHC 疾鹿??疾痔??疾痔??疾鹿?????瘤?鹿??\ 1111001011110000110101101110001100111111001111111111001011110000111101101100000000111111001111111111001011110000111101101100000000111111001111111111001011110000110101101110001100111111001111110011111100111111001111111101011110111011001111111101011011100011001111110011111101011100 f2f0d6e33f3ff2f0f6c03f3ff2f0f6c03f3ff2f0d6e33f3f3f3f3fd7bb3fd6e33f3f5c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)