To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN シ」シ瘤セ鹿シ」疾辞シナ疾鹿讌 10111100101000111011110011100001100011101011111010001110101011011011110010100011100011101011111010001110101010111011110011000101100011101011111010001110101011011110011010100101 bca3bce18ebe8eadbca38ebe8eabbcc58ebe8eade6a5
EUC-JP シ」シ瘤セ鹿シ」疾辞シナ疾鹿讌 100011101011110010001110101000111000111010111100111000011110111010001110101111101011110010101111100011101011110010001110101000111011110011000000101111001010110110001110101111001000111011000101101111001100000010111100101011111110110010100111 8ebc8ea38ebce1ee8ebebcaf8ebc8ea3bcc0bcad8ebc8ec5bcc0bcafeca7
UTF-8 シ」シ瘤セ鹿シ」疾辞シナ疾鹿讌 111011111011110110111100111011111011110110100011111011111011110110111100111001111001100010100100111011111011110110111110111010011011100110111111111011111011110110111100111011111011110110100011111001111001011010111110111010001011111010011110111011111011110110111100111011111011111010000101111001111001011010111110111010011011100110111111111010001010111010001100 efbdbcefbda3efbdbce798a4efbdbee9b9bfefbdbcefbda3e796bee8be9eefbdbcefbe85e796bee9b9bfe8ae8c
UHC ???瘤?鹿??疾???疾鹿? 0011111100111111001111111101011110111011001111111101011011100011001111110011111111110010111100000011111100111111001111111111001011110000110101101110001100111111 3f3f3fd7bb3fd6e33f3ff2f03f3f3ff2f0d6e33f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)