To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN シス叱ハス竺 11110000101011111011110011110001100011101011110110001110101101101111000111100100110010101111000110001110101111011000111010110001 f0afbcf18ebd8eb6f1e4caf18ebd8eb1
EUC-JP ?シ?ス叱?ハ?ス竺 00111111100011101011110000111111100011101011110110111100101110000011111110001110110010100011111110001110101111011011110010110011 3f8ebc3f8ebdbcb83f8eca3f8ebdbcb3
UTF-8 シス叱ハス竺 111011101000000110101110111011111011110110111100111011101000010010001001111011111011110110111101111001011000111110110001111011101000010110011111111011111011111010001010111011101000010010001001111011111011110110111101111001111010101110111010 ee81aeefbdbcee8489efbdbde58fb1ee859fefbe8aee8489efbdbde7abba
UHC ????叱????竺 001111110011111100111111001111111111001011101010001111110011111100111111001111111111010111100111 3f3f3f3ff2ea3f3f3f3ff5e7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)