To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 瘟?ぐ耶ヨ?? 1110000110001001001111111000001010101110100101101110101110000011100010000011111100111111 e1893f82ae96eb83883f3f
EUC-JP 瘟?ぐ耶ヨ?? 1110000111101001001111111010010010110000110011001110110110100101111010000011111100111111 e1e93fa4b0cceda5e83f3f
UTF-8 瘟룩ぐ耶ヨ갬銳 111001111001100010011111111010111010001110101001111000111000000110010000111010001000000010110110111000111000001110101000111010101011000010101100111010011000101010110011 e7989feba3a9e38190e880b6e383a8eab0ace98ab3
UHC 瘟룩ぐ耶ヨ갬銳 1110100010110000101101111110100010101010101100001110010110101101101010111110100010110000101101111110011111100101 e8b0b7e8aab0e5adabe8b0b7e7e5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)