To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 汚х┴熱?姚ι 10001001100110001000010010000111100001001010100010010100010011010011111110011011010011001000001111000111 8998848784a8944d3f9b4c83c7
EUC-JP 汚х┴熱?姚ι 10110001111110001010011111100111101010001010101011000111101011100011111111010101101011011010011011001001 b1f8a7e7a8aac7ae3fd5ada6c9
UTF-8 汚х┴熱풞姚ι 11100110101100011001101011010001100001011110001010010100101101001110011110000110101100011110110110010010100111101110010110100111100110101100111010111001 e6b19ad185e294b4e786b1ed929ee5a79aceb9
UHC 汚х┴熱풞姚ι 1110011111111101101011001110011110100110101010101110011011110000101111110100000111101000111011101010010111101001 e7fdace7a6aae6f0bf41e8eea5e9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)