To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 要?????宋?ゴ 100101110111011000111111001111110011111100111111001111111001000101110110001111111000001101010011 97763f3f3f3f3f91763f8353
EUC-JP 要?????宋?ゴ 110011011101011100111111001111110011111100111111001111111100000111010111001111111010010110110100 cdd73f3f3f3f3fc1d73fa5b4
UTF-8 要랃슉樂뗦쮭宋볣ゴ 111010001010011010000001111010111001111010000011111011001000101010001001111011111010011010111111111010111001011110100110111011001010111010101101111001011010111010001011111010111011001110100011111000111000001010110100 e8a681eb9e83ec8a89efa6bfeb97a6ecaeade5ae8bebb3a3e382b4
UHC 要랃슉樂뗦쮭宋볣ゴ 111010011010100110001101111011111011110110110101111010001111100110001011111001101010100010001010111000011110010010010011111010011010101110110100 e9a98defbdb5e8f98be6a88ae1e493e9abb4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)