To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????爾?汐 0011111100111111001111110011111100111111001111111000111010100010001111111000111010101100 3f3f3f3f3f3f8ea23f8eac
EUC-JP 瑄?瑄??瑄爾瑄汐 10001111110011001011100100111111100011111100110010111001001111110011111110001111110011001011100110111100101001001000111111001100101110011011110010101110 8fccb93f8fccb93f3f8fccb9bca48fccb9bcae
UTF-8 瑄罹瑄롛롗瑄爾瑄汐 111001111001000110000100111011111010011110100110111001111001000110000100111010111010000110011011111010111010000110010111111001111001000110000100111001111000100010111110111001111001000110000100111001101011000110010000 e79184efa7a6e79184eba19beba197e79184e788bee79184e6b190
UHC 瑄罹瑄롛롗瑄爾瑄汐 111000001100010111101100101110101110000011000101100011101101111110001110110110111110000011000101111011001011001111100000110001011110000010110001 e0c5ecbae0c58edf8edbe0c5ecb3e0c5e0b1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)