To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??????耳?汐??????耳?汐^ 0011111100111111001111110011111100111111001111111000111010101000001111111000111010101100001111110011111100111111001111110011111100111111100011101010100000111111100011101010110001011110 3f3f3f3f3f3f8ea83f8eac3f3f3f3f3f3f8ea83f8eac5e
EUC-JP 瑄?瑄??瑄耳瑄汐瑄?瑄??瑄耳瑄汐^ 100011111100110010111001001111111000111111001100101110010011111100111111100011111100110010111001101111001010101010001111110011001011100110111100101011101000111111001100101110010011111110001111110011001011100100111111001111111000111111001100101110011011110010101010100011111100110010111001101111001010111001011110 8fccb93f8fccb93f3f8fccb9bcaa8fccb9bcae8fccb93f8fccb93f3f8fccb9bcaa8fccb9bcae5e
UTF-8 瑄罹瑄롛롗瑄耳瑄汐瑄罹瑄롛롗瑄耳瑄汐^ 11100111100100011000010011101111101001111010011011100111100100011000010011101011101000011001101111101011101000011001011111100111100100011000010011101000100000001011001111100111100100011000010011100110101100011001000011100111100100011000010011101111101001111010011011100111100100011000010011101011101000011001101111101011101000011001011111100111100100011000010011101000100000001011001111100111100100011000010011100110101100011001000001011110 e79184efa7a6e79184eba19beba197e79184e880b3e79184e6b190e79184efa7a6e79184eba19beba197e79184e880b3e79184e6b1905e
UHC 瑄罹瑄롛롗瑄耳瑄汐瑄罹瑄롛롗瑄耳瑄汐^ 11100000110001011110110010111010111000001100010110001110110111111000111011011011111000001100010111101100101111001110000011000101111000001011000111100000110001011110110010111010111000001100010110001110110111111000111011011011111000001100010111101100101111001110000011000101111000001011000101011110 e0c5ecbae0c58edf8edbe0c5ecbce0c5e0b1e0c5ecbae0c58edf8edbe0c5ecbce0c5e0b15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)