To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN テ、テ凖姪崚、テ凖姪嫂 1100001110100100110000111001100111000011100101101100001110011011110000111010010011000011100110011100001110010110110000111001101101011110 c3a4c399c396c39bc3a4c399c396c39b5e
EUC-JP テ、テ凖姪崚、テ凖姪嫂 10001110110000111000111010100100100011101100001111010010110001011100110011000101110101101100010110001110101001001000111011000011110100101100010111001100110001011101010110111111 8ec38ea48ec3d2c5ccc5d6c58ea48ec3d2c5ccc5d5bf
UTF-8 テ、テ凖姪崚、テ凖姪嫂 111011111011111010000011111011111011110110100100111011111011111010000011111001011000011110010110111001011010011110101010111001011011010010011010111011111011110110100100111011111011111010000011111001011000011110010110111001011010011110101010111001011010101110000010 efbe83efbda4efbe83e58796e5a7aae5b49aefbda4efbe83e58796e5a7aae5ab82
UHC ????姪????姪嫂 0011111100111111001111110011111111110010111010110011111100111111001111110011111111110010111010111110000111111001 3f3f3f3ff2eb3f3f3f3ff2ebe1f9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)