To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 而陌????缺耳?圓 100011101010011111101000100110010011111100111111001111110011111111100011100111101000111010101000001111111001101010100010 8ea7e8993f3f3f3fe39e8ea83f9aa2
EUC-JP 而陌????缺耳?圓 101111001010100111101111111110010011111100111111001111110011111111100101111111101011110010101010001111111101010010100100 bca9eff93f3f3f3fe5febcaa3fd4a4
UTF-8 而陌렠염裏곈缺耳렲圓 111010001000000010001100111010011001100110001100111010111010000010100000111011001001011110111100111011111010011110100111111010101011001110001000111001111011110010111010111010001000000010110011111010111010000010110010111001011001110010010011 e8808ce9998ceba0a0ec97bcefa7a7eab388e7bcbae880b3eba0b2e59c93
UHC 而陌렠염裏곈缺耳렲圓 1110110010111011110110001110100010001110101100011011111110110000111011001100000010110000111010011100110011000000111011001011110010001110101111111110101010101101 ecbbd8e88eb1bfb0ecc0b0e9ccc0ecbc8ebfeaad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)