To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 邱夊エ堺サ∫キ 1110011110110111100110101110100010110100100011011110010010111011100000011110011110110111 e7b79ae8b48de4bb81e7b7
EUC-JP 邱夊エ堺サ∫キ 1110111010111001110101001110101010001110101101001011101011100110100011101011101110100010111010011000111010110111 eeb9d4ea8eb4bae68ebba2e98eb7
UTF-8 邱夊エ堺サ∫キ 111010011000001010110001111001011010010010001010111011111011110110110100111001011010000010111010111011111011110110111011111000101000100010101011111011111011110110110111 e982b1e5a48aefbdb4e5a0baefbdbbe288abefbdb7
UHC 邱??堺?∫? 11001111110010000011111100111111110011001111011100111111101000011111001000111111 cfc83f3fccf73fa1f23f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)