To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 玉???d?蟻 10001011110010100011111100111111001111111000001010000100001111111000101101100001 8bca3f3f3f82843f8b61
EUC-JP 玉??彛d?蟻 101101101100110000111111001111111000111110111100111110101010001111100100001111111011010111000010 b6cc3f3f8fbcfaa3e43fb5c2
UTF-8 玉좉퀋彛d퓩蟻 111001111000111010001001111011001010001010001001111011011000000010001011111001011011110110011011111011111011110110000100111011011001001110101001111010001001111110111011 e78e89eca289ed808be5bd9befbd84ed93a9e89fbb
UHC 玉좉퀋彛d퓩蟻 1110100010101100101000001110101010110011100000011110110010101101101000111110010010111111100100011110101111111100 e8aca0eab381ecada3e4bf91ebfc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)