To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 蠢?爰???淀 11100101101111110011111111100000101001110011111100111111001111111001011110000100 e5bf3fe0a73f3f3f9784
EUC-JP 蠢?爰?焌?淀 111010101100000100111111111000001010100100111111100011111100100111101000001111111100110111100100 eac13fe0a93f8fc9e83fcde4
UTF-8 蠢렎爰렪焌렠淀 111010001010000010100010111010111010000010001110111001111000100010110000111010111010000010101010111001111000010010001100111010111010000010100000111001101011011110000000 e8a0a2eba08ee788b0eba0aae7848ceba0a0e6b780
UHC 蠢렎爰렪焌렠淀 1111000111100011100011101010010011101010101110101000111010111000111100011110000010001110101100011110111111100011 f1e38ea4eaba8eb8f1e08eb1efe3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)