To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 鎰?爭篩??Τ 1110100001001100001111111110000010100101111000101011111100111111001111111000001110110001 e84c3fe0a5e2bf3f3f83b1
EUC-JP 鎰?爭篩??Τ 1110111110101101001111111110000010100111111001001100000100111111001111111010011010110011 efad3fe0a7e4c13f3fa6b3
UTF-8 鎰렏爭篩뤵쇱Τ 1110100110001110101100001110101110100000100011111110011110001000101011011110011110101111101010011110101110100100101101011110110010000111101100011100111010100100 e98eb0eba08fe788ade7afa9eba4b5ec87b1cea4
UHC 鎰렏爭篩뤵쇱Τ 1110110011110000100011101010010111101110101100111101111011101000100011111110001110111100111011001010010111010011 ecf08ea5eeb3dee88fe3bceca5d3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)