To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 塢?????張 100110101100011100111111001111110011111100111111001111111001001010100011 9ac73f3f3f3f3f92a3
EUC-JP 塢??縕??張 1101010011001001001111110011111110001111110101001100001000111111001111111100010010100101 d4c93f3f8fd4c23f3fc4a5
UTF-8 塢곻슁縕됧막張 111001011010000110100010111010101011001110111011111011001000101010000001111001111011100010010101111010111001000010100111111010111010011110001001111001011011110010110101 e5a1a2eab3bbec8a81e7b895eb90a7eba789e5bcb5
UHC 塢곻슁縕됧막張 1110011111110001100000011110111110111101101100111110100010110010100010011110010110111000101101111110110111100101 e7f181efbdb3e8b289e5b8b7ede5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)