To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 鐔常臭荵御音絨坂 11101000010111001000111111101101100011110100110011100100101110011000110011100100100010011011100111100011010011111000110111100010 e85c8fed8f4ce4b98ce489b9e34f8de2
EUC-JP 鐔常臭荵御音絨坂 11101111101111011011111011101111101111011010110111101000101110111011100011100110101100101011101111100101101100001011101011100100 efbdbeefbdade8bbb8e6b2bbe5b0bae4
UTF-8 鐔常臭荵御音絨坂 111010011001000010010100111001011011100010111000111010001000011110101101111010001000110110110101111001011011111010100001111010011001111110110011111001111011010110101000111001011001110110000010 e99094e5b8b8e887ade88db5e5bea1e99fb3e7b5a8e59d82
UHC ?常臭?御音絨坂 0011111111011111110010001111011010101011001111111110010111011001111010111110010111101011110101101111011111111000 3fdfc8f6ab3fe5d9ebe5ebd6f7f8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)