To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 鮓ッ繞コ妺ュ閧 1110100110110110101011111110001110000101101110101111101010100101101011011110100010000010 e9b6afe385bafaa5ade882
EUC-JP 鮓ッ繞コ妺ュ閧 111100101011100010001110101011111110010111100101100011101011101010001111101110011011011110001110101011011110111111100010 f2b88eafe5e58eba8fb9b78eadefe2
UTF-8 鮓ッ繞コ妺ュ閧 111010011010111010010011111011111011110110101111111001111011100110011110111011111011110110111010111001011010011010111010111011111011110110101101111010011001011010100111 e9ae93efbdafe7b99eefbdbae5a6baefbdade996a7
UHC ??繞???? 0011111100111111111010011010010000111111001111110011111100111111 3f3fe9a43f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)