To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????[????[^ 0011111100111111001111110011111101011011001111110011111100111111001111110101101101011110 3f3f3f3f5b3f3f3f3f5b5e
SJIS-WIN ?勹?勹[?勹?勹[^ 001111111001100110101111001111111001100110101111010110110011111110011001101011110011111110011001101011110101101101011110 3f99af3f99af5b3f99af3f99af5b5e
EUC-JP 伈勹伈勹[伈勹伈勹[^ 1000111110110000110101011101001010110001100011111011000011010101110100101011000101011011100011111011000011010101110100101011000110001111101100001101010111010010101100010101101101011110 8fb0d5d2b18fb0d5d2b15b8fb0d5d2b18fb0d5d2b15b5e
UTF-8 伈勹伈勹[伈勹伈勹[^ 111001001011110010001000111001011000101110111001111001001011110010001000111001011000101110111001010110111110010010111100100010001110010110001011101110011110010010111100100010001110010110001011101110010101101101011110 e4bc88e58bb9e4bc88e58bb95be4bc88e58bb9e4bc88e58bb95b5e
UHC ????[????[^ 0011111100111111001111110011111101011011001111110011111100111111001111110101101101011110 3f3f3f3f5b3f3f3f3f5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)