To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 騾手ヲ也ァ矩尠 111010011000000010001110111010001010011010010110111001111010011110001011111010011001101110010110 e9808ee8a696e7a78be99b96
EUC-JP 騾手ヲ也ァ矩尠 1111000111100000101111001110101010001110101001101100110011101001100011101010011110110110111010111101010111110110 f1e0bcea8ea6cce98ea7b6ebd5f6
UTF-8 騾手ヲ也ァ矩尠 111010011010100010111110111001101000100110001011111011111011110110100110111001001011100110011111111011111011110110100111111001111001111110101001111001011011000010100000 e9a8bee6898befbda6e4b99fefbda7e79fa9e5b0a0
UHC ?手?也?矩? 00111111111000101010001000111111111001011010010100111111110011111011101100111111 3fe2a23fe5a53fcfbb3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)