To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 撓??要??汚 10011101100110100011111100111111100101110111011000111111001111111000100110011000 9d9a3f3f97763f3f8998
EUC-JP 撓??要??汚 11011001111110100011111100111111110011011101011100111111001111111011000111111000 d9fa3f3fcdd73f3fb1f8
UTF-8 撓뷂숱要뺧쉰汚 111001101001001010010011111010111011011110000010111011001000100010110001111010001010011010000001111010111011101010100111111011001000100110110000111001101011000110011010 e69293ebb782ec88b1e8a681ebbaa7ec89b0e6b19a
UHC 撓뷂숱要뺧쉰汚 1110100011110101100101001110111110111101101000101110100110101001100101011110111110111101101011101110011111111101 e8f594efbda2e9a995efbdaee7fd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)