To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 澱?猿校??鬱?? 10010011011000100011111110001001100011101000110101011010001111110011111110011111010101000011111100111111 93623f898e8d5a3f3f9f543f3f
EUC-JP 澱?猿校??鬱?? 11000101110000110011111110110001111011101011100110111011001111110011111111011101101101010011111100111111 c5c33fb1eeb9bb3f3fddb53f3f
UTF-8 澱렰猿校렰렎鬱띳렰 111001101011111010110001111010111010000010110000111001111000110010111111111001101010000010100001111010111010000010110000111010111010000010001110111010011010110010110001111010111001110110110011111010111010000010110000 e6beb1eba0b0e78cbfe6a0a1eba0b0eba08ee9acb1eb9db3eba0b0
UHC 澱렰猿校렰렎鬱띳렰 111011101111111010001110101111011110101010111011110011101110100010001110101111011000111010100100111010101010011010110110111100011000111010111101 eefe8ebdeabbcee88ebd8ea4eaa6b6f18ebd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)