To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 縡?怨峯????趙陌工 1110001101110001001111111000100110000101100101011111010100111111001111110011111100111111111001101110001011101000100110011000110101001000 e3713f898595f53f3f3f3fe6e2e8998d48
EUC-JP 縡?怨峯????趙陌工 1110010111010010001111111011000111100101110010101111011100111111001111110011111100111111111011001110010011101111111110011011100110101001 e5d23fb1e5caf73f3f3f3fece4eff9b9a9
UTF-8 縡렕怨峯렟닿렕렟趙陌工 111001111011100010100001111010111010000010010101111001101000000010101000111001011011001110101111111010111010000010011111111010111000101110111111111010111010000010010101111010111010000010011111111010001011011010011001111010011001100110001100111001011011011110100101 e7b8a1eba095e680a8e5b3afeba09feb8bbfeba095eba09fe8b699e9998ce5b7a5
UHC 縡렕怨峯렟닿렕렟趙陌工 11101110101011011000111010101010111010101011001111011100111001111000111010110000101101001110101010001110101010101000111010110000111100001110000111011000111010001100110111101111 eead8eaaeab3dce78eb0b4ea8eaa8eb0f0e1d8e8cdef

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)