To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 樗峰???底????? 1001001010010100100101011111010000111111001111110011111110010010111010100011111100111111001111110011111100111111 929495f43f3f3f92ea3f3f3f3f3f
EUC-JP 樗峰???底????? 1100001111110100110010101111011000111111001111110011111111000100111011000011111100111111001111110011111100111111 c3f4caf63f3f3fc4ec3f3f3f3f3f
UTF-8 樗峰렕쇘땐底펭렒곌렫렣 111001101010100010010111111001011011001110110000111010111010000010010101111011001000011110011000111010111001010110010000111001011011101010010101111011011000111010101101111010111010000010010010111010101011001110001100111010111010000010101011111010111010000010100011 e6a897e5b3b0eba095ec8798eb9590e5ba95ed8eadeba092eab38ceba0abeba0a3
UHC 樗峰렕쇘땐底펭렒곌렫렣 11101110110000001101110011101000100011101010101010111100111001111011011010101001111011101011110011000110111010111000111010100111101100001110101010001110101110011000111010110100 eec0dce88eaabce7b6a9eebcc6eb8ea7b0ea8eb98eb4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)