To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 岳??松θぜ違? 10001010011110000011111100111111100011111011110010000011110001101000001010111010100010001110000100111111 8a783f3f8fbc83c682ba88e13f
EUC-JP 岳??松θぜ違? 10110011110110010011111100111111101111101011111010100110110010001010010010111100101100001110001100111111 b3d93f3fbebea6c8a4bcb0e33f
UTF-8 岳묒빖松θぜ違먯 1110010110110010101100111110101110101100100100101110101110111001100101101110011010011101101111101100111010111000111000111000000110011100111010011000000110010101111010111010100010101111 e5b2b3ebac92ebb996e69dbeceb8e3819ce98195eba8af
UHC 岳묒빖松θぜ違먯 11100100101111111001000111101100100101011011100011100001111001101010010111101000101010101011110011101010110111101001000011101100 e4bf91ec95b8e1e6a5e8aabceade90ec

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)