To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 櫻??旬??昻 10011111010011100011111100111111100011110111101100111111001111111111101011010000 9f4e3f3f8f7b3f3ffad0
EUC-JP 櫻??旬??? 110111011010111100111111001111111011110111011100001111110011111100111111 ddaf3f3fbddc3f3f3f
UTF-8 櫻쇘떋旬댐㎢昻 111001101010101110111011111011001000011110011000111010111001011010001011111001101001011110101100111010111000110010010000111000111000111010100010111001101001100010111011 e6abbbec8798eb968be697aceb8c90e38ea2e698bb
UHC 櫻쇘떋旬댐㎢昻 1110010110100001101111001110011110001011101000011110001011100010101101001110111110100111101101001110010011101001 e5a1bce78ba1e2e2b4efa7b4e4e9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)