To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN 櫻→?櫻→?B 1001111101001110100000011010100000111111100111110100111010000001101010000011111101000010 9f4e81a83f9f4e81a83f42
EUC-JP 櫻→?櫻→?B 1101110110101111101000101010101000111111110111011010111110100010101010100011111101000010 ddafa2aa3fddafa2aa3f42
UTF-8 櫻→찣櫻→찣B 11100110101010111011101111100010100001101001001011101100101100001010001111100110101010111011101111100010100001101001001011101100101100001010001101000010 e6abbbe28692ecb0a3e6abbbe28692ecb0a342
UHC 櫻→찣櫻→찣B 11100101101000011010000111100110101010011001111111100101101000011010000111100110101010011001111101000010 e5a1a1e6a99fe5a1a1e6a99f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)