To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 蒸堪?蒸堪?^ 1000111111110110100010101010110000111111100011111111011010001010101011000011111101011110 8ff68aac3f8ff68aac3f5e
EUC-JP 蒸堪?蒸堪?^ 1011111011111000101101001010111000111111101111101111100010110100101011100011111101011110 bef8b4ae3fbef8b4ae3f5e
UTF-8 蒸堪㉩蒸堪㉦^ 11101000100100101011100011100101101000001010101011100011100010011010100111101000100100101011100011100101101000001010101011100011100010011010011001011110 e892b8e5a0aae389a9e892b8e5a0aae389a65e
UHC 蒸堪㉩蒸堪㉦^ 11110001111110101100101011101101101010001011101011110001111110101100101011101101101010001011011101011110 f1facaeda8baf1facaeda8b75e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)