To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 聢丞■聢丞■^ 11100011110111001000111111100101100000011010000111100011110111001000111111100101100000011010000101011110 e3dc8fe581a1e3dc8fe581a15e
EUC-JP 聢丞■聢丞■^ 11100110110111101011111011100111101000101010001111100110110111101011111011100111101000101010001101011110 e6debee7a2a3e6debee7a2a35e
UTF-8 聢丞■聢丞■^ 11101000100000011010001011100100101110001001111011100010100101101010000011101000100000011010001011100100101110001001111011100010100101101010000001011110 e881a2e4b89ee296a0e881a2e4b89ee296a05e
UHC ?丞■?丞■^ 0011111111100011101010101010000111100001001111111110001110101010101000011110000101011110 3fe3aaa1e13fe3aaa1e15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)