To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN ??┏??┏^ 001111110011111110000100101011000011111100111111100001001010110001011110 3f3f84ac3f3f84ac5e
EUC-JP 薏?┏薏?┏^ 10001111110110011101111000111111101010001010111010001111110110011101111000111111101010001010111001011110 8fd9de3fa8ae8fd9de3fa8ae5e
UTF-8 薏멩┏薏멩┏^ 11101000100101101000111111101011101010011010100111100010100101001000111111101000100101101000111111101011101010011010100111100010100101001000111101011110 e8968feba9a9e2948fe8968feba9a9e2948f5e
UHC 薏멩┏薏멩┏^ 11101011111110111011100011100110101001101010111011101011111110111011100011100110101001101010111001011110 ebfbb8e6a6aeebfbb8e6a6ae5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)