To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN ?翹??翹?^ 001111111110001111001001001111110011111111100011110010010011111101011110 3fe3c93f3fe3c93f5e
EUC-JP 荑翹?荑翹?^ 10001111110101111111100111100110110010110011111110001111110101111111100111100110110010110011111101011110 8fd7f9e6cb3f8fd7f9e6cb3f5e
UTF-8 荑翹㉠荑翹㉠^ 11101000100011011001000111100111101111111011100111100011100010011010000011101000100011011001000111100111101111111011100111100011100010011010000001011110 e88d91e7bfb9e389a0e88d91e7bfb9e389a05e
UHC 荑翹㉠荑翹㉠^ 11101100101111111100111011101110101010001011000111101100101111111100111011101110101010001011000101011110 ecbfceeea8b1ecbfceeea8b15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)