To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN 垣企?垣企?B 1000101001011111100010101110100100111111100010100101111110001010111010010011111101000010 8a5f8ae93f8a5f8ae93f42
EUC-JP 垣企?垣企?B 1011001111000000101101001110101100111111101100111100000010110100111010110011111101000010 b3c0b4eb3fb3c0b4eb3f42
UTF-8 垣企㉢垣企㉢B 11100101100111101010001111100100101111001000000111100011100010011010001011100101100111101010001111100100101111001000000111100011100010011010001001000010 e59ea3e4bc81e389a2e59ea3e4bc81e389a242
UHC 垣企㉢垣企㉢B 11101010101011111101000011101010101010001011001111101010101011111101000011101010101010001011001101000010 eaafd0eaa8b3eaafd0eaa8b342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)