To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????\ 00111111001111110011111100111111001111110011111100111111001111110011111101011100 3f3f3f3f3f3f3f3f3f5c
SJIS-WIN ???誼??攸??\ 001111110011111100111111100010110110001000111111001111111001110110111111001111110011111101011100 3f3f3f8b623f3f9dbf3f3f5c
EUC-JP ???誼??攸??\ 001111110011111100111111101101011100001100111111001111111101101011000001001111110011111101011100 3f3f3fb5c33f3fdac13f3f5c
UTF-8 聯뤿끏誼붷궟攸낆졒\ 11101111101001101001011111101011101001001011111111101011100000011000111111101000101010101011110011101011101101101011011111101010101101101001111111100110100101001011100011101011100000101000011011101100101000011001001001011100 efa697eba4bfeb818fe8aabcebb6b7eab69fe694b8eb8286eca1925c
UHC 聯뤿끏誼붷궟攸낆졒\ 11100110111000011000111111101011100001011011111111101011111111101001010011100101100000101011001011101010111100101000010111101100101000001011111101011100 e6e18feb85bfebfe94e582b2eaf285eca0bf5c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)