To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鴦??惟??悟??毅??蹂??押り?艤 111010011111000100111111001111111000100011010010001111110011111110001100111001010011111100111111100010110100001000111111001111111110011011111000001111110011111110001001100111111000001011101000001111111110010001111110 e9f13f3f88d23f3f8ce53f3f8b423f3fe6f83f3f899f82e83fe47e
EUC-JP 鴦??惟??悟??毅??蹂??押り?艤 111100101111001100111111001111111011000011010100001111110011111110111000111001110011111100111111101101011010001100111111001111111110110011111010001111110011111110110010101000011010010011101010001111111110011111011111 f2f33f3fb0d43f3fb8e73f3fb5a33f3fecfa3f3fb2a1a4ea3fe7df
UTF-8 鴦꾨땶惟깅덩悟귣슣毅싧퐲蹂잛뫓押り낮艤 111010011011010010100110111010101011111010101000111010111001010110110110111001101000001110011111111010101011100110000101111010111000110110101001111001101000001010011111111010101011011110100011111011001000101010100011111001101010111110000101111011001000101110100111111011011001000010110010111010001011100110000010111011001001111010011011111010111010101110010011111001101000101010111100111000111000001010001010111010111000001010101110111010001000100110100100 e9b4a6eabea8eb95b6e6839feab985eb8da9e6829feab7a3ec8aa3e6af85ec8ba7ed90b2e8b982ec9e9bebab93e68abce3828aeb82aee889a4
UHC 鴦꾨땶惟깅덩悟귣슣毅싧퐲蹂잛뫓押り낮艤 1110010011101100100001001110101110001011100011001110101011101110101100011110101110110101101000101110011111110110100000101110101110011010101011111110101111110110100110101110010110111101100110111110101110110011100111111110110010010001101101011110010011100011101010101110101010110011101101111110101111111010 e4ec84eb8b8ceaeeb1ebb5a2e7f682eb9aafebf69ae5bd9bebb39fec91b5e4e3aaeab3b7ebfa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)