To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嶸??揖??蹂??嶸??泣??儒??雅 1111101010110100001111110011111110010111010010110011111100111111111001101111100000111111001111111111101010110100001111110011111110001011100000110011111100111111100011101111001000111111001111111000100111101011 fab43f3f974b3f3fe6f83f3ffab43f3f8b833f3f8ef23f3f89eb
EUC-JP 嶸??揖??蹂??嶸??泣?ˇ儒??雅 100011111011101111110100001111110011111111001101101011000011111100111111111011001111101000111111001111111000111110111011111101000011111100111111101101011110001100111111100011111010001010110000101111001111010000111111001111111011001011101101 8fbbf43f3fcdac3f3fecfa3f3f8fbbf43f3fb5e33f8fa2b0bcf43f3fb2ed
UTF-8 嶸뗭옚揖썸솮蹂⑹뒛嶸뗭옚泣섉ˇ儒묒쓥雅 1110010110110110101110001110101110010111101011011110110010011000100110101110011010001111100101101110110010001101101110001110110010000110101011101110100010111001100000101110001010010001101110011110101110010010100110111110010110110110101110001110101110010111101011011110110010011000100110101110011010110011101000111110110010000100100010011100101110000111111001011000010010010010111010111010110010010010111011001001001110100101111010011001101110000101 e5b6b8eb97adec989ae68f96ec8db8ec86aee8b982e291b9eb929be5b6b8eb97adec989ae6b3a3ec8489cb87e58492ebac92ec93a5e99b85
UHC 嶸뗭옚揖썸솮蹂⑹뒛嶸뗭옚泣섉ˇ儒묒쓥雅 1110011110101110100010111110110010011110100111101110101111100111101111011110011010011001101001001110101110110011101010011110110010001010100110001110011110101110100010111110110010011110100111101110101111101000100110001110011010100010101001111110101011100011100100011110110010011101100001101110010010111010 e7ae8bec9e9eebe7bde699a4ebb3a9ec8a98e7ae8bec9e9eebe898e6a2a7eae391ec9d86e4ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)