To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 諺?????柚??矣??葯щ?B 100011001011111100111111001111110011111100111111001111111001011101001101001111110011111111100001111000010011111100111111111001001101111010000100100010110011111101000010 8cbf3f3f3f3f3f974d3f3fe1e13f3fe4de848b3f42
EUC-JP 諺??瑗??柚??矣??葯щ?B 1011100011000001001111110011111110001111110011001100000000111111001111111100110110101110001111110011111111100010111000110011111100111111111010001110000010100111111010110011111101000010 b8c13f3f8fccc03f3fcdae3f3fe2e33f3fe8e0a7eb3f42
UTF-8 諺뚯쉪瑗뉐뎄柚뜻럩矣꾧펶葯щ샄B 111010001010101110111010111010111001101010101111111011001000100110101010111001111001000110010111111010111000100110010000111010111000111010000100111001101001111110011010111010111001110010111011111010111001111110101001111001111001111110100011111010101011111010100111111011011000111010110110111010001001000110101111110100011000100111101100100000111000010001000010 e8abbaeb9aafec89aae79197eb8990eb8e84e69f9aeb9cbbeb9fa9e79fa3eabea7ed8eb6e891afd189ec838442
UHC 諺뚯쉪瑗뉐뎄柚뜻럩矣꾧펶葯щ샄B 11100101111011001000110011101100100110101000010011101010101111001000011111100101101101011010110011101010111101101011011011100110100011101000110011101011111110001000010011101010101111001000011111100101101101011010110011101011100110001011011001000010 e5ec8cec9a84eabc87e5b5aceaf6b6e68e8cebf884eabc87e5b5aceb98b642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)