To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 堰?????矣??嚴щ?異??碎??沃?? 10001001100000010011111100111111001111110011111100111111111000011110000100111111001111111001101010001110100001001000101100111111100010001101100100111111001111111110000111101010001111110011111110010111100000000011111100111111 89813f3f3f3f3fe1e13f3f9a8e848b3f88d93f3fe1ea3f3f97803f3f
EUC-JP 堰?????矣??嚴щ?異??碎??沃?? 10110001111000010011111100111111001111110011111100111111111000101110001100111111001111111101001111101110101001111110101100111111101100001101101100111111001111111110001011101100001111110011111111001101111000000011111100111111 b1e13f3f3f3f3fe2e33f3fd3eea7eb3fb0db3f3fe2ec3f3fcde03f3f
UTF-8 堰묐쓷流쒒걡矣쒖뫒嚴щ벊異룩첑碎ㅻ깹沃쇰궟 1110010110100000101100001110101110101100100100001110110010010011101101111110111110100111100010101110110010010010100100101110101010110001101000011110011110011111101000111110110010010010100101101110101110101011100100101110010110011010101101001101000110001001111010111011001010001010111001111001010110110000111010111010001110101001111011001011001010010001111001111010001010001110111000111000010110111011111010101011100110111001111001101011001010000011111011001000011110110000111010101011011010011111 e5a0b0ebac90ec93b7efa78aec9292eab1a1e79fa3ec9296ebab92e59ab4d189ebb28ae795b0eba3a9ecb291e7a28ee385bbeab9b9e6b283ec87b0eab69f
UHC 堰묐쓷流쒒걡矣쒖뫒嚴щ벊異룩첑碎ㅻ깹沃쇰궟 111001011110100010010001111010111001110110010100111010101111110010011100111010011000000110001010111010111111100010011100111011001001000110110100111001011111000110101100111010111001001110101101111011001011011010110111111010001010101010011110111000011110111110100100111010111011001010100001111010001010101010111100111010111000001010110010 e5e891eb9d94eafc9ce9818aebf89cec91b4e5f1aceb93adecb6b7e8aa9ee1efa4ebb2a1e8aabceb82b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)