To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 予??維??純?ぇ予??維??純?ぇ^ 100101110101110000111111001111111000100011011011001111110011111110001111100000110011111110000010101001011001011101011100001111110011111110001000110110110011111100111111100011111000001100111111100000101010010101011110 975c3f3f88db3f3f8f833f82a5975c3f3f88db3f3f8f833f82a55e
EUC-JP 予??維??純?ぇ予??維??純?ぇ^ 110011011011110100111111001111111011000011011101001111110011111110111101111000110011111110100100101001111100110110111101001111110011111110110000110111010011111100111111101111011110001100111111101001001010011101011110 cdbd3f3fb0dd3f3fbde33fa4a7cdbd3f3fb0dd3f3fbde33fa4a75e
UTF-8 予롢룗維껅쉸純볥ぇ予롢룗維껅쉸純볥ぇ^ 11100100101110101000100011101011101000011010001011101011101000111001011111100111101101101010110111101010101110111000010111101100100010011011100011100111101101001001010011101011101100111010010111100011100000011000011111100100101110101000100011101011101000011010001011101011101000111001011111100111101101101010110111101010101110111000010111101100100010011011100011100111101101001001010011101011101100111010010111100011100000011000011101011110 e4ba88eba1a2eba397e7b6adeabb85ec89b8e7b494ebb3a5e38187e4ba88eba1a2eba397e7b6adeabb85ec89b8e7b494ebb3a5e381875e
UHC 予롢룗維껅쉸純볥ぇ予롢룗維껅쉸純볥ぇ^ 11100101111110001000111011100011100011111001001111101011101010111000001111100110100110101000111011100010111011011001001111101011101010101010011111100101111110001000111011100011100011111001001111101011101010111000001111100110100110101000111011100010111011011001001111101011101010101010011101011110 e5f88ee38f93ebab83e69a8ee2ed93ebaaa7e5f88ee38f93ebab83e69a8ee2ed93ebaaa75e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)