To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 竺霑識ハ闔ュ璽竺霑識ハ闔ュ璽B 111100011111101010001110101100011110100010111111100011101010111111110010101010011100101011101000100011101010110110001110101000111111000111111010100011101011000111101000101111111000111010101111111100101010100111001010111010001000111010101101100011101010001101000010 f1fa8eb1e8bf8eaff2a9cae88ead8ea3f1fa8eb1e8bf8eaff2a9cae88ead8ea342
EUC-JP ?竺霑識?ハ闔ュ璽?竺霑識?ハ闔ュ璽B 001111111011110010110011111100001100000110111100101100010011111110001110110010101110111111101110100011101010110110111100101001010011111110111100101100111111000011000001101111001011000100111111100011101100101011101111111011101000111010101101101111001010010101000010 3fbcb3f0c1bcb13f8ecaefee8eadbca53fbcb3f0c1bcb13f8ecaefee8eadbca542
UTF-8 竺霑識ハ闔ュ璽竺霑識ハ闔ュ璽B 11101110100001011011010111100111101010111011101011101001100111001001000111101000101011011001100011101110100001111010000011101111101111101000101011101001100101111001010011101111101111011010110111100111100100101011110111101110100001011011010111100111101010111011101011101001100111001001000111101000101011011001100011101110100001111010000011101111101111101000101011101001100101111001010011101111101111011010110111100111100100101011110101000010 ee85b5e7abbae99c91e8ad98ee87a0efbe8ae99794efbdade792bdee85b5e7abbae99c91e8ad98ee87a0efbe8ae99794efbdade792bd42
UHC ?竺霑識??闔?璽?竺霑識??闔?璽B 0011111111110101111001111110111111000101111000111101101100111111001111111111100111101111001111111101111111011110001111111111010111100111111011111100010111100011110110110011111100111111111110011110111100111111110111111101111001000010 3ff5e7efc5e3db3f3ff9ef3fdfde3ff5e7efc5e3db3f3ff9ef3fdfde42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)