To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 沚???斐ь??猷宏沚???斐ь??猷槐^ 10011111100011010011111100111111001111111001010011100011100001001000111000111111001111111001011101010001100011010100011110011111100011010011111100111111001111111001010011100011100001001000111000111111001111111001011101010001100111101100010101011110 9f8d3f3f3f94e3848e3f3f97518d479f8d3f3f3f94e3848e3f3f97519ec55e
EUC-JP 沚?鱉?斐ь??猷宏沚?鱉?斐ь??猷槐^ 1101110111101101001111111000111111101011110000000011111111001000111001011010011111101110001111110011111111001101101100101011100110101000110111011110110100111111100011111110101111000000001111111100100011100101101001111110111000111111001111111100110110110010110111001100011101011110 dded3f8febc03fc8e5a7ee3f3fcdb2b9a8dded3f8febc03fc8e5a7ee3f3fcdb2dcc75e
UTF-8 沚어鱉뤋斐ь틢↔猷宏沚어鱉뤋斐ь틢↔猷槐^ 1110011010110010100110101110110010010110101101001110100110110001100010011110101110100100100010111110011010010110100100001101000110001100111011011000101110100010111000101000011010010100111001111000110010110111111001011010111010001111111001101011001010011010111011001001011010110100111010011011000110001001111010111010010010001011111001101001011010010000110100011000110011101101100010111010001011100010100001101001010011100111100011001011011111100110101001111001000001011110 e6b29aec96b4e9b189eba48be69690d18ced8ba2e28694e78cb7e5ae8fe6b29aec96b4e9b189eba48be69690d18ced8ba2e28694e78cb7e6a7905e
UHC 沚어鱉뤋斐ь틢↔猷宏沚어鱉뤋斐ь틢↔猷槐^ 1111001010101111101111101110111011011100101011101000111110111011110111011110110010101100111011101011101010001110101000011110101011101011101000111100111011011011111100101010111110111110111011101101110010101110100011111011101111011101111011001010110011101110101110101000111010100001111010101110101110100011110011101101100101011110 f2afbeeedcae8fbbddecaceeba8ea1eaeba3cedbf2afbeeedcae8fbbddecaceeba8ea1eaeba3ced95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)