To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 «®«¤¡Þa]n}«®«¤¡Þa]n{^ 10001111101010111010111010001111101010111010010010100001110111100110000101011101011011100111110110001111101010111010111010001111101010111010010010100001110111100110000101011101011011100111101101011110 8fabae8faba4a1de615d6e7d8fabae8faba4a1de615d6e7b5e
SJIS-WIN ????????a]n}????????a]n{^ 00111111001111110011111100111111001111110011111100111111001111110110000101011101011011100111110100111111001111110011111100111111001111110011111100111111001111110110000101011101011011100111101101011110 3f3f3f3f3f3f3f3f615d6e7d3f3f3f3f3f3f3f3f615d6e7b5e
EUC-JP ??®??¤¡Þa]n}??®??¤¡Þa]n{^ 0011111100111111100011111010001011101110001111110011111110001111101000101111000010001111101000101100001010001111101010011011000001100001010111010110111001111101001111110011111110001111101000101110111000111111001111111000111110100010111100001000111110100010110000101000111110101001101100000110000101011101011011100111101101011110 3f3f8fa2ee3f3f8fa2f08fa2c28fa9b0615d6e7d3f3f8fa2ee3f3f8fa2f08fa2c28fa9b0615d6e7b5e
UTF-8 «®«¤¡Þa]n}«®«¤¡Þa]n{^ 1100001010001111110000101010101111000010101011101100001010001111110000101010101111000010101001001100001010100001110000111001111001100001010111010110111001111101110000101000111111000010101010111100001010101110110000101000111111000010101010111100001010100100110000101010000111000011100111100110000101011101011011100111101101011110 c28fc2abc2aec28fc2abc2a4c2a1c39e615d6e7dc28fc2abc2aec28fc2abc2a4c2a1c39e615d6e7b5e
UHC ??®??¤¡Þa]n}??®??¤¡Þa]n{^ 001111110011111110100010111001110011111100111111101000101011010010100010101011101010100010101101011000010101110101101110011111010011111100111111101000101110011100111111001111111010001010110100101000101010111010101000101011010110000101011101011011100111101101011110 3f3fa2e73f3fa2b4a2aea8ad615d6e7d3f3fa2e73f3fa2b4a2aea8ad615d6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)