To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 窈??俉??絶??n}窈??俉??絶??n{^ 1110001001110111001111110011111111111010011000010011111100111111100100001110001000111111001111110110111001111101111000100111011100111111001111111111101001100001001111110011111110010000111000100011111100111111011011100111101101011110 e2773f3ffa613f3f90e23f3f6e7de2773f3ffa613f3f90e23f3f6e7b5e
EUC-JP 窈??俉??絶??n}窈??俉??絶??n{^ 11100011110110000011111100111111100011111011000110111011001111110011111111000000111001000011111100111111011011100111110111100011110110000011111100111111100011111011000110111011001111110011111111000000111001000011111100111111011011100111101101011110 e3d83f3f8fb1bb3f3fc0e43f3f6e7de3d83f3f8fb1bb3f3fc0e43f3f6e7b5e
UTF-8 窈놅쉿俉득벳絶잌눣n}窈놅쉿俉득벳絶잌눣n{^ 1110011110101010100010001110101110000110100001011110110010001001101111111110010010111111100010011110101110010011100111011110101110110010101100111110011110110101101101101110110010011110100011001110101110001000101000110110111001111101111001111010101010001000111010111000011010000101111011001000100110111111111001001011111110001001111010111001001110011101111010111011001010110011111001111011010110110110111011001001111010001100111010111000100010100011011011100111101101011110 e7aa88eb8685ec89bfe4bf89eb939debb2b3e7b5b6ec9e8ceb88a36e7de7aa88eb8685ec89bfe4bf89eb939debb2b3e7b5b6ec9e8ceb88a36e7b5e
UHC 窈놅쉿俉득벳絶잌눣n}窈놅쉿俉득벳絶잌눣n{^ 1110100110100001100001101110111110111101101100101110011111101011101101011110011010111010101010101110111110111110100111111110010110000111101110100110111001111101111010011010000110000110111011111011110110110010111001111110101110110101111001101011101010101010111011111011111010011111111001011000011110111010011011100111101101011110 e9a186efbdb2e7ebb5e6baaaefbe9fe587ba6e7de9a186efbdb2e7ebb5e6baaaefbe9fe587ba6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)