To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 窈?????衍?????窈?????衍?????^ 1110001001110111001111110011111100111111001111110011111110011111101001010011111100111111001111110011111100111111111000100111011100111111001111110011111100111111001111111001111110100101001111110011111100111111001111110011111101011110 e2773f3f3f3f3f9fa53f3f3f3f3fe2773f3f3f3f3f9fa53f3f3f3f3f5e
EUC-JP 窈?????衍?????窈?????衍?????^ 1110001111011000001111110011111100111111001111110011111111011110101001110011111100111111001111110011111100111111111000111101100000111111001111110011111100111111001111111101111010100111001111110011111100111111001111110011111101011110 e3d83f3f3f3f3fdea73f3f3f3f3fe3d83f3f3f3f3fdea73f3f3f3f3f5e
UTF-8 窈뚪쟽溜곕젷衍뚪쟽溜곕젍窈뚪쟽溜곕젷衍뚪쟽溜곕젍^ 11100111101010101000100011101011100110101010101011101100100111111011110111101111101001111000101111101010101100111001010111101100101000001011011111101000101000011000110111101011100110101010101011101100100111111011110111101111101001111000101111101010101100111001010111101100101000001000110111100111101010101000100011101011100110101010101011101100100111111011110111101111101001111000101111101010101100111001010111101100101000001011011111101000101000011000110111101011100110101010101011101100100111111011110111101111101001111000101111101010101100111001010111101100101000001000110101011110 e7aa88eb9aaaec9fbdefa78beab395eca0b7e8a18deb9aaaec9fbdefa78beab395eca08de7aa88eb9aaaec9fbdefa78beab395eca0b7e8a18deb9aaaec9fbdefa78beab395eca08d5e
UHC 窈뚪쟽溜곕젷衍뚪쟽溜곕젍窈뚪쟽溜곕젷衍뚪쟽溜곕젍^ 11101001101000011000110011101001101000001000001111101010111111101011000011101011101000001010101111100110111000101000110011101001101000001000001111101010111111101011000011101011101000001000111011101001101000011000110011101001101000001000001111101010111111101011000011101011101000001010101111100110111000101000110011101001101000001000001111101010111111101011000011101011101000001000111001011110 e9a18ce9a083eafeb0eba0abe6e28ce9a083eafeb0eba08ee9a18ce9a083eafeb0eba0abe6e28ce9a083eafeb0eba08e5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)