To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 窈?????縊i????窈?????縊i????B 11100010011101110011111100111111001111110011111100111111111000110110111110000010100010010011111100111111001111110011111111100010011101110011111100111111001111110011111100111111111000110110111110000010100010010011111100111111001111110011111101000010 e2773f3f3f3f3fe36f82893f3f3f3fe2773f3f3f3f3fe36f82893f3f3f3f42
EUC-JP 窈?????縊i????窈?????縊i????B 11100011110110000011111100111111001111110011111100111111111001011101000010100011111010010011111100111111001111110011111111100011110110000011111100111111001111110011111100111111111001011101000010100011111010010011111100111111001111110011111101000010 e3d83f3f3f3f3fe5d0a3e93f3f3f3fe3d83f3f3f3f3fe5d0a3e93f3f3f3f42
UTF-8 窈뚩퀌溜곕젵縊i쟽溜곕젵窈뚮㈇溜좊젛縊i쟽溜곕젵B 11100111101010101000100011101011100110101010100111101101100000001000110011101111101001111000101111101010101100111001010111101100101000001011010111100111101110001000101011101111101111011000100111101100100111111011110111101111101001111000101111101010101100111001010111101100101000001011010111100111101010101000100011101011100110101010111011100011100010001000011111101111101001111000101111101100101000101000101011101100101000001001101111100111101110001000101011101111101111011000100111101100100111111011110111101111101001111000101111101010101100111001010111101100101000001011010101000010 e7aa88eb9aa9ed808cefa78beab395eca0b5e7b88aefbd89ec9fbdefa78beab395eca0b5e7aa88eb9aaee38887efa78beca28aeca09be7b88aefbd89ec9fbdefa78beab395eca0b542
UHC 窈뚩퀌溜곕젵縊i쟽溜곕젵窈뚮㈇溜좊젛縊i쟽溜곕젵B 11101001101000011000110011101000101100111000001011101010111111101011000011101011101000001010100111100100111111001010001111101001101000001000001111101010111111101011000011101011101000001010100111101001101000011000110011101011101010011011100011101010111111101010000011101011101000001001011111100100111111001010001111101001101000001000001111101010111111101011000011101011101000001010100101000010 e9a18ce8b382eafeb0eba0a9e4fca3e9a083eafeb0eba0a9e9a18ceba9b8eafea0eba097e4fca3e9a083eafeb0eba0a942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)