To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 窈?????衍?????窈?????衍?????^ 1110001001110111001111110011111100111111001111110011111110011111101001010011111100111111001111110011111100111111111000100111011100111111001111110011111100111111001111111001111110100101001111110011111100111111001111110011111101011110 e2773f3f3f3f3f9fa53f3f3f3f3fe2773f3f3f3f3f9fa53f3f3f3f3f5e
EUC-JP 窈?????衍?????窈?????衍?????^ 1110001111011000001111110011111100111111001111110011111111011110101001110011111100111111001111110011111100111111111000111101100000111111001111110011111100111111001111111101111010100111001111110011111100111111001111110011111101011110 e3d83f3f3f3f3fdea73f3f3f3f3fe3d83f3f3f3f3fdea73f3f3f3f3f5e
UTF-8 窈뚮씟溜곕젶衍뚨텚溜뗥씍窈뚮씟溜곕젶衍뚨텚溜뗩턄^ 11100111101010101000100011101011100110101010111011101100100101001001111111101111101001111000101111101010101100111001010111101100101000001011011011101000101000011000110111101011100110101010100011101101100001011001101011101111101001111000101111101011100101111010010111101100100101001000110111100111101010101000100011101011100110101010111011101100100101001001111111101111101001111000101111101010101100111001010111101100101000001011011011101000101000011000110111101011100110101010100011101101100001011001101011101111101001111000101111101011100101111010100111101101100001001000010001011110 e7aa88eb9aaeec949fefa78beab395eca0b6e8a18deb9aa8ed859aefa78beb97a5ec948de7aa88eb9aaeec949fefa78beab395eca0b6e8a18deb9aa8ed859aefa78beb97a9ed84845e
UHC 窈뚮씟溜곕젶衍뚨텚溜뗥씍窈뚮씟溜곕젶衍뚨텚溜뗩턄^ 11101001101000011000110011101011100111011011001111101010111111101011000011101011101000001010101011100110111000101000110011100111101101101001001111101010111111101000101111100101100111011010010011101001101000011000110011101011100111011011001111101010111111101011000011101011101000001010101011100110111000101000110011100111101101101001001111101010111111101000101111101001101101011010000001011110 e9a18ceb9db3eafeb0eba0aae6e28ce7b693eafe8be59da4e9a18ceb9db3eafeb0eba0aae6e28ce7b693eafe8be9b5a05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)