To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 雍??巍??葉??凝②????宥??厭??^ 1110100010110100001111110011111110011011110110010011111100111111100101110111010000111111001111111000101111000011100001110100000100111111001111110011111100111111100101110100011100111111001111111000100101111101001111110011111101011110 e8b43f3f9bd93f3f97743f3f8bc387413f3f3f3f97473f3f897d3f3f5e
EUC-JP 雍??巍??葉??凝?????宥??厭??^ 11110000101101100011111100111111110101101101101100111111001111111100110111010101001111110011111110110110110001010011111100111111001111110011111100111111110011011010100000111111001111111011000111011110001111110011111101011110 f0b63f3fd6db3f3fcdd53f3fb6c53f3f3f3f3fcda83f3fb1de3f3f5e
UTF-8 雍됰젙巍띾떩葉띄뼇凝②펹溜뺡냽宥사㉥厭묒뙟^ 11101001100110111000110111101011100100001011000011101100101000001001100111100101101101111000110111101011100111011011111011101011100101101010100111101000100100011000100111101011100111011000010011101011101111001000011111100101100001111001110111100010100100011010000111101101100011101011100111101111101001111000101111101011101110101010000111101011100000111011110111100101101011101010010111101100100000101010110011100011100010011010010111100101100011101010110111101011101011001001001011101011100110011001111101011110 e99b8deb90b0eca099e5b78deb9dbeeb96a9e89189eb9d84ebbc87e5879de291a1ed8eb9efa78bebbaa1eb83bde5aea5ec82ace389a5e58eadebac92eb999f5e
UHC 雍됰젙巍띾떩葉띄뼇凝②펹溜뺡냽宥사㉥厭묒뙟^ 11101000101111001000100111101011101000001001010111101000111001001000110111101011100010111011101111100111101010001011011011100111100101101001000111101011111010101010100011101000101111001000100111101010111111101001010111101001100001101000110111101010111010011011101111100111101010001011011011100110111101001001000111101100100011001010010001011110 e8bc89eba095e8e48deb8bbbe7a8b6e79691ebeaa8e8bc89eafe95e9868deae9bbe7a8b6e6f491ec8ca45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)