To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 嚥??巍??葉??凝?????宥??厭??^ 10011010100010110011111100111111100110111101100100111111001111111001011101110100001111110011111110001011110000110011111100111111001111110011111100111111100101110100011100111111001111111000100101111101001111110011111101011110 9a8b3f3f9bd93f3f97743f3f8bc33f3f3f3f3f97473f3f897d3f3f5e
EUC-JP 嚥??巍??葉??凝?????宥??厭??^ 11010011111010110011111100111111110101101101101100111111001111111100110111010101001111110011111110110110110001010011111100111111001111110011111100111111110011011010100000111111001111111011000111011110001111110011111101011110 d3eb3f3fd6db3f3fcdd53f3fb6c53f3f3f3f3fcda83f3fb1de3f3f5e
UTF-8 嚥좊젙巍띾떩葉띄뼇凝ⓩ퓼溜뺡냽宥사㉥厭묒뙟^ 11100101100110101010010111101100101000101000101011101100101000001001100111100101101101111000110111101011100111011011111011101011100101101010100111101000100100011000100111101011100111011000010011101011101111001000011111100101100001111001110111100010100100111010100111101101100100111011110011101111101001111000101111101011101110101010000111101011100000111011110111100101101011101010010111101100100000101010110011100011100010011010010111100101100011101010110111101011101011001001001011101011100110011001111101011110 e59aa5eca28aeca099e5b78deb9dbeeb96a9e89189eb9d84ebbc87e5879de293a9ed93bcefa78bebbaa1eb83bde5aea5ec82ace389a5e58eadebac92eb999f5e
UHC 嚥좊젙巍띾떩葉띄뼇凝ⓩ퓼溜뺡냽宥사㉥厭묒뙟^ 11100110101111111010000011101011101000001001010111101000111001001000110111101011100010111011101111100111101010001011011011100111100101101001000111101011111010101010100011100110101111111010000011101010111111101001010111101001100001101000110111101010111010011011101111100111101010001011011011100110111101001001000111101100100011001010010001011110 e6bfa0eba095e8e48deb8bbbe7a8b6e79691ebeaa8e6bfa0eafe95e9868deae9bbe7a8b6e6f491ec8ca45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)