To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 嚥??爰??誘λき???萸??兢???銀ы?^ 1001101010001011001111110011111111100000101001110011111100111111100101110101010110000011110010011000001010101011001111110011111100111111111001001100111000111111001111111001100101011101001111110011111100111111100010111110001010000100100011010011111101011110 9a8b3f3fe0a73f3f975583c982ab3f3f3fe4ce3f3f995d3f3f3f8be2848d3f5e
EUC-JP 嚥??爰??誘λき???萸??兢堉??銀ы?^ 11010011111010110011111100111111111000001010100100111111001111111100110110110110101001101100101110100100101011010011111100111111001111111110100011010000001111110011111111010001101111101000111110110111111111010011111100111111101101101110010010100111111011010011111101011110 d3eb3f3fe0a93f3fcdb6a6cba4ad3f3f3fe8d03f3fd1be8fb7fd3f3fb6e4a7ed3f5e
UTF-8 嚥싲갇爰껓쭏誘λき廉띾맟萸먩걗兢堉딉쬉銀ы넺^ 1110010110011010101001011110110010001011101100101110101010110000100001111110011110001000101100001110101010111011100100111110110010101101100011111110100010101010100110001100111010111011111000111000000110001101111011111010011010100010111010111001110110111110111010111010011110011111111010001001000010111000111010111010100010101001111010101011000110010111111001011000010110100010111001011010000010001001111010111001010010001001111011001010110010001001111010011000101010000000110100011000101111101011100001001011101001011110 e59aa5ec8bb2eab087e788b0eabb93ecad8fe8aa98cebbe3818defa6a2eb9dbeeba79fe890b8eba8a9eab197e585a2e5a089eb9489ecac89e98a80d18beb84ba5e
UHC 嚥싲갇爰껓쭏誘λき廉띾맟萸먩걗兢堉딉쬉銀ы넺^ 111001101011111110011010111010111011000010100100111010101011101010000011111011111010011110001000111010111010111110100101111010111010101010101101111001101111010110001101111010111001000010101100111010111010110110010000111001101000000110000010110100001110011111101011101111001000101011101111101001101001111111101011110111101010110011101101100001101011010001011110 e6bf9aebb0a4eaba83efa788ebafa5ebaaade6f58deb90acebad90e68182d0e7ebbc8aefa69febdeaced86b45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)