To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 異?障?????啼?虞?異?障?????啼?虞?^ 100010001101100100111111100011111110000100111111001111110011111100111111001111111001101001100101001111111000101111110001001111111000100011011001001111111000111111100001001111110011111100111111001111110011111110011010011001010011111110001011111100010011111101011110 88d93f8fe13f3f3f3f3f9a653f8bf13f88d93f8fe13f3f3f3f3f9a653f8bf13f5e
EUC-JP 異?障?勖???啼?虞?異?障?勖???啼?虞?^ 10110000110110110011111110111110111000110011111110001111101100111110110100111111001111110011111111010011110001100011111110110110111100110011111110110000110110110011111110111110111000110011111110001111101100111110110100111111001111110011111111010011110001100011111110110110111100110011111101011110 b0db3fbee33f8fb3ed3f3f3fd3c63fb6f33fb0db3fbee33f8fb3ed3f3f3fd3c63fb6f33f5e
UTF-8 異렔障렚勖쾨렕렟啼렮虞렧異렔障렚勖쾨렕렟啼렮虞렧^ 11100111100101011011000011101011101000001001010011101001100110101001110011101011101000001001101011100101100010111001011011101100101111101010100011101011101000001001010111101011101000001001111111100101100101011011110011101011101000001010111011101000100110011001111011101011101000001010011111100111100101011011000011101011101000001001010011101001100110101001110011101011101000001001101011100101100010111001011011101100101111101010100011101011101000001001010111101011101000001001111111100101100101011011110011101011101000001010111011101000100110011001111011101011101000001010011101011110 e795b0eba094e99a9ceba09ae58b96ecbea8eba095eba09fe595bceba0aee8999eeba0a7e795b0eba094e99a9ceba09ae58b96ecbea8eba095eba09fe595bceba0aee8999eeba0a75e
UHC 異렔障렚勖쾨렕렟啼렮虞렧異렔障렚勖쾨렕렟啼렮虞렧^ 11101100101101101000111010101001111011101010000110001110101011011110100111101101110001001110101010001110101010101000111010110000111100001010011010001110101110111110100111100101100011101011011011101100101101101000111010101001111011101010000110001110101011011110100111101101110001001110101010001110101010101000111010110000111100001010011010001110101110111110100111100101100011101011011001011110 ecb68ea9eea18eade9edc4ea8eaa8eb0f0a68ebbe9e58eb6ecb68ea9eea18eade9edc4ea8eaa8eb0f0a68ebbe9e58eb65e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)