To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 語??誼??袁⑥???‥揄??受??亦??違? 1000110011101010001111110011111110001011011000100011111100111111111001011100110110000111010001010011111100111111001111111000000101100100100111011000100100111111001111111000111011110011001111110011111110010110100100100011111100111111100010001110000100111111 8cea3f3f8b623f3fe5cd87453f3f3f81649d893f3f8ef33f3f96923f3f88e13f
EUC-JP 語??誼??袁????‥揄??受??亦??違? 10111000111011000011111100111111101101011100001100111111001111111110101011001111001111110011111100111111001111111010000111000101110110011110100100111111001111111011110011110101001111110011111111001011111100100011111100111111101100001110001100111111 b8ec3f3fb5c33f3feacf3f3f3f3fa1c5d9e93f3fbcf53f3fcbf23f3fb0e33f
UTF-8 語륁뼔誼⒴짃袁⑥몱廬믩‥揄뗰쭓受쇨샹亦밸갇違뷓 111010001010101010011110111010111010010110000001111010111011110010010100111010001010101010111100111000101001001010110100111011001010011110000011111010001010001010000001111000101001000110100101111010111010101010110001111011111010011010000010111010111010111110101001111000101000000010100101111001101000111110000100111010111001011110110000111011001010110110010011111001011000111110010111111011001000011110101000111011001000001110111001111001001011101010100110111010111011000010111000111010101011000010000111111010011000000110010101111010111011011110010011 e8aa9eeba581ebbc94e8aabce292b4eca783e8a281e291a5ebaab1efa682ebafa9e280a5e68f84eb97b0ecad93e58f97ec87a8ec83b9e4baa6ebb0b8eab087e98195ebb793
UHC 語륁뼔誼⒴짃袁⑥몱廬믩‥揄뗰쭓受쇨샹亦밸갇違뷓 11100101110111101000111111101100100101101001110011101011111111101010100111100101101000111001001111101010101111101010100011101100100100011001101011100101111111101001001011101011101000011010010111101010111100011000101111101111101001111000101111100001111101001011110011101010101111001010011111100110101100101011100111101011101100001010010011101010110111101001010101000010 e5de8fec969cebfea9e5a393eabea8ec919ae5fe92eba1a5eaf18befa78be1f4bceabca7e6b2b9ebb0a4eade9542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)