To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 輿??援??碎⑥サ語η?諛??奄???δ? 10010111011000000011111100111111100010011000011100111111001111111110000111101010100001110100010110000011010101001000110011101010100000111100010100111111111001101000011100111111001111111000100110000010001111110011111100111111100000111100001000111111 97603f3f89873f3fe1ea874583548cea83c53fe6873f3f89823f3f3f83c23f
EUC-JP 輿??援??碎?サ語η?諛??奄??堉δ? 1100110111000001001111110011111110110001111001110011111100111111111000101110110000111111101001011011010110111000111011001010011011000111001111111110101111100111001111110011111110110001111000100011111100111111100011111011011111111101101001101100010000111111 cdc13f3fb1e73f3fe2ec3fa5b5b8eca6c73febe73f3fb1e23f3f8fb7fda6c43f
UTF-8 輿삳뿣援좑쭏碎⑥サ語η럤諛㏃꽑奄멸랜堉δ벧 11101000101111001011111111101100100000101011001111101011101111111010001111100110100011111011010011101100101000101001000111101100101011011000111111100111101000101000111011100010100100011010010111100011100000101011010111101000101010101001111011001110101101111110101110011111101001001110100010101011100110111110001110001111100000111110101010111101100100011110010110100101100001001110101110101001101110001110101110011110100111001110010110100000100010011100111010110100111010111011001010100111 e8bcbfec82b3ebbfa3e68fb4eca291ecad8fe7a28ee291a5e382b5e8aa9eceb7eb9fa4e8ab9be38f83eabd91e5a584eba9b8eb9e9ce5a089ceb4ebb2a7
UHC 輿삳뿣援좑쭏碎⑥サ語η럤諛㏃꽑奄멸랜堉δ벧 111001101010101110111011111010111001011110100011111010101011010110100000111011111010011110001000111000011110111110101000111011001010101110110101111001011101111010100101111001111000111010000111111010111011000010100111111011001000010010100000111001011111001010111000111010101011011110100011111010111011110010100101111001001011101010100110 e6abbbeb97a3eab5a0efa788e1efa8ecabb5e5dea5e78e87ebb0a7ec84a0e5f2b8eab7a3ebbca5e4baa6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)