To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??蔚???源叛???蔚???源叛?B 00111111001111111000100101010101001111110011111100111111100011001011100110010100101111100011111100111111001111111000100101010101001111110011111100111111100011001011100110010100101111100011111101000010 3f3f89553f3f3f8cb994be3f3f3f89553f3f3f8cb994be3f42
EUC-JP 焌?蔚???源叛?焌?蔚???源叛?B 1000111111001001111010000011111110110001101101100011111100111111001111111011100010111011110010001100000000111111100011111100100111101000001111111011000110110110001111110011111100111111101110001011101111001000110000000011111101000010 8fc9e83fb1b63f3f3fb8bbc8c03f8fc9e83fb1b63f3f3fb8bbc8c03f42
UTF-8 焌렊蔚목렰렪源叛헹焌렊蔚목렰렪源叛헹B 11100111100001001000110011101011101000001000101011101000100101001001101011101011101010101010100111101011101000001011000011101011101000001010101011100110101110101001000011100101100011111001101111101101100101111011100111100111100001001000110011101011101000001000101011101000100101001001101011101011101010101010100111101011101000001011000011101011101000001010101011100110101110101001000011100101100011111001101111101101100101111011100101000010 e7848ceba08ae8949aebaaa9eba0b0eba0aae6ba90e58f9bed97b9e7848ceba08ae8949aebaaa9eba0b0eba0aae6ba90e58f9bed97b942
UHC 焌렊蔚목렰렪源叛헹焌렊蔚목렰렪源叛헹B 11110001111000001000111010100001111010101010010110111000111100011000111010111101100011101011100011101010101110011101101011100100110001111111001111110001111000001000111010100001111010101010010110111000111100011000111010111101100011101011100011101010101110011101101011100100110001111111001101000010 f1e08ea1eaa5b8f18ebd8eb8eab9dae4c7f3f1e08ea1eaa5b8f18ebd8eb8eab9dae4c7f342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)