To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????劑?錚???諸勘???肯?旭? 0011111100111111001111110011111100111111100110011001110100111111111010000100001000111111001111110011111110001111100101001000101010101000001111110011111100111111100011010110110100111111100010001010111000111111 3f3f3f3f3f999d3fe8423f3f3f8f948aa83f3f3f8d6d3f88ae3f
EUC-JP 焌????劑?錚???諸勘???肯?旭? 10001111110010011110100000111111001111110011111100111111110100011111110100111111111011111010001100111111001111110011111110111101111101001011010010101010001111110011111100111111101110011100111000111111101100001011000000111111 8fc9e83f3f3f3fd1fd3fefa33f3f3fbdf4b4aa3f3f3fb9ce3fb0b03f
UTF-8 焌셍렟熉렟劑렫錚漏렋렟諸勘렟렫롈肯짇旭렎 111001111000010010001100111011001000010110001101111010111010000010011111111001111000011010001001111010111010000010011111111001011000101010010001111010111010000010101011111010011000110010011010111011111010010110001110111010111010000010001011111010111010000010011111111010001010101110111000111001011000101110011000111010111010000010011111111010111010000010101011111010111010000110001000111010001000001010101111111011001010011110000111111001101001011110101101111010111010000010001110 e7848cec858deba09fe78689eba09fe58a91eba0abe98c9aefa58eeba08beba09fe8abb8e58b98eba09feba0abeba188e882afeca787e697adeba08e
UHC 焌셍렟熉렟劑렫錚漏렋렟諸勘렟렫롈肯짇旭렎 11110001111000001011110011000100100011101011000011101001111110111000111010110000111100001010010110001110101110011110111010110110110100101110100010001110101000101000111010110000111100001011001111001010111010111000111010110000100011101011100110001110110011101101000011101001110000011111100111101001111011111000111010100100 f1e0bcc48eb0e9fb8eb0f0a58eb9eeb6d2e88ea28eb0f0b3caeb8eb08eb98eced0e9c1f9e9ef8ea4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)