To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鴦??幽??????ゅ?蹂≪?筌??誼? 111010011111000100111111001111111001011101001000001111110011111100111111001111110011111100111111100000101110001100111111111001101111100010000001111000010011111111100010101000110011111100111111100010110110001000111111 e9f13f3f97483f3f3f3f3f3f82e33fe6f881e13fe2a33f3f8b623f
EUC-JP 鴦??幽?????沅ゅ?蹂≪?筌??誼? 1111001011110011001111110011111111001101101010010011111100111111001111110011111100111111100011111100011011101001101001001110010100111111111011001111101010100010111000110011111111100100101001010011111100111111101101011100001100111111 f2f33f3fcda93f3f3f3f3f8fc6e9a4e53fecfaa2e33fe4a53f3fb5c33f
UTF-8 鴦꾨땶幽됰쳥樂낅슣沅ゅ퐲蹂≪쪠筌뤾쑴誼딞 111010011011010010100110111010101011111010101000111010111001010110110110111001011011100110111101111010111001000010110000111011001011001110100101111011111010011010111111111010111000001010000101111011001000101010100011111001101011001010000101111000111000001010000101111011011001000010110010111010001011100110000010111000101000100110101010111011001010101010100000111001111010110110001100111010111010010010111110111011001001000110110100111010001010101010111100111010111001010010011110 e9b4a6eabea8eb95b6e5b9bdeb90b0ecb3a5efa6bfeb8285ec8aa3e6b285e38285ed90b2e8b982e289aaecaaa0e7ad8ceba4beec91b4e8aabceb949e
UHC 鴦꾨땶幽됰쳥樂낅슣沅ゅ퐲蹂≪쪠筌뤾쑴誼딞 11100100111011001000010011101011100010111000110011101010111010111000100111101011101010111000101011101000111110011000010111101011100110101010111111101010101101101010101011100101101111011001101111101011101100111010000111101100101001011001100111101111101001111000111111101010101111101010100111101011111111101000101101000001 e4ec84eb8b8ceaeb89ebab8ae8f985eb9aafeab6aae5bd9bebb3a1eca599efa78feabea9ebfe8b41

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)