To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 辱ゅ?榮??円?コ辱ゅ?榮??円??^ 10010000010010101000001011100011001111111001111011000100001111110011111110001001011111100011111110000011010100101001000001001010100000101110001100111111100111101100010000111111001111111000100101111110001111110011111101011110 904a82e33f9ec43f3f897e3f8352904a82e33f9ec43f3f897e3f3f5e
EUC-JP 辱ゅ?榮??円?コ辱ゅ?榮??円??^ 10111111101010111010010011100101001111111101110011000110001111110011111110110001110111110011111110100101101100111011111110101011101001001110010100111111110111001100011000111111001111111011000111011111001111110011111101011110 bfaba4e53fdcc63f3fb1df3fa5b3bfaba4e53fdcc63f3fb1df3f3f5e
UTF-8 辱ゅ츒榮쀯슈円녘コ辱ゅ츒榮쀯슈円녘쳷^ 11101000101111101011000111100011100000101000010111101100101110001001001011100110101001101010111011101100100000001010111111101100100010101000100011100101100001101000011011101011100001011001100011100011100000101011001111101000101111101011000111100011100000101000010111101100101110001001001011100110101001101010111011101100100000001010111111101100100010101000100011100101100001101000011011101011100001011001100011101100101100111011011101011110 e8beb1e38285ecb892e6a6aeec80afec8a88e58686eb8598e382b3e8beb1e38285ecb892e6a6aeec80afec8a88e58686eb8598ecb3b75e
UHC 辱ゅ츒榮쀯슈円녘コ辱ゅ츒榮쀯슈円녘쳷^ 11101001101101001010101011100101101011101000110111100111101101001001011111101111101111011011010011100101111101111011001111101000101010111011001111101001101101001010101011100101101011101000110111100111101101001001011111101111101111011011010011100101111101111011001111101000101010111001101001011110 e9b4aae5ae8de7b497efbdb4e5f7b3e8abb3e9b4aae5ae8de7b497efbdb4e5f7b3e8ab9a5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)