To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 菴殊ソエ蟷エ郢堤イ厭菴殊ソエ蟷エ郢堤イ閲^ 111001001011110110001110111010101011111110110100111001011011100110110100111001111011100110010010111001111011001010001001011111011110010010111101100011101110101010111111101101001110010110111001101101001110011110111001100100101110011110110010100010010111101101011110 e4bd8eeabfb4e5b9b4e7b992e7b2897de4bd8eeabfb4e5b9b4e7b992e7b2897b5e
EUC-JP 菴殊ソエ蟷エ郢堤イ厭菴殊ソエ蟷エ郢堤イ閲^ 1110100010111111101111001110110010001110101111111000111010110100111010101011101110001110101101001110111010111011110001001110100110001110101100101011000111011110111010001011111110111100111011001000111010111111100011101011010011101010101110111000111010110100111011101011101111000100111010011000111010110010101100011101110001011110 e8bfbcec8ebf8eb4eabb8eb4eebbc4e98eb2b1dee8bfbcec8ebf8eb4eabb8eb4eebbc4e98eb2b1dc5e
UTF-8 菴殊ソエ蟷エ郢堤イ厭菴殊ソエ蟷エ郢堤イ閲^ 11101000100011111011010011100110101011101000101011101111101111011011111111101111101111011011010011101000100111111011011111101111101111011011010011101001100000111010001011100101101000001010010011101111101111011011001011100101100011101010110111101000100011111011010011100110101011101000101011101111101111011011111111101111101111011011010011101000100111111011011111101111101111011011010011101001100000111010001011100101101000001010010011101111101111011011001011101001100101101011001001011110 e88fb4e6ae8aefbdbfefbdb4e89fb7efbdb4e983a2e5a0a4efbdb2e58eade88fb4e6ae8aefbdbfefbdb4e89fb7efbdb4e983a2e5a0a4efbdb2e996b25e
UHC 菴殊?????堤?厭菴殊?????堤??^ 11100100111000001110001010101000001111110011111100111111001111110011111111110000101001110011111111100110111101001110010011100000111000101010100000111111001111110011111100111111001111111111000010100111001111110011111101011110 e4e0e2a83f3f3f3f3ff0a73fe6f4e4e0e2a83f3f3f3f3ff0a73f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)