To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????nf????n^}Y????nf????n^}bE 0011111100111111001111110011111101101110011001100011111100111111001111110011111101101110010111100111110101011001001111110011111100111111001111110110111001100110001111110011111100111111001111110110111001011110011111010110001001000101 3f3f3f3f6e663f3f3f3f6e5e7d593f3f3f3f6e663f3f3f3f6e5e7d6245
SJIS-WIN 砥?砥?nf砥?砥?n^}Y砥?砥?nf砥?砥?n^}bE 10010011011101010011111110010011011101010011111101101110011001101001001101110101001111111001001101110101001111110110111001011110011111010101100110010011011101010011111110010011011101010011111101101110011001101001001101110101001111111001001101110101001111110110111001011110011111010110001001000101 93753f93753f6e6693753f93753f6e5e7d5993753f93753f6e6693753f93753f6e5e7d6245
EUC-JP 砥?砥?nf砥?砥?n^}Y砥?砥?nf砥?砥?n^}bE 11000101110101100011111111000101110101100011111101101110011001101100010111010110001111111100010111010110001111110110111001011110011111010101100111000101110101100011111111000101110101100011111101101110011001101100010111010110001111111100010111010110001111110110111001011110011111010110001001000101 c5d63fc5d63f6e66c5d63fc5d63f6e5e7d59c5d63fc5d63f6e66c5d63fc5d63f6e5e7d6245
UTF-8 砥렡砥렡nf砥렡砥렡n^}Y砥렡砥렡nf砥렡砥렡n^}bE 11100111101000001010010111101011101000001010000111100111101000001010010111101011101000001010000101101110011001101110011110100000101001011110101110100000101000011110011110100000101001011110101110100000101000010110111001011110011111010101100111100111101000001010010111101011101000001010000111100111101000001010010111101011101000001010000101101110011001101110011110100000101001011110101110100000101000011110011110100000101001011110101110100000101000010110111001011110011111010110001001000101 e7a0a5eba0a1e7a0a5eba0a16e66e7a0a5eba0a1e7a0a5eba0a16e5e7d59e7a0a5eba0a1e7a0a5eba0a16e66e7a0a5eba0a1e7a0a5eba0a16e5e7d6245
UHC 砥렡砥렡nf砥렡砥렡n^}Y砥렡砥렡nf砥렡砥렡n^}bE 111100101011001010001110101100101111001010110010100011101011001001101110011001101111001010110010100011101011001011110010101100101000111010110010011011100101111001111101010110011111001010110010100011101011001011110010101100101000111010110010011011100110011011110010101100101000111010110010111100101011001010001110101100100110111001011110011111010110001001000101 f2b28eb2f2b28eb26e66f2b28eb2f2b28eb26e5e7d59f2b28eb2f2b28eb26e66f2b28eb2f2b28eb26e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)