To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?n?nf?n?n^}Y?n?nf?n?n^}bE 00111111011011100011111101101110011001100011111101101110001111110110111001011110011111010101100100111111011011100011111101101110011001100011111101101110001111110110111001011110011111010110001001000101 3f6e3f6e663f6e3f6e5e7d593f6e3f6e663f6e3f6e5e7d6245
SJIS-WIN 痔n痔nf痔n痔n^}Y痔n痔nf痔n痔n^}bE 100011101010010001101110100011101010010001101110011001101000111010100100011011101000111010100100011011100101111001111101010110011000111010100100011011101000111010100100011011100110011010001110101001000110111010001110101001000110111001011110011111010110001001000101 8ea46e8ea46e668ea46e8ea46e5e7d598ea46e8ea46e668ea46e8ea46e5e7d6245
EUC-JP 痔n痔nf痔n痔n^}Y痔n痔nf痔n痔n^}bE 101111001010011001101110101111001010011001101110011001101011110010100110011011101011110010100110011011100101111001111101010110011011110010100110011011101011110010100110011011100110011010111100101001100110111010111100101001100110111001011110011111010110001001000101 bca66ebca66e66bca66ebca66e5e7d59bca66ebca66e66bca66ebca66e5e7d6245
UTF-8 痔n痔nf痔n痔n^}Y痔n痔nf痔n痔n^}bE 1110011110010111100101000110111011100111100101111001010001101110011001101110011110010111100101000110111011100111100101111001010001101110010111100111110101011001111001111001011110010100011011101110011110010111100101000110111001100110111001111001011110010100011011101110011110010111100101000110111001011110011111010110001001000101 e797946ee797946e66e797946ee797946e5e7d59e797946ee797946e66e797946ee797946e5e7d6245
UHC 痔n痔nf痔n痔n^}Y痔n痔nf痔n痔n^}bE 111101101100000001101110111101101100000001101110011001101111011011000000011011101111011011000000011011100101111001111101010110011111011011000000011011101111011011000000011011100110011011110110110000000110111011110110110000000110111001011110011111010110001001000101 f6c06ef6c06e66f6c06ef6c06e5e7d59f6c06ef6c06e66f6c06ef6c06e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)