To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???nR???n^[???nR???n^[^ 0011111100111111001111110110111001010010001111110011111100111111011011100101111001011011001111110011111100111111011011100101001000111111001111110011111101101110010111100101101101011110 3f3f3f6e523f3f3f6e5e5b3f3f3f6e523f3f3f6e5e5b5e
SJIS-WIN 霆狡鵡nR霆狡鵡n^[霆狡鵡nR霆狡鵡n^[^ 1110100010111011111000001100001010010110101101110110111001010010111010001011101111100000110000101001011010110111011011100101111001011011111010001011101111100000110000101001011010110111011011100101001011101000101110111110000011000010100101101011011101101110010111100101101101011110 e8bbe0c296b76e52e8bbe0c296b76e5e5be8bbe0c296b76e52e8bbe0c296b76e5e5b5e
EUC-JP 霆狡鵡nR霆狡鵡n^[霆狡鵡nR霆狡鵡n^[^ 1111000010111101111000001100010011001100101110010110111001010010111100001011110111100000110001001100110010111001011011100101111001011011111100001011110111100000110001001100110010111001011011100101001011110000101111011110000011000100110011001011100101101110010111100101101101011110 f0bde0c4ccb96e52f0bde0c4ccb96e5e5bf0bde0c4ccb96e52f0bde0c4ccb96e5e5b5e
UTF-8 霆狡鵡nR霆狡鵡n^[霆狡鵡nR霆狡鵡n^[^ 1110100110011100100001101110011110001011101000011110100110110101101000010110111001010010111010011001110010000110111001111000101110100001111010011011010110100001011011100101111001011011111010011001110010000110111001111000101110100001111010011011010110100001011011100101001011101001100111001000011011100111100010111010000111101001101101011010000101101110010111100101101101011110 e99c86e78ba1e9b5a16e52e99c86e78ba1e9b5a16e5e5be99c86e78ba1e9b5a16e52e99c86e78ba1e9b5a16e5e5b5e
UHC 霆狡鵡nR霆狡鵡n^[霆狡鵡nR霆狡鵡n^[^ 1110111111111101110011101110101011011001111101110110111001010010111011111111110111001110111010101101100111110111011011100101111001011011111011111111110111001110111010101101100111110111011011100101001011101111111111011100111011101010110110011111011101101110010111100101101101011110 effdceead9f76e52effdceead9f76e5e5beffdceead9f76e52effdceead9f76e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)