To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?R?^f?R?^^}Y?R?^f?R?^^}bE 00111111010100100011111101011110011001100011111101010010001111110101111001011110011111010101100100111111010100100011111101011110011001100011111101010010001111110101111001011110011111010110001001000101 3f523f5e663f523f5e5e7d593f523f5e663f523f5e5e7d6245
SJIS-WIN 般R般^f般R般^^}Y般R般^f般R般^^}bE 100101001100101001010010100101001100101001011110011001101001010011001010010100101001010011001010010111100101111001111101010110011001010011001010010100101001010011001010010111100110011010010100110010100101001010010100110010100101111001011110011111010110001001000101 94ca5294ca5e6694ca5294ca5e5e7d5994ca5294ca5e6694ca5294ca5e5e7d6245
EUC-JP 般R般^f般R般^^}Y般R般^f般R般^^}bE 110010001100110001010010110010001100110001011110011001101100100011001100010100101100100011001100010111100101111001111101010110011100100011001100010100101100100011001100010111100110011011001000110011000101001011001000110011000101111001011110011111010110001001000101 c8cc52c8cc5e66c8cc52c8cc5e5e7d59c8cc52c8cc5e66c8cc52c8cc5e5e7d6245
UTF-8 般R般^f般R般^^}Y般R般^f般R般^^}bE 1110100010001000101011000101001011101000100010001010110001011110011001101110100010001000101011000101001011101000100010001010110001011110010111100111110101011001111010001000100010101100010100101110100010001000101011000101111001100110111010001000100010101100010100101110100010001000101011000101111001011110011111010110001001000101 e888ac52e888ac5e66e888ac52e888ac5e5e7d59e888ac52e888ac5e66e888ac52e888ac5e5e7d6245
UHC 般R般^f般R般^^}Y般R般^f般R般^^}bE 110110101111010101010010110110101111010101011110011001101101101011110101010100101101101011110101010111100101111001111101010110011101101011110101010100101101101011110101010111100110011011011010111101010101001011011010111101010101111001011110011111010110001001000101 daf552daf55e66daf552daf55e5e7d59daf552daf55e66daf552daf55e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)