To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??R??^f??R??^^}Y??R??^f??R??^^}bE 001111110011111101010010001111110011111101011110011001100011111100111111010100100011111100111111010111100101111001111101010110010011111100111111010100100011111100111111010111100110011000111111001111110101001000111111001111110101111001011110011111010110001001000101 3f3f523f3f5e663f3f523f3f5e5e7d593f3f523f3f5e663f3f523f3f5e5e7d6245
SJIS-WIN 癌癌R癌癌^f癌癌R癌癌^^}Y癌癌R癌癌^f癌癌R癌癌^^}bE 10001010111000001000101011100000010100101000101011100000100010101110000001011110011001101000101011100000100010101110000001010010100010101110000010001010111000000101111001011110011111010101100110001010111000001000101011100000010100101000101011100000100010101110000001011110011001101000101011100000100010101110000001010010100010101110000010001010111000000101111001011110011111010110001001000101 8ae08ae0528ae08ae05e668ae08ae0528ae08ae05e5e7d598ae08ae0528ae08ae05e668ae08ae0528ae08ae05e5e7d6245
EUC-JP 癌癌R癌癌^f癌癌R癌癌^^}Y癌癌R癌癌^f癌癌R癌癌^^}bE 10110100111000101011010011100010010100101011010011100010101101001110001001011110011001101011010011100010101101001110001001010010101101001110001010110100111000100101111001011110011111010101100110110100111000101011010011100010010100101011010011100010101101001110001001011110011001101011010011100010101101001110001001010010101101001110001010110100111000100101111001011110011111010110001001000101 b4e2b4e252b4e2b4e25e66b4e2b4e252b4e2b4e25e5e7d59b4e2b4e252b4e2b4e25e66b4e2b4e252b4e2b4e25e5e7d6245
UTF-8 癌癌R癌癌^f癌癌R癌癌^^}Y癌癌R癌癌^f癌癌R癌癌^^}bE 1110011110011001100011001110011110011001100011000101001011100111100110011000110011100111100110011000110001011110011001101110011110011001100011001110011110011001100011000101001011100111100110011000110011100111100110011000110001011110010111100111110101011001111001111001100110001100111001111001100110001100010100101110011110011001100011001110011110011001100011000101111001100110111001111001100110001100111001111001100110001100010100101110011110011001100011001110011110011001100011000101111001011110011111010110001001000101 e7998ce7998c52e7998ce7998c5e66e7998ce7998c52e7998ce7998c5e5e7d59e7998ce7998c52e7998ce7998c5e66e7998ce7998c52e7998ce7998c5e5e7d6245
UHC 癌癌R癌癌^f癌癌R癌癌^^}Y癌癌R癌癌^f癌癌R癌癌^^}bE 11100100110111111110010011011111010100101110010011011111111001001101111101011110011001101110010011011111111001001101111101010010111001001101111111100100110111110101111001011110011111010101100111100100110111111110010011011111010100101110010011011111111001001101111101011110011001101110010011011111111001001101111101010010111001001101111111100100110111110101111001011110011111010110001001000101 e4dfe4df52e4dfe4df5e66e4dfe4df52e4dfe4df5e5e7d59e4dfe4df52e4dfe4df5e66e4dfe4df52e4dfe4df5e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)