To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?R?^}?R?^u?R?^sK}?R?^}?R?^u?R?^sK{^ 0011111101010010001111110101111001111101001111110101001000111111010111100111010100111111010100100011111101011110011100110100101101111101001111110101001000111111010111100111110100111111010100100011111101011110011101010011111101010010001111110101111001110011010010110111101101011110 3f523f5e7d3f523f5e753f523f5e734b7d3f523f5e7d3f523f5e753f523f5e734b7b5e
SJIS-WIN 癌R癌^}癌R癌^u癌R癌^sK}癌R癌^}癌R癌^u癌R癌^sK{^ 1000101011100000010100101000101011100000010111100111110110001010111000000101001010001010111000000101111001110101100010101110000001010010100010101110000001011110011100110100101101111101100010101110000001010010100010101110000001011110011111011000101011100000010100101000101011100000010111100111010110001010111000000101001010001010111000000101111001110011010010110111101101011110 8ae0528ae05e7d8ae0528ae05e758ae0528ae05e734b7d8ae0528ae05e7d8ae0528ae05e758ae0528ae05e734b7b5e
EUC-JP 癌R癌^}癌R癌^u癌R癌^sK}癌R癌^}癌R癌^u癌R癌^sK{^ 1011010011100010010100101011010011100010010111100111110110110100111000100101001010110100111000100101111001110101101101001110001001010010101101001110001001011110011100110100101101111101101101001110001001010010101101001110001001011110011111011011010011100010010100101011010011100010010111100111010110110100111000100101001010110100111000100101111001110011010010110111101101011110 b4e252b4e25e7db4e252b4e25e75b4e252b4e25e734b7db4e252b4e25e7db4e252b4e25e75b4e252b4e25e734b7b5e
UTF-8 癌R癌^}癌R癌^u癌R癌^sK}癌R癌^}癌R癌^u癌R癌^sK{^ 1110011110011001100011000101001011100111100110011000110001011110011111011110011110011001100011000101001011100111100110011000110001011110011101011110011110011001100011000101001011100111100110011000110001011110011100110100101101111101111001111001100110001100010100101110011110011001100011000101111001111101111001111001100110001100010100101110011110011001100011000101111001110101111001111001100110001100010100101110011110011001100011000101111001110011010010110111101101011110 e7998c52e7998c5e7de7998c52e7998c5e75e7998c52e7998c5e734b7de7998c52e7998c5e7de7998c52e7998c5e75e7998c52e7998c5e734b7b5e
UHC 癌R癌^}癌R癌^u癌R癌^sK}癌R癌^}癌R癌^u癌R癌^sK{^ 1110010011011111010100101110010011011111010111100111110111100100110111110101001011100100110111110101111001110101111001001101111101010010111001001101111101011110011100110100101101111101111001001101111101010010111001001101111101011110011111011110010011011111010100101110010011011111010111100111010111100100110111110101001011100100110111110101111001110011010010110111101101011110 e4df52e4df5e7de4df52e4df5e75e4df52e4df5e734b7de4df52e4df5e7de4df52e4df5e75e4df52e4df5e734b7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)