To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 陋帷鮒闢夂弗迴守鮒陋帷鮒闢夂弗迴手鴇陋帷鮒 111010001001101110011011111001111001010110101001111010001001001110011010111001111001010110100100111001111000111110001110111001111001010110101001111010001001101110011011111001111001010110101001111010001001001110011010111001111001010110100100111001111000111110001110111010001001001110111100111010001001101110011011111001111001010110101001 e89b9be795a9e8939ae795a4e78f8ee795a9e89b9be795a9e8939ae795a4e78f8ee893bce89b9be795a9
EUC-JP 陋帷鮒闢夂弗迴守鮒陋帷鮒闢夂弗迴手鴇陋帷鮒 111011111111101111010110111010011100101010101011111011111111001111010100111010011100101010100110111011011110111110111100111010011100101010101011111011111111101111010110111010011100101010101011111011111111001111010100111010011100101010100110111011011110111110111100111010101100011010111110111011111111101111010110111010011100101010101011 effbd6e9caabeff3d4e9caa6edefbce9caabeffbd6e9caabeff3d4e9caa6edefbceac6beeffbd6e9caab
UTF-8 陋帷鮒闢夂弗迴守鮒陋帷鮒闢夂弗迴手鴇陋帷鮒 111010011001100110001011111001011011100010110111111010011010111010010010111010011001011110100010111001011010010010000010111001011011110010010111111010001011111110110100111001011010111010001000111010011010111010010010111010011001100110001011111001011011100010110111111010011010111010010010111010011001011110100010111001011010010010000010111001011011110010010111111010001011111110110100111001101000100110001011111010011011010010000111111010011001100110001011111001011011100010110111111010011010111010010010 e9998be5b8b7e9ae92e997a2e5a482e5bc97e8bfb4e5ae88e9ae92e9998be5b8b7e9ae92e997a2e5a482e5bc97e8bfb4e6898be9b487e9998be5b8b7e9ae92
UHC 陋??闢?弗?守?陋??闢?弗?手?陋?? 110101111011000000111111001111111101110010100011001111111101110111010111001111111110000111111010001111111101011110110000001111110011111111011100101000110011111111011101110101110011111111100010101000100011111111010111101100000011111100111111 d7b03f3fdca33fddd73fe1fa3fd7b03f3fdca33fddd73fe2a23fd7b03f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)