To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????S?????????IB 001111110011111100111111001111110011111100111111001111110011111100111111010100110011111100111111001111110011111100111111001111110011111100111111001111110100100101000010 3f3f3f3f3f3f3f3f3f533f3f3f3f3f3f3f3f3f4942
SJIS-WIN 陋帷鮒闢夂弗迴手弊S陋帷鮒闢夂弗迴手弊IB 111010001001101110011011111001111001010110101001111010001001001110011010111001111001010110100100111001111000111110001110111010001001010110111110010100111110100010011011100110111110011110010101101010011110100010010011100110101110011110010101101001001110011110001111100011101110100010010101101111100100100101000010 e89b9be795a9e8939ae795a4e78f8ee895be53e89b9be795a9e8939ae795a4e78f8ee895be4942
EUC-JP 陋帷鮒闢夂弗迴手弊S陋帷鮒闢夂弗迴手弊IB 111011111111101111010110111010011100101010101011111011111111001111010100111010011100101010100110111011011110111110111100111010101100101011000000010100111110111111111011110101101110100111001010101010111110111111110011110101001110100111001010101001101110110111101111101111001110101011001010110000000100100101000010 effbd6e9caabeff3d4e9caa6edefbceacac053effbd6e9caabeff3d4e9caa6edefbceacac04942
UTF-8 陋帷鮒闢夂弗迴手弊S陋帷鮒闢夂弗迴手弊IB 111010011001100110001011111001011011100010110111111010011010111010010010111010011001011110100010111001011010010010000010111001011011110010010111111010001011111110110100111001101000100110001011111001011011110010001010010100111110100110011001100010111110010110111000101101111110100110101110100100101110100110010111101000101110010110100100100000101110010110111100100101111110100010111111101101001110011010001001100010111110010110111100100010100100100101000010 e9998be5b8b7e9ae92e997a2e5a482e5bc97e8bfb4e6898be5bc8a53e9998be5b8b7e9ae92e997a2e5a482e5bc97e8bfb4e6898be5bc8a4942
UHC 陋??闢?弗?手弊S陋??闢?弗?手弊IB 11010111101100000011111100111111110111001010001100111111110111011101011100111111111000101010001011111000110010010101001111010111101100000011111100111111110111001010001100111111110111011101011100111111111000101010001011111000110010010100100101000010 d7b03f3fdca33fddd73fe2a2f8c953d7b03f3fdca33fddd73fe2a2f8c94942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)