To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????gB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110011101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6742
SJIS-WIN 馭???筌l?肄∽?弛f?純???淫??筌?ЗgB 111010010110011000111111001111110011111111100010101000111000001010001100001111111110001111100101100000011110010000111111100100100110111110000010100001100011111110001111100000110011111100111111001111111000100011111010001111110011111111100010101000110011111110000100010010000110011101000010 e9663f3f3fe2a3828c3fe3e581e43f926f82863f8f833f3f3f88fa3f3fe2a33f84486742
EUC-JP 馭???筌l?肄∽?弛f?純???淫??筌?ЗgB 111100011100011100111111001111110011111111100100101001011010001111101100001111111110011011100111101000101110011000111111110000111101000010100011111001100011111110111101111000110011111100111111001111111011000011111100001111110011111111100100101001010011111110100111101010010110011101000010 f1c73f3f3fe4a5a3ec3fe6e7a2e63fc3d0a3e63fbde33f3f3fb0fc3f3fe4a53fa7a96742
UTF-8 馭곥룂흟筌l꼷肄∽쭪弛f걖純꺟곻㎗淫륁숯筌욎ЗgB 11101001101001101010110111101010101100111010010111101011101000111000001011101101100111011001111111100111101011011000110011101111101111011000110011101010101111001011011111101000100000101000010011100010100010001011110111101100101011011010101011100101101111001001101111101111101111011000011011101010101100011001011011100111101101001001010011101010101110101001111111101010101100111011101111100011100011101001011111100110101101111010101111101011101001011000000111101100100010001010111111100111101011011000110011101100100110101000111011010000100101110110011101000010 e9a6adeab3a5eba382ed9d9fe7ad8cefbd8ceabcb7e88284e288bdecadaae5bc9befbd86eab196e7b494eaba9feab3bbe38e97e6b7abeba581ec88afe7ad8cec9a8ed0976742
UHC 馭곥룂흟筌l꼷肄∽쭪弛f걖純꺟곻㎗淫륁숯筌욎ЗgB 111001011101111110000001111000111000111110000011110001011000000111101111101001111010001111101100100001001000111111101100101111011010000111101111101001111001111011101100101011001010001111100110100000011000000111100010111011011000001111000101100000011110111110100111101000111110101111100010100011111110110010111101101000011110111110100111100111101110110010101100101010010110011101000010 e5df81e38f83c581efa7a3ec848fecbda1efa79eecaca3e68181e2ed83c581efa7a3ebe28fecbda1efa79eecaca96742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)