To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???鍮??罐徇??銀??檍??宥ゆ?音??B 00111111001111110011111111101000010010100011111100111111111000111010001110011100011011010011111100111111100010111110001000111111001111111001111011111000001111110011111110010111010001111000001011100100001111111000100110111001001111110011111101000010 3f3f3fe84a3f3fe3a39c6d3f3f8be23f3f9ef83f3f974782e43f89b93f3f42
EUC-JP ???鍮??罐徇??銀??檍?ħ宥ゆ?音??B 001111110011111100111111111011111010101100111111001111111110011010100101110101111100111000111111001111111011011011100100001111110011111111011100111110100011111110001111101010011100010011001101101010001010010011100110001111111011001010111011001111110011111101000010 3f3f3fefab3f3fe6a5d7ce3f3fb6e43f3fdcfa3f8fa9c4cda8a4e63fb2bb3f3f42
UTF-8 略노쵐鍮뽪씭罐徇쒏룚銀ㅽ벃檍용ħ宥ゆ쾬音깅룇B 111011111010010110110110111010111000010110111000111011001011010110010000111010011000110110101110111010111011110110101010111011001001010010101101111001111011110110010000111001011011111010000111111011001001001010001111111010111010001110011010111010011000101010000000111000111000010110111101111010111011001010000011111001101010101010001101111011001001101010101001110001001010011111100101101011101010010111100011100000101000011011101100101111101010110011101001100111111011001111101010101110011000010111101011101000111000011101000010 efa5b6eb85b8ecb590e98daeebbdaaec94ade7bd90e5be87ec928feba39ae98a80e385bdebb283e6aa8dec9aa9c4a7e5aea5e38286ecbeace99fb3eab985eba38742
UHC 略노쵐鍮뽪씭罐徇쒏룚銀ㅽ벃檍용ħ宥ゆ쾬音깅룇B 111001011011001010110011111010111010110010010010111010111011100110010110111001101001110110111110110011101011100011100010110111111001110011100110100011111001011011101011110111101010010011101101100100111010100111100101111001011011111111101011101010011010010011101010111010011010101011100110101100101000001111101011111001011011000111101011100011111000011001000010 e5b2b3ebac92ebb996e69dbeceb8e2df9ce68f96ebdea4ed93a9e5e5bfeba9a4eae9aae6b283ebe5b1eb8f8642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)