To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?????????吟щ?佯??壹?????B 0011111100111111001111110011111100111111001111110011111100111111001111111000101111100001100001001000101100111111100110001101000100111111001111111001101011100011001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f8be1848b3f98d13f3f9ae33f3f3f3f3f42
EUC-JP ??????佾??吟щ?佯??壹?????B 00111111001111110011111100111111001111110011111110001111101100001111101100111111001111111011011011100011101001111110101100111111110100001101001100111111001111111101010011100101001111110011111100111111001111110011111101000010 3f3f3f3f3f3f8fb0fb3f3fb6e3a7eb3fd0d33f3fd4e53f3f3f3f3f42
UTF-8 琉뜹쓮療명뀞佾섇퉬吟щ쨽佯얜씡壹욱녆亮꾩꺃B 111011111010011110001100111010111001110010111001111011001001001110101110111011111010011110000001111010111010101010000101111010111000000010011110111001001011110110111110111011001000010010000111111011011000100110101100111001011001000010011111110100011000100111101100101010001011110111100100101111011010111111101100100101101001110011101100100101001010000111100101101000111011100111101100100110101011000111101011100001011000011011101111101001011011011111101010101111101010100111101010101110101000001101000010 efa78ceb9cb9ec93aeefa781ebaa85eb809ee4bdbeec8487ed89ace5909fd189eca8bde4bdafec969cec94a1e5a3b9ec9ab1eb8586efa5b7eabea9eaba8342
UHC 琉뜹쓮療명뀞佾섇퉬吟щ쨽佯얜씡壹욱녆亮꾩꺃B 11101011101001001011011011100101100111011000111011101000111111101011100011101101100001011001010111101100111010111001100011100101101110011000010011101011111000011010110011101011101001001001011111100101101110101011111011101011100111011011010111101100111011001011111111101101100001101011110111100101101110011000010011101100100000111010110001000010 eba4b6e59d8ee8feb8ed8595eceb98e5b984ebe1aceba497e5babeeb9db5ececbfed86bde5b984ec83ac42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)