To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?????????吟щ?鎰??壹?????B 0011111100111111001111110011111100111111001111110011111100111111001111111000101111100001100001001000101100111111111010000100110000111111001111111001101011100011001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f8be1848b3fe84c3f3f9ae33f3f3f3f3f42
EUC-JP ??????佾??吟щ?鎰??壹?????B 00111111001111110011111100111111001111110011111110001111101100001111101100111111001111111011011011100011101001111110101100111111111011111010110100111111001111111101010011100101001111110011111100111111001111110011111101000010 3f3f3f3f3f3f8fb0fb3f3fb6e3a7eb3fefad3f3fd4e53f3f3f3f3f42
UTF-8 琉뜹쓮療명뀞佾섇퉬吟щ쨰鎰녿씡壹욱녆亮꾩꺃B 111011111010011110001100111010111001110010111001111011001001001110101110111011111010011110000001111010111010101010000101111010111000000010011110111001001011110110111110111011001000010010000111111011011000100110101100111001011001000010011111110100011000100111101100101010001011000011101001100011101011000011101011100001011011111111101100100101001010000111100101101000111011100111101100100110101011000111101011100001011000011011101111101001011011011111101010101111101010100111101010101110101000001101000010 efa78ceb9cb9ec93aeefa781ebaa85eb809ee4bdbeec8487ed89ace5909fd189eca8b0e98eb0eb85bfec94a1e5a3b9ec9ab1eb8586efa5b7eabea9eaba8342
UHC 琉뜹쓮療명뀞佾섇퉬吟щ쨰鎰녿씡壹욱녆亮꾩꺃B 11101011101001001011011011100101100111011000111011101000111111101011100011101101100001011001010111101100111010111001100011100101101110011000010011101011111000011010110011101011101001001000101011101100111100001000011011101011100111011011010111101100111011001011111111101101100001101011110111100101101110011000010011101100100000111010110001000010 eba4b6e59d8ee8feb8ed8595eceb98e5b984ebe1aceba48aecf086eb9db5ececbfed86bde5b984ec83ac42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)