To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????i??????????iB 0011111100111111001111110011111100111111001111110011111100111111001111110011111101101001001111110011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f3f6942
SJIS-WIN 贈?????爰?殞?i贈?????爰?殞?iB 1001000110100001001111110011111100111111001111110011111111100000101001110011111110011111011011010011111101101001100100011010000100111111001111110011111100111111001111111110000010100111001111111001111101101101001111110110100101000010 91a13f3f3f3f3fe0a73f9f6d3f6991a13f3f3f3f3fe0a73f9f6d3f6942
EUC-JP 贈?????爰?殞?i贈?????爰?殞?iB 1100001010100011001111110011111100111111001111110011111111100000101010010011111111011101110011100011111101101001110000101010001100111111001111110011111100111111001111111110000010101001001111111101110111001110001111110110100101000010 c2a33f3f3f3f3fe0a93fddce3f69c2a33f3f3f3f3fe0a93fddce3f6942
UTF-8 贈계렢목렰렚爰렧殞렗i贈계렢목렰렚爰렧殞렗iB 111010001011010010001000111010101011001110000100111010111010000010100010111010111010101010101001111010111010000010110000111010111010000010011010111001111000100010110000111010111010000010100111111001101010111010011110111010111010000010010111011010011110100010110100100010001110101010110011100001001110101110100000101000101110101110101010101010011110101110100000101100001110101110100000100110101110011110001000101100001110101110100000101001111110011010101110100111101110101110100000100101110110100101000010 e8b488eab384eba0a2ebaaa9eba0b0eba09ae788b0eba0a7e6ae9eeba09769e8b488eab384eba0a2ebaaa9eba0b0eba09ae788b0eba0a7e6ae9eeba0976942
UHC 贈계렢목렰렚爰렧殞렗i贈계렢목렰렚爰렧殞렗iB 11110001111111001011000011101000100011101011001110111000111100011000111010111101100011101010110111101010101110101000111010110110111010011111100110001110101011000110100111110001111111001011000011101000100011101011001110111000111100011000111010111101100011101010110111101010101110101000111010110110111010011111100110001110101011000110100101000010 f1fcb0e88eb3b8f18ebd8eadeaba8eb6e9f98eac69f1fcb0e88eb3b8f18ebd8eadeaba8eb6e9f98eac6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)