To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 止??逗?彫?烝?止??逗?彫?壺?^ 100011100111111000111111001111111001000010000000001111111001001010100100001111111110000001111110001111111000111001111110001111110011111110010000100000000011111110010010101001000011111110011010111000100011111101011110 8e7e3f3f90803f92a43fe07e3f8e7e3f3f90803f92a43f9ae23f5e
EUC-JP 止?瀣逗?彫?烝?止?瀣逗?彫?壺?^ 10111011110111110011111110001111110010011011000110111111111000000011111111000100101001100011111111011111110111110011111110111011110111110011111110001111110010011011000110111111111000000011111111000100101001100011111111010100111001000011111101011110 bbdf3f8fc9b1bfe03fc4a63fdfdf3fbbdf3f8fc9b1bfe03fc4a63fd4e43f5e
UTF-8 止렍瀣逗류彫렣烝렎止렍瀣逗류彫렣壺렲^ 11100110101011011010001011101011101000001000110111100111100000001010001111101001100000001001011111101011101001011001100011100101101111011010101111101011101000001010001111100111100000111001110111101011101000001000111011100110101011011010001011101011101000001000110111100111100000001010001111101001100000001001011111101011101001011001100011100101101111011010101111101011101000001010001111100101101000111011101011101011101000001011001001011110 e6ada2eba08de780a3e98097eba598e5bdabeba0a3e7839deba08ee6ada2eba08de780a3e98097eba598e5bdabeba0a3e5a3baeba0b25e
UHC 止렍瀣逗류彫렣烝렎止렍瀣逗류彫렣壺렲^ 11110010101011011000111010100011111110101010111011010100111010001011011111111001111100001100000110001110101101001111000111110110100011101010010011110010101011011000111010100011111110101010111011010100111010001011011111111001111100001100000110001110101101001111101110111110100011101011111101011110 f2ad8ea3faaed4e8b7f9f0c18eb4f1f68ea4f2ad8ea3faaed4e8b7f9f0c18eb4fbbe8ebf5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)