To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????N}????????N{^ 001111110011111100111111001111110011111100111111001111110011111101001110011111010011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN 晶ミシュ上骼キ灼N}晶ミシュ上骼キ灼N{^ 1000111110111011110100001011110010101101100011111110001111101001100011101011011110001110110111000100111001111101100011111011101111010000101111001010110110001111111000111110100110001110101101111000111011011100010011100111101101011110 8fbbd0bcad8fe3e98eb78edc4e7d8fbbd0bcad8fe3e98eb78edc4e7b5e
EUC-JP 晶ミシュ上骼キ灼N}晶ミシュ上骼キ灼N{^ 10111110101111011000111011010000100011101011110010001110101011011011111011100101111100011110111010001110101101111011110011011110010011100111110110111110101111011000111011010000100011101011110010001110101011011011111011100101111100011110111010001110101101111011110011011110010011100111101101011110 bebd8ed08ebc8eadbee5f1ee8eb7bcde4e7dbebd8ed08ebc8eadbee5f1ee8eb7bcde4e7b5e
UTF-8 晶ミシュ上骼キ灼N}晶ミシュ上骼キ灼N{^ 1110011010011001101101101110111110111110100100001110111110111101101111001110111110111101101011011110010010111000100010101110100110101010101111001110111110111101101101111110011110000001101111000100111001111101111001101001100110110110111011111011111010010000111011111011110110111100111011111011110110101101111001001011100010001010111010011010101010111100111011111011110110110111111001111000000110111100010011100111101101011110 e699b6efbe90efbdbcefbdade4b88ae9aabcefbdb7e781bc4e7de699b6efbe90efbdbcefbdade4b88ae9aabcefbdb7e781bc4e7b5e
UHC 晶???上??灼N}晶???上??灼N{^ 111011111101110000111111001111110011111111011111101111100011111100111111111011011100011101001110011111011110111111011100001111110011111100111111110111111011111000111111001111111110110111000111010011100111101101011110 efdc3f3f3fdfbe3f3fedc74e7defdc3f3f3fdfbe3f3fedc74e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)