To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????h??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011010000011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 閼ア莉冶「冶セソ蜊呈據h閼ア莉冶「冶セソ蜊呈據 11101000100001001011000111100100101110111001011011101000101000101001011011101000101111101011111111100101100011011001001011100110100111011001111101101000111010001000010010110001111001001011101110010110111010001010001010010110111010001011111010111111111001011000110110010010111001101001110110011111 e884b1e4bb96e8a296e8bebfe58d92e69d9f68e884b1e4bb96e8a296e8bebfe58d92e69d9f
EUC-JP 閼ア莉冶「冶セソ蜊呈據h閼ア莉冶「冶セソ蜊呈據 111011111110010010001110101100011110100010111101110011001110101010001110101000101100110011101010100011101011111010001110101111111110100111101101110001001110100011011010101000010110100011101111111001001000111010110001111010001011110111001100111010101000111010100010110011001110101010001110101111101000111010111111111010011110110111000100111010001101101010100001 efe48eb1e8bdccea8ea2ccea8ebe8ebfe9edc4e8daa168efe48eb1e8bdccea8ea2ccea8ebe8ebfe9edc4e8daa1
UTF-8 閼ア莉冶「冶セソ蜊呈據h閼ア莉冶「冶セソ蜊呈據 11101001100101101011110011101111101111011011000111101000100011101000100111100101100001101011011011101111101111011010001011100101100001101011011011101111101111011011111011101111101111011011111111101000100111001000101011100101100100011000100011100110100100111001101001101000111010011001011010111100111011111011110110110001111010001000111010001001111001011000011010110110111011111011110110100010111001011000011010110110111011111011110110111110111011111011110110111111111010001001110010001010111001011001000110001000111001101001001110011010 e996bcefbdb1e88e89e586b6efbda2e586b6efbdbeefbdbfe89c8ae59188e6939a68e996bcefbdb1e88e89e586b6efbda2e586b6efbdbeefbdbfe89c8ae59188e6939a
UHC 閼?莉冶?冶???呈據h閼?莉冶?冶???呈據 1110010011011001001111111101011111101001111001011010011100111111111001011010011100111111001111110011111111101111110100001100101111100000011010001110010011011001001111111101011111101001111001011010011100111111111001011010011100111111001111110011111111101111110100001100101111100000 e4d93fd7e9e5a73fe5a73f3f3fefd0cbe068e4d93fd7e9e5a73fe5a73f3f3fefd0cbe0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)