To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 荳茨セ呈訒コ趙シ鮨奇スエ螻。渧。遉コ 111001001011100010001000111011111011111010010010111001101111010110010100111110111010001110111010111001101110001010111100111010011011110110001010111011111011110110110100111001011011000110100001111110110100100010100001111001111010010010111010 e4b888efbe92e6f594fba3bae6e2bce9bd8aefbdb4e5b1a1fb48a1e7a4ba
EUC-JP 荳茨セ呈?訒コ趙シ鮨奇スエ螻。渧。遉コ 111010001011101010110000111100011000111010111110110001001110100000111111100011111101110111001000100011101011101011101100111001001000111010111100111100101011111110110100111100011000111010111101100011101011010011101010101100111000111010100001100011111100011111101011100011101010000111101110101001101000111010111010 e8bab0f18ebec4e83f8fddc88ebaece48ebcf2bfb4f18ebd8eb4eab38ea18fc7eb8ea1eea68eba
UTF-8 荳茨セ呈訒コ趙シ鮨奇スエ螻。渧。遉コ 111010001000110110110011111010001000110010101000111011111011110110111110111001011001000110001000111011101000111110111111111010001010100010010010111011111011110110111010111010001011011010011001111011111011110110111100111010011010111010101000111001011010010110000111111011111011110110111101111011111011110110110100111010001001111010111011111011111011110110100001111001101011100010100111111011111011110110100001111010011000000110001001111011111011110110111010 e88db3e88ca8efbdbee59188ee8fbfe8a892efbdbae8b699efbdbce9aea8e5a587efbdbdefbdb4e89ebbefbda1e6b8a7efbda1e98189efbdba
UHC 荳茨?呈???趙??奇???????? 110101001110010111101101101111000011111111101111110100000011111100111111001111111111000011100001001111110011111111010000111101000011111100111111001111110011111100111111001111110011111100111111 d4e5edbc3fefd03f3f3ff0e13f3fd0f43f3f3f3f3f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)