To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 荳茨セ呈訒コ趙シ鮨奇スエ螻。 111001001011100010001000111011111011111010010010111001101111010110010100111110111010001110111010111001101110001010111100111010011011110110001010111011111011110110110100111001011011000110100001 e4b888efbe92e6f594fba3bae6e2bce9bd8aefbdb4e5b1a1
EUC-JP 荳茨セ呈?訒コ趙シ鮨奇スエ螻。 111010001011101010110000111100011000111010111110110001001110100000111111100011111101110111001000100011101011101011101100111001001000111010111100111100101011111110110100111100011000111010111101100011101011010011101010101100111000111010100001 e8bab0f18ebec4e83f8fddc88ebaece48ebcf2bfb4f18ebd8eb4eab38ea1
UTF-8 荳茨セ呈訒コ趙シ鮨奇スエ螻。 111010001000110110110011111010001000110010101000111011111011110110111110111001011001000110001000111011101000111110111111111010001010100010010010111011111011110110111010111010001011011010011001111011111011110110111100111010011010111010101000111001011010010110000111111011111011110110111101111011111011110110110100111010001001111010111011111011111011110110100001 e88db3e88ca8efbdbee59188ee8fbfe8a892efbdbae8b699efbdbce9aea8e5a587efbdbdefbdb4e89ebbefbda1
UHC 荳茨?呈???趙??奇???? 1101010011100101111011011011110000111111111011111101000000111111001111110011111111110000111000010011111100111111110100001111010000111111001111110011111100111111 d4e5edbc3fefd03f3f3ff0e13f3fd0f43f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)