To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 而?毅驀缺錚狡??而?毅驀缺錚狡??^ 10001110101001110011111110001011010000101110100101111101111000111001111011101000010000101110000011000010001111110011111110001110101001110011111110001011010000101110100101111101111000111001111011101000010000101110000011000010001111110011111101011110 8ea73f8b42e97de39ee842e0c23f3f8ea73f8b42e97de39ee842e0c23f3f5e
EUC-JP 而?毅驀缺錚狡??而?毅驀缺錚狡??^ 10111100101010010011111110110101101000111111000111011110111001011111111011101111101000111110000011000100001111110011111110111100101010010011111110110101101000111111000111011110111001011111111011101111101000111110000011000100001111110011111101011110 bca93fb5a3f1dee5feefa3e0c43f3fbca93fb5a3f1dee5feefa3e0c43f3f5e
UTF-8 而렲毅驀缺錚狡렣㎌而렲毅驀缺錚狡렣㎊^ 11101000100000001000110011101011101000001011001011100110101011111000010111101001101010011000000011100111101111001011101011101001100011001001101011100111100010111010000111101011101000001010001111100011100011101000110011101000100000001000110011101011101000001011001011100110101011111000010111101001101010011000000011100111101111001011101011101001100011001001101011100111100010111010000111101011101000001010001111100011100011101000101001011110 e8808ceba0b2e6af85e9a980e7bcbae98c9ae78ba1eba0a3e38e8ce8808ceba0b2e6af85e9a980e7bcbae98c9ae78ba1eba0a3e38e8a5e
UHC 而렲毅驀缺錚狡렣㎌而렲毅驀缺錚狡렣㎊^ 11101100101110111000111010111111111010111111011011011000111010011100110011000000111011101011011011001110111010101000111010110100101001111101111011101100101110111000111010111111111010111111011011011000111010011100110011000000111011101011011011001110111010101000111010110100101001111101110001011110 ecbb8ebfebf6d8e9ccc0eeb6ceea8eb4a7deecbb8ebfebf6d8e9ccc0eeb6ceea8eb4a7dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)