To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 韋慕甥銷ォ魄暦スサ髣泌鞫﨟埼ョ暦スサ^ 11101000111010001001010111100111100010011001100111100111111101111010101111101001101011101001011111101111101111011011101111101001100101111001010011100101111001101001000111111011100111011000110111101001101011101001011111101111101111011011101101011110 e8e895e78999e7f7abe9ae97efbdbbe99794e5e691fb9d8de9ae97efbdbb5e
EUC-JP 韋慕甥銷ォ魄暦スサ髣泌鞫?埼ョ暦スサ^ 111100001110101011001010111010011011000111111001111011101111100110001110101010111111001010110000110011101111000110001110101111011000111010111011111100011111011111001000111001111110101111110001001111111011101011101011100011101010111011001110111100011000111010111101100011101011101101011110 f0eacae9b1f9eef98eabf2b0cef18ebd8ebbf1f7c8e7ebf13fbaeb8eaecef18ebd8ebb5e
UTF-8 韋慕甥銷ォ魄暦スサ髣泌鞫﨟埼ョ暦スサ^ 11101001100111111000101111100110100001011001010111100111100101001010010111101001100010101011011111101111101111011010101111101001101011011000010011100110100110101010011011101111101111011011110111101111101111011011101111101001101010111010001111100110101100111000110011101001100111101010101111101111101010001001111111100101100111111011110011101111101111011010111011100110100110101010011011101111101111011011110111101111101111011011101101011110 e99f8be68595e794a5e98ab7efbdabe9ad84e69aa6efbdbdefbdbbe9aba3e6b38ce99eabefa89fe59fbcefbdaee69aa6efbdbdefbdbb5e
UHC 韋慕甥銷?魄????泌鞫?埼????^ 111010101101111111011001101101111101111111100111111000011101000100111111110110111101111000111111001111110011111100111111111110011011001011001111110101000011111111010000111100100011111100111111001111110011111101011110 eadfd9b7dfe7e1d13fdbde3f3f3f3ff9b2cfd43fd0f23f3f3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)