To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN 薰・遲掵薰・遲掉^ 111110111001111010100101111001111010110110011101011111011111101110011110101001011110011110101101100111010111101101011110 fb9ea5e7ad9d7dfb9ea5e7ad9d7b5e
EUC-JP ?・遲掵?・遲掉^ 001111111000111010100101111011101010111111011001110111100011111110001110101001011110111010101111110110011101110001011110 3f8ea5eeafd9de3f8ea5eeafd9dc5e
UTF-8 薰・遲掵薰・遲掉^ 11101000100101101011000011101111101111011010010111101001100000011011001011100110100011101011010111101000100101101011000011101111101111011010010111101001100000011011001011100110100011101000100101011110 e896b0efbda5e981b2e68eb5e896b0efbda5e981b2e68e895e
UHC 薰?遲?薰?遲掉^ 1111110110111001001111111111001011000000001111111111110110111001001111111111001011000000110100111111110001011110 fdb93ff2c03ffdb93ff2c0d3fc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)