To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 薰ゥ雖牙∪霎樶か雖蛾エ」霆ク 111110111001111010101001111001011010101110001001111001011000000110111110111010001011111010011110111011101000001010101001111001011010101110001001111010011011010010100011111010001011101110111000 fb9ea9e5ab89e581bee8be9eee82a9e5ab89e9b4a3e8bbb8
EUC-JP ?ゥ雖牙∪霎樶か雖蛾エ」霆ク 001111111000111010101001111010101010110110110010111001111010001011000000111100001100000011011100111100001010010010101011111010101010110110110010111010111000111010110100100011101010001111110000101111011000111010111000 3f8ea9eaadb2e7a2c0f0c0dcf0a4abeaadb2eb8eb48ea3f0bd8eb8
UTF-8 薰ゥ雖牙∪霎樶か雖蛾エ」霆ク 111010001001011010110000111011111011110110101001111010011001101110010110111001111000100110011001111000101000100010101010111010011001110010001110111001101010100010110110111000111000000110001011111010011001101110010110111010001001101110111110111011111011110110110100111011111011110110100011111010011001110010000110111011111011110110111000 e896b0efbda9e99b96e78999e288aae99c8ee6a8b6e3818be99b96e89bbeefbdb4efbda3e99c86efbdb8
UHC 薰?雖牙∪??か雖蛾??霆? 11111101101110010011111111100010110011001110010010110011101000011111101000111111001111111010101010101011111000101100110011100100101101100011111100111111111011111111110100111111 fdb93fe2cce4b3a1fa3f3faaabe2cce4b63f3feffd3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)