To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????W????Jn}????W????Jn{^ 00111111001111110011111100111111010101110011111100111111001111110011111101001010011011100111110100111111001111110011111100111111010101110011111100111111001111110011111101001010011011100111101101011110 3f3f3f3f573f3f3f3f4a6e7d3f3f3f3f573f3f3f3f4a6e7b5e
SJIS-WIN 薰ァ貉ソW薰ァ貉ソJn}薰ァ貉ソW薰ァ貉ソJn{^ 111110111001111010100111111001101011100110111111010101111111101110011110101001111110011010111001101111110100101001101110011111011111101110011110101001111110011010111001101111110101011111111011100111101010011111100110101110011011111101001010011011100111101101011110 fb9ea7e6b9bf57fb9ea7e6b9bf4a6e7dfb9ea7e6b9bf57fb9ea7e6b9bf4a6e7b5e
EUC-JP ?ァ貉ソW?ァ貉ソJn}?ァ貉ソW?ァ貉ソJn{^ 00111111100011101010011111101100101110111000111010111111010101110011111110001110101001111110110010111011100011101011111101001010011011100111110100111111100011101010011111101100101110111000111010111111010101110011111110001110101001111110110010111011100011101011111101001010011011100111101101011110 3f8ea7ecbb8ebf573f8ea7ecbb8ebf4a6e7d3f8ea7ecbb8ebf573f8ea7ecbb8ebf4a6e7b5e
UTF-8 薰ァ貉ソW薰ァ貉ソJn}薰ァ貉ソW薰ァ貉ソJn{^ 111010001001011010110000111011111011110110100111111010001011001010001001111011111011110110111111010101111110100010010110101100001110111110111101101001111110100010110010100010011110111110111101101111110100101001101110011111011110100010010110101100001110111110111101101001111110100010110010100010011110111110111101101111110101011111101000100101101011000011101111101111011010011111101000101100101000100111101111101111011011111101001010011011100111101101011110 e896b0efbda7e8b289efbdbf57e896b0efbda7e8b289efbdbf4a6e7de896b0efbda7e8b289efbdbf57e896b0efbda7e8b289efbdbf4a6e7b5e
UHC 薰???W薰???Jn}薰???W薰???Jn{^ 1111110110111001001111110011111100111111010101111111110110111001001111110011111100111111010010100110111001111101111111011011100100111111001111110011111101010111111111011011100100111111001111110011111101001010011011100111101101011110 fdb93f3f3f57fdb93f3f3f4a6e7dfdb93f3f3f57fdb93f3f3f4a6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)