To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????E 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 霍狗、シ遐りキ也く霍狗、シ遐りキ夜惚E 11101000101101111000101111100111101001001011110011100111101000001000001011101000101101111001011011100111100000101010110111101000101101111000101111100111101001001011110011100111101000001000001011101000101101111001011011101001100011011001101101000101 e8b78be7a4bce7a082e8b796e782ade8b78be7a4bce7a082e8b796e98d9b45
EUC-JP 霍狗、シ遐りキ也く霍狗、シ遐りキ夜惚E 11110000101110011011011011101001100011101010010010001110101111001110111010100010101001001110101010001110101101111100110011101001101001001010111111110000101110011011011011101001100011101010010010001110101111001110111010100010101001001110101010001110101101111100110011101011101110011111101101000101 f0b9b6e98ea48ebceea2a4ea8eb7cce9a4aff0b9b6e98ea48ebceea2a4ea8eb7ccebb9fb45
UTF-8 霍狗、シ遐りキ也く霍狗、シ遐りキ夜惚E 11101001100111001000110111100111100010111001011111101111101111011010010011101111101111011011110011101001100000011001000011100011100000101000101011101111101111011011011111100100101110011001111111100011100000011000111111101001100111001000110111100111100010111001011111101111101111011010010011101111101111011011110011101001100000011001000011100011100000101000101011101111101111011011011111100101101001001001110011100110100000111001101001000101 e99c8de78b97efbda4efbdbce98190e3828aefbdb7e4b99fe3818fe99c8de78b97efbda4efbdbce98190e3828aefbdb7e5a49ce6839a45
UHC ?狗??遐り?也く?狗??遐り?夜惚E 0011111111001111101101110011111100111111111110011100011010101010111010100011111111100101101001011010101010101111001111111100111110110111001111110011111111111001110001101010101011101010001111111110010110101000111110111110110101000101 3fcfb73f3ff9c6aaea3fe5a5aaaf3fcfb73f3ff9c6aaea3fe5a8fbed45

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)