To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 淏晢スエ涬会スス雖永淏晢スエ涬会スス雖永B 111110110100001010011101111011111011110110110100111110110100000110001001111011111011110110111101111001011010101110001001011010011111101101000010100111011110111110111101101101001111101101000001100010011110111110111101101111011110010110101011100010010110100101000010 fb429defbdb4fb4189efbdbde5ab8969fb429defbdb4fb4189efbdbde5ab896942
EUC-JP 淏晢スエ涬会スス雖永淏晢スエ涬会スス雖永B 100011111100011111011001110110101111000110001110101111011000111010110100100011111100011111001111101100101111000110001110101111011000111010111101111010101010110110110001110010101000111111000111110110011101101011110001100011101011110110001110101101001000111111000111110011111011001011110001100011101011110110001110101111011110101010101101101100011100101001000010 8fc7d9daf18ebd8eb48fc7cfb2f18ebd8ebdeaadb1ca8fc7d9daf18ebd8eb48fc7cfb2f18ebd8ebdeaadb1ca42
UTF-8 淏晢スエ涬会スス雖永淏晢スエ涬会スス雖永B 11100110101101111000111111100110100110011010001011101111101111011011110111101111101111011011010011100110101101101010110011100100101111001001101011101111101111011011110111101111101111011011110111101001100110111001011011100110101100001011100011100110101101111000111111100110100110011010001011101111101111011011110111101111101111011011010011100110101101101010110011100100101111001001101011101111101111011011110111101111101111011011110111101001100110111001011011100110101100001011100001000010 e6b78fe699a2efbdbdefbdb4e6b6ace4bc9aefbdbdefbdbde99b96e6b0b8e6b78fe699a2efbdbdefbdb4e6b6ace4bc9aefbdbdefbdbde99b96e6b0b842
UHC 淏???????雖永淏???????雖永B 111110111100100000111111001111110011111100111111001111110011111100111111111000101100110011100111101101011111101111001000001111110011111100111111001111110011111100111111001111111110001011001100111001111011010101000010 fbc83f3f3f3f3f3f3fe2cce7b5fbc83f3f3f3f3f3f3fe2cce7b542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)