To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 貉ソ驥域ケソ驟碁楜貉ソ驥域ケソ驟碁楜B 11100110101110011011111111101001100001111000100011100110101110011011111111101001100001011000110011101001100111101011001111100110101110011011111111101001100001111000100011100110101110011011111111101001100001011000110011101001100111101011001101000010 e6b9bfe98788e6b9bfe9858ce99eb3e6b9bfe98788e6b9bfe9858ce99eb342
EUC-JP 貉ソ驥域ケソ驟碁楜貉ソ驥域ケソ驟碁楜B 11101100101110111000111010111111111100011110011110110000111010001000111010111001100011101011111111110001111001011011100011101011110111001011010111101100101110111000111010111111111100011110011110110000111010001000111010111001100011101011111111110001111001011011100011101011110111001011010101000010 ecbb8ebff1e7b0e88eb98ebff1e5b8ebdcb5ecbb8ebff1e7b0e88eb98ebff1e5b8ebdcb542
UTF-8 貉ソ驥域ケソ驟碁楜貉ソ驥域ケソ驟碁楜B 11101000101100101000100111101111101111011011111111101001101010011010010111100101100111111001111111101111101111011011100111101111101111011011111111101001101010011001111111100111101000101000000111100110101001011001110011101000101100101000100111101111101111011011111111101001101010011010010111100101100111111001111111101111101111011011100111101111101111011011111111101001101010011001111111100111101000101000000111100110101001011001110001000010 e8b289efbdbfe9a9a5e59f9fefbdb9efbdbfe9a99fe7a281e6a59ce8b289efbdbfe9a9a5e59f9fefbdb9efbdbfe9a99fe7a281e6a59c42
UHC ??驥域??驟碁???驥域??驟碁?B 001111110011111111010001110010101110011010110100001111110011111111110110101011101101000110110011001111110011111100111111110100011100101011100110101101000011111100111111111101101010111011010001101100110011111101000010 3f3fd1cae6b43f3ff6aed1b33f3f3fd1cae6b43f3ff6aed1b33f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)