To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 松?6?ル+鈺??松?6?ル+鈺??B 1000111110111100001111111000001001010101001111111000001110001011100000010111101111111011110001000011111100111111100011111011110000111111100000100101010100111111100000111000101110000001011110111111101111000100001111110011111101000010 8fbc3f82553f838b817bfbc43f3f8fbc3f82553f838b817bfbc43f3f42
EUC-JP 松?6?ル+鈺??松?6?ル+鈺??B 10111110101111100011111110100011101101100011111110100101111010111010000111011100100011111110001111010101001111110011111110111110101111100011111110100011101101100011111110100101111010111010000111011100100011111110001111010101001111110011111101000010 bebe3fa3b63fa5eba1dc8fe3d53f3fbebe3fa3b63fa5eba1dc8fe3d53f3f42
UTF-8 松듬6略ル+鈺곈띃松듬6略ル+鈺곈띃B 11100110100111011011111011101011100100111010110011101111101111001001011011101111101001011011011011100011100000111010101111101111101111001000101111101001100010001011101011101010101100111000100011101011100111011000001111100110100111011011111011101011100100111010110011101111101111001001011011101111101001011011011011100011100000111010101111101111101111001000101111101001100010001011101011101010101100111000100011101011100111011000001101000010 e69dbeeb93acefbc96efa5b6e383abefbc8be988baeab388eb9d83e69dbeeb93acefbc96efa5b6e383abefbc8be988baeab388eb9d8342
UHC 松듬6略ル+鈺곈띃松듬6略ル+鈺곈띃B 11100001111001101011010111101011101000111011011011100101101100101010101111101011101000111010101111101000101011011011000011101001100011011011111011100001111001101011010111101011101000111011011011100101101100101010101111101011101000111010101111101000101011011011000011101001100011011011111001000010 e1e6b5eba3b6e5b2abeba3abe8adb0e98dbee1e6b5eba3b6e5b2abeba3abe8adb0e98dbe42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)