To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????R????^[????R????^[^ 0011111100111111001111110011111101010010001111110011111100111111001111110101111001011011001111110011111100111111001111110101001000111111001111110011111100111111010111100101101101011110 3f3f3f3f523f3f3f3f5e5b3f3f3f3f523f3f3f3f5e5b5e
SJIS-WIN 鬘矩。杵R鬘矩。杵^[鬘矩。杵R鬘矩。杵^[^ 1110100110100001100010111110100110100001100010110110111001010010111010011010000110001011111010011010000110001011011011100101111001011011111010011010000110001011111010011010000110001011011011100101001011101001101000011000101111101001101000011000101101101110010111100101101101011110 e9a18be9a18b6e52e9a18be9a18b6e5e5be9a18be9a18b6e52e9a18be9a18b6e5e5b5e
EUC-JP 鬘矩。杵R鬘矩。杵^[鬘矩。杵R鬘矩。杵^[^ 111100101010001110110110111010111000111010100001101101011100111101010010111100101010001110110110111010111000111010100001101101011100111101011110010110111111001010100011101101101110101110001110101000011011010111001111010100101111001010100011101101101110101110001110101000011011010111001111010111100101101101011110 f2a3b6eb8ea1b5cf52f2a3b6eb8ea1b5cf5e5bf2a3b6eb8ea1b5cf52f2a3b6eb8ea1b5cf5e5b5e
UTF-8 鬘矩。杵R鬘矩。杵^[鬘矩。杵R鬘矩。杵^[^ 11101001101011001001100011100111100111111010100111101111101111011010000111100110100111011011010101010010111010011010110010011000111001111001111110101001111011111011110110100001111001101001110110110101010111100101101111101001101011001001100011100111100111111010100111101111101111011010000111100110100111011011010101010010111010011010110010011000111001111001111110101001111011111011110110100001111001101001110110110101010111100101101101011110 e9ac98e79fa9efbda1e69db552e9ac98e79fa9efbda1e69db55e5be9ac98e79fa9efbda1e69db552e9ac98e79fa9efbda1e69db55e5b5e
UHC ?矩?杵R?矩?杵^[?矩?杵R?矩?杵^[^ 00111111110011111011101100111111111011101011111001010010001111111100111110111011001111111110111010111110010111100101101100111111110011111011101100111111111011101011111001010010001111111100111110111011001111111110111010111110010111100101101101011110 3fcfbb3feebe523fcfbb3feebe5e5b3fcfbb3feebe523fcfbb3feebe5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)