To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 褥?∥意??儀??繹??儒??誘??鵝 111001011111000100111111100000010110000110001000110100110011111100111111100010110101011000111111001111111110001110001000001111110011111110001110111100100011111100111111100101110101010100111111001111111110101001000000 e5f13f816188d33f3f8b563f3fe3883f3f8ef23f3f97553f3fea40
EUC-JP 褥?‖意??儀??繹??儒??誘??鵝 111010101111001100111111101000011100001010110000110101010011111100111111101101011011011100111111001111111110010111101000001111110011111110111100111101000011111100111111110011011011011000111111001111111111001110100001 eaf33fa1c2b0d53f3fb5b73f3fe5e83f3fbcf43f3fcdb63f3ff3a1
UTF-8 褥띕∥意덌쭏儀숈춳繹먭낯儒껃짃誘⑹탪鵝 111010001010010010100101111010111001110110010101111000101000100010100101111001101000010010001111111010111000110110001100111011001010110110001111111001011000010010000000111011001000100010001000111011001011011010110011111001111011100110111001111010111010100010101101111010111000001010101111111001011000010010010010111010101011101110000011111011001010011110000011111010001010101010011000111000101001000110111001111011011000001110101010111010011011010110011101 e8a4a5eb9d95e288a5e6848feb8d8cecad8fe58480ec8888ecb6b3e7b9b9eba8adeb82afe58492eabb83eca783e8aa98e291b9ed83aae9b59d
UHC 褥띕∥意덌쭏儀숈춳繹먭낯儒껃짃誘⑹탪鵝 1110100110110011101101101110101110100001101010111110101111110010100010001110111110100111100010001110101111110000100110011110110010101101100011111110011010111010100100001110101010110011101110001110101011100011100000111110010110100011100100111110101110101111101010011110110010110101100011001110010010111101 e9b3b6eba1abebf288efa788ebf099ecad8fe6ba90eab3b8eae383e5a393ebafa9ecb58ce4bd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)