To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???項??嚥??^ 001111110011111100111111100011011000000000111111001111111001101010001011001111110011111101011110 3f3f3f8d803f3f9a8b3f3f5e
EUC-JP ???項??嚥??^ 001111110011111100111111101110011110000000111111001111111101001111101011001111110011111101011110 3f3f3fb9e03f3fd3eb3f3f5e
UTF-8 呂얏영項믤씭嚥따쨶^ 11101111101001101000000011101100100101101000111111101100100110001000000111101001101000001000010111101011101011111010010011101100100101001010110111100101100110101010010111101011100101001011000011101100101010001011011001011110 efa680ec968fec9881e9a085ebafa4ec94ade59aa5eb94b0eca8b65e
UHC 呂얏영項믤씭嚥따쨶^ 11100101111110111011111011100110101111111011010111111010101000111001001011100110100111011011111011100110101111111011010111111011101001001001000001011110 e5fbbee6bfb5faa392e69dbee6bfb5fba4905e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)