To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???午ヨ?節?????節??五??雅ζ?^ 0011111100111111001111111000110011011111100000111000100000111111100100001101111100111111001111110011111100111111001111111001000011011111001111110011111110001100110111000011111100111111100010011110101110000011110001000011111101011110 3f3f3f8cdf83883f90df3f3f3f3f3f90df3f3f8cdc3f3f89eb83c43f5e
EUC-JP 旿??午ヨ?節??孼??節??五??雅ζ?^ 100011111100000111110100001111110011111110111000111000011010010111101000001111111100000011100001001111110011111110001111101110101100001100111111001111111100000011100001001111110011111110111000110111100011111100111111101100101110110110100110110001100011111101011110 8fc1f43f3fb8e1a5e83fc0e13f3f8fbac33f3fc0e13f3fb8de3f3fb2eda6c63f5e
UTF-8 旿울쉘午ヨ땽節답벤孼닺쑠節⑵룶五볩슈雅ζ뤃^ 111001101001011110111111111011001001101010111000111011001000100110011000111001011000110110001000111000111000001110101000111010111001010110111101111001111010111110000000111010111000101110110101111010111011001010100100111001011010110110111100111010111000101110111010111011001001000110100000111001111010111110000000111000101001000110110101111010111010001110110110111001001011101010010100111010111011001110101001111011001000101010001000111010011001101110000101110011101011011011101011101001001000001101011110 e697bfec9ab8ec8998e58d88e383a8eb95bde7af80eb8bb5ebb2a4e5adbceb8bbaec91a0e7af80e291b5eba3b6e4ba94ebb3a9ec8a88e99b85ceb6eba4835e
UHC 旿울쉘午ヨ땽節답벤孼닺쑠節⑵룶五볩슈雅ζ뤃^ 11100111111110101011111111101111101111011010100111100111111011011010101111101000100010111001001111101111101111011011010011100100101110101010010111100101111011011011010011101000100111001011111111101111101111011010100111101000100011111010101111100111111010011001001111101111101111011011010011100100101110101010010111100110100011111011010001011110 e7fabfefbda9e7edabe88b93efbdb4e4baa5e5edb4e89cbfefbda9e88fabe7e993efbdb4e4baa5e68fb45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)