To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蜈??嗽??巍??隘??冗??蜈??節??^ 1110010110000101001111110011111110011010011101010011111100111111100110111101100100111111001111111110100010100101001111110011111110001111111001110011111100111111111001011000010100111111001111111001000011011111001111110011111101011110 e5853f3f9a753f3f9bd93f3fe8a53f3f8fe73f3fe5853f3f90df3f3f5e
EUC-JP 蜈??嗽??巍??隘??冗??蜈??節??^ 1110100111100101001111110011111111010011110101100011111100111111110101101101101100111111001111111111000010100111001111110011111110111110111010010011111100111111111010011110010100111111001111111100000011100001001111110011111101011110 e9e53f3fd3d63f3fd6db3f3ff0a73f3fbee93f3fe9e53f3fc0e13f3f5e
UTF-8 蜈좈레嗽뤺쪧巍랃슐隘꿱쪧冗밟뿈蜈좈냽節뜹샂^ 11101000100111001000100011101100101000101000100011101011101000001000100011100101100101111011110111101011101001001011101011101100101010101010011111100101101101111000110111101011100111101000001111101100100010101001000011101001100110101001100011101010101111111011000111101100101010101010011111100101100001101001011111101011101100001001111111101011101111111000100011101000100111001000100011101100101000101000100011101011100000111011110111100111101011111000000011101011100111001011100111101100100000111000001001011110 e89c88eca288eba088e597bdeba4baecaaa7e5b78deb9e83ec8a90e99a98eabfb1ecaaa7e58697ebb09febbf88e89c88eca288eb83bde7af80eb9cb9ec83825e
UHC 蜈좈레嗽뤺쪧巍랃슐隘꿱쪧冗밟뿈蜈좈냽節뜹샂^ 11101000101001011010000011101001101101111011100111100001111101011000111111101000101001011010000011101000111001001000110111101111101111011011011011100100111101101011001011101000101001011010000011101001101101111011100111100010100101111000111111101000101001011010000011101001100001101000110111101111101111011011011011100101100110001011010001011110 e8a5a0e9b7b9e1f58fe8a5a0e8e48defbdb6e4f6b2e8a5a0e9b7b9e2978fe8a5a0e9868defbdb6e598b45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)