To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 迥匁ケソ遽晁サ顔刊迚倡竃貉ソ遽晁サ顔刊迚録 11100111100010101001011011100110101110011011111111100111101011111001110111101000101110111000101011100111100010101010011111100111100010011001100011100111100010101001011011100110101110011011111111100111101011111001110111101000101110111000101011100111100010101010011111100111100010011001100001011110 e78a96e6b9bfe7af9de8bb8ae78aa7e78998e78a96e6b9bfe7af9de8bb8ae78aa7e789985e
EUC-JP 迥匁ケソ遽晁サ顔刊迚倡竃貉ソ遽晁サ顔刊迚録 111011011110101011001100111010001000111010111001100011101011111111101110101100011101101011101010100011101011101110110100111010011011010010101001111011011110100111010000111010011011001111110110111011001011101110001110101111111110111010110001110110101110101010001110101110111011010011101001101101001010100111101101111010011100111110111111 edeacce88eb98ebfeeb1daea8ebbb4e9b4a9ede9d0e9b3f6ecbb8ebfeeb1daea8ebbb4e9b4a9ede9cfbf
UTF-8 迥匁ケソ遽晁サ顔刊迚倡竃貉ソ遽晁サ顔刊迚録 111010001011111110100101111001011000110010000001111011111011110110111001111011111011110110111111111010011000000110111101111001101001100110000001111011111011110110111011111010011010000110010100111001011000100010001010111010001011111110011010111001011000000010100001111001111010101110000011111010001011001010001001111011111011110110111111111010011000000110111101111001101001100110000001111011111011110110111011111010011010000110010100111001011000100010001010111010001011111110011010111010011000110010110010 e8bfa5e58c81efbdb9efbdbfe981bde69981efbdbbe9a194e5888ae8bf9ae580a1e7ab83e8b289efbdbfe981bde69981efbdbbe9a194e5888ae8bf9ae98cb2
UHC ????遽晁?顔刊?倡???遽晁?顔刊?? 001111110011111100111111001111111100101111101000111100001100010100111111111001001101010011001010110010100011111111110011110110110011111100111111001111111100101111101000111100001100010100111111111001001101010011001010110010100011111100111111 3f3f3f3fcbe8f0c53fe4d4caca3ff3db3f3f3fcbe8f0c53fe4d4caca3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)