To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?æ??????æ??????æ???? 0011111111100110001111110011111100111111001111110011111100111111111001100011111100111111001111110011111100111111001111111110011000111111001111110011111100111111 3fe63f3f3f3f3f3fe63f3f3f3f3f3fe63f3f3f3f
SJIS-WIN ??????怡??????醫?????’ 0011111100111111001111110011111100111111001111111001110001111101001111110011111100111111001111110011111100111111111001111100111000111111001111110011111100111111001111111000000101100110 3f3f3f3f3f3f9c7d3f3f3f3f3f3fe7ce3f3f3f3f3f8166
EUC-JP ?æ????怡?æ????醫?æ???’ 0011111110001111101010011100000100111111001111110011111100111111110101111101111000111111100011111010100111000001001111110011111100111111001111111110111011010000001111111000111110101001110000010011111100111111001111111010000111000111 3f8fa9c13f3f3f3fd7de3f8fa9c13f3f3f3feed03f8fa9c13f3f3fa1c7
UTF-8 룶æ熉룶절룫怡룶æ熉룶절룫醫룶æ熉룶절’ 111010111010001110110110110000111010011011100111100001101000100111101011101000111011011011101100101000001000100011101011101000111010101111100110100000001010000111101011101000111011011011000011101001101110011110000110100010011110101110100011101101101110110010100000100010001110101110100011101010111110100110000110101010111110101110100011101101101100001110100110111001111000011010001001111010111010001110110110111011001010000010001000111000101000000010011001 eba3b6c3a6e78689eba3b6eca088eba3abe680a1eba3b6c3a6e78689eba3b6eca088eba3abe986abeba3b6c3a6e78689eba3b6eca088e28099
UHC 룶æ熉룶절룫怡룶æ熉룶절룫醫룶æ熉룶절’ 10001111101010111010100110100001111010011111101110001111101010111100000011111101100011111010001011101100101011101000111110101011101010011010000111101001111110111000111110101011110000001111110110001111101000101110110010100010100011111010101110101001101000011110100111111011100011111010101111000000111111011010000110101111 8faba9a1e9fb8fabc0fd8fa2ecae8faba9a1e9fb8fabc0fd8fa2eca28faba9a1e9fb8fabc0fda1af

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)