To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 衍??諭???l?}衍??諭???l?{^ 100111111010010100111111001111111001011101000000001111110011111100111111100000101000110000111111011111011001111110100101001111110011111110010111010000000011111100111111001111111000001010001100001111110111101101011110 9fa53f3f97403f3f3f828c3f7d9fa53f3f97403f3f3f828c3f7b5e
EUC-JP 衍??諭???l?}衍??諭???l?{^ 110111101010011100111111001111111100110110100001001111110011111100111111101000111110110000111111011111011101111010100111001111110011111111001101101000010011111100111111001111111010001111101100001111110111101101011110 dea73f3fcda13f3f3fa3ec3f7ddea73f3fcda13f3f3fa3ec3f7b5e
UTF-8 衍됰맚諭싨에類l죷}衍됰맚諭싨에類l죷{^ 111010001010000110001101111010111001000010110000111010111010011110011010111010001010101110101101111011001000101110101000111011001001011110010000111011111010011110010000111011111011110110001100111011001010001110110111011111011110100010100001100011011110101110010000101100001110101110100111100110101110100010101011101011011110110010001011101010001110110010010111100100001110111110100111100100001110111110111101100011001110110010100011101101110111101101011110 e8a18deb90b0eba79ae8abadec8ba8ec9790efa790efbd8ceca3b77de8a18deb90b0eba79ae8abadec8ba8ec9790efa790efbd8ceca3b77b5e
UHC 衍됰맚諭싨에類l죷}衍됰맚諭싨에類l죷{^ 111001101110001010001001111010111001000010101010111010111011000110011010111001101011111110100001111010111011101010100011111011001010000110010001011111011110011011100010100010011110101110010000101010101110101110110001100110101110011010111111101000011110101110111010101000111110110010100001100100010111101101011110 e6e289eb90aaebb19ae6bfa1ebbaa3eca1917de6e289eb90aaebb19ae6bfa1ebbaa3eca1917b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)