To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????→?循??嚥△?蚓??儒??魚 0011111100111111001111110011111110000001101010000011111110001111011110100011111100111111100110101000101110000001101000100011111111100101011011010011111100111111100011101111001000111111001111111000101110011011 3f3f3f3f81a83f8f7a3f3f9a8b81a23fe56d3f3f8ef23f3f8b9b
EUC-JP ???沅→?循??嚥△?蚓??儒??魚 00111111001111110011111110001111110001101110100110100010101010100011111110111101110110110011111100111111110100111110101110100010101001000011111111101001110011100011111100111111101111001111010000111111001111111011010111111011 3f3f3f8fc6e9a2aa3fbddb3f3fd3eba2a43fe9ce3f3fbcf43f3fb5fb
UTF-8 蓮잙슣沅→쾮循됰젧嚥△뫁蚓껆뫀儒밸윪魚 111011111010011010011001111011001001111010011001111011001000101010100011111001101011001010000101111000101000011010010010111011001011111010101110111001011011111010101010111010111001000010110000111011001010000010100111111001011001101010100101111000101001011010110011111010111010101110000001111010001001101010010011111010101011101110000110111010111010101110000000111001011000010010010010111010111011000010111000111011001001110010101010111010011010110110011010 efa699ec9e99ec8aa3e6b285e28692ecbeaee5beaaeb90b0eca0a7e59aa5e296b3ebab81e89a93eabb86ebab80e58492ebb0b8ec9caae9ad9a
UHC 蓮잙슣沅→쾮循됰젧嚥△뫁蚓껆뫀儒밸윪魚 1110011011100101100111111110101110011010101011111110101010110110101000011110011010110010100001011110001011100000100010011110101110100000100111111110011010111111101000011110001010010001101001011110110011100010100000111110011110010001101001001110101011100011101110011110101110011111101010011110010111100000 e6e59feb9aafeab6a1e6b285e2e089eba09fe6bfa1e291a5ece283e791a4eae3b9eb9fa9e5e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)