To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 闕ウ莨∵▲魄滂スィv闕ウ莨∵▲魄滂スィvB 111010001000110110110011111001001011110010000001111001101000000110100011111010011010111010011111111011111011110110101000011101101110100010001101101100111110010010111100100000011110011010000001101000111110100110101110100111111110111110111101101010000111011001000010 e88db3e4bc81e681a3e9ae9fefbda876e88db3e4bc81e681a3e9ae9fefbda87642
EUC-JP 闕ウ莨∵▲魄滂スィv闕ウ莨∵▲魄滂スィvB 111011111110110110001110101100111110100010111110101000101110100010100010101001011111001010110000110111101111000110001110101111011000111010101000011101101110111111101101100011101011001111101000101111101010001011101000101000101010010111110010101100001101111011110001100011101011110110001110101010000111011001000010 efed8eb3e8bea2e8a2a5f2b0def18ebd8ea876efed8eb3e8bea2e8a2a5f2b0def18ebd8ea87642
UTF-8 闕ウ莨∵▲魄滂スィv闕ウ莨∵▲魄滂スィvB 111010011001011110010101111011111011110110110011111010001000111010101000111000101000100010110101111000101001011010110010111010011010110110000100111001101011101110000010111011111011110110111101111011111011110110101000011101101110100110010111100101011110111110111101101100111110100010001110101010001110001010001000101101011110001010010110101100101110100110101101100001001110011010111011100000101110111110111101101111011110111110111101101010000111011001000010 e99795efbdb3e88ea8e288b5e296b2e9ad84e6bb82efbdbdefbda876e99795efbdb3e88ea8e288b5e296b2e9ad84e6bb82efbdbdefbda87642
UHC 闕??∵▲魄滂??v闕??∵▲魄滂??vB 11001111111101000011111100111111101000011111000110100001111000111101101111011110110110111011010100111111001111110111011011001111111101000011111100111111101000011111000110100001111000111101101111011110110110111011010100111111001111110111011001000010 cff43f3fa1f1a1e3dbdedbb53f3f76cff43f3fa1f1a1e3dbdedbb53f3f7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)