To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 倭??踰??飮??沃???腋??維????? 10011000011000000011111100111111111001101111101000111111001111111001111101011010001111110011111110010111100000000011111100111111001111111110001111111100001111110011111110001000110110110011111100111111001111110011111100111111 98603f3fe6fa3f3f9f5a3f3f97803f3f3fe3fc3f3f88db3f3f3f3f3f
EUC-JP 倭??踰??飮??沃???腋??維????? 11001111110000010011111100111111111011001111110000111111001111111101110110111011001111110011111111001101111000000011111100111111001111111110011011111110001111110011111110110000110111010011111100111111001111110011111100111111 cfc13f3fecfc3f3fddbb3f3fcde03f3f3fe6fe3f3fb0dd3f3f3f3f3f
UTF-8 倭녾낮踰졿뤃飮껉턀沃샩쎈떈腋잙컾維귟짆琉멸강 111001011000000010101101111010111000010110111110111010111000001010101110111010001011100010110000111011001010000110111111111010111010010010000011111010011010001110101110111010101011101110001001111011011000010010000000111001101011001010000011111011001000001110101001111011001000111010001000111010111001011010001000111010001000010110001011111011001001111010011001111011001011101110111110111001111011011010101101111010101011011110011111111011001010011110000110111011111010011110001100111010111010100110111000111010101011000010010101 e580adeb85beeb82aee8b8b0eca1bfeba483e9a3aeeabb89ed8480e6b283ec83a9ec8e88eb9688e8858bec9e99ecbbbee7b6adeab79feca786efa78ceba9b8eab095
UHC 倭녾낮踰졿뤃飮껉턀沃샩쎈떈腋잙컾維귟짆琉멸강 1110100011011110100001101110101010110011101101111110101110110010101000001110011010001111101101001110101111100110100000111110101010110101100111001110100010101010100110001100111010111101111010111000101110011110111001001111110110011111111010111011000010011111111010111010101110000010111010001010001110010101111010111010010010111000111010101011000010101101 e8de86eab3b7ebb2a0e68fb4ebe683eab59ce8aa98cebdeb8b9ee4fd9febb09febab82e8a395eba4b8eab0ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)