To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蘂??意??儒???△?釗??巡??齬??異 111001010100000100111111001111111000100011010011001111110011111110001110111100100011111100111111001111111000000110100010001111111111101110111011001111110011111110001111100001000011111100111111111010101001011100111111001111111000100011011001 e5413f3f88d33f3f8ef23f3f3f81a23ffbbb3f3f8f843f3fea973f3f88d9
EUC-JP 蘂??意??儒??璵△?釗??巡??齬??異 111010011010001000111111001111111011000011010101001111110011111110111100111101000011111100111111100011111100110011100110101000101010010000111111100011111110001110100110001111110011111110111101111001000011111100111111111100111111011100111111001111111011000011011011 e9a23f3fb0d53f3fbcf43f3f8fcce6a2a43f8fe3a63f3fbde43f3ff3f73f3fb0db
UTF-8 蘂띠눖意덄뙴儒삠걶璵△뫀釗껇뿥巡볤퍘齬잙벊異 111010001001100010000010111010111001110110100000111010111000100010010110111001101000010010001111111010111000110110000100111010111001100110110100111001011000010010010010111011001000001010100000111010101011000110110110111001111001001010110101111000101001011010110011111010111010101110000000111010011000011110010111111010101011101110000111111010111011111110100101111001011011011110100001111010111011001110100100111011011000110110011000111010011011110110101100111011001001111010011001111010111011001010001010111001111001010110110000 e89882eb9da0eb8896e6848feb8d84eb99b4e58492ec82a0eab1b6e792b5e296b3ebab80e98797eabb87ebbfa5e5b7a1ebb3a4ed8d98e9bdacec9e99ebb28ae795b0
UHC 蘂띠눖意덄뙴儒삠걶璵△뫀釗껇뿥巡볤퍘齬잙벊異 1110011111011110101101101110110010000111101100001110101111110010100010001110011110001100101101111110101011100011101110111110001110000001100111001110011010100101101000011110001010010001101001001110000111110010100000111110100010010111101001011110001011011110100100111110101010111011100011111110010111100001100111111110101110010011101011011110110010110110 e7deb6ec87b0ebf288e78cb7eae3bbe3819ce6a5a1e291a4e1f283e897a5e2de93eabb8fe5e19feb93adecb6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)