To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 罌??魏??幽??憶??飮??循??永??飮 111000111010000000111111001111111110100110110000001111110011111110010111010010000011111100111111100010011010111100111111001111111001111101011010001111110011111110001111011110100011111100111111100010010110100100111111001111111001111101011010 e3a03f3fe9b03f3f97483f3f89af3f3f9f5a3f3f8f7a3f3f89693f3f9f5a
EUC-JP 罌??魏??幽??憶??飮??循??永??飮 111001101010001000111111001111111111001010110010001111110011111111001101101010010011111100111111101100101011000100111111001111111101110110111011001111110011111110111101110110110011111100111111101100011100101000111111001111111101110110111011 e6a23f3ff2b23f3fcda93f3fb2b13f3fddbb3f3fbddb3f3fb1ca3f3fddbb
UTF-8 罌삼퐟魏섌찛幽뚰뮉憶귣뵃飮김삃循놃뫝永띠닂飮 111001111011110110001100111011001000001010111100111011011001000010011111111010011010110110001111111011001000010010001100111011001011000010011011111001011011100110111101111010111001101010110000111010111010111010001001111001101000011010110110111010101011011110100011111010111011010110000011111010011010001110101110111010101011100110000000111011001000001010000011111001011011111010101010111010111000011010000011111010111010101110011101111001101011000010111000111010111001110110100000111010111000101110000010111010011010001110101110 e7bd8cec82bced909fe9ad8fec848cecb09be5b9bdeb9ab0ebae89e686b6eab7a3ebb583e9a3aeeab980ec8283e5beaaeb8683ebab9de6b0b8eb9da0eb8b82e9a3ae
UHC 罌삼퐟魏섌찛幽뚰뮉憶귣뵃飮김삃循놃뫝永띠닂飮 1110010110100010101110111110111110111101100010001110101011100000100110001110100110101001100110111110101011101011100011001110110110010010100101111110010111100011100000101110101110010100100010011110101111100110101100011110100010011000100010101110001011100000100001101110110110010001101111011110011110110101101101101110110010001000100010111110101111100110 e5a2bbefbd88eae098e9a99beaeb8ced9297e5e382eb9489ebe6b1e8988ae2e086ed91bde7b5b6ec888bebe6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)