To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????U}?????????U{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101010101111101001111110011111100111111001111110011111100111111001111110011111100111111010101010111101101011110 3f3f3f3f3f3f3f3f3f557d3f3f3f3f3f3f3f3f3f557b5e
SJIS-WIN 竪他属竪属揃辰谷続U}竪他属竪属揃辰谷続U{^ 1001001001000111100100011011110010010001101011101001001001000111100100011010111010010001101101011001001001000011100100100100101010010001101100010101010101111101100100100100011110010001101111001001000110101110100100100100011110010001101011101001000110110101100100100100001110010010010010101001000110110001010101010111101101011110 924791bc91ae924791ae91b59243924a91b1557d924791bc91ae924791ae91b59243924a91b1557b5e
EUC-JP 竪他属竪属揃辰谷続U}竪他属竪属揃辰谷続U{^ 1100001110101000110000101011111011000010101100001100001110101000110000101011000011000010101101111100001110100100110000111010101111000010101100110101010101111101110000111010100011000010101111101100001010110000110000111010100011000010101100001100001010110111110000111010010011000011101010111100001010110011010101010111101101011110 c3a8c2bec2b0c3a8c2b0c2b7c3a4c3abc2b3557dc3a8c2bec2b0c3a8c2b0c2b7c3a4c3abc2b3557b5e
UTF-8 竪他属竪属揃辰谷続U}竪他属竪属揃辰谷続U{^ 1110011110101011101010101110010010111011100101101110010110110001100111101110011110101011101010101110010110110001100111101110011010001111100000111110100010111110101100001110100010110000101101111110011110110110100110100101010101111101111001111010101110101010111001001011101110010110111001011011000110011110111001111010101110101010111001011011000110011110111001101000111110000011111010001011111010110000111010001011000010110111111001111011011010011010010101010111101101011110 e7abaae4bb96e5b19ee7abaae5b19ee68f83e8beb0e8b0b7e7b69a557de7abaae4bb96e5b19ee7abaae5b19ee68f83e8beb0e8b0b7e7b69a557b5e
UHC 竪他?竪??辰谷?U}竪他?竪??辰谷?U{^ 111000101011010111110110111000100011111111100010101101010011111100111111111100101110001111001101110110110011111101010101011111011110001010110101111101101110001000111111111000101011010100111111001111111111001011100011110011011101101100111111010101010111101101011110 e2b5f6e23fe2b53f3ff2e3cddb3f557de2b5f6e23fe2b53f3ff2e3cddb3f557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)