To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲щ?踰?━臾??永??逾??碎⑦?搖 1110000110011111100001001000101100111111111001101111101000111111100001001010101011100100011010110011111100111111100010010110100100111111001111111110011110100101001111110011111111100001111010101000011101000110001111111001110110001010 e19f848b3fe6fa3f84aae46b3f3f89693f3fe7a53f3fe1ea87463f9d8a
EUC-JP 癲щ?踰?━臾??永??逾??碎??搖 11100010101000011010011111101011001111111110110011111100001111111010100010101100111001111100110000111111001111111011000111001010001111110011111111101110101001110011111100111111111000101110110000111111001111111101100111101010 e2a1a7eb3fecfc3fa8ace7cc3f3fb1ca3f3feea73f3fe2ec3f3fd9ea
UTF-8 癲щ톩踰섓━臾뺥뮍永띠뮊逾껇쁻碎⑦뮄搖 1110011110011001101100101101000110001001111011011000011010101001111010001011100010110000111011001000010010010011111000101001010010000001111010001000011110111110111010111011101010100101111010111010111010001101111001101011000010111000111010111001110110100000111010111010111010001010111010011000000010111110111010101011101110000111111011001000000110111011111001111010001010001110111000101001000110100110111010111010111010000100111001101001000010010110 e799b2d189ed86a9e8b8b0ec8493e29481e887beebbaa5ebae8de6b0b8eb9da0ebae8ae980beeabb87ec81bbe7a28ee291a6ebae84e69096
UHC 癲щ톩踰섓━臾뺥뮍永띠뮊逾껇쁻碎⑦뮄搖 1110111110100110101011001110101110110111100000011110101110110010100110001110111110100110101011001110101110101100100101011110110110010010100110101110011110110101101101101110110010010010100110001110101110110101100000111110100010011000100000101110000111101111101010001110110110010010100100111110100011110100 efa6acebb781ebb298efa6acebac95ed929ae7b5b6ec9298ebb583e89882e1efa8ed9293e8f4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)