To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 永??逸?ゥ韋?┰永??逸??惟??? 100010010110100100111111001111111000100011101101001111111000001101000100111010001110100000111111100001001011101110001001011010010011111100111111100010001110110100111111001111111000100011010010001111110011111100111111 89693f3f88ed3f8344e8e83f84bb89693f3f88ed3f3f88d23f3f3f
EUC-JP 永??逸?ゥ韋?┰永??逸??惟??? 101100011100101000111111001111111011000011101111001111111010010110100101111100001110101000111111101010001011110110110001110010100011111100111111101100001110111100111111001111111011000011010100001111110011111100111111 b1ca3f3fb0ef3fa5a5f0ea3fa8bdb1ca3f3fb0ef3f3fb0d43f3f3f
UTF-8 永띠닂逸썽ゥ韋얠┰永띠닂逸썹쳽惟듯뮉玲 111001101011000010111000111010111001110110100000111010111000101110000010111010011000000010111000111011001000110110111101111000111000001010100101111010011001111110001011111011001001011010100000111000101001010010110000111001101011000010111000111010111001110110100000111010111000101110000010111010011000000010111000111011001000110110111001111011001011001110111101111001101000001110011111111010111001001110101111111010111010111010001001111011111010011010101101 e6b0b8eb9da0eb8b82e980b8ec8dbde382a5e99f8bec96a0e294b0e6b0b8eb9da0eb8b82e980b8ec8db9ecb3bde6839feb93afebae89efa6ad
UHC 永띠닂逸썽ゥ韋얠┰永띠닂逸썹쳽惟듯뮉玲 1110011110110101101101101110110010001000100010111110110011101111101111011110100110101011101001011110101011011111101111101110110010100110101111011110011110110101101101101110110010001000100010111110110011101111101111011110011110101011101000001110101011101110101101011110110110010010100101111110011110111111 e7b5b6ec888becefbde9aba5eadfbeeca6bde7b5b6ec888becefbde7aba0eaeeb5ed9297e7bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)