To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 永?????柔??永??揖??猿??永??揖 1000100101101001001111110011111100111111001111110011111110001111010111110011111100111111100010010110100100111111001111111001011101001011001111110011111110001001100011100011111100111111100010010110100100111111001111111001011101001011 89693f3f3f3f3f8f5f3f3f89693f3f974b3f3f898e3f3f89693f3f974b
EUC-JP 永?????柔??永??揖??猿??永??揖 1011000111001010001111110011111100111111001111110011111110111101110000000011111100111111101100011100101000111111001111111100110110101100001111110011111110110001111011100011111100111111101100011100101000111111001111111100110110101100 b1ca3f3f3f3f3fbdc03f3fb1ca3f3fcdac3f3fb1ee3f3fb1ca3f3fcdac
UTF-8 永띔퇌麟귝슅柔곗뒡永띔퍜揖곁독猿볦돽永띔퍜揖 111001101011000010111000111010111001110110010100111011011000011110001100111011111010011110110011111010101011011110011101111011001000101010000101111001101001111110010100111010101011001110010111111010111001001010100001111001101011000010111000111010111001110110010100111011011000110110011100111001101000111110010110111010101011001110000001111010111000111110000101111001111000110010111111111010111011001110100110111010111000111110111101111001101011000010111000111010111001110110010100111011011000110110011100111001101000111110010110 e6b0b8eb9d94ed878cefa7b3eab79dec8a85e69f94eab397eb92a1e6b0b8eb9d94ed8d9ce68f96eab381eb8f85e78cbfebb3a6eb8fbde6b0b8eb9d94ed8d9ce68f96
UHC 永띔퇌麟귝슅柔곗뒡永띔퍜揖곁독猿볦돽永띔퍜揖 1110011110110101101101101110101010110111100111011110110011101000100000101110011010011010100101111110101011110101101100001110110010001010100111011110011110110101101101101110101010111011100100111110101111100111101100001110011110110101101101101110101010111011100100111110110010001001101111111110011110110101101101101110101010111011100100111110101111100111 e7b5b6eab79dece882e69a97eaf5b0ec8a9de7b5b6eabb93ebe7b0e7b5b6eabb93ec89bfe7b5b6eabb93ebe7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)