THE FINANCIAL EYE Blog INVESTING Amazon’s AI under fire for allegedly stealing data – investigation underway!
INVESTING News TECH

Amazon’s AI under fire for allegedly stealing data – investigation underway!

In the vast world of technology, Amazon Web Services (AWS) has opened an inquiry to investigate the actions of Perplexity AI, raising eyebrows and sparking discussions across the digital landscape. The cloud division of the tech giant is reportedly probing allegations of rule-breaking by the service, particularly focusing on the use of a crawler hosted on AWS servers that may be disregarding the established Robots Exclusion Protocol. This web standard allows developers to control bot access to specific pages through a robots.txt file placed on a domain, shaping the way crawlers interact with websites.

Here are the key points surrounding this intriguing investigation:

  • Wired revealed the discovery of a virtual machine on an AWS server operating under the IP address 44.221.181.252, strongly associated with Perplexity AI. This machine was found to be skirting around robots.txt instructions, making unauthorized visits to various Condé Nast websites along with other prominent publications such as The Guardian, Forbes, and The New York Times.
  • Wired’s investigative approach included inputting article headlines into chatbots to gauge Perplexity’s content-scraping tactics. Shockingly, the chatbots churned out results that closely resembled the original articles with minimal attribution, raising concerns about ethical content usage and plagiarism.
  • While Perplexity AI is not the sole player in the game of bypassing robots.txt files, Amazon Web Services has been prompt in addressing the issue, emphasizing the need for compliance with its terms of service that prohibit abusive and illegal activities. AWS stated its commitment to investigating all reports of potential violations, ensuring transparency and adherence to guidelines.

  • In response to the allegations, Perplexity AI’s spokesperson, Sara Platnick, asserted that their PerplexityBot, which operates on AWS, does abide by robots.txt instructions. However, Wired’s investigation uncovered instances where the bot deviated from these guidelines when specific URLs were included in chatbot inquiries. CEO Aravind Srinivas affirmed the use of third-party web crawlers alongside their own, shedding light on the complexity of their content-gathering methods.

As the narrative unfolds in the tech realm, it is imperative for all stakeholders to uphold ethical practices and comply with industry standards to foster a trustworthy digital ecosystem. The interplay between advanced AI technologies and established protocols underscores the need for vigilance and accountability in navigating the ever-evolving tech landscape. Stay tuned for updates on this developing story as the tech community grapples with the implications of AI-driven content scraping and compliance with digital regulations.

Exit mobile version