In a new report, Cloudflair accused artificial intelligence service Perplexity of using unauthorized methods to collect website data, even from sites that explicitly block the access of its bots via robots.txt file or through security systems.
According to the report, Perplexity has succeeded in displaying new site content that has not been indexed before, despite blocking its official PerplexityBot and Perplexity-User robots across robots.txt, in addition to its ability to bypass the rules of firewalls dedicated to preventing automated crawling.Cloudflare pointed out that Perplexity relies on a generic browser that emulates Google Chrome on macOS, which allows it to bypass restrictions.