How OpenAI’s Bot Overwhelmed a Seven-Person Company’s Website Like a DDoS Attack

January 11, 2025January 11, 2025

On Saturday, Triplegangers CEO Oleksandr Tomchuk faced a significant challenge when his company’s e-commerce website went offline. This incident was caused by a distributed denial-of-service (DDoS) attack, which turned out to be the result of a bot from OpenAI relentlessly trying to scrape data from his extensive online catalog.

Incident Overview: DDoS Attack by OpenAI Bot

Tomchuk quickly realized that the source of the disruption was an OpenAI bot that was attempting to download a vast amount of information from Triplegangers’ site. He explained, “We have over 65,000 products, each with its own page, and every page contains at least three images.” The bot was sending “tens of thousands” of requests to the server, resulting in hundreds of thousands of photo downloads along with detailed product descriptions.

OpenAI utilized approximately 600 IP addresses to conduct the scraping.
Tomchuk and his team are still analyzing server logs to determine the full extent of the attack.

The Impact on Triplegangers

Triplegangers, a small company with only seven employees, has dedicated over a decade to building what it claims is the largest database of “human digital doubles” online. This database includes 3D scanned images of real human models, which are sold to various industries, including video game development and digital art.

Despite having a terms of service page that prohibits unauthorized bot activity, the site’s protection was inadequate. Tomchuk noted that sites need to configure their robots.txt file correctly to prevent bots like OpenAI’s GPTBot from accessing their content. This file, known as the Robots Exclusion Protocol, is designed to inform search engines which parts of a website should not be crawled.

Understanding Robots.txt and Its Limitations

OpenAI states that it respects robots.txt configurations; however, there are limitations. For instance, it can take up to 24 hours for OpenAI’s bots to recognize updates to these files. This delay can leave websites vulnerable in the meantime.

Tomchuk expressed frustration, stating, “If a site isn’t using robots.txt properly, companies like OpenAI assume they can scrape data freely.” Unfortunately, the damage was already done, and Triplegangers faced unexpected server costs due to the excessive activity generated by the bot.

Mitigating Future Risks

By mid-week, after enduring days of bot attacks, Triplegangers implemented a correctly configured robots.txt file and established a Cloudflare account to block the GPTBot and other unwanted crawlers. Tomchuk reported that their site remained stable following these changes.

However, the challenge remains: Tomchuk has no clear way to identify what data was successfully scraped or to retrieve that material. He noted, “I found no way to contact OpenAI for assistance.” Furthermore, the long-promised opt-out tool from OpenAI has yet to be delivered.

Legal Implications and Industry Concerns

The situation is particularly precarious for Triplegangers due to the nature of their work, which involves scanning actual individuals. Laws such as the General Data Protection Regulation (GDPR) emphasize that companies cannot use images of individuals without consent.

Triplegangers’ site is a goldmine for AI crawlers because it includes detailed tagging of images by ethnicity, age, and other characteristics—valuable data for AI training.

A Call for Action: Protecting Online Businesses

Tomchuk wants other small online businesses to be aware of the risks posed by AI bots. “The only way to know if an AI bot is stealing your copyrighted material is to actively monitor your server logs,” he cautioned. The issue is widespread, with many businesses reporting similar experiences with bots crashing their sites.

According to recent research from DoubleVerify, there has been an 86% increase in “general invalid traffic” attributed to AI crawlers in 2024, highlighting the urgency of addressing this issue.

In conclusion, Tomchuk likened the behavior of AI bots to a “mafia shakedown,” asserting that companies should seek permission rather than scrape data without consent. As the digital landscape evolves, it is imperative for online businesses to implement robust measures to safeguard their content.

For more insights into AI and its impact on the digital economy, subscribe to TechCrunch’s AI-focused newsletter delivered every Wednesday.

Industry News

Adam Neumann’s Flow Secures $100M+ Investment, Doubling Valuation to $2.5B

Bysupport April 25, 2025April 25, 2025

Former WeWork CEO Adam Neumann has raised over $100 million for his proptech startup, Flow, which is now valued at approximately $2.5 billion. Prominent investor Andreessen Horowitz played a key role in this funding round. Neumann is optimistic about Flow’s potential to go public in the future. The company focuses on residential rentals and co-living spaces and previously raised $350 million in 2022, achieving a $1 billion valuation. Flow’s emergence comes amid scrutiny of Neumann’s controversial history with WeWork, which filed for bankruptcy in 2023. Observers are watching how Flow navigates the challenges of the proptech industry.

Apple Alerts Global Users of Recent Spyware Attacks: Protect Yourself Now!

Industry News

Apple to Unveil iPhone 18 in Two Exciting Phases in 2026: What to Expect!

Bysupport May 5, 2025May 5, 2025

Apple is preparing for the iPhone 18 launch, expected in 2026, with a phased release strategy. The Pro models are set to debut in fall 2026, followed by budget-friendly options, including the iPhone 16e, in spring 2027. Additionally, a foldable iPhone may also launch in fall 2026. Rumors suggest a slimmer model, possibly the iPhone 17 Air, could be released later this year. To reduce supply chain risks, Apple is considering manufacturing some iPhone 18 models in India, aiming to lessen reliance on Chinese production amid tariff challenges. Stay tuned for more updates on these developments.

Industry News

IBM Completes $6.4 Billion Acquisition of HashiCorp: A Major Move in Cloud Infrastructure

Bysupport February 27, 2025February 27, 2025

IBM has completed its $6.4 billion acquisition of HashiCorp, enhancing its cloud computing capabilities. This deal was expedited after receiving approvals from the U.S. Federal Trade Commission and the U.K.’s antitrust regulator. HashiCorp, known for its Terraform infrastructure management tool, became an attractive target following a controversial licensing shift to proprietary software in 2023. The acquisition aligns with IBM’s transition from traditional systems to a cloud-focused strategy, aiming to bolster its hybrid cloud offerings. IBM’s recent acquisition history includes major purchases like Red Hat and Apptio, reflecting its commitment to expanding its technology portfolio.

Industry News

Unlock the Power of Google’s Gemini: Ask Questions with Videos and On-Screen Content!

Bysupport March 3, 2025March 3, 2025

Google has enhanced its AI assistant, Gemini, with new features unveiled at the Mobile World Congress 2025 in Barcelona. A key addition is the “Screenshare” feature, allowing users to share their smartphone screens with Gemini and ask questions about displayed content, enhancing the shopping experience. Another notable feature is real-time video search, enabling users to record videos and receive immediate assistance based on what’s being filmed. These features will be available to Gemini Advanced users on the Google One AI Premium plan for Android devices later this month, promising a more intuitive interaction with technology.

Unlocking the Power of the Hottest AI Models: Applications, Benefits, and How to Use Them Effectively

Industry News

Discover the Top AI Models: Their Functions and How to Harness Their Power

Bysupport March 30, 2025March 30, 2025

AI models are evolving rapidly, with significant innovations from major companies like Google and startups such as OpenAI and Anthropic. This article examines advanced AI models released in 2024 and 2025, highlighting their unique features and applications. Notable releases include Google Gemini 2.5 for web and code development, the ChatGPT-4o Image Generator for text and image creation, and Stability AI’s Stable Virtual Camera for generating 3D scenes. Other models include OpenAI’s GPT 4.5 “Orion” and Anthropic’s Claude Sonnet 3.7. With over a million AI models available, staying updated on advancements is crucial for effective utilization.

Industry News

Rep. Jim Jordan Grills Big Tech: Did Biden Attempt to Censor AI?

Bysupport March 15, 2025March 15, 2025

House Judiciary Chair Jim Jordan has launched an inquiry into AI censorship, sending letters to 16 major tech companies, including Google and OpenAI, to investigate potential collusion with the Biden administration to suppress lawful speech in AI products. This inquiry reflects growing concerns among conservatives about AI’s role in public discourse. Jordan referenced a December report alleging government efforts to control AI to limit expression. Companies like OpenAI and Anthropic are adjusting their AI models in response to political pressures, while others, like Google, have restricted political responses. The situation underscores tensions between Silicon Valley and political influences on free speech.

How OpenAI’s Bot Overwhelmed a Seven-Person Company’s Website Like a DDoS Attack

Incident Overview: DDoS Attack by OpenAI Bot

The Impact on Triplegangers

Understanding Robots.txt and Its Limitations

Mitigating Future Risks

Legal Implications and Industry Concerns

A Call for Action: Protecting Online Businesses

Adam Neumann’s Flow Secures $100M+ Investment, Doubling Valuation to $2.5B

Apple to Unveil iPhone 18 in Two Exciting Phases in 2026: What to Expect!

Unlock the Power of Google’s Gemini: Ask Questions with Videos and On-Screen Content!

Discover the Top AI Models: Their Functions and How to Harness Their Power

Rep. Jim Jordan Grills Big Tech: Did Biden Attempt to Censor AI?

MuchBetter Unveils Prepaid Corporate Mastercard: Revolutionizing Business Expense Management

Nvidia Launches Powerful GeForce RTX 5060 Graphics Card for Desktops and Laptops: Revolutionizing Gaming Performance

Rainbow Six Siege X Launching June 10: Exciting Enhancements and a Commitment to Reduced Toxicity!

Join Our Newsletter

Recent Post

MuchBetter Unveils Prepaid Corporate Mastercard: Revolutionizing Business Expense…

Nvidia Launches Powerful GeForce RTX 5060 Graphics Card…

Rainbow Six Siege X Launching June 10: Exciting…

Newsletter

Subscribe to our MailChimp newsletter
and stay up to date with all events coming straight in your mailbox:

Incident Overview: DDoS Attack by OpenAI Bot

The Impact on Triplegangers

Understanding Robots.txt and Its Limitations

Mitigating Future Risks

Legal Implications and Industry Concerns

A Call for Action: Protecting Online Businesses

Similar Posts

Join Our Newsletter

Recent Post

Newsletter

Subscribe to our MailChimp newsletter and stay up to date with all events coming straight in your mailbox:

Subscribe to our MailChimp newsletter
and stay up to date with all events coming straight in your mailbox: