        ![line drawing of a lock and password field](/sites/default/files/styles/blog_hero_image_mobile/public/media/image/2025-11/managing_automated_bots.png?itok=BX1XYjCG) 

 

 

 Image

        ![Headshot for Greg OToole](/sites/default/files/styles/post_content_attribution_headshot/public/media/image/2025-07/Greg%20OToole.png?h=e45acf4d&itok=DDNUjN5p) 

 

 

 

 [Greg O'Toole](/people/contributors/greg-otoole) Web Technologist, Drupal Builder &amp; Open Source Advocate Acquia

 

 

 

## Collection

 [Drupal](/index%2ephp/blog/series/drupal) 

 

 

Own the Bots with Acquia Edge

 

[Learn More](https://docs.acquia.com/acquia-cloud-platform/add-ons/edge/overview)

 

 

 

 

Website Operations

# Managing Automated Bots and Search URLs on Acquia Cloud

 November 20, 2025 3 minute read

 [Share this blog post on LinkedIn

 ](https://www.linkedin.com/sharing/share-offsite/?url=https://www.acquia.com/index%2ephp/markdownify/node/58576) [Share this blog post on Twitter

 ](<https://twitter.com/intent/tweet?text=Managing Automated Bots and Search URLs on Acquia Cloud&url=https://www.acquia.com/index%2ephp/markdownify/node/58576&via=acquia>) [Share this blog post via Email

 ](<mailto:?subject=Managing Automated Bots and Search URLs on Acquia Cloud&body=https://www.acquia.com/index%2ephp/markdownify/node/58576>) 

 

AI crawlers are hungry for data, but website owners are barring them from internal /search pages. Learn the critical reasons why—from protecting server resources and preventing SEO dilution to securing private user data and blocking competitive scraping.

 

        ![line drawing of a lock and password field](/sites/default/files/styles/blog_hero_image_mobile/public/media/image/2025-11/managing_automated_bots.png?itok=BX1XYjCG) 

 

 

## Collection :

 [Drupal](/index%2ephp/blog/series/drupal) 

 

We are seeing a major influx across the web of LLM assistants, AI agents, data scrapers, and other sources of bot traffic, Whether or not the traffic is considered valuable or malicious, these bots will try to access your web applications. There are several practical reasons for blocking them from crawling your sites' search URLs (e.g., /search?q=…). Here are the most common:

## Protecting Server Resources

- **High resource cost**: Every AI bot request can trigger a database query or full-text index lookup. Unchecked, these crawls can spike CPU and database load, slowing the site for real users.
- **Crawl loops**: Search URLs can generate infinite variations (?q=a, ?q=b, etc.). A bot might follow and index all of them, creating an endless crawl that burns bandwidth and database cycles.
- **Search service exhaustion**: Some search services have query limits. Bots can quickly exhaust these, leading to outages or overage charges.

## SEO and Content Quality

- **Duplicate content**: Search result pages typically contain snippets of content already available on product or article pages. Indexing them can dilute SEO signals and clutter search-engine indexes.
- **Low-quality pages**: Many search result pages aren’t meaningful landing pages. Blocking them helps keep only valuable, well-curated URLs in public indexes.

## Privacy, Security, and Compliance

- **Sensitive queries**: Site searches sometimes reveal patterns of user interest or internal keywords that shouldn’t be exposed to third parties.
- **Data ownership**: Some owners prefer not to let AI training sets include their internal search results or user-generated queries.

## Reduce Scraping, Competitor Insight

- **Competitive intelligence**: Search URLs can expose product availability, pricing, or trends that competitors might mine.
- **Content theft**: Blocking bots makes it harder for automated tools to copy a site’s full content catalog through search listings.

## How It’s Done

Start by knowing [good bots from questionable and/or bad bots](https://www.acquia.com/blog/ai-bots-knowing-good-vs-bad "Good vs Bad Bots"). Then make a plan. For starters, you might add to the robots.txt the following to manage GPTBot by specifying the user agent, or all bots using the \*:

```plaintext
# Don't allow GPTBot to your search URLs.
User-agent: GPTBot
Disallow: /search

# Don't allow any bots to access your search URLs.
User-agent: *
Disallow: /search
```

This approach is limited to good bots that will respect robots.txt. Your team may augment your strategy and [use WAF/bot-management tools](https://www.acquia.com/blog/ai-bots-importance-web-application-firewalls-your-business "WAF and Bot Management") (Acquia Edge, Cloudflare, or Akamai) for a more robust approach to manage, block or rate-limit AI bots.

**Bottom line:** Search URLs are often dynamically generated, resource-intensive, and low value for indexing, so many site owners proactively block AI and other non-human crawlers from them. Then focus on [optimizing your landing pages](https://www.acquia.com/blog/acquia-seo-content-insights-powered-conductor "content optimization") for the highest quality content for your organization.



 

 Image

        ![Headshot for Greg OToole](/sites/default/files/styles/post_content_attribution_headshot/public/media/image/2025-07/Greg%20OToole.png?h=e45acf4d&itok=DDNUjN5p) 

 

 

 

 [Greg O'Toole](/people/contributors/greg-otoole) Web Technologist, Drupal Builder &amp; Open Source Advocate Acquia

 

Greg, a veteran of web technology since 1994, has helped countless enterprise customers align their business goals with strategic web solutions. His expertise in digital transformation helps companies deliver digital experiences that drive results.

 

 

 

 

 [Share this blog post on LinkedIn

 ](https://www.linkedin.com/sharing/share-offsite/?url=https://www.acquia.com/index%2ephp/markdownify/node/58576) [Share this blog post on Twitter

 ](<https://twitter.com/intent/tweet?text=Managing Automated Bots and Search URLs on Acquia Cloud&url=https://www.acquia.com/index%2ephp/markdownify/node/58576&via=acquia>) [Share this blog post via Email

 ](<mailto:?subject=Managing Automated Bots and Search URLs on Acquia Cloud&body=https://www.acquia.com/index%2ephp/markdownify/node/58576>) 

 

 

 

## Keep Reading

 [View More Resources](/index%2ephp/resources) 

  [Image

        ![](/sites/default/files/styles/card_tablet/public/media/image/2025-07/Blog%20Graphic-Bots_Good%20vs%20Bad.png?h=4362216e&itok=pnZDPExv) 

 

 

 

 Blog Article 4 min 

######  AI Bots - Good vs. Bad

 A look at how to distinguish between "good" and "bad" AI bots, and tactical recommendations on how a WAF can play a pivotal management role. 

 Digital Security &amp; Governance  

 AI Insights 

 ](/blog/ai-bots-knowing-good-vs-bad)



  [Image

        ![](/sites/default/files/styles/card_tablet/public/media/image/2025-07/Blog%20Graphic-Bots_High%20Intent%20Conversions.png?h=4362216e&itok=1iwCPqwY) 

 

 

 

 Blog Article 4 min 

######  Use AI Traffic to Your Benefit

 Record-high search impressions but falling CTRs stem from SERP’s zero-click features. Focus on high-intent users, content, and AI-driven SEO. 

 Digital Security &amp; Governance  

 AI Insights 

 ](/blog/ai-bots-strategically-turning-your-high-search-impression-low-click-traffic-high-intent)