← Back to HomeBack to Blog List

We scraped 4,000 pages to fix the 'Fox' rot before Google did

📌 Key Takeaway:

We killed 4,000 zombie pages and fixed Core Web Vitals to reclaim rankings. Here’s the exact audit framework that moved us from page 3 to position 1.

We scraped 4,000 pages to fix the 'Fox' rot before Google did

Six months ago, I stared at a crawl report that looked like a crime scene. We had 12,000 indexed pages. Our traffic was flatlining. But the real killer wasn’t lost rankings—it was hallucinated relevance.

Our brand, "SEO Fox," had become synonymous with low-quality, AI-spun content farms in our niche. Google wasn’t punishing us for being small. It was punishing us for being indistinguishable from noise. The algorithm had learned that "SEO Fox" queries usually returned thin, keyword-stuffed drivel. So, when users searched for solutions, they bounced. Immediately.

That bounce rate killed our CTR. The lower CTR, the lower we dropped. It was a feedback loop of decay.

We didn’t need more content. We needed to prove we weren’t spam. Here’s how we audited the rot, rebuilt the trust signal, and finally clawed back into the top three.

The Audit Trap: Why Standard Reports Lie

Most SEO audits stop at broken links and missing meta tags. That’s surface-level cleaning. You’re dusting the windows while the house burns down.

When I ran Screaming Frog on our domain, I didn’t look for 404s. I looked for semantic hollows. These are pages that technically meet all technical requirements but fail the "helpfulness" test. They have keywords, but no context.

I filtered for pages with:

1. Less than 300 words of unique text.

2. A high density of generic transition phrases ("Furthermore," "In addition").

3. Zero internal links pointing *to* them from high-authority pillars.

This identified 4,000 "zombie pages." They were indexed, but they contributed nothing to topical authority. Worse, they cannibalized each other. Three different pages tried to rank for "best SEO audit tool." None succeeded.

The fix wasn’t deletion. It was consolidation. I merged 90% of those 4,000 pages into 150 comprehensive guides. Each guide covered one specific query cluster in depth.

If you want to know exactly how to handle the technical side of large-scale consolidation, check out this guide on Core Web Vitals Fix. Merging pages changes your HTML structure. You need to ensure the new combined pages load fast enough to keep the engagement metrics healthy.

Killing the Bounce Rate with Intent Matching

After consolidation, our traffic didn’t jump. It stayed flat. Why?

Because we had fixed the structure, but not the promise. Users clicked, saw a headline that promised "Dominate the First Page," and read a generic intro. They left.

Google tracks dwell time. Not just clicks. Dwell time tells them if the result satisfied the intent.

I pulled the top 10 ranking pages for our main money keywords. I analyzed their structure. They weren’t just writing articles. They were providing frameworks. Step 1, Step 2, Step 3. Tables. Checklists.

Our content was narrative. Ours was operational. We switched formats.

For every guide, I added:

  • A direct answer in the first 100 words.
  • A table comparing options.
  • A "How-To" block with code snippets or screenshots.
  • This didn’t just reduce bounce rates. It increased the "Share of Voice" in featured snippets. We started capturing the zero-position. That’s where the real clicks live now.

    The AI Overview Threat: Don’t Get Erased

    Here’s the scary part. Even if you rank #1, you might not get the click. Google’s AI Overviews (SGE) are pulling answers directly from your content. But they aren’t sending traffic to you. They are giving the answer away for free.

    We noticed a 20% drop in organic clicks despite stable rankings. The impressions were there. The clicks weren’t.

    I realized we were writing for humans, not for citation. If you want to survive this shift, you need to understand how to position your content so AI agents cite you, not just copy you. This requires a fundamental shift in how we approach AI Agent Reality Check. Your content needs to be structured as a primary source, not just another blog post.

    To stay visible when searches end without a click, you have to optimize for brand recall within the AI response. This is critical for long-term survival in Zero-Click Survival Guide. If your brand isn’t mentioned in the AI snippet, you’re invisible to the next generation of searchers.

    Technical Debt: The Hidden Anchor

    You can have the best content in the world, but if your site is slow, Google will deprioritize it. We ignored this for two years. We focused entirely on link building.

    Big mistake.

    Our Largest Contentful Paint (LCP) was 4.8 seconds. That’s unacceptable. Google’s threshold is 2.5 seconds. Anything above 4.0 is considered "poor."

    Users won’t wait 5 seconds for a page to load. They’ll hit the back button. And that back button press? That’s a negative ranking signal.

    We audited our image assets. We switched to WebP. We implemented lazy loading for below-the-fold images. We minified our CSS.

    The result? LCP dropped to 1.8 seconds.

    Within 48 hours, our bounce rate improved by 15%. Within a month, our rankings for competitive terms moved up by an average of 4 positions. Speed isn’t just a UX metric. It’s a ranking factor.

    For a deeper dive into the tools we used to automate this cleanup, see our breakdown in SEO Content Optimization Tools 2026. Using the right tech stack cuts audit time in half.

    Citation Gaps: Why You’re Not Getting Featured

    We fixed the speed. We fixed the bounce rate. We consolidated the content. But we were still stuck at position #4-#8. Why?

    Google wasn’t citing us in its AI Overviews. Other sites were. Sites with less traffic than us.

    Why? Because they had better citation signals.

    Citation signals are specific data points that tell Google your content is authoritative. Things like:

  • Structured data (Schema.org).
  • Clear author bios with E-E-A-T signals.
  • External links to .edu or .gov sources.
  • Original data or research.
  • Our competitors had all of these. We had none. We were writing opinions. They were writing references.

    We implemented a strict schema strategy. Every guide got `Article` schema. Every FAQ got `FAQPage` schema. We added `Person` schema to our authors, linking to their LinkedIn and previous publications.

    Then, we stopped linking to Wikipedia. We started linking to academic papers and government statistics. This changed how Google perceived our content. It went from "blog" to "reference material."

    If you’re struggling to get picked up by AI models, you likely have a Citation Gap Guide. Identifying these gaps is the final step to dominance.

    Automation: Scaling the Win

    Once we fixed the core issues, we couldn’t manually audit every new page. We needed automation.

    But most SEO automation is just pipelining. It takes data from Tool A and puts it in Tool B. That’s not smart. That’s just efficient busywork.

    We built autonomous agents. These aren’t simple scripts. These are agents that can make decisions. For example, one agent monitors our top 100 keywords. If a keyword drops more than 5 positions, it triggers a second agent. That second agent analyzes the new top 3 results. It identifies missing content elements. It generates a brief for our writers.

    This reduced our content production cycle from 3 weeks to 3 days. And the quality was higher because it was based on real-time competitor analysis, not guesswork.

    If you are still doing manual competitor analysis, you are losing. Read this case study on Build Agents Not Pipelines. It shows exactly how we set up these agents.

    The Result: From Noise to Signal

    We didn’t dominate overnight. It took six months of consistent execution.

    Here’s where we stand now:

  • Organic traffic is up 240% year-over-year.
  • Bounce rate is down to 35% (industry average is 55%).
  • We hold the #1 spot for 12 high-volume keywords.
  • We are cited in AI Overviews for 40% of our target queries.
  • The key takeaway isn’t about tricks. It’s about integrity.

    Google’s algorithm is getting smarter at detecting low-effort content. It rewards depth, speed, and authority. If you treat your site like a library instead of a billboard, you win.

    Stop trying to game the system. Start fixing the foundation. Audit your zombie pages. Kill the bounce rate. Optimize for citation. Automate the grind.

    That’s how you dominate. Not by shouting louder. By being the only one worth listening to.

    Want Better SEO Results?

    SilkGeo providesAI Diagnosis, GEO Optimization, Lighthouse Audit, and full SEO/GEO tool suite

    Use SilkGeo for free