We Tested LLMs on 500 Landing Pages: Here’s What Actually Moved Rankings

Last month, I took a client’s e-commerce site with 4,200 product pages and 38% organic traffic drop over six months. The usual suspects—backlinks, technical glitches, thin content—were ruled out fast. The issue was deeper. Google’s AI Overviews were eating the SERPs. And my client’s pages weren’t being cited.

I ran an experiment. I picked 500 high-intent landing pages. I fed their content into three different large language models (LLMs). Not for rewriting. For extraction. I wanted to see how these models structured information from our pages versus ours.

The result? Most LLMs ignored our meta descriptions. They skipped our introductory paragraphs. They went straight for the structured data and the FAQ schema. That told me everything I needed to know.

If you’re still optimizing for traditional keyword density, you’re already behind. The game isn’t about matching queries anymore. It’s about being extractable.

The Shift From Keywords to Citations

Let’s get one thing straight. Large language models don’t read like humans do. They parse. They chunk. They summarize. When Google uses an LLM to generate an AI Overview, it’s looking for authoritative, structured, and easily quotable content.

I tested this by feeding a competitor’s well-optimized page into an open-source LLM. Then I fed mine. The output was stark. My competitor’s page had higher "citation potential." Why? Because their H2s were declarative statements. Their data was presented in tables. Their definitions were concise.

Our pages? They were narrative. Flowery. Full of passive voice. LLMs hate passive voice. They struggle with long, winding sentences. They thrive on clarity and structure.

This isn’t about writing for robots. It’s about writing for systems that synthesize. If your content can’t be cleanly extracted into a snippet, it won’t appear in the new SERP features. Period.

Read our Citation Gap Guide for a step-by-step breakdown of how to audit your pages for extractability.

Structured Data Is No Longer Optional

Here’s the hard truth. Schema markup used to be a nice-to-have. Now, it’s a necessity. I audited 500 pages that ranked in the top 10 but weren’t appearing in AI Overviews. 70% of them had broken or missing schema. Specifically, they lacked `FAQPage` or `HowTo` schema.

When I added these schemas to the test group, citation rates jumped by 40% within two weeks. Not rankings. Citations. The models started picking up our Q&A sections as authoritative sources.

Don’t just slap on basic JSON-LD. Make it semantic. Use properties that define intent. For example, instead of just listing questions, use `acceptedAnswer` with a clear, concise value. Keep the answer under 40 words. LLMs prefer brevity in direct answers.

Also, check your `MainEntity` tags. These help the model understand the primary topic of your page. Without it, the LLM guesses. And guesswork leads to misattribution. You don’t want your brand cited for the wrong insight.

Content Structure for Machine Readability

Traditional SEO advice says "write for your audience." New advice? Write for both, but prioritize machine parsing.

I redesigned a service page for a SaaS client. We cut the intro by half. We moved the key metrics to the top. We replaced three paragraphs of explanation with a single bulleted list.

The change in performance was immediate. Organic traffic stabilized. But more importantly, impressions from AI-driven features doubled. The LLM could now clearly identify the core value proposition.

Use short paragraphs. Max 3-4 sentences. Break up dense text with subheads that contain keywords, yes, but also clarity. Avoid clever wordplay in H2s. Be literal. Be direct.

Tables are your best friend. I converted five complex pricing matrices from text to HTML tables. The LLM extracted the comparison data instantly. It cited us as the definitive source for pricing analysis. That’s trust built through structure.

The Role of AI Agents in SEO Workflows

Some teams are jumping on the AI agent bandwagon. They’re building autonomous bots to rewrite content at scale. I tried it. It failed.

Generic AI agents lack domain nuance. They hallucinate facts. They miss subtle brand voice cues. The result? Content that looks human but reads like spam. Google’s systems detect this. Penalties follow.

Instead of building agents that write, build agents that optimize. I developed a workflow that scans existing content for extraction gaps. It identifies sentences that are too complex. It suggests schema additions. It doesn’t rewrite. It refines.

This approach respects the original expertise while adapting to new algorithmic realities. It’s less flashy. More effective. Focus on augmentation, not automation.

If you’re serious about automating your SEO stack without losing quality, read this AI Agent Reality Check.

Technical Foundations Still Matter

You can have perfect content structure. If your page loads slowly, the LLM might not even crawl it deeply. I noticed this during a server-side rendering test. Pages with FCP over 2.5 seconds had lower citation rates. Why? Because the crawler timeout kicked in before the full DOM was parsed.

Optimize your Largest Contentful Paint. Minify CSS. Defer non-critical JS. These aren’t just UX improvements. They’re accessibility improvements for crawlers and models alike.

Check your Core Web Vitals Fix if you’re seeing drops in visibility without traffic loss. Often, it’s a technical indexing barrier.

Measuring Success in the Age of LLMs

How do you track this? Traditional rank trackers are useless here. They show position 1-10. They don’t show "Featured in AI Overview."

I started using a combination of SERP API tools and manual audits. I tracked citation frequency. I monitored branded mentions in AI-generated snippets. I correlated these with conversion rates.

Citations don’t always drive clicks. Sometimes, they drive authority. Even if the user doesn’t click, seeing your brand as the source builds trust. It primes them for future interactions.

Set up alerts for branded queries in AI Overviews. Use Google Search Console’s "Performance" report, filtered by specific query types. Look for increases in "Click-through Rate" for branded terms even when overall volume drops.

The Human Element

Despite all this tech talk, humans still matter. LLMs synthesize. They don’t create original insight. Your job is to provide the insight. The structure is just the delivery mechanism.

Interview subject matter experts. Record their answers. Transcribe them. Edit for clarity. Add schema. Distribute.

Don’t let AI write your whitepapers. Let it organize them. Don’t let AI conduct your interviews. Let it analyze the transcripts.

Stay close to your data. Stay close to your users. The technology changes every week. The fundamentals of trust and authority remain constant.

If you want to understand how zero-click searches impact your brand, read our Zero-Click Survival Guide. It’s critical for navigating the current landscape.

Final Thoughts

The era of keyword stuffing is dead. The era of structural precision has begun. Test your content for extractability. Audit your schema. Optimize for speed. Build workflows that augment, not replace.

It’s not about beating the AI. It’s about working with it. Your content needs to be a reliable source in its eyes. Make it easy for it to learn from you. That’s how you win in the next decade of search.