SCHEMA MARKUP FOR AEO: THE STRUCTURED DATA FOUNDATİON FOR MACHİNES AND ANSWER ENGİNES

Schema Markup for AEO: The Structured Data Foundation for Machines and Answer Engines

When search engines and AI assistants read a page, they are really interpreting raw text. Schema markup, or structured data, is a standard layer that describes your content to these systems in a machine-readable language. In the AEO (Answer Engine Optimization) era, well-built structured data clarifies a piece of content's author, date, and entity relationships. Yet the field is full of myths: some sources present schema as a magic ranking button, while others claim it guarantees AI visibility. Drawing on verified Google sources and controlled studies, we will separate fact from hype one by one.

What Schema Markup Is and Why It Is the Foundation of Structured Data

Schema markup is standardized structured data that describes a page's content to search engines in a machine-readable way using the shared vocabulary called schema.org. Google uses it for two purposes: to understand more clearly what the page is about and to make the page eligible for search features such as rich results. The schema.org vocabulary is jointly supported by Google, Microsoft (Bing), Yandex, and Yahoo!, making it a single universal standard. The vocabulary contains hundreds of types and over a thousand properties and is used by millions of domains. The correct framing is this: structured data is not the strategy itself but the packaging layer that lets machines parse good content correctly.

JSON-LD, Microdata, and RDFa: The Three Formats and Why Google Recommends JSON-LD

There are three valid ways to embed schema.org data in a page: JSON-LD, Microdata, and RDFa. Google's official wording is clear: all three formats are equally fine for Google as long as the markup is valid and properly implemented. Google recommends JSON-LD, but only because it is the easiest to implement and maintain at scale and the least prone to user error. Contrary to a common misconception, JSON-LD as a format does not earn higher rankings or more rich results. JSON-LD is placed inside a script tag and can appear in either the head or the body of the page. What matters is that the markup is present in the page's raw HTML; external .json files are not crawled. For details, Google Search Central documentation is the primary source.

The Most Important Schema Types for Content Sites

For a blog, the most appropriate type is BlogPosting, for news content NewsArticle, and for general articles Article. None of them has required properties, but Google recommends author (Person or Organization), datePublished, dateModified, headline, and image. Organization markup defines the brand identity; name, url, logo, and sameAs (links to social and authoritative profiles) are the most valuable fields, and adding it to the homepage alone is sufficient. BreadcrumbList describes the site hierarchy and is one of the types that still earns a rich result in 2026. The WebSite type defines the site name. For businesses serving a local area, the LocalBusiness type provides a strong local signal with name, address, phone, and opening hours. Verifying the recommended properties of Article and other types against Google's official gallery is a good habit.

A typical JSON-LD block for a blog post looks like this:

<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "BlogPosting",
  "headline": "Schema Markup and Structured Data for AEO",
  "datePublished": "2026-06-15",
  "dateModified": "2026-06-15",
  "author": {
    "@type": "Person",
    "name": "Özkan Göçer",
    "url": "https://www.ogocer.com/en/"
  },
  "publisher": {
    "@type": "Organization",
    "name": "Ogocer"
  }
}
</script>

Which Types Still Earn Rich Results in 2026: The End of FAQ and HowTo

Many outdated sources still present FAQ and HowTo markup as a source of rich results, yet the situation has changed completely. The table below summarizes what has been removed and what is still valid.

Status Type / feature Date
Removed HowTo rich result Mobile August 2023, desktop September 13, 2023
Removed FAQ rich result (no longer in Search) May 7, 2026 (report and test June 2026, API August 2026)
Removed Sitelinks search box (WebSite + SearchAction) November 21, 2024
Retired Book Actions, Course Info, ClaimReview, Estimated Salary, Learning Video, Special Announcement, Vehicle Listing June 2025
Still earns rich results Product, Review, AggregateRating, Recipe, Article, Video, Breadcrumb, Organization, LocalBusiness, Event 2026

FAQPage is still a valid schema.org type and leaving it on the page does no harm, but it no longer produces a rich result in Google Search. You can verify the dates from the primary sources, the FAQ document and the HowTo announcement.

Structured Data Is Not a Ranking Factor

A common and costly misconception is that adding schema directly pushes a page higher. Google clearly rejects this: structured data is not a general ranking factor. Even a manual action for a structured data violation does not affect ranking; it only removes rich result eligibility. Google spokespeople including John Mueller and Danny Sullivan have confirmed this repeatedly since 2019. The benefits are indirect: a potential click-through-rate increase via rich results, better entity understanding, and eligibility for AI search features. Google's structured data policies clarify the distinction. In other words, schema does not replace classic SEO work; it is a technical complement to it.

How AI Answer Engines Actually Use Schema

The most critical AEO question is this: do systems like ChatGPT, Perplexity, Gemini, Claude, and Google AI Mode read structured data? Controlled tests by Ahrefs and SearchVIU in October 2025 produced a surprising result. The engines in question extracted only the visible HTML content when fetching a page live; none of them found data placed solely inside JSON-LD, for example a price present only in the schema. The practical reason is this: as of 2026, the major AI crawlers (GPTBot, ClaudeBot, PerplexityBot, OAI-SearchBot, and others) do not execute JavaScript and only read the raw HTML returned by the server. Therefore schema injected later via JavaScript is never seen by these bots. Bot behavior may change over time, but this is the picture today.

A broader model shows that schema plays different roles across the training, indexing, index-based search, and live-fetch stages. Structured data can contribute indirectly especially in index-based systems such as Google AI Overviews and Bing with Copilot, because Google AI Mode uses the same index and markup as organic search. Another important fact is this: there is no special schema type or separate AI optimization for AI Overviews or ChatGPT.

Does Schema Increase AI Citations? Examining the Evidence

The claim that adding schema gets you cited more in AI is the most frequently repeated AEO myth. The most rigorous evidence is Ahrefs' difference-in-differences study: between August 2025 and March 2026, 1,885 pages that added schema were compared against 4,000 matched control pages. The result was striking, because adding schema produced no meaningful citation increase on any platform. A 2.4 percent increase for Google AI Mode and 2.2 percent for ChatGPT were at the level of statistical noise; for Google AI Overviews a small but significant decline of 4.6 percent was measured. Kurt Fischman's academic preprint reached a similar conclusion: generic schema has no independent effect on citations, and the main determinant is organic ranking position. Ahrefs' study shares the findings in detail.

Frequently circulated figures such as 73 percent, 3.2x, 317 percent, or 90 percent of AI answers rely on structured data have no verifiable primary source; most are uncontrolled agency observations, and that 90 percent figure actually belongs to a completely different topic, the accuracy rate of AI Overviews. The frequent presence of schema on cited pages is a correlation, not causation: authoritative publishers already use schema and get cited a lot. Likewise, the narrative that only 12 percent of sites use structured data so early movers win is false; that figure refers to registered domains, while about 54 percent of active sites already use JSON-LD.

Entity SEO, @id, and E-E-A-T Signals

The most valuable use of structured data is clarifying entities. Within JSON-LD, @id gives a node a unique identifier and lets you reference the same entity from different places, while schema.org's url property indicates the entity's actual web address, and the two should be used together. The recommended @id format is the canonical URL plus an identifier, for example the homepage address and an organization tag. An important limit is this: search engines process structured data page by page and do not automatically merge entities sharing the same @id across pages, so it must be reinforced with HTML internal links.

On the author authority side, Person markup clarifies who wrote the content via name, url, jobTitle, and especially sameAs, that is, by linking to authoritative profiles. Still, markup alone does not increase E-E-A-T. Google states that E-E-A-T is not a direct ranking score and cannot be raised simply by adding markup. John Mueller calls structured data an extremely light signal. Real authority is determined by user-visible author information, a consistent publishing history, and off-site recognition.

Implementation, Consistency, and the Most Common Errors

In implementation, the most common mistake is syntax: a single missing comma or unclosed bracket makes the entire schema block invisible to Google's parser. The second common mistake is inconsistency. The data in JSON-LD must match the content visible on the page exactly; Google can disqualify markup that does not represent visible content from rich results, and AI systems understand the content worse when the markup deviates from the visible text. For example, if dateModified in the schema says 2023 while the text references 2026 data, trust is eroded.

For WordPress users, plugins such as Yoast or Rank Math generate basic schema automatically. The right approach is not to turn this off and write by hand, but to audit the generated markup and keep it aligned with the visible content. The critical technical rule is clear: because AI bots do not execute JavaScript, structured data should be embedded in the raw HTML on the server side, not injected only in the browser via JavaScript.

Validation Tools and a Practical AEO Strategy

There are two main validation tools with different purposes. The Rich Results Test tests only Google-specific rich result eligibility and no longer supports FAQ after June 2026. The Schema Markup Validator validates general schema.org syntax and continues to check all types, including FAQPage. The URL Inspection tool in Search Console also shows how Google sees the structured data on the live page.

A practical minimum set for a small business is this: Organization on the homepage (with sameAs), WebSite for the site name, BreadcrumbList for navigation, Article or BlogPosting with a Person author on content pages, and LocalBusiness if you serve a local area. Since FAQ markup no longer earns a rich result, building the question-and-answer structure with visible headings and direct answers delivers the same AEO benefit, because AI engines read the visible text during live retrieval.

Conclusion

In short, schema markup is an indispensable infrastructure for AEO, but not the magic it is hyped to be. Well-built structured data clarifies your content for machines and answer engines, makes it eligible for classic rich results, and strengthens your entity relationships. Ranking or AI citation, however, is won mainly through accuracy, authority, and clear, extractable answers. We have already covered what is AEO and the steps of content optimization for AEO; in the era of AI search, applying all three together is the most solid path. For a broader view, you can explore our AEO services.

FAQ

Frequently Asked Questions

Quick answers for readers who skipped to the end.

What is schema markup and why does it matter for SEO?
Schema markup is standardized structured data that describes a page's content to search engines in a machine-readable way using the schema.org vocabulary. Google uses it for two purposes: to better understand page content and to make the page eligible for search features such as rich results. schema.org is the single official vocabulary jointly supported by Google, Microsoft (Bing), Yandex, and Yahoo!. It matters because it helps machines parse content correctly, but it is not a direct ranking factor.
What is the difference between JSON-LD, Microdata, and RDFa, and which does Google recommend?
All three are valid ways to embed schema.org data in a page. Google treats all three equally well; per its official wording, all three formats are fine for Google as long as the markup is valid and properly implemented. Google recommends JSON-LD, but only because it is the easiest to implement and maintain at scale and the least prone to user error. The claim that JSON-LD earns higher rankings or more rich results is false.
Are schema.org and Google rich results the same thing?
No. schema.org is the vocabulary that describes content, a hierarchy of types and properties. Rich results are the enhanced displays Google may show in search based on that structured data. A type can be a valid schema.org type yet still produce no rich result in Google. For example, FAQPage is still a valid schema.org type, but as of May 7, 2026 it no longer appears as a rich result in Google Search.
Should I use Article or BlogPosting for a blog post?
There are three valid Article types: Article, NewsArticle, and BlogPosting. None of them has required properties. For a blog post, the most specific type, BlogPosting, is appropriate; use NewsArticle for news content and Article for general articles. Google's general rule is to use the most specific applicable type. Recommended properties include author (Person or Organization), author.name, author.url, datePublished, dateModified (in ISO 8601 format), headline, and image.
Does FAQPage schema still work in 2026?
FAQPage is still a valid schema.org type and leaving it on the page does no harm, but it no longer earns a rich result in Google Search. Google officially announced that FAQ rich results stopped appearing as of May 7, 2026. In June 2026 the FAQ search appearance, the rich result report, and Rich Results Test support were removed, and Search Console API support was removed in August 2026. The feature had already been restricted to authoritative government and health sites since August 2023, and that exception ended on May 7, 2026.
Why does HowTo schema no longer show rich results?
Google started this process with its announcement on August 8, 2023, as part of simplifying search results. HowTo rich results were removed first on mobile in August 2023 and then fully on desktop on September 13, 2023. Today you can leave HowTo markup on your page, but it produces no special rich result formatting on any device. Many outdated SEO articles still list HowTo as an active rich result, so caution is warranted.
Is structured data a direct Google ranking factor?
No. Google clearly states that structured data is not a general ranking factor and does not directly make a page rank higher. Even a manual action for a structured data violation does not affect ranking; it only removes rich result eligibility. Google spokespeople including John Mueller and Danny Sullivan have confirmed this repeatedly since 2019. The benefits are indirect: a potential increase in click-through rate via rich results, better entity understanding, and eligibility for AI search features.
Does schema markup increase visibility in answers like AI Overviews and ChatGPT?
It does not guarantee it, and there is no solid causal evidence that it increases it. Ahrefs' controlled study comparing 1,885 pages against 4,000 control pages found that adding schema produced no meaningful citation increase on any platform (AI Overviews down 4.6 percent; AI Mode up 2.4 percent and ChatGPT up 2.2 percent, the latter two at noise level). The frequent presence of schema on cited pages is a correlation, not causation. Visibility is determined mainly by accuracy, authority, and a clear, extractable answer structure.
Can search and AI engines read schema injected with JavaScript?
There is an important distinction here. Google can read JSON-LD that is dynamically injected with JavaScript. However, AI crawlers (GPTBot, OAI-SearchBot, ChatGPT-User, ClaudeBot, PerplexityBot, and others) do not execute JavaScript as of 2026 and only read the raw HTML returned by the server. Therefore schema injected via JavaScript is never seen by these bots. The safest approach is to embed the schema statically in the raw HTML on the server side.
Does Person schema directly increase author E-E-A-T?
No, it does not increase it directly. Person markup helps search engines understand who wrote a piece of content and disambiguate the author entity, especially by linking to official profiles via sameAs. However, Google states that E-E-A-T is not a direct ranking factor or a score and cannot be raised simply by adding markup. John Mueller describes structured data as an extremely light signal. The real determinants of authority are visible author information, a consistent publishing history, and off-site recognition.
What happens if there is an inconsistency between structured data and the visible content?
Inconsistency is harmful. The content in JSON-LD must match the content visible on the page; Google's policy requires that markup accurately represents the content actually visible on the page. Markup that contradicts the visible content can lose rich result eligibility. In addition, AI systems understand the content worse when the markup deviates from the visible text. For example, if dateModified in the schema says 2023 while the text references 2026 data, trust drops. For this reason, markup and visible content should always be kept in sync.
Which tools do I use to validate my schema markup?
There are two main tools. The Rich Results Test checks Google-specific rich result eligibility, but it no longer supports FAQ after June 2026. The Schema Markup Validator performs general, non-Google-specific schema.org syntax validation and continues to validate all types, including FAQPage. The URL Inspection tool in Search Console also shows how Google sees the structured data on the live page. None of these tools check cross-page @id reference consistency; that must be verified manually.
Do I have an early-adopter advantage because few sites use structured data?
This claim is misleading, and the frequently circulated 12 percent figure is misinterpreted. That number is the share of all registered domains, not of active sites. According to current data, JSON-LD usage among sites is around 54 percent. In other words, structured data is already widespread; the narrative that very few sites use it so there is a big advantage does not reflect reality. Use schema not for a competitive edge but to describe content correctly.
If FAQ markup no longer earns rich results, how do I structure clear Q&A?
Since FAQPage markup has lost its rich result advantage, it is more valuable to build the question-and-answer structure at the visible content level. Write questions as clear headings, for example H2 or H3, and provide direct, extractable answers immediately below them. The first sentence of the answer should directly answer the question. This structure makes content understandable for both users and AI engines, because AI systems read the visible text during live retrieval. You achieve the same AEO benefit without FAQ schema, using clean and well-headed visible content.
Should I create an llms.txt file; is it required for AEO?
It is not required for Google. Google's 2026 guidance states that llms.txt is not needed for AI Overviews and AI Mode and that its systems do not consult the file. Therefore presenting llms.txt as a Google AEO strategy is misleading. That said, the situation may differ in some third-party and agent scenarios; some tools may support llms.txt. In short, it has no proven effect for Google visibility but may be considered in niche agent scenarios.
What is the minimum viable AEO schema set for a small business?
A practical minimum set includes: Organization on the homepage (with name, url, logo, and sameAs linking to social and authoritative profiles), WebSite for the site name, BreadcrumbList for site hierarchy which still earns a rich result, and Article or BlogPosting on content pages (with author as a Person entity, plus datePublished and dateModified). LocalBusiness can be added for businesses serving a local area. The most important rule: all markup must accurately represent and stay consistent with the content visible on the page. Think of schema not as the strategy itself but as the packaging layer that makes good content machine-readable.
Summarize:
Özkan Göçer profile photo

Özkan Göçer

Growth Engineer & Digital Marketing Specialist

Özkan Göçer is a Growth Engineer and Digital Marketing Specialist with over 15 years of field experience and 200+ completed projects. He shares advanced optimization strategies that help content get cited as a primary source by AI platforms like AI Overview, ChatGPT, and Perplexity.


Scroll to top