Schema markup
Definition
Schema markup is structured data added to web pages using the schema.org vocabulary that tells search engines and AI systems exactly what the content represents — a product, an article, a recipe, an FAQ, a person. It powers rich results in Google, drives entity understanding in knowledge graphs, and increasingly determines whether content is cited in AI Overviews and LLM-generated answers.
What it is
Schema markup uses the schema.org vocabulary — an open standard maintained by Google, Microsoft, Yahoo, and Yandex — to describe things on the web in a format machines can parse. It defines entities (people, products, businesses, articles), their properties (names, dates, ratings, prices), and the relationships between them.
The recommended format is JSON-LD, embedded in a <script type="application/ld+json"> tag in the page head. JSON-LD is preferred over Microdata and RDFa because it separates structured data from HTML content, making it easier to maintain at scale.
How to implement it
The five highest-impact schema types for most sites:
-
Organization — defines your business as a known entity. The single highest-leverage implementation in 2026 because it powers knowledge panel accuracy and AI citation reliability, even though it produces no visible SERP feature.
-
Article or BlogPosting — for editorial content. Defines headline, author, publish date, and modified date. Required for Top Stories carousels and AI Overview citation.
-
FAQPage — for question-answer content. The single most impactful schema for AEO and GEO. Note that as of January 2026, Google restricted FAQPage rich results to government and health domains in some queries, but the markup still feeds AI Overviews and voice assistants regardless of rich result eligibility.
-
Product — for ecommerce. Combined with Offer and Review schemas, triggers product rich snippets showing price, availability, and ratings. High-converting placement.
-
LocalBusiness — for any business with a physical location. Powers local pack inclusion and Google Business Profile alignment.
20–40%
Higher click-through rate on pages with rich result eligibility
Industry studies, 2026
5
Schema types covering 90% of business needs (Organization, Article, FAQ, Product, LocalBusiness)
Industry analysis, 2026
March 2026
Date Google's core update reduced FAQ rich results but increased schema's role in AI Mode citation
Google Search Central
Common mistakes
The four mistakes that consistently break implementations:
Schema does not match visible content. If your FAQ schema contains questions and answers that are not actually rendered on the page, Google can demote or manually action the markup. The March 2026 core update specifically penalized FAQ, Review, and HowTo schema implemented as SERP manipulation rather than as accurate descriptions of genuine page content.
Stale schema. Product schema with outdated prices, expired events, or out-of-stock items left as available creates user friction and trust penalties. Schema must update whenever the underlying content changes.
Over-marking up everything. Adding schema to elements that do not need it dilutes the impact of your most important structured data and can confuse search engines.
Skipping validation. Errors are silent. They do not break the page; they just prevent rich results from appearing. Quarterly validation catches drift introduced by template changes or content updates.
How to validate it
Three free tools cover validation:
Google Rich Results Test — checks whether a URL or code snippet qualifies for specific rich results and shows previews. Use this for Google-specific markup (FAQ, Product, Recipe, etc.).
Schema.org Validator — validates against the full schema.org specification. Catches errors Google's tool ignores. Use this for thorough syntax checking.
Google Search Console — the Enhancements report shows ongoing schema performance and flags errors at scale. Set up alerts for validation failures so issues surface before they tank rich result eligibility.
Frequently asked questions
Is schema markup a ranking factor?
Not directly. Google has explicitly stated that structured data is not a ranking signal. But the indirect effects are substantial — schema enables rich results that improve click-through rates, builds entity understanding in the Knowledge Graph, and increasingly determines whether AI systems cite your content. The downstream impact on visibility is real and measurable.
Which schema format should I use?
JSON-LD. Google strongly prefers it, and it is easier to maintain than inline Microdata or RDFa because it separates structured data from page HTML. Most modern CMS platforms generate JSON-LD by default.
Do I need schema markup for AI search?
Yes. Both Google and Microsoft have publicly confirmed they use schema markup for their AI features, and ChatGPT has confirmed it uses structured data for product results. Clean entity schema is becoming as important for AI citation as for traditional SERP rich results.
What changed in Google's January and March 2026 updates?
Google deprecated several lesser-used schema types (Practice Problem, SpecialAnnouncement, Q&A) in January 2026 and reduced rich result eligibility for FAQ and HowTo on non-primary content in March 2026. The headline shift: schema is moving from a SERP display trigger to an AI trust signal. Accurate entity schema now matters more than ever, even when no rich result is displayed.
How long does schema take to show results?
Implementation is immediate; effect appears once Google re-crawls the page. Rich result eligibility typically shows in Search Console within 1–2 weeks. AI citation effects are harder to attribute but typically take 2–4 weeks as AI systems re-index the updated structured data.
Featured snippets
A featured snippet is a highlighted answer box that appears at the top of Google search results — above the standard organic listings — pulled directly from a web page. It answers the user's query in 40–60 words without requiring a click, and the same content often feeds Google AI Overviews and voice assistant responses.
llms.txt
llms.txt is a proposed web standard — a markdown-formatted file placed at the root of a website — that gives LLMs and AI tools a curated index of a site's most important content. Modeled on robots.txt and sitemap.xml but designed for LLM comprehension rather than search crawlers, llms.txt is in the early adoption phase as of 2026, with no major AI platform officially committed to consuming it.