-
Breaking Case Study: AI does not read schema; Schema dos not help – Mark williams Cook
As shared on Linkedin, X, BlueSky by LudvigHoel and Mark Williams Cook (the Tafferboy) and Barry Schwartz , j0udini
From Mark Williams-Cook on LinkedIn:
LLMs work by "tokenising" content. That means taking common sequences of characters found in text and minting a unique "token" for that set. The LLM then takes billions of sample "windows" of sets of these tokens to build a prediction on what comes next.The image below is some example schema that has a colour change applied which represents that set of characters is a unique token as made by the GPT-4o model. What you will notice is that the schema gets "destroyed". For instance, the schema "@type": "Organization", gets broken down so there are separate tokens for "type" and "Organization", which means that in terms of tokenisation the regular words "type" and "Organization" are not distinguishable from schema.
From SE Roundtable
There are a lot of folks in the community saying that implementing structured data / schema on your pages will help you with AI Search visibility. But few have really tested it until now. And those few tests show that adding structured data / schema does not help with your visibility in AI search, at least not yet.
The first to test this was Mark Williams-Cook who posted on LinkedIn an experiment he conducted where he posted a "visual explanation of why your favourite LLM does not use schema in their core training data." He explained how when the LLMs process the page, it actually "destroys" the schema markup and thus does not use it.
from:
https://www.seroundtable.com/structured-data-schema-ai-search-visibility-40099.html
Log in to reply.