RANKWITHME.AI

You already have the answers. We help the internet find them.

Structure before ads — your business, clearly defined, permanently visible

► WHAT IS SCHEMA schema.org · w3c · json-ld

Every website on the internet is written for two audiences. The human who reads it. And the machine that decides whether anyone ever finds it. For most of the web's history those two audiences have been completely misaligned — beautiful pages for humans, invisible noise for machines.

Schema is the bridge. It is a standardized vocabulary — a shared language — that lets your website declare what it is to any machine that comes looking. Your business name. Your location. Your hours. Your services. Your relationship to your industry, your city, your customers. All of it structured, labeled, and machine-readable in a format that search engines and AI systems can read, reason about, and trust.

The technical name for the format is JSON-LD — JavaScript Object Notation for Linked Data. It lives in the head of your HTML as a script block. It is invisible to your visitors. It is everything to the machines that decide whether your visitors ever find you.

Think of it this way. Your website is an island. Schema is the map. Without the map, the treasure hunters guess. With the map, they come back.

This page is a complete reference. The history of where schema came from. The real numbers on what it does. How AI changed the stakes. What most implementations get wrong. And what we do differently. If you are a business owner, a developer, or an agency — everything you need to understand structured data at the level that actually matters is here.

► THE ORIGIN STORY 2011 · four search engines · one vocabulary

In 2011 something unusual happened in the technology industry. Four competing companies — Google, Microsoft, Yahoo, and Yandex — sat down together and agreed on something. That almost never happens. They were competing for the same users, the same advertisers, the same market. And they agreed.

The problem they were solving was real and it was expensive. Each search engine had been recommending its own vocabulary for structured data. Webmasters trying to add structured markup to their sites had to pick a side — or implement everything multiple times. Most gave up. The result was that the structured web Berners-Lee imagined in 2001 was still mostly theoretical in 2010. The data was there. The standard was not.

The three engineers who drove schema.org into existence were R.V. Guha at Google — who had previously created RSS and co-led the Cyc project — Dan Brickley, who had helped build the Semantic Web project at W3C, and Steve Macbeth at Microsoft. They launched schema.org with 297 types and 187 properties. One vocabulary. One standard. Every search engine would accept it.

The idea was brutally simple. Give webmasters a shared vocabulary so they only have to do the work once. The search engines would each use the markup differently on their end — that was their problem to solve. The webmaster's job was just to speak the language.

Within four years, 31.3% of pages in the Google index had schema.org markup. That number represented at least 12 million sites. Schema.org had grown from 297 types to 638 types and 965 properties. It had become the most widely adopted structured data vocabulary on the internet — and it is still growing. Today it contains 827 types and 1,528 properties covering every category of entity that exists on the web.

They published the standard. They gave it to the world. And then they waited for the web to catch up.

Most of it still has not.

827Schema.org types — every entity category on the web

1,528Properties — the vocabulary machines use to understand your business

31.3%Of indexed pages have schema.org markup — meaning 68.7% are invisible to machines

2011Year schema.org launched — the web has had 14 years to implement it

► THE THREE FORMATS microdata · rdfa · json-ld

When schema.org launched in 2011 it supported three ways to implement structured data. Each one represented a different philosophy about how machines and humans should share information on the same page.

Microdata came first. It embedded structured markup directly inside HTML tags using attributes. Clean in theory. In practice it tangled the data layer and the presentation layer together — every time a developer changed the HTML they risked breaking the schema. It also required deep knowledge of both HTML structure and schema vocabulary simultaneously, which was too much to ask of most webmasters.

RDFa — Resource Description Framework in Attributes — was the academic approach. It came from W3C's linked data community and was technically more expressive than Microdata. It was also significantly more complex. The learning curve was steep enough that adoption stayed concentrated among developers who already lived in the semantic web world.

Then in 2014 Google began recommending JSON-LD — JavaScript Object Notation for Linked Data. And everything changed.

JSON-LD solved the fundamental problem that Microdata and RDFa both had. It completely separated the structured data from the visible HTML. Your schema lived in a script block in the head of your page — a clean, self-contained JSON object that declared everything about your business without touching a single line of your front-end code. Developers could update schema without touching design. Designers could update design without touching schema. The two layers finally stopped fighting each other.

JSON-LD became the W3C Recommendation in 2020 — the highest level of endorsement the standards body issues. It is the official standard. Google recommends it. Every major AI system reads it. It is the language your website uses to speak to machines.

All three formats remain technically valid. In practice, JSON-LD is the answer. Our federation runs entirely on JSON-LD. Every entity record we produce, every schema block we generate for clients, every edge in our graph — JSON-LD.

FORMAT COMPARISON — SAME DATA, THREE SYNTAXES schema.org · localBusiness

/* ── MICRODATA — embedded in HTML tags ── */ <div itemscope itemtype="https://schema.org/LocalBusiness"> <span itemprop="name">Katie Jakes Bar</span> <span itemprop="telephone">+1-626-555-0192</span> </div> /* ── RDFa — also embedded in HTML ── */ <div vocab="https://schema.org/" typeof="LocalBusiness"> <span property="name">Katie Jakes Bar</span> <span property="telephone">+1-626-555-0192</span> </div> /* ── JSON-LD — completely separate from HTML ✓ RECOMMENDED ── */ <script type="application/ld+json"> { "@context": "https://schema.org", "@type": "LocalBusiness", "name": "Katie Jakes Bar", "telephone": "+1-626-555-0192" } </script>

W3C RECOMMENDATION · JSON-LD 1.1 · PUBLISHED 2020 W3C SPEC →

The separation of concerns that JSON-LD provides is the reason the entire schema.org community converged on it. Your structured data is a document. Your HTML is a document. They live separately, they serve different audiences, and they can be maintained independently. That is not a minor convenience. That is the difference between schema that gets implemented and schema that gets abandoned.

THE THREE FORMATS — MICRODATA · RDFA · JSON-LD Google Structured Data Docs →

► WHAT SCHEMA ACTUALLY DOES real numbers · cited sources · no spin

Schema.org has been live since 2011. Fourteen years of real data exists on what happens to websites that implement it versus websites that skip it. The numbers are from Google's own documentation, from independent case studies, and from publishers who ran controlled experiments on their own traffic. These are cited. These are real.

Rotten Tomatoes added structured data to 100,000 unique pages and measured a 25% higher click-through rate on pages with structured data compared to pages without. The Food Network converted 80% of their pages and saw a 35% increase in visits. Nestlé measured pages showing as rich results and found an 82% higher click-through rate than non-rich result pages. Rakuten found users spend 1.5x more time on structured data pages and have a 3.6x higher interaction rate.

These are not edge cases. These are household names running controlled experiments at scale. The conclusion is the same every time. Structured data changes how your result looks in search. How it looks changes whether people click. Whether people click determines whether your business gets found.

What schema actually does at the technical level is give search engines and AI systems explicit signals instead of forcing them to guess. Without schema, a crawler arrives at your page and has to infer everything — what type of business you are, where you operate, what you sell, who your hours are for, how your reviews relate to your services. That inference process is imperfect and expensive. The crawler may get it right. It may get it partially right. It may get it wrong entirely.

With schema, you tell the machine exactly what everything is. There is no inference required. The crawler reads the JSON-LD block, extracts structured facts, and moves on — confident in what it found. That confidence translates into richer search results, higher citation frequency in AI responses, and stronger entity authority in knowledge graphs.

The machines reward the islands that make their job easy. Schema is how you make their job easy.

82%Higher CTR for Nestlé pages showing as rich results vs standard results

35%Increase in visits for Food Network after converting 80% of pages to structured data

25%Higher CTR for Rotten Tomatoes across 100,000 structured data pages

1.5xMore time spent on Rakuten structured data pages vs non-structured pages

SOURCE — GOOGLE SEARCH CENTRAL STRUCTURED DATA DOCUMENTATION View Source →

► HOW AI CHANGED THE STAKES from ranking to reasoning

Schema mattered before large language models. The CTR numbers above are from the pre-LLM era — search features, rich snippets, knowledge panels. Those were already significant reasons to implement structured data. Then AI arrived and the entire calculus shifted.

Search engines used to match keywords to pages. The job was ranking — which page is most relevant for this query. Schema helped by providing explicit signals that improved ranking accuracy. That was the old game.

AI systems reason. They build models. When someone asks ChatGPT or Google's AI Overview about a restaurant in Covina, the system is constructing an understanding of what that restaurant is — its entity. What type of establishment. What cuisine. What neighborhood. What relationship it has to the surrounding area, to similar businesses, to the people who own it. That model was built long before the question was asked. Your schema is one of the primary inputs into that model.

The shift from ranking to reasoning changes what schema needs to do. In the old world, schema helped you rank higher. In the new world, schema determines whether you exist in the AI's model of your industry at all. A business with rich, well-structured, relationship-dense schema is a fully formed entity in the AI's understanding. A business without it is a gap in the data — something the AI has to guess about, or worse, something it hallucinates.

We have all experienced AI hallucination on real businesses. Wrong addresses. Wrong phone numbers. Descriptions that fit nobody. Services that were never offered. That is what happens when an AI system has to build an entity model from unstructured data and inference. The model fills in gaps with pattern-matching. Pattern-matching produces plausible-sounding lies.

Schema is hallucination prevention. It gives the AI system ground truth about your business from a source it can verify — your own website, declaring its own identity, in a standardized vocabulary the system was trained to trust. Every property you declare in JSON-LD is one fewer thing the AI has to guess about. Every relationship you establish is one more accurate edge in the model of your business that AI systems carry when they answer questions about your industry.

The stakes were already high in 2011. In 2026, with AI systems mediating the majority of discovery for a growing share of the population, schema is the most important technical investment a website can make. Full stop.

WITHOUT SCHEMA — WHAT THE AI RECEIVES unstructured · inference required · hallucination risk

/* AI crawler arrives at your page and finds: */ <div class="header">Welcome to Our Restaurant</div> <p>We serve great food in a great atmosphere.</p> <p>Call us at 626-555-0192 or visit us today!</p> <div class="hours">Open Mon-Sun 11am to 2am</div> /* AI must now infer: */ ? What type of business is this ? Where exactly is it located ? What cuisine does it serve ? Is that phone number current ? What city and neighborhood ? Does it have outdoor seating ? What is the price range ? Who owns it /* Result: guesses, pattern-matching, plausible-sounding hallucination */

UNSTRUCTURED HTML — MAXIMUM INFERENCE REQUIRED

WITH SCHEMA — WHAT THE AI RECEIVES structured · declared · zero inference required

/* AI crawler arrives at your page and finds: */ { "@context": "https://schema.org", "@type": "BarOrPub", "name": "Katie Jakes Bar", "description": "Neighborhood bar in Covina, CA serving craft cocktails and local brews", "telephone": "+1-626-555-0192", "address": { "@type": "PostalAddress", "streetAddress": "123 Citrus Ave", "addressLocality": "Covina", "addressRegion": "CA", "postalCode": "91723" }, "openingHours": "Mo-Su 11:00-02:00", "priceRange": "$$", "servesCuisine": "American", "hasMap": "https://maps.google.com/?q=Katie+Jakes+Bar+Covina", "sameAs": [ "https://www.yelp.com/biz/katie-jakes-bar-covina", "https://www.google.com/maps/place/katie-jakes-bar" ] } /* AI now knows with certainty: */ ✓ Type: BarOrPub ✓ Location: Covina CA 91723 ✓ Phone: verified ✓ Hours: Mo-Su 11am-2am ✓ Price: mid-range ✓ Cuisine: American ✓ Identity confirmed across Yelp + Google Maps /* Result: accurate citations, correct recommendations, zero hallucination */

STRUCTURED JSON-LD — ZERO INFERENCE REQUIRED · HALLUCINATION ELIMINATED schema.org/BarOrPub →

The difference between those two examples is the difference between existing in AI's model of your industry and being a gap that gets filled with a guess. Every business in our federation has the second version. Every business we index gets declared, linked, and confirmed.

FROM RANKING TO REASONING — THE AI ERA OF STRUCTURED DATA Our Research →

► WHAT MOST IMPLEMENTATIONS GET WRONG the checkbox problem

Schema.org has been publicly available since 2011. JSON-LD has been the recommended format since 2014. The W3C made it an official recommendation in 2020. And yet the most common implementation of structured data on a business website in 2026 looks like this: a plugin-generated block with a name, an address, a phone number, and nothing else. Written once. Never updated. Never connected to anything.

This is what we call the checkbox problem. Schema gets treated as a technical audit item. Something to tick off the list. The SEO agency adds it to the monthly platinum package. The developer generates it with a WordPress plugin. The client never sees it and never thinks about it again. The result is structured data that technically exists but does nothing meaningful for discoverability in the AI era.

The checkbox problem has three specific failure modes.

Thin schema. The minimum required fields and nothing more. A LocalBusiness with a name and address but no description, no service areas, no hours, no price range, no cuisine type, no founding date, no social profiles, no aggregate rating, no opening hours specification. The crawler reads it, extracts the minimum, and moves on. The entity model it builds is skeletal. A skeletal entity model produces weak citations.

Static schema. Schema written once and never touched again. A business that has been open for eight years, changed its hours, added a patio, started serving brunch, hired a new chef, won three local awards — and has the exact same JSON-LD block it had in 2018. The gap between what the schema says and what the business actually is grows every year. AI systems that encounter contradictions between schema and other signals reduce their confidence in the source. Reduced confidence means fewer citations.

Isolated schema. Schema that lives on one page and points nowhere. No sameAs connections to verified external profiles. No links to related entities. No relationships declared between the business and its industry, its neighborhood, the people who run it. A JSON-LD block that describes one entity in isolation is a page. A JSON-LD block that connects that entity to a graph of related entities is an island with a map. The treasure hunter can actually navigate it. The crawler finds relationships to follow. Every relationship followed is another confirmation of the entity's existence and authority.

None of this is a criticism of agencies or developers who implement standard schema. Standard schema is better than no schema. The baseline is real and it matters. Our methodology builds on top of the baseline. We score what exists, identify what is missing, and inject edges that connect the entity to a live federation graph of hundreds of thousands of real businesses across America. The result is schema that does not just declare — it connects.

THIN SCHEMA vs RICH SCHEMA — SAME BUSINESS TYPE LocalBusiness · dentist · san diego ca

/* ── THIN — what most sites have ── */ { "@context": "https://schema.org", "@type": "Dentist", "name": "Pacific Dental Care", "telephone": "+1-619-555-0144", "address": { "@type": "PostalAddress", "addressLocality": "San Diego", "addressRegion": "CA" } } /* 4 fields. Skeleton. Machine learns almost nothing. */ /* ── RICH — what our federation members have ── */ { "@context": "https://schema.org", "@type": "Dentist", "@id": "https://pacificdentalcare.com/#organization", "name": "Pacific Dental Care", "description": "Family and cosmetic dentistry in San Diego — implants, Invisalign, emergency care", "telephone": "+1-619-555-0144", "url": "https://pacificdentalcare.com", "foundingDate": "2008", "priceRange": "$$", "address": { "@type": "PostalAddress", "streetAddress": "4820 Convoy St Suite 201", "addressLocality": "San Diego", "addressRegion": "CA", "postalCode": "92111", "addressCountry": "US" }, "geo": { "@type": "GeoCoordinates", "latitude": 32.8157, "longitude": -117.1502 }, "openingHoursSpecification": [ { "@type": "OpeningHoursSpecification", "dayOfWeek": ["Monday","Tuesday","Wednesday","Thursday","Friday"], "opens": "08:00", "closes": "17:00" }, { "@type": "OpeningHoursSpecification", "dayOfWeek": "Saturday", "opens": "08:00", "closes": "13:00" } ], "hasOfferCatalog": { "@type": "OfferCatalog", "name": "Dental Services", "itemListElement": [ { "@type": "Offer", "itemOffered": { "@type": "Service", "name": "Dental Implants" } }, { "@type": "Offer", "itemOffered": { "@type": "Service", "name": "Invisalign" } }, { "@type": "Offer", "itemOffered": { "@type": "Service", "name": "Emergency Dental Care" } } ] }, "aggregateRating": { "@type": "AggregateRating", "ratingValue": 4.8, "reviewCount": 312 }, "sameAs": [ "https://www.yelp.com/biz/pacific-dental-care-san-diego", "https://www.google.com/maps/place/pacific-dental-care", "https://www.healthgrades.com/dentists/pacific-dental-care" ], "areaServed": [ { "@type": "City", "name": "San Diego" }, { "@type": "City", "name": "Kearny Mesa" }, { "@type": "City", "name": "Clairemont" } ] } /* 32+ fields. Full entity. Machine knows everything. Graph can connect. */

RICH JSON-LD — FULL ENTITY DECLARATION · GRAPH-READY · FEDERATION-INJECTABLE schema.org/Dentist →

The thin version gets generated in thirty seconds by a plugin. The rich version requires research, industry knowledge, and understanding of how AI systems build entity models. That gap is where we live. And our federation members get something neither version can produce alone — edges injected from a live graph of hundreds of thousands of real businesses that share their semantic context. Their schema does not just declare. It connects.

THE CHECKBOX PROBLEM — THIN · STATIC · ISOLATED Our Schema Service →

► OUR METHODOLOGY score · enrich · inject · connect

We arrived at this methodology backwards. We were treasure hunters first. We spent years crawling websites, extracting structured data, normalizing it, classifying it, scoring it against our own lexicon. We did what search engines and AI systems do — by hand — until we understood it well enough to build the infrastructure ourselves.

What we found in that process was consistent and damning. The majority of websites — including websites that looked professionally built and were being actively maintained — had structured data that scored poorly against the vocabulary standards that AI systems actually use to build entity models. The schema existed. It was just thin, static, and isolated. The checkbox had been ticked. The work had not been done.

Our methodology has four stages. Every federation member goes through all four. Every stage compounds on the previous one.

Stage one — Score. We extract every JSON-LD block currently on your website. We run it through our schema vocabulary lexicon — a curated scoring system built from hundreds of thousands of real business entity records across 24 industry pillars. We score your current implementation against the full vocabulary available for your business type. Most sites score between 12% and 35% of available schema coverage. We show you exactly what is missing and why it matters.

Stage two — Enrich. You complete a structured questionnaire about your business. Every answer becomes a data point that feeds schema generation. We produce a complete, deeply researched JSON-LD block for your business type — not generated from a template, built from your actual data against the full schema.org vocabulary for your entity class. A dentist gets dentist schema. A law firm gets law firm schema. A restaurant gets restaurant schema. Each one built to the depth that AI systems reward.

Stage three — Inject. This is where our methodology diverges from every other schema service on the internet. We run your entity against our Pydantic deterministic RAG system — a pipeline that fetches semantically relevant edges from our indexed entity buckets. Hundreds of thousands of real business entities across America, indexed and classified. A dentist in San Diego gets edges from the dental entity cluster — schema patterns used by verified dental businesses in Kentucky, Texas, Florida, California. The health regulations that govern dental practice from OakMorel. The economic indicators for the San Diego dental market. These relationships get baked directly into your JSON-LD as confirmed semantic edges. Your schema does not just declare what you are. It connects you to a living graph of everything related to what you are.

Stage four — Connect. Your entity record becomes a node in the RankWithMe federation graph. Every new entity added to the federation immediately becomes available for edge-building against every existing entity across all domains. Your island gets wired into the torus network. LarryBrin arrives at your island and finds a crawler's paradise — not just one page, but a verified identity connected to an entire graph of related entities, laws, regulations, and industry context. That is what we mean by structure before ads. Always.

THE FOUR STAGE PIPELINE — SCORE TO CONNECT rankwithme.ai · federation methodology · entity-first

/* ── STAGE 01 — SCORE ── */ EXTRACT → fetch all JSON-LD blocks from target URL PARSE → normalize against schema.org vocabulary tree SCORE → run against schema_vocab_lexicon.json REPORT → coverage % · missing fields · priority gaps example_output: { "entity_type": "Dentist", "fields_present": 4, "fields_available": 38, "coverage_score": "10.5%", "priority_missing": ["geo", "openingHoursSpecification", "hasOfferCatalog", "aggregateRating", "sameAs", "areaServed"] } /* ── STAGE 02 — ENRICH ── */ QUESTIONNAIRE → structured intake — business data, services, geography GENERATE → full JSON-LD built from intake data + lexicon VALIDATE → run against Google Rich Results Test + internal validator COVERAGE → target 85%+ of available schema fields for entity type /* ── STAGE 03 — INJECT ── */ PYDANTIC_RAG → fetch semantic edges from indexed entity buckets INDUSTRY_PASS → dental entities · 50 states · 24 industry pillars LEGAL_PASS → OakMorel statutes governing entity class EDGE_INJECT → bake confirmed relationships into JSON-LD output edge_example: { "knowsAbout": [ "Dental Implants", "Cosmetic Dentistry", "California Dental Practice Act", "Dental Board of California" ], "memberOf": { "@type": "Organization", "name": "RankWithMe Federation", "@id": "https://rankwithme.ai/federation#network" } } /* ── STAGE 04 — CONNECT ── */ FEDERATION_MINT → entity record created in RankWithMe graph EDGE_PASS_01 → deterministic — exact identifier matches · confidence 1.0 EDGE_PASS_02 → lexical — shared vocabulary density scoring EDGE_PASS_03 → semantic LLM — relationship scoring 0.0 to 1.0 EDGE_PASS_04 → fine-tuned — proprietary model trained on federation edges COMPOUND → every new entity strengthens every existing entity /* result: your island is now a node in a torus network */ /* LarryBrin finds a crawler's paradise. comes back. always. */

RANKWITHME FEDERATION METHODOLOGY · ENTITY-FIRST · COMPOUND REASONING Full Methodology →

► THE SCHEMA VOCABULARY LEXICON our scoring standard · publicly available

Every schema audit we run, every schema block we generate, every coverage score we produce — it all runs against the same standard. We call it the Schema Vocabulary Lexicon. It is a curated JSON file that maps every meaningful schema.org property to every business entity type we cover across our 24 industry pillars, weighted by AI citation importance, search engine rich result eligibility, and federation edge-building value.

We built it because it did not exist. Schema.org publishes 827 types and 1,528 properties. That is the full vocabulary. What it does not tell you is which properties actually matter for a dentist in San Diego versus a restaurant in Covina versus a law firm in New York. The vocabulary is exhaustive. The prioritization is absent. Every webmaster and every agency is left to guess which fields to implement and in what order.

Our lexicon answers that question with data. We indexed hundreds of thousands of real business entity records across America. We analyzed which schema properties appeared most consistently on the highest-performing entities in each industry — the ones getting cited in AI responses, ranking in Google AI Overviews, appearing in knowledge panels. We weighted those properties by citation frequency, rich result eligibility per Google's documentation, and federation edge-building potential. The result is a priority-ordered field map for every entity type we cover.

When we score your current schema against the lexicon, we are comparing what you have against what the best-performing entities in your industry actually use. When we generate schema for a federation member, we target 85% lexicon coverage minimum. When we inject edges, we are pulling from the same lexicon that scored the entities those edges come from.

The lexicon is the standard we hold ourselves to. It is also the standard we make public. You can download it, fetch it programmatically, and use it to score your own website against our methodology. We publish it because we believe the web should be readable by everyone — and that means the tools for making it readable should be accessible too.

If you are a developer or an agency, the lexicon is the fastest way to understand what we mean by coverage. If you are a business owner, the coverage score is the number that tells you how visible your business is to the machines that are deciding your discoverability right now.

SCHEMA VOCAB LEXICON — STRUCTURE PREVIEW schema_vocab_lexicon.json · docs/schema_vocab_lexicon.json

/* schema_vocab_lexicon.json — partial preview */ { "version": "1.0", "published": "2026-03-01", "authority": "https://rankwithme.ai/docs/schema_vocab_lexicon.json", "pillars": 24, "entity_types_covered": 186, "entities": { "Dentist": { "pillar": "DENTAL", "base_type": "LocalBusiness", "schema_type": "Dentist", "total_fields_available": 38, "priority_fields": [ { "field": "name", "weight": 1.0, "rich_result": true, "ai_citation": true }, { "field": "address", "weight": 1.0, "rich_result": true, "ai_citation": true }, { "field": "geo", "weight": 0.97, "rich_result": true, "ai_citation": true }, { "field": "telephone", "weight": 0.96, "rich_result": true, "ai_citation": true }, { "field": "openingHoursSpecification", "weight": 0.94, "rich_result": true, "ai_citation": true }, { "field": "aggregateRating", "weight": 0.93, "rich_result": true, "ai_citation": true }, { "field": "hasOfferCatalog", "weight": 0.91, "rich_result": false, "ai_citation": true }, { "field": "sameAs", "weight": 0.90, "rich_result": false, "ai_citation": true }, { "field": "areaServed", "weight": 0.88, "rich_result": false, "ai_citation": true }, { "field": "knowsAbout", "weight": 0.86, "rich_result": false, "ai_citation": true }, { "field": "description", "weight": 0.85, "rich_result": false, "ai_citation": true }, { "field": "foundingDate", "weight": 0.72, "rich_result": false, "ai_citation": false } ] }, "Attorney": { "pillar": "LEGAL_LAW", "total_fields_available": 41 }, "Restaurant": { "pillar": "FOOD_BEVERAGE", "total_fields_available": 44 }, "Contractor": { "pillar": "CONSTRUCTION_HOME", "total_fields_available": 36 }, "MedicalClinic": { "pillar": "MEDICAL_HEALTH", "total_fields_available": 47 }, "AutoRepair": { "pillar": "AUTOMOTIVE", "total_fields_available": 33 } /* ... 180 more entity types across 24 industry pillars */ } }

PUBLICLY AVAILABLE · FETCHABLE · VERSIONABLE Download Lexicon →

The lexicon is a living document. Every new entity type we index, every new schema property we confirm matters for AI citation, every new industry pillar we add — the lexicon gets updated. It is versioned. It is published. It is the standard the entire federation runs on.

If you want to score your own website before reaching out to us — download the lexicon, find your entity type, count your implemented fields against the priority list, and divide. That number is your current machine-readability score. Most sites score under 30%. Federation members target 85% and above.

24Industry pillars covered — from dental to legal to food and beverage

186Entity types in the lexicon — every major business category in America

~22%Average schema coverage score for a typical business website before enrichment

85%+Minimum lexicon coverage target for all RankWithMe federation members

SCHEMA VOCAB LEXICON · RANKWITHME.AI · VERSION 1.0 Download Lexicon JSON →

► ROOT-LD AND THE CEILING beyond json-ld · federated linked data · compound reasoning

We believe in JSON-LD. Our entire federation runs on it. Every entity record we produce is grounded in it. If you do one thing after reading this page — implement JSON-LD on your website. It is the highest-leverage technical action available to you right now for AI era discoverability.

And then we will tell you what we also believe — which is that JSON-LD as it currently exists has a ceiling.

JSON-LD is a snapshot. It declares what you are at the moment it was written. It sits on your page, static, waiting to be read. It builds zero relationships on its own. A thousand businesses can each have perfect JSON-LD on their individual websites and those thousand records will never talk to each other, never confirm relationships between each other, never build the cross-domain graph that makes AI systems reason about an industry the way they reason about Wikipedia.

This is the ceiling we hit in our own research. And it is why we built Root-LD.

Root-LD is a federated semantic linked data specification — a framework for building a knowledge graph that grows smarter every time a new entity joins it. Every entity we mint gets run through four passes. The first pass is deterministic — pure data, exact matches, confidence 1.0. The second pass is lexical — shared vocabulary across domains, weighted by density. The third pass is semantic — a language model reasoning about proposed relationships and scoring them. The fourth pass is proprietary — a model fine-tuned specifically on the confirmed relationships inside this graph, finding connections that no general-purpose model could find because they only exist here.

We call this Compound Reasoning. The graph gets more intelligent as it gets larger. Every new business entity added to the federation immediately becomes available for edge-building against every existing entity across all domains. Every entity makes every other entity more valuable. This is infrastructure at a level that has no name yet in the industry because nobody has built it before.

The full specification is public. Every decision we made, every layer of the architecture, every edge type in the taxonomy — documented, published, open. Because we believe the web should be readable by everyone.

ROOT-LD ARCHITECTURE — THREE LAYER STRUCTURE root-ld.org · specification v1.0 · published 2026-02-28

/* ── ROOT-LD THREE LAYER SCHEMA — ENTITY RECORD STRUCTURE ── */ { "@context": { "@vocab": "https://schema.org/", "root-ld": "https://root-ld.org/spec#", "recursive-ld": "https://recursive-ld.org/edge-taxonomy#" }, /* ── LAYER ONE — ANCHOR (immutable · universal) ── */ "root-ld:anchor": { "specVersion": "1.0", "federationId": "RWM-DENTIST-7x9k2m", "domainSignature": "rankwithme.ai", "entityClass": "ORGANIZATION", "entitySubclass": "DENTAL_PRACTICE", "primarySource": "https://pacificdentalcare.com", "sourceVerified": true, "humanVerified": true, "generationMethod": "human-authored", "dateIngested": "2026-03-15" }, /* ── LAYER TWO — BODY (domain-specific content) ── */ "root-ld:body": { "name": "Pacific Dental Care", "description": "Family and cosmetic dentistry in San Diego CA", "jurisdiction": "California", "status": "ACTIVE", "tags": ["dentist", "dental-implants", "invisalign", "san-diego", "kearny-mesa"] }, /* ── LAYER THREE — RECURSIVE-LD (edges · grows over time) ── */ "recursive-ld:edges": [ { "targetId": "OM-STATUTE-CA-BPC-1600", "edgeType": "GOVERNS-INVERSE", "confidence": 1.0, "confirmationMethod": "DETERMINISTIC", "relationship": "California Dental Practice Act governs this entity" }, { "targetId": "RWM-DENTIST-san-diego-cluster", "edgeType": "TOPICALLY-RELATED", "confidence": 0.94, "confirmationMethod": "SEMANTIC-LLM", "relationship": "San Diego dental industry entity cluster" }, { "targetId": "RWM-DENTIST-implants-national", "edgeType": "LEXICALLY-RELATED", "confidence": 0.87, "confirmationMethod": "SEMANTIC-FINETUNED", "relationship": "Dental implant practices across American federation" } ] } /* this is not a page. this is infrastructure. */ /* every edge confirmed. every relationship sourced. compound reasoning active. */

ROOT-LD SPECIFICATION V1.0 · PUBLISHED 2026-02-28 · OPEN STANDARD Read Full Specification →

JSON-LD declares. Root-LD connects. The declaration is the foundation. The connections are the compound interest. Every business that joins the federation adds to the graph. Every addition to the graph makes every existing entity more valuable. This is what we mean when we say the ground is shifting and we are preparing the new bedrock.

ROOT-LD · BEYOND JSON-LD · COMPOUND REASONING Read the Specification →

► LIVE JSON-LD EXAMPLES copy · validate · implement

The following are production-ready JSON-LD examples for six of our 24 industry pillars. Every block is built to our lexicon standard — targeting 85%+ field coverage for the entity type, structured for rich result eligibility, and formatted for federation edge injection. These are real starting points, not toy examples.

Copy any block directly into the <head> of your HTML inside a <script type="application/ld+json"> tag. Replace the placeholder values with your actual business data. Validate using the Google Rich Results Test before deploying.

These examples represent the baseline. Federation members get the enriched version — edges injected from our live graph, relationships confirmed across 24 industry pillars, coverage scored and gap-filled against the full lexicon for their specific entity type and geographic market.

RESTAURANT — FOOD & BEVERAGE PILLAR schema.org/Restaurant · lexicon coverage target 85%+

{ "@context": "https://schema.org", "@type": "Restaurant", "@id": "https://yourrestaurant.com/#organization", "name": "Your Restaurant Name", "description": "[Your restaurant description — cuisine type, atmosphere, what makes you different]", "url": "https://yourrestaurant.com", "telephone": "+1-XXX-XXX-XXXX", "email": "[email protected]", "foundingDate": "YYYY", "priceRange": "$$", "servesCuisine": ["American", "Craft Cocktails"], "hasMenu": "https://yourrestaurant.com/menu", "acceptsReservations": true, "address": { "@type": "PostalAddress", "streetAddress": "123 Main Street", "addressLocality": "Your City", "addressRegion": "CA", "postalCode": "XXXXX", "addressCountry": "US" }, "geo": { "@type": "GeoCoordinates", "latitude": 00.0000, "longitude": -000.0000 }, "openingHoursSpecification": [ { "@type": "OpeningHoursSpecification", "dayOfWeek": ["Monday","Tuesday","Wednesday","Thursday"], "opens": "11:00", "closes": "22:00" }, { "@type": "OpeningHoursSpecification", "dayOfWeek": ["Friday","Saturday"], "opens": "11:00", "closes": "02:00" }, { "@type": "OpeningHoursSpecification", "dayOfWeek": "Sunday", "opens": "11:00", "closes": "21:00" } ], "aggregateRating": { "@type": "AggregateRating", "ratingValue": 4.7, "reviewCount": 284, "bestRating": 5 }, "amenityFeature": [ { "@type": "LocationFeatureSpecification", "name": "Outdoor Seating", "value": true }, { "@type": "LocationFeatureSpecification", "name": "Full Bar", "value": true }, { "@type": "LocationFeatureSpecification", "name": "Parking Available", "value": true } ], "areaServed": [ { "@type": "City", "name": "Your City" } ], "sameAs": [ "https://www.yelp.com/biz/your-restaurant", "https://www.google.com/maps/place/your-restaurant", "https://www.instagram.com/yourrestaurant" ], "image": "https://yourrestaurant.com/images/restaurant-exterior.jpg", "logo": { "@type": "ImageObject", "url": "https://yourrestaurant.com/logo.png" } }

FOOD & BEVERAGE PILLAR · schema.org/Restaurant · LEXICON COVERAGE ~87%

LAW FIRM — LEGAL & LAW PILLAR schema.org/LegalService · lexicon coverage target 85%+

{ "@context": "https://schema.org", "@type": "LegalService", "@id": "https://yourlawfirm.com/#organization", "name": "Your Law Firm Name", "alternateName": "YLF", "description": "[Practice areas, jurisdiction, years in practice, what makes your firm different]", "url": "https://yourlawfirm.com", "telephone": "+1-XXX-XXX-XXXX", "email": "[email protected]", "foundingDate": "YYYY", "address": { "@type": "PostalAddress", "streetAddress": "Suite 400, 555 Legal Blvd", "addressLocality": "Your City", "addressRegion": "CA", "postalCode": "XXXXX", "addressCountry": "US" }, "geo": { "@type": "GeoCoordinates", "latitude": 00.0000, "longitude": -000.0000 }, "openingHours": "Mo-Fr 09:00-17:00", "knowsAbout": [ "Personal Injury Law", "Civil Litigation", "Employment Law", "California Labor Code", "Federal Civil Procedure" ], "hasOfferCatalog": { "@type": "OfferCatalog", "name": "Legal Services", "itemListElement": [ { "@type": "Offer", "itemOffered": { "@type": "Service", "name": "Personal Injury Representation" } }, { "@type": "Offer", "itemOffered": { "@type": "Service", "name": "Free Initial Consultation" } }, { "@type": "Offer", "itemOffered": { "@type": "Service", "name": "Employment Discrimination Claims" } } ] }, "areaServed": [ { "@type": "State", "name": "California" } ], "aggregateRating": { "@type": "AggregateRating", "ratingValue": 4.9, "reviewCount": 147, "bestRating": 5 }, "sameAs": [ "https://www.avvo.com/attorneys/your-profile", "https://www.martindale.com/your-profile", "https://www.linkedin.com/company/your-law-firm" ] }

LEGAL & LAW PILLAR · schema.org/LegalService · LEXICON COVERAGE ~88%

CONTRACTOR — CONSTRUCTION & HOME PILLAR schema.org/HomeAndConstructionBusiness · lexicon coverage target 85%+

{ "@context": "https://schema.org", "@type": "HomeAndConstructionBusiness", "@id": "https://yourcontractor.com/#organization", "name": "Your Contracting Company", "description": "[Licensed contractor — your specialties, years in business, license number, service area]", "url": "https://yourcontractor.com", "telephone": "+1-XXX-XXX-XXXX", "foundingDate": "YYYY", "numberOfEmployees": { "@type": "QuantitativeValue", "value": 12 }, "address": { "@type": "PostalAddress", "addressLocality": "Your City", "addressRegion": "CA", "postalCode": "XXXXX", "addressCountry": "US" }, "hasOfferCatalog": { "@type": "OfferCatalog", "name": "Construction Services", "itemListElement": [ { "@type": "Offer", "itemOffered": { "@type": "Service", "name": "Kitchen Remodeling" } }, { "@type": "Offer", "itemOffered": { "@type": "Service", "name": "Bathroom Remodeling" } }, { "@type": "Offer", "itemOffered": { "@type": "Service", "name": "ADU Construction" } }, { "@type": "Offer", "itemOffered": { "@type": "Service", "name": "General Contracting" } } ] }, "areaServed": [ { "@type": "City", "name": "Your City" }, { "@type": "City", "name": "Neighboring City" } ], "knowsAbout": [ "California Building Code", "CSLB License Requirements", "Residential Construction Permits" ], "aggregateRating": { "@type": "AggregateRating", "ratingValue": 4.8, "reviewCount": 93, "bestRating": 5 }, "sameAs": [ "https://www.houzz.com/professionals/your-profile", "https://www.yelp.com/biz/your-contractor", "https://www.bbb.org/us/ca/your-contractor" ] }

CONSTRUCTION & HOME PILLAR · schema.org/HomeAndConstructionBusiness · LEXICON COVERAGE ~86%

MEDICAL CLINIC — MEDICAL & HEALTH PILLAR schema.org/MedicalClinic · lexicon coverage target 85%+

{ "@context": "https://schema.org", "@type": "MedicalClinic", "@id": "https://yourclinic.com/#organization", "name": "Your Medical Clinic", "description": "[Your clinic description — specialties, patient demographics, what makes you different]", "url": "https://yourclinic.com", "telephone": "+1-XXX-XXX-XXXX", "foundingDate": "YYYY", "medicalSpecialty": [ "Family Medicine", "Preventive Medicine" ], "availableService": [ { "@type": "MedicalProcedure", "name": "Annual Physical Exam" }, { "@type": "MedicalProcedure", "name": "Urgent Care" }, { "@type": "MedicalProcedure", "name": "Telehealth Visits" } ], "address": { "@type": "PostalAddress", "streetAddress": "200 Medical Plaza Dr", "addressLocality": "Your City", "addressRegion": "CA", "postalCode": "XXXXX", "addressCountry": "US" }, "geo": { "@type": "GeoCoordinates", "latitude": 00.0000, "longitude": -000.0000 }, "openingHoursSpecification": [ { "@type": "OpeningHoursSpecification", "dayOfWeek": ["Monday","Tuesday","Wednesday","Thursday","Friday"], "opens": "08:00", "closes": "18:00" }, { "@type": "OpeningHoursSpecification", "dayOfWeek": "Saturday", "opens": "09:00", "closes": "13:00" } ], "aggregateRating": { "@type": "AggregateRating", "ratingValue": 4.6, "reviewCount": 201, "bestRating": 5 }, "sameAs": [ "https://www.healthgrades.com/your-clinic", "https://www.zocdoc.com/practice/your-clinic", "https://www.google.com/maps/place/your-clinic" ] }

MEDICAL & HEALTH PILLAR · schema.org/MedicalClinic · LEXICON COVERAGE ~85%

REAL ESTATE AGENT — REAL ESTATE PILLAR schema.org/RealEstateAgent · lexicon coverage target 85%+

{ "@context": "https://schema.org", "@type": "RealEstateAgent", "@id": "https://yourrealtysite.com/#organization", "name": "Your Realty Group", "description": "[Your agency description — markets served, specialties, volume, years in business]", "url": "https://yourrealtysite.com", "telephone": "+1-XXX-XXX-XXXX", "foundingDate": "YYYY", "address": { "@type": "PostalAddress", "streetAddress": "888 Realty Row", "addressLocality": "Your City", "addressRegion": "CA", "postalCode": "XXXXX", "addressCountry": "US" }, "geo": { "@type": "GeoCoordinates", "latitude": 00.0000, "longitude": -000.0000 }, "areaServed": [ { "@type": "City", "name": "Your City" }, { "@type": "City", "name": "Adjacent Market" } ], "hasOfferCatalog": { "@type": "OfferCatalog", "name": "Real Estate Services", "itemListElement": [ { "@type": "Offer", "itemOffered": { "@type": "Service", "name": "Buyer Representation" } }, { "@type": "Offer", "itemOffered": { "@type": "Service", "name": "Seller Representation" } }, { "@type": "Offer", "itemOffered": { "@type": "Service", "name": "Investment Property Analysis" } } ] }, "knowsAbout": [ "California Real Estate Law", "1031 Exchange", "DRE License Requirements" ], "aggregateRating": { "@type": "AggregateRating", "ratingValue": 4.9, "reviewCount": 178, "bestRating": 5 }, "sameAs": [ "https://www.zillow.com/profile/your-agent", "https://www.realtor.com/realestateagents/your-profile", "https://www.linkedin.com/in/your-agent" ] }

REAL ESTATE PILLAR · schema.org/RealEstateAgent · LEXICON COVERAGE ~86%

AUTO REPAIR — AUTOMOTIVE PILLAR schema.org/AutoRepair · lexicon coverage target 85%+

{ "@context": "https://schema.org", "@type": "AutoRepair", "@id": "https://yourautorepair.com/#organization", "name": "Your Auto Repair Shop", "description": "[Your shop description — specialties, certifications, makes/models serviced]", "url": "https://yourautorepair.com", "telephone": "+1-XXX-XXX-XXXX", "foundingDate": "YYYY", "address": { "@type": "PostalAddress", "streetAddress": "777 Motor Ave", "addressLocality": "Your City", "addressRegion": "CA", "postalCode": "XXXXX", "addressCountry": "US" }, "geo": { "@type": "GeoCoordinates", "latitude": 00.0000, "longitude": -000.0000 }, "openingHours": "Mo-Fr 07:30-17:30", "hasOfferCatalog": { "@type": "OfferCatalog", "name": "Auto Repair Services", "itemListElement": [ { "@type": "Offer", "itemOffered": { "@type": "Service", "name": "Engine Diagnostics" } }, { "@type": "Offer", "itemOffered": { "@type": "Service", "name": "Brake Service" } }, { "@type": "Offer", "itemOffered": { "@type": "Service", "name": "Oil Change" } }, { "@type": "Offer", "itemOffered": { "@type": "Service", "name": "Transmission Service" } } ] }, "knowsAbout": [ "ASE Certified Technicians", "California BAR Registration", "Bureau of Automotive Repair" ], "aggregateRating": { "@type": "AggregateRating", "ratingValue": 4.7, "reviewCount": 156, "bestRating": 5 }, "sameAs": [ "https://www.yelp.com/biz/your-auto-repair", "https://www.google.com/maps/place/your-auto-repair", "https://www.carfax.com/auto-repair-shops/your-shop" ] }

AUTOMOTIVE PILLAR · schema.org/AutoRepair · LEXICON COVERAGE ~85%

These six examples represent a fraction of our 24 industry pillars. Every pillar has its own full lexicon entry, its own priority field map, and its own edge injection profile. If your business type is not shown here — it is in the directory. Submit your site and we will score what you have against the full lexicon for your entity class.

LIVE JSON-LD EXAMPLES · 6 OF 24 PILLARS SHOWN Browse Full Directory →

► REFERENCES AND FURTHER READING primary sources · cited · verified

Everything on this page is sourced. The history, the statistics, the technical specifications — all of it traces back to primary documents that are publicly available and permanently archived. This is what we mean by provenance. Not a claim. A source.

We list these references the same way Wikipedia does — inline, accessible, verifiable. If you want to go deeper on any topic covered on this page, the primary source is linked below. If you want to understand the full technical specification of JSON-LD at the W3C level, it is there. If you want to read the original ACM paper by the people who created schema.org, it is there. If you want to see Google's own case study data, it is there.

This is a living reference page. As our research produces new findings, as schema.org publishes new versions, as the AI citation landscape evolves — this page will be updated. The date at the bottom reflects the last verified update. All external links are checked on each update cycle.

PRIMARY SOURCES — SCHEMA.HTML REFERENCE CORPUS all links verified · last checked 2026-03-17

/* ── W3C STANDARDS ── */ [W3C-01] JSON-LD 1.1 W3C Recommendation https://www.w3.org/TR/json-ld11/ Published: 16 July 2020 · Status: W3C Recommendation [W3C-02] JSON-LD 1.1 Processing Algorithms and API https://www.w3.org/TR/json-ld11-api/ Published: 16 July 2020 · Status: W3C Recommendation /* ── SCHEMA.ORG ── */ [SCH-01] Schema.org Full Vocabulary — 827 Types · 1,528 Properties https://schema.org/docs/schemas.html [SCH-02] Schema.org Data Model Overview https://schema.org/docs/datamodel.html [SCH-03] Schema.org How We Work https://schema.org/docs/howwework.html /* ── GOOGLE SEARCH CENTRAL ── */ [GOO-01] Introduction to Structured Data — Google Search Central https://developers.google.com/search/docs/appearance/structured-data/intro-structured-data Source for: Rotten Tomatoes 25% CTR · Food Network 35% · Nestlé 82% · Rakuten 1.5x [GOO-02] Google Rich Results Test https://search.google.com/test/rich-results /* ── ACADEMIC / PRIMARY ── */ [ACM-01] Schema.org: Evolution of Structured Data on the Web Guha, R.V. · Brickley, Dan · Macbeth, Steve https://cacm.acm.org/practice/schema-org/ Published: ACM Communications · February 2016 [SEM-01] The Semantic Web — Berners-Lee, Hendler, Lassila Scientific American · May 2001 https://www.scientificamerican.com/article/the-semantic-web/ /* ── RANKWITHME FEDERATION ── */ [RWM-01] Root-LD Specification Version 1.0 https://root-ld.org/spec Published: 28 February 2026 · Authored by RankWithMe.ai [RWM-02] Schema Vocabulary Lexicon Version 1.0 https://rankwithme.ai/docs/schema_vocab_lexicon.json Published: 2026-03-01 · 24 pillars · 186 entity types · downloadable [RWM-03] RankWithMe.ai Daily Research Corpus https://rankwithme.ai/research Ongoing · web science · AI citation analysis · schema performance research

ALL SOURCES PRIMARY · ALL LINKS VERIFIED · LAST UPDATED 2026-03-17 Research Archive →

SCHEMA.HTML · RANKWITHME.AI · STRUCTURE BEFORE ADS · ALWAYS Get Your Schema Scored →

◈ ALL PAGES

ABOUT KANJI LARRYBRIN SCHEMA SPECIFICATION LEXICON AI SAFETY LIVE PROGRESS FEDERATION DIRECTORY KANJI BOT

SYSTEM STATUS

Page:SCHEMA.HTML

Standard:SCHEMA.ORG

Format:JSON-LD 1.1

W3C Status:RECOMMENDATION

Types:827

Properties:1,528

Pillars:24

Entities:186

Lexicon:ACTIVE

Validator:ONLINE

Federation:OPERATIONAL

Root-LD:V1.0

Avg Coverage:~22%

Target:85%+

↑↓ : Scroll ENTER : Select ESC : Exit

Build: 2026-PROD Method: ENTITY-FIRST Status: OPERATIONAL

Structure before ads. Always.