Platform Research Insights Glossary Tools Compare FAQ Request Demo

Glossary

PROGEOLAB glossary: explore all 32 entries.

AI Content Paradox

The AI Content Paradox describes the counterintuitive outcome where blocking AI crawlers makes a company's AI representation worse, not absent. When AI sys

2026-04-20

AI Readiness Score

The AI Readiness Score is a composite metric that measures how well-prepared a website is for AI discovery and citation. The PROGEOLAB AI Readiness Index s

2026-04-20

Entity Disambiguation (AI)

Entity disambiguation in the AI context is the process by which AI systems determine which real-world entity a website, content piece, or mention refers to

2026-04-20

Four-UA Comparison (Methodology)

The Four-UA Comparison is an audit methodology that tests website accessibility using four different user agent identities: an honest unknown research bot,

2026-04-20

GEO Maturity Model

The GEO Maturity Model is a framework that classifies organizations into six levels of AI visibility readiness, from Level 0 (Invisible — unreachable by an

2026-04-20

GEO Visibility Gap

The GEO Visibility Gap is the difference between what a website shows to human browsers and what it shows to AI crawlers. A company is in the GEO Visibilit

2026-04-20

robots.txt AI Policy

A robots.txt AI policy is a set of User-agent directives in a website's robots.txt file that specifically addresses AI crawlers. This includes naming indiv

2026-04-20

sameAs (Wikidata Link)

In JSON-LD structured data, the sameAs property declares that two URLs refer to the same real-world entity. Linking to a Wikidata entity (e.g., wikidata.or

2026-04-20

Soft 404

A soft 404 occurs when a web server returns an HTTP 200 (success) status code for a URL that does not actually contain the requested content. Instead of a

2026-04-20

Text-to-HTML Ratio

The text-to-HTML ratio measures the proportion of a webpage that is human-readable text versus HTML markup, JavaScript, CSS, and other code. For AI crawler

2026-04-20

Training vs Retrieval Crawler

AI crawlers fall into two fundamental categories: training crawlers that ingest content into model weights (GPTBot, Google-Extended, CCBot) and retrieval c

2026-04-20

WAF Bot Management

WAF (Web Application Firewall) bot management is the security layer that controls which automated clients can access a website. WAFs operate independently

2026-04-20

AI Crawler

An AI crawler is an automated web bot operated by an AI company to collect web content for training or retrieving information for AI models. Major AI crawl

2026-03-27

AI Referral Traffic

AI referral traffic consists of website visits that originate from AI-powered platforms such as ChatGPT, Perplexity, Google AI Overviews, and Microsoft Cop

2026-03-27

AI Visibility

AI visibility measures how often and how prominently a brand appears in AI-generated answers across platforms like ChatGPT, Perplexity, Google AI Overviews

2026-03-27

Answer Engine Optimization (AEO)

Answer Engine Optimization is a strategy focused on structuring content so it appears as direct answers in AI-powered search experiences, voice assistants,

2026-03-27

Citation Decay

Citation decay describes the natural decline in how often AI systems cite a particular piece of content over time. Research shows that 40 to 60 percent of

2026-03-27

Citation Rate

Citation rate measures how frequently AI search engines reference and link to a specific domain or page when generating answers. It is typically expressed

2026-03-27

ClaudeBot

ClaudeBot is Anthropic's web crawler, used to gather information for Claude AI models. Like GPTBot, it can be controlled via robots.txt. ClaudeBot behavior

2026-03-27

Content Architecture for AI

Content architecture for AI is the practice of structuring an entire website — not just individual pages — so that AI crawlers can efficiently parse, extra

2026-03-27

Content Freshness Signal

Content freshness signals indicate to AI systems that a piece of content is current and up-to-date. AI engines weigh recency when selecting sources for cit

2026-03-27

Data Void

A data void occurs when LLMs have weak, insufficient, or no reliable training data on a specific topic or query. Data voids represent strategic opportuniti

2026-03-27

DefinedTerm Schema

DefinedTerm Schema is structured data markup that identifies a word or phrase along with its formal definition. Used in glossary entries and knowledge base

2026-03-27

FAQPage Schema

FAQPage Schema is a structured data markup type from Schema.org that identifies question-and-answer content on a web page. Implementing FAQPage Schema in J

2026-03-27

Generative Engine Optimization (GEO)

Generative Engine Optimization is the practice of optimizing digital content and web presence so that AI-powered search platforms — including ChatGPT, Perp

2026-03-27

GPTBot

GPTBot is OpenAI's web crawler, identified by the user-agent string

2026-03-27

LLM Bot Telemetry

LLM bot telemetry is the analysis of web server access logs to understand how AI crawlers — such as GPTBot (OpenAI), ClaudeBot (Anthropic), Google-Extended

2026-03-27

llms.txt

llms.txt is an emerging standard for communicating website structure and purpose to LLM crawlers. Similar to how robots.txt tells search engines what to cr

2026-03-27

Schema Markup for AI Visibility

Schema markup for AI visibility involves implementing structured data types that help AI systems parse, understand, and cite web content. Key Schema types

2026-03-27

Semantic Recall

Semantic recall is the ability of an LLM to retrieve and cite content based on meaning, context, and intent rather than exact keyword matching. When a user

2026-03-27

Share of Voice (AI Search)

Share of voice in AI search measures the proportion of AI-generated responses in which a brand is mentioned or cited compared to competitors, for a given s

2026-03-27

Unified Visibility Score

A unified visibility score combines three optimization dimensions into a single composite metric: SEO (traditional search engine ranking performance), GEO

2026-03-27