Glossary
PROGEOLAB glossary: explore all 32 entries.
AI Content Paradox
The AI Content Paradox describes the counterintuitive outcome where blocking AI crawlers makes a company's AI representation worse, not absent. When AI sys
2026-04-20AI Readiness Score
The AI Readiness Score is a composite metric that measures how well-prepared a website is for AI discovery and citation. The PROGEOLAB AI Readiness Index s
2026-04-20Entity Disambiguation (AI)
Entity disambiguation in the AI context is the process by which AI systems determine which real-world entity a website, content piece, or mention refers to
2026-04-20Four-UA Comparison (Methodology)
The Four-UA Comparison is an audit methodology that tests website accessibility using four different user agent identities: an honest unknown research bot,
2026-04-20GEO Maturity Model
The GEO Maturity Model is a framework that classifies organizations into six levels of AI visibility readiness, from Level 0 (Invisible — unreachable by an
2026-04-20GEO Visibility Gap
The GEO Visibility Gap is the difference between what a website shows to human browsers and what it shows to AI crawlers. A company is in the GEO Visibilit
2026-04-20robots.txt AI Policy
A robots.txt AI policy is a set of User-agent directives in a website's robots.txt file that specifically addresses AI crawlers. This includes naming indiv
2026-04-20sameAs (Wikidata Link)
In JSON-LD structured data, the sameAs property declares that two URLs refer to the same real-world entity. Linking to a Wikidata entity (e.g., wikidata.or
2026-04-20Soft 404
A soft 404 occurs when a web server returns an HTTP 200 (success) status code for a URL that does not actually contain the requested content. Instead of a
2026-04-20Text-to-HTML Ratio
The text-to-HTML ratio measures the proportion of a webpage that is human-readable text versus HTML markup, JavaScript, CSS, and other code. For AI crawler
2026-04-20Training vs Retrieval Crawler
AI crawlers fall into two fundamental categories: training crawlers that ingest content into model weights (GPTBot, Google-Extended, CCBot) and retrieval c
2026-04-20WAF Bot Management
WAF (Web Application Firewall) bot management is the security layer that controls which automated clients can access a website. WAFs operate independently
2026-04-20AI Crawler
An AI crawler is an automated web bot operated by an AI company to collect web content for training or retrieving information for AI models. Major AI crawl
2026-03-27AI Referral Traffic
AI referral traffic consists of website visits that originate from AI-powered platforms such as ChatGPT, Perplexity, Google AI Overviews, and Microsoft Cop
2026-03-27AI Visibility
AI visibility measures how often and how prominently a brand appears in AI-generated answers across platforms like ChatGPT, Perplexity, Google AI Overviews
2026-03-27Answer Engine Optimization (AEO)
Answer Engine Optimization is a strategy focused on structuring content so it appears as direct answers in AI-powered search experiences, voice assistants,
2026-03-27Citation Decay
Citation decay describes the natural decline in how often AI systems cite a particular piece of content over time. Research shows that 40 to 60 percent of
2026-03-27Citation Rate
Citation rate measures how frequently AI search engines reference and link to a specific domain or page when generating answers. It is typically expressed
2026-03-27ClaudeBot
ClaudeBot is Anthropic's web crawler, used to gather information for Claude AI models. Like GPTBot, it can be controlled via robots.txt. ClaudeBot behavior
2026-03-27Content Architecture for AI
Content architecture for AI is the practice of structuring an entire website — not just individual pages — so that AI crawlers can efficiently parse, extra
2026-03-27Content Freshness Signal
Content freshness signals indicate to AI systems that a piece of content is current and up-to-date. AI engines weigh recency when selecting sources for cit
2026-03-27Data Void
A data void occurs when LLMs have weak, insufficient, or no reliable training data on a specific topic or query. Data voids represent strategic opportuniti
2026-03-27DefinedTerm Schema
DefinedTerm Schema is structured data markup that identifies a word or phrase along with its formal definition. Used in glossary entries and knowledge base
2026-03-27FAQPage Schema
FAQPage Schema is a structured data markup type from Schema.org that identifies question-and-answer content on a web page. Implementing FAQPage Schema in J
2026-03-27Generative Engine Optimization (GEO)
Generative Engine Optimization is the practice of optimizing digital content and web presence so that AI-powered search platforms — including ChatGPT, Perp
2026-03-27GPTBot
GPTBot is OpenAI's web crawler, identified by the user-agent string
2026-03-27LLM Bot Telemetry
LLM bot telemetry is the analysis of web server access logs to understand how AI crawlers — such as GPTBot (OpenAI), ClaudeBot (Anthropic), Google-Extended
2026-03-27llms.txt
llms.txt is an emerging standard for communicating website structure and purpose to LLM crawlers. Similar to how robots.txt tells search engines what to cr
2026-03-27Schema Markup for AI Visibility
Schema markup for AI visibility involves implementing structured data types that help AI systems parse, understand, and cite web content. Key Schema types
2026-03-27Semantic Recall
Semantic recall is the ability of an LLM to retrieve and cite content based on meaning, context, and intent rather than exact keyword matching. When a user
2026-03-27Share of Voice (AI Search)
Share of voice in AI search measures the proportion of AI-generated responses in which a brand is mentioned or cited compared to competitors, for a given s
2026-03-27Unified Visibility Score
A unified visibility score combines three optimization dimensions into a single composite metric: SEO (traditional search engine ranking performance), GEO
2026-03-27