The text-to-HTML ratio measures the proportion of a webpage that is human-readable text versus HTML markup, JavaScript, CSS, and other code. For AI crawlers that do not execute JavaScript, a low text ratio means less extractable content. Pages with ratios below 5% are effectively invisible to most AI systems.