Skip to main content
  • Home
  • Features
  • How It Works
  • Pricing
  • Blog
  • FAQ
Log In Start Free

SEO & GEO Glossary: 50 Terms Every Marketer Should Know

Published by SiteCrawlIQ Team

SEO & GEO Glossary: 50 Terms Every Marketer Should Know

The search optimization landscape has expanded significantly with the rise of AI-powered search. This glossary covers the essential terminology across traditional SEO, GEO (Generative Engine Optimization), and AEO (Answer Engine Optimization) - the three pillars of modern search visibility.

A

AEO (Answer Engine Optimization)


The practice of optimizing content to appear as direct answers in search features like Google's Featured Snippets, People Also Ask boxes, and voice assistant responses. AEO focuses on question-answer formatting and concise, factual content structure.

AI Crawlers


Automated bots used by AI companies to index web content for their language models. Major AI crawlers include GPTBot (OpenAI), ClaudeBot (Anthropic), PerplexityBot (Perplexity), and Google-Extended (Google Gemini). Blocking these crawlers in robots.txt prevents your content from being cited by AI search engines.

AI Overview


Google's AI-generated summary that appears at the top of search results for many queries. AI Overviews synthesize information from multiple sources and currently appear in 47%+ of search results. Formerly known as SGE (Search Generative Experience).

Alt Text


Descriptive text associated with an image that helps search engines understand image content and provides accessibility for screen reader users. Every meaningful image should have unique, descriptive alt text.

Anchor Text


The clickable text in a hyperlink. Descriptive anchor text helps search engines understand the topic of the linked page. "Click here" provides no SEO value; "SEO audit guide" does.

B

Backlink


A link from another website to yours. Backlinks remain one of the strongest traditional SEO ranking factors, signaling authority and trustworthiness to search engines.

Bounce Rate


The percentage of visitors who leave a page without taking any action. High bounce rates (above 70% for content pages) may indicate poor content relevance, slow load times, or user experience issues.

Breadcrumb Navigation


A secondary navigation element showing the user's location within a site hierarchy (Home > Blog > SEO Guide). BreadcrumbList schema markup helps search engines display this hierarchy in search results.

C

Canonical Tag


An HTML element (rel="canonical") that tells search engines which version of a page is the "primary" one when multiple versions exist. Essential for preventing duplicate content issues caused by URL parameters, pagination, or content syndication.

Citability


A measure of how likely content is to be cited by AI-powered search engines. Content with high citability typically features answer-first structure, factual density, clear headings, lists/tables, and freshness signals. SiteCrawlIQ scores citability on a 0-100 scale.

CLS (Cumulative Layout Shift)


A Core Web Vitals metric measuring visual stability. It quantifies how much page elements shift during loading. Good: under 0.1. Caused by images without dimensions, dynamically injected content, and late-loading ads.

Content Freshness


A ranking signal based on how recently content was created or updated. AI platforms cite content that is 25.7% fresher than what traditional search surfaces. Indicated by publish dates, "last updated" timestamps, and current data.

Core Web Vitals


Google's set of metrics for measuring real-world user experience: LCP (loading), INP (interactivity), and CLS (visual stability). A confirmed ranking factor that affects both traditional rankings and AI Overview eligibility.

Crawl Budget


The number of pages a search engine will crawl on your site within a given time period. Large sites with inefficient URL structures (faceted navigation, URL parameters) can exhaust crawl budget before important pages are indexed.

D-E

Duplicate Content


Identical or near-identical content appearing at multiple URLs. Causes ranking signal dilution and can result in the wrong version of a page being indexed. Resolved with canonical tags, 301 redirects, or noindex directives.

E-E-A-T (Experience, Expertise, Authoritativeness, Trustworthiness)


Google's framework for evaluating content quality. Not a direct ranking factor but a guiding principle for quality raters. Demonstrated through author credentials, original research, expert quotes, and transparent sourcing.

F

Featured Snippet


A special search result block that appears above organic results, directly answering a query. Also called "position zero." Optimized through concise, answer-first content formatting with clear headings.

G

GEO (Generative Engine Optimization)


The practice of optimizing content to be discovered, cited, and recommended by AI-powered search engines (ChatGPT, Perplexity, Google AI Overviews, Claude). GEO encompasses AI crawler access, schema markup, content citability, llms.txt, and structured content formatting. The GEO market is valued at $1.48 billion in 2026.

Google Search Console


Google's free tool for monitoring your site's search performance, index coverage, and technical issues. Essential for tracking keyword rankings, impressions, clicks, and Core Web Vitals field data.

H-I

Heading Hierarchy


The structured use of HTML heading tags (H1 through H6) to organize page content. Proper hierarchy (H1 > H2 > H3) helps search engines understand content structure. Pages with correct heading hierarchy are 2.8x more likely to be cited by AI engines.

HTTPS


The secure version of HTTP that encrypts data between browser and server. Required for all pages - Google uses HTTPS as a ranking signal and Chrome marks HTTP pages as "Not Secure."

INP (Interaction to Next Paint)


A Core Web Vitals metric measuring page interactivity. It quantifies the time from a user interaction (click, tap, keypress) to the visual response. Good: under 200ms. Replaced FID (First Input Delay) in 2024.

Index Coverage


The number of your pages that search engines have added to their index (and thus can show in search results). Monitored through Google Search Console's index coverage report.

Internal Link


A hyperlink from one page of your website to another page on the same website. Internal links distribute link equity, help crawlers discover pages, and establish content relationships.

J-K

JSON-LD (JavaScript Object Notation for Linked Data)


The preferred format for implementing structured data on web pages. JSON-LD is embedded in a script tag in the page's HTML head and describes content types (Organization, Article, Product, FAQ, etc.) in a machine-readable format.

Knowledge Panel


A Google Search feature showing a summary of information about an entity (person, business, organization) on the right side of search results. Triggered by consistent entity signals including Organization schema, Wikipedia entries, and sameAs links.

Keyword Cannibalization


When multiple pages on the same site target the same keyword, splitting ranking potential. Resolved by consolidating competing pages or differentiating their target keywords.

L

LCP (Largest Contentful Paint)


A Core Web Vitals metric measuring perceived loading speed. It marks the time when the largest visible content element (hero image, headline text block) finishes rendering. Good: under 2.5 seconds.

Link Equity


The ranking value passed from one page to another through hyperlinks. Also called "link juice." Internal linking distributes link equity from high-authority pages to deeper pages.

llms.txt


A plain-text file placed at a website's root (yoursite.com/llms.txt) that provides AI language models with a structured description of the site. Analogous to robots.txt for AI search. Includes business description, key pages, pricing, and contact information.

M-N

Meta Description


An HTML meta tag providing a summary of the page's content. Displayed as the snippet in search results. Should be 120-160 characters, unique per page, and include target keywords naturally.

Mobile-First Indexing


Google's approach of using the mobile version of a page for indexing and ranking. Since 2023, all Google indexing is mobile-first, meaning your site's mobile experience directly determines your rankings.

Noindex


A robots meta tag or HTTP header directive telling search engines not to include a page in their index. Used for pages that should exist but not appear in search results (admin pages, thank-you pages, thin tag archives).

O-P

OpenGraph Tags


HTML meta tags that control how content appears when shared on social media platforms (Facebook, LinkedIn). Key tags: og:title, og:description, og:image, og:url. Also used by AI engines as content signals.

Orphan Page


A page that exists on your site but has no internal links pointing to it. Search engines struggle to discover orphan pages, and they receive no internal link equity. Common for older blog posts or deprecated product pages.

PageRank


Google's original algorithm for measuring page importance based on the quantity and quality of links pointing to it. While the public PageRank metric was retired, the underlying algorithm still influences Google's ranking system.

R-S

Rich Results


Enhanced search results that display extra information beyond the standard title, URL, and description. Examples: star ratings, FAQ dropdowns, recipe cards, event dates. Triggered by valid structured data (JSON-LD).

robots.txt


A text file at your website's root that tells search engine crawlers which pages they can and cannot access. In 2026, robots.txt is critical for both traditional SEO (Googlebot, Bingbot) and GEO (GPTBot, ClaudeBot).

Schema Markup


Structured data vocabulary (schema.org) that helps search engines understand the content and context of web pages. Implemented via JSON-LD format. Key types: Organization, WebSite, Article, Product, FAQPage, BreadcrumbList.

SERP (Search Engine Results Page)


The page displayed by a search engine in response to a query. Modern SERPs include organic results, ads, featured snippets, AI Overviews, People Also Ask, knowledge panels, and other features.

Sitemap (XML)


An XML file listing all the important URLs on your site, helping search engines discover and crawl your content efficiently. Should include only canonical, indexable URLs and be submitted to Search Console.

Structured Data


Machine-readable code (typically JSON-LD) added to web pages to help search engines understand content context. Powers rich results in Google and provides entity context for AI engines.

T-W

Title Tag


An HTML element specifying the title of a web page. Displayed as the clickable headline in search results. Should be under 60 characters, unique per page, and include target keywords near the beginning.

TTFB (Time to First Byte)


The time between a browser requesting a page and receiving the first byte of the response. Reflects server performance. Target: under 200ms. A slow TTFB delays all subsequent loading metrics.

Twitter Card


HTML meta tags controlling how content appears when shared on X (formerly Twitter). Types: summary, summary_large_image, app, player. Similar to OpenGraph tags but platform-specific.

URL Parameter


A key-value pair appended to a URL after a question mark (e.g., ?sort=price&color=blue). Can create duplicate content issues if not handled with canonical tags or parameter handling configuration.

Voice Search


Search queries spoken to virtual assistants (Siri, Google Assistant, Alexa). Voice queries tend to be longer and more conversational. Optimized through question-answer content structure and Featured Snippet targeting.

Web Vitals


See Core Web Vitals. The broader Web Vitals initiative by Google encompasses Core Web Vitals (LCP, INP, CLS) plus additional metrics like TTFB and FCP (First Contentful Paint).

Key Takeaways

  • Modern search optimization spans three disciplines: SEO (traditional search), GEO (AI search), and AEO (answer engines)

  • Technical terms like Core Web Vitals, JSON-LD, and canonical tags have direct ranking impact - they're not academic concepts

  • AI-specific terms (llms.txt, citability, AI crawlers, GEO) represent the fastest-growing area of search optimization

  • Understanding these terms is the foundation for executing the strategies that drive organic visibility
  • FAQ

    What's the difference between SEO, GEO, and AEO?

    SEO optimizes for traditional search engine rankings (Google, Bing). GEO optimizes for AI-powered search engines (ChatGPT, Perplexity, AI Overviews). AEO optimizes for direct-answer features (Featured Snippets, People Also Ask, voice search). All three overlap significantly in practice.

    Which terms are most important for beginners to learn first?

    Start with the technical foundations: title tags, meta descriptions, canonical tags, sitemap, robots.txt, and Core Web Vitals. Then learn the GEO-specific terms: llms.txt, AI crawlers, citability, and JSON-LD schema.

    How does SiteCrawlIQ relate to these terms?

    SiteCrawlIQ audits your site for virtually every concept in this glossary - from Core Web Vitals and schema validation to llms.txt presence and content citability. The tool translates these technical concepts into actionable recommendations.

    Are there new terms I should watch for in 2026?

    Yes. "Multi-modal SEO" (optimizing for image/video AI search), "entity graphs" (connected structured data across pages), and "AI brand safety" (controlling how AI engines represent your brand) are emerging concepts gaining traction.

    ---

    Put these terms into practice. Run a free site audit at [SiteCrawlIQ](https://sitecrawliq.com) and see how your site scores across SEO, GEO, and performance metrics.

    See Your Site's Real SEO Data

    Stop guessing and start with real crawl data. SiteCrawlIQ combines traditional SEO auditing with GEO readiness scoring, structured data validation, and Core Web Vitals monitoring. Our hybrid crawler renders JavaScript pages, checks your llms.txt file, validates schema markup, and scores your content for AI engine citability. Get a comprehensive health score across seven weighted categories, plus a prioritized action plan generated by GPT-5 analysis of your actual crawl data.

    Start Your Free Audit
    • Features
    • Pricing
    • How It Works
    • Blog
    • FAQ
    • Help Center
    • API Docs
    • Privacy Policy
    • Terms of Service
    • What Is GEO?
    • SEO vs GEO
    • Audit Checklist
    • AI Crawler Guide

    SiteCrawlIQ - AI-powered SEO and GEO audit platform.

    SiteCrawlIQ, Inc. | support@sitecrawliq.com | 1-800-555-1234