Ecommerce Growth BlueprintEcommerce Growth BlueprintPowered by Little Shell, LLC

SEO + GEO Glossary

Plain-English definitions of the SEO and Generative Engine Optimization terms that show up in our audits. Use the quick-jump links to find a specific term, or scroll through everything.

AI Search & GEO

The emerging discipline of being visible inside ChatGPT, Perplexity, Gemini, and Microsoft Copilot answers.

Generative Engine Optimization (GEO)

#

The practice of optimizing a website so it gets cited by AI search assistants like ChatGPT, Perplexity, Gemini, and Microsoft Copilot when answering user questions. GEO is the AI-search equivalent of traditional SEO.

While SEO optimizes for ranking in Google's blue links, GEO optimizes for being quoted inside AI-generated answers. The key signals are different — FAQ schema, clear publisher entity, llms.txt, and citation-friendly content matter more than traditional ranking factors like backlinks.

AI search visibility

#

How often and prominently a website appears in answers generated by AI assistants (ChatGPT, Perplexity, Gemini, Microsoft Copilot, Apple Intelligence).

Unlike traditional Google search where you rank for a query, AI search visibility means being cited inside the AI's answer. AI assistants pick which sources to cite based on signals like schema markup, content structure, brand entity quality, and crawl accessibility.

llms.txt

#

A markdown file placed at the root of a website (`/llms.txt`) that tells LLM crawlers which pages on the site matter most and what they cover. Similar to robots.txt but specifically for AI crawlers.

Originally proposed by Jeremy Howard, llms.txt is an emerging convention adopted by companies like Anthropic and Vercel. It gives AI assistants a curated index of important content so they cite the right URLs when answering questions about a brand or product.

AI crawlers (GPTBot, ClaudeBot, PerplexityBot)

#

User-agents operated by AI companies that crawl the web to populate their search indexes and answer queries in real time. Major ones include OpenAI's GPTBot and ChatGPT-User, Anthropic's ClaudeBot, Perplexity's PerplexityBot, and Google's Google-Extended.

Each AI crawler can be allowed or blocked individually via robots.txt. Sites that want to be cited in AI search results should explicitly allow these crawlers — blocking them removes the site from AI search visibility entirely.

SEO Fundamentals

The core concepts that decide whether your store can be found at all in Google search.

Canonical tag

#

An HTML tag (`<link rel="canonical">`) that tells search engines which URL is the official version of a page when multiple URLs lead to the same content. Prevents duplicate-content penalties.

Common case: Shopify product variants (?variant=12345) create unique URLs for the same product. Without a canonical tag, Google sees them as duplicate competing pages, splitting ranking authority. The canonical tag tells Google to consolidate the ranking signal back to the parent URL.

Keyword cannibalization

#

When two or more pages on the same site target identical or very similar keywords, causing them to compete with each other in search results. Google can't decide which one is the right answer, so it ranks neither well.

Common in ecommerce when similar products use overlapping titles (e.g., "Vanilla Bourbon" and "Vanilla Spice" both targeting "vanilla candle"). Fix by deliberately differentiating each page's keyword focus, consolidating into one page, or using canonical tags.

Long-tail keyword

#

A specific, multi-word search query — usually three or more words — that has lower search volume but higher buyer intent than broad terms. Example: "vanilla bourbon soy candle 8oz" vs. just "candle."

Long-tail keywords are easier to rank for (less competition) and convert better (more specific intent). For new ecommerce stores, long-tail SEO is the primary path to organic traffic in the first 12 months while brand awareness builds.

E-E-A-T (Experience, Expertise, Authoritativeness, Trustworthiness)

#

Google's framework for evaluating content quality, with particular weight for "Your Money or Your Life" topics like health and finance. Sites with clear authorship, expertise signals, and trust markers rank higher.

E-E-A-T is improved by named human authors, About pages, founder profiles, professional credentials, transparent contact info, and structured data that establishes the publisher's identity. AI assistants weight these signals heavily too when choosing what to cite.

H1 tag

#

The primary heading on a webpage, marked with the `<h1>` HTML tag. Google reads H1 as the strongest single signal of what the page is about, so every page should have exactly one.

Common mistake: having two H1 tags on a page (often from a theme bug or an announcement bar). This confuses Google about the page topic and can suppress ranking by 15–30%. Modern best practice is one H1 per page, containing the primary keyword.

Meta description

#

A 140–160-character HTML tag (`<meta name="description">`) that summarizes a page's content. Doesn't directly affect ranking but heavily influences click-through rate by appearing as the description snippet in Google search results.

Auto-generated meta descriptions usually pull the first sentence of page content, which is rarely optimized for the search snippet. Custom meta descriptions written for each page significantly lift click-through rate from search results.

Alt text

#

A descriptive caption attached to an HTML image (`<img alt="...">`) that helps search engines understand non-text content and provides accessibility for screen-reader users.

Beyond accessibility, alt text is a real ranking signal for Google Image Search and is one of the few content cues AI search engines use to understand non-text content on a page. Descriptive, specific alt text ("Vanilla Bourbon candle on a wooden table at dusk") beats generic alt text ("product photo").

Core Web Vitals

Google's set of page-speed and stability metrics. Real ranking factors for mobile.

Largest Contentful Paint (LCP)

#

A Core Web Vital that measures how long it takes for the largest visible content element on a page (usually a hero image or headline) to render. Google considers LCP "good" under 2.5 seconds.

Pages with LCP over 4 seconds typically trigger a mobile ranking penalty. Most LCP problems come from oversized, unoptimized hero images. Compressing images to under 200KB and serving them at the right dimensions is the most common fix.

Cumulative Layout Shift (CLS)

#

A Core Web Vital that measures how much the page layout jumps around during loading. Lower is better — Google considers CLS under 0.1 "good."

High CLS scores come from images without dimensions, late-loading fonts, and ads that get inserted after page load. Setting explicit width and height on all images is the simplest fix.

Interaction to Next Paint (INP)

#

A Core Web Vital introduced in March 2024 (replacing First Input Delay) that measures the responsiveness of a page to user interactions. Google considers INP under 200ms "good."

INP is heavily affected by JavaScript execution. Sites with too many tracking scripts, chat widgets, or unoptimized frontend code tend to have poor INP. Auditing and removing third-party scripts is the most common path to improvement.

Schema.org & Structured Data

Machine-readable tags that tell search engines and AI assistants what a page is about.

Schema.org structured data

#

A standardized vocabulary of tags (usually rendered as JSON-LD inside `<script>` tags) that tells search engines what a page is about in machine-readable form. Common types: Product, Article, FAQPage, Organization.

Structured data unlocks Google's "rich results" — star ratings, prices, breadcrumbs, FAQ accordions, and other enhanced search appearances. It's also one of the strongest signals AI search assistants use to decide what to cite in their answers.

FAQPage schema

#

A Schema.org type that marks up Q&A content on a page so search engines can display individual questions and answers directly in search results (as expandable FAQ snippets) and AI assistants can cite specific answers.

One of the highest-leverage AI search additions any site can make. AI assistants disproportionately cite Q&A content marked up with FAQPage schema — adding it to your homepage FAQ section can measurably improve AI citation rates within 30–60 days.

HowTo schema

#

A Schema.org type that marks up step-by-step instructions so search engines can display them as a "Things to do" rich result and AI assistants can cite individual steps.

Particularly powerful when paired with action plans, tutorials, or how-to guides. Each HowToStep becomes independently quotable, which is exactly what AI search assistants need when answering "how do I…?" questions about your product or category.

Ecommerce SEO

Issues and opportunities specific to online stores — especially Shopify, WooCommerce, and marketplace sellers.

Shopify variant URL

#

A URL Shopify generates for each product variant (e.g., ?variant=12345 for size or color). Without a canonical tag, these URLs are indexed as separate competing pages, splitting ranking authority across variants of the same product.

The fix is a one-line theme edit: adding a canonical link tag in theme.liquid that points all variant URLs back to the parent product URL. This consolidates ranking signals into a single strong page instead of several weak ones.

Merchant listings / Product rich results

#

Google search results that display product information (price, availability, ratings, shipping) directly under the page title. Powered by Product schema markup combined with merchant feed data.

To qualify, Product schema must include offers, aggregateRating (or review), image, brand, and shipping/return policy fields. Stores that get this right typically see substantial click-through-rate lifts on product page search results.

Want these terms applied to your store?

The Ecommerce Growth Blueprint diagnoses these issues on your actual store and gives you step-by-step instructions to fix them. $97 one-time, delivered in minutes.

Start your audit

Or preview a sample audit first.