重要前提
安装AI Skills的关键前提是:必须科学上网,且开启TUN模式,这一点至关重要,直接决定安装能否顺利完成,在此郑重提醒三遍:科学上网,科学上网,科学上网。查看完整安装教程 →
geo-citability by zubair-trabzada/geo-seo-claude
npx skills add https://github.com/zubair-trabzada/geo-seo-claude --skill geo-citabilityAI 语言模型会引用符合特定结构标准的段落。普林斯顿大学、佐治亚理工学院和印度理工学院德里分校(2024年)的研究发现,经过 GEO 优化的内容在 AI 生成的回答中能获得高出 30-115% 的可见度。关键发现是:AI 系统优先提取和引用那些长度为 134-167 个单词、自包含(无需上下文即可理解)、事实丰富(包含具体统计数据、日期或命名实体)且在开头 1-2 句话内直接回答问题的段落。
这与传统的 SEO 文案写作有根本区别,后者优化的是关键词密度和用户参与度指标。GEO 可引用性优化的是可提取性——即 AI 系统从您的内容中提取段落并将其作为直接答案呈现的难易程度。
此项衡量内容是否包含清晰、可引用的答案段落,以便 AI 系统能够逐字提取。
评分标准:
| 分数 | 标准 |
|---|---|
| 90-100 | 每个主要部分都以 1-2 句直接答案开头。使用 "X 是..." 或 "X 指的是..." 等模式。每个部分的前 40-60 个单词可以独立作为一个完整的答案。 |
| 70-89 | 大多数部分有清晰的答案开头。存在一些定义模式。答案可识别,但可能需要少量上下文。 |
| 50-69 | 部分部分有类似答案的开头,但许多将答案埋藏在段落中间或末尾。很少有明确的定义模式。 |
| 30-49 | 答案通常埋藏在长段落中。没有一致的定义模式。内容是叙述驱动的,而非答案驱动的。 |
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
| 没有可识别的答案块。内容完全是叙述性、对话式或碎片化的。AI 难以提取任何可引用的段落。 |
需要关注的点:
高可引用性示例:
Content delivery networks (CDNs) are distributed server systems that cache and serve
web content from locations geographically close to end users. A CDN reduces latency
by 50-70% on average by serving assets from edge servers rather than a single origin
server. The three largest CDN providers as of 2025 are Cloudflare (serving approximately
20% of all websites), Amazon CloudFront, and Akamai Technologies.
单词数:58。自包含:是。事实:3 个具体数据点。定义模式:是。
低可引用性示例:
If you've ever wondered why some websites load faster than others, the answer might
surprise you. There's this amazing technology that has been around for a while now.
It's changed the way we think about web performance. Let me explain how it works and
why you should care about it for your business.
单词数:52。自包含:否(未明确主题)。事实:0。定义模式:否。
此项衡量单个段落能否在不依赖周围内容的情况下被提取和理解。
评分标准:
| 分数 | 标准 |
|---|---|
| 90-100 | 80%+ 的内容块是完全自包含的。每个段落都明确命名其主题。不依赖指代先前内容的代词。段落内包含具体事实。 |
| 70-89 | 60-79% 的内容块是自包含的。大多数段落命名了其主题。偶尔有需要上下文的代词指代。 |
| 50-69 | 40-59% 的内容块是自包含的。混合使用明确主语和代词。有些段落需要阅读前面的部分。 |
| 30-49 | 20-39% 的内容块是自包含的。严重依赖代词和上下文指代。大多数段落需要周围文本。 |
| 0-29 | 低于 20% 自包含。内容读起来是连续的叙述,提取任何段落都会失去意义。 |
每个段落的自包含性检查清单:
此项衡量有助于 AI 系统解析和分割内容的结构格式。
评分标准:
| 分数 | 标准 |
|---|---|
| 90-100 | 清晰的 H1 > H2 > H3 层级结构。信息性内容使用基于问题的标题。短段落(2-4 句)。使用表格进行比较。使用有序列表表示流程。使用无序列表表示功能/选项。 |
| 70-89 | 良好的标题层级结构,有少量跳跃。部分基于问题的标题。主要是短段落。使用了一些表格和列表。 |
| 50-69 | 存在标题层级结构但不一致。很少有基于问题的标题。长短段落混合。有限的表格/列表使用。 |
| 30-49 | 标题结构最少。没有基于问题的标题。长段落占主导。很少使用表格/列表。 |
| 0-29 | 没有标题结构或层级结构严重断裂。大段文字。没有表格或列表。 |
AI 可引用性的结构最佳实践:
此项衡量是否存在具体的、可验证的数据点,这是 AI 系统在选择引用来源时优先考虑的因素。
评分标准:
| 分数 | 标准 |
|---|---|
| 90-100 | 每 500 个单词包含 5+ 个具体统计数据。所有主张都有命名来源或日期支持。使用确切数字(而非 "许多" 或 "几个")。包含百分比、金额、时间范围和命名研究。 |
| 70-89 | 每 500 个单词包含 3-4 个统计数据。大多数主张有来源支持。主要是具体数字,偶尔有模糊的量化词。 |
| 50-69 | 每 500 个单词包含 1-2 个统计数据。部分主张有来源支持。具体和模糊数字混合。 |
| 30-49 | 每 500 个单词少于 1 个统计数据。很少有主张有来源支持。主要是模糊的量化词。 |
| 0-29 | 没有统计数据。没有主张有来源支持。所有量化词都是模糊的("许多"、"大多数"、"很多")。 |
什么算作统计数据:
什么不算:
此项衡量内容是否提供了 AI 系统在其他地方找不到的信息,使其成为必要的引用来源。
评分标准:
| 分数 | 标准 |
|---|---|
| 90-100 | 包含第一方研究、专有数据、原始调查或独特数据集。提供在其他任何页面都找不到的分析或见解。有清晰的方法论描述。 |
| 70-89 | 包含一些原始见解或对现有数据的独特分析。提供带有原创示例的独特视角。 |
| 50-69 | 主要是综合现有信息,但增加了一些独特的评论或示例。 |
| 30-49 | 主要是衍生内容,以最少的原创贡献重述常识。 |
| 0-29 | 完全是衍生的。所有信息都可以在更高权威的来源上找到(通常是逐字逐句的)。 |
独特内容的信号:
对于每个内容块,计算:
块可引用性分数 = (答案 * 0.30) + (自包含 * 0.25) + (结构 * 0.20) + (统计 * 0.15) + (独特 * 0.10)
对于每个得分低于 60 的块,生成具体的重写建议:
生成一个名为 GEO-CITABILITY-SCORE.md 的文件:
# AI 可引用性分析:[页面标题]
**URL:** [URL]
**分析日期:** [日期]
**总体可引用性分数: [X]/100**
**可引用性覆盖率:** [X]% 的内容块得分高于 70
---
## 分数摘要
| 类别 | 分数 | 权重 | 加权分数 |
|---|---|---|---|
| 答案块质量 | [X]/100 | 30% | [X] |
| 段落自包含性 | [X]/100 | 25% | [X] |
| 结构可读性 | [X]/100 | 20% | [X] |
| 统计密度 | [X]/100 | 15% | [X] |
| 独特性和原始数据 | [X]/100 | 10% | [X] |
| **总体** | | | **[X]/100** |
---
## 最强的内容块
### 1. "[标题]" -- 分数: [X]/100
> [块的前 2 句话]
**为什么有效:** [解释]
### 2. "[标题]" -- 分数: [X]/100
> [块的前 2 句话]
**为什么有效:** [解释]
---
## 最弱的内容块(重写优先级)
### 1. "[标题]" -- 分数: [X]/100
**当前开头:**
> [现有的前 2 句话]
**问题:** [具体问题 -- 答案埋藏、没有事实等]
**建议重写:**
> [使用答案优先模式和事实重写的开头 2-3 句话]
**额外改进:**
- [添加比较 X, Y, Z 的表格]
- [包含关于 ... 的统计数据]
- [将长段落拆分为 2-3 个较短的段落]
---
## 快速改进的格式建议
1. **[具体建议]** -- 预期可引用性提升: +[X] 分
2. **[具体建议]** -- 预期可引用性提升: +[X] 分
3. **[具体建议]** -- 预期可引用性提升: +[X] 分
4. **[具体建议]** -- 预期可引用性提升: +[X] 分
5. **[具体建议]** -- 预期可引用性提升: +[X] 分
---
## 各部分分数
| 部分标题 | 单词数 | 答案质量 | 自包含 | 结构 | 统计 | 独特 | 总体 |
|---|---|---|---|---|---|---|---|
| [H2 标题] | [N] | [X] | [X] | [X] | [X] | [X] | [X] |
| AI 系统 | 引用偏好 |
|---|---|
| ChatGPT (搜索) | 偏好具有明确定义、命名来源和近期日期的段落。倾向于每次回答引用 2-4 个来源。 |
| Perplexity | 非常偏好事实密集、包含统计数据的段落。每次回答引用 4-8 个来源。高度重视时效性。 |
| Claude | 偏好结构良好、全面的段落。重视细微差别和准确性胜过简洁性。 |
| Gemini (AI 概述) | 偏好简洁的答案块(40-60 个单词)。重视已经在有机搜索结果前 10 名中的内容。 |
| Copilot (Bing) | 与 Gemini 类似。偏好来自高权威域名、具有明确事实主张的段落。 |
每周安装量
55
仓库
GitHub 星标数
3.7K
首次出现
Feb 27, 2026
安全审计
安装于
codex54
opencode54
kimi-cli52
gemini-cli52
amp52
cline52
AI language models cite passages that meet specific structural criteria. Research from Princeton, Georgia Tech, and IIT Delhi (2024) found that GEO-optimized content achieves 30-115% higher visibility in AI-generated responses. The key finding: AI systems preferentially extract and cite passages that are 134-167 words long , self-contained (understandable without surrounding context), fact-rich (containing specific statistics, dates, or named entities), and directly answer a question in the first 1-2 sentences.
This is fundamentally different from traditional SEO copywriting, which optimizes for keyword density and user engagement metrics. GEO citability optimizes for extractability -- the ease with which an AI system can pull a passage from your content and present it as a direct answer.
This measures whether content contains clear, quotable answer passages that AI systems can extract verbatim.
Scoring Criteria:
| Score | Criteria |
|---|---|
| 90-100 | Every major section opens with a 1-2 sentence direct answer. Uses "X is..." or "X refers to..." patterns. First 40-60 words of each section can stand alone as a complete answer. |
| 70-89 | Most sections have clear answer openings. Some definition patterns present. Answers are identifiable but may need minor context. |
| 50-69 | Some sections have answer-like openings but many bury the answer in the middle or end of paragraphs. Few explicit definition patterns. |
| 30-49 | Answers are generally buried in long paragraphs. No consistent definition patterns. Content is narrative-driven rather than answer-driven. |
| 0-29 | No identifiable answer blocks. Content is entirely narrative, conversational, or fragmented. AI would struggle to extract any quotable passage. |
What to look for:
High-citability example:
Content delivery networks (CDNs) are distributed server systems that cache and serve
web content from locations geographically close to end users. A CDN reduces latency
by 50-70% on average by serving assets from edge servers rather than a single origin
server. The three largest CDN providers as of 2025 are Cloudflare (serving approximately
20% of all websites), Amazon CloudFront, and Akamai Technologies.
Word count: 58. Self-contained: Yes. Facts: 3 specific data points. Definition pattern: Yes.
Low-citability example:
If you've ever wondered why some websites load faster than others, the answer might
surprise you. There's this amazing technology that has been around for a while now.
It's changed the way we think about web performance. Let me explain how it works and
why you should care about it for your business.
Word count: 52. Self-contained: No (no topic identified). Facts: 0. Definition pattern: No.
This measures whether individual passages can be extracted and understood without needing the surrounding content.
Scoring Criteria:
| Score | Criteria |
|---|---|
| 90-100 | 80%+ of content blocks are fully self-contained. Each passage names its subject explicitly. No reliance on pronouns referencing earlier content. Contains specific facts within the passage. |
| 70-89 | 60-79% of content blocks are self-contained. Most passages name their subject. Occasional pronoun references that require context. |
| 50-69 | 40-59% of content blocks are self-contained. Mixed use of explicit subjects and pronouns. Some passages require reading prior sections. |
| 30-49 | 20-39% of content blocks are self-contained. Heavy reliance on pronouns and contextual references. Most passages need surrounding text. |
| 0-29 | Under 20% self-contained. Content reads as a continuous narrative where extracting any paragraph loses meaning. |
Self-containment checklist for each passage:
This measures the structural formatting that helps AI systems parse and segment content.
Scoring Criteria:
| Score | Criteria |
|---|---|
| 90-100 | Clean H1 > H2 > H3 hierarchy. Question-based headings for informational content. Short paragraphs (2-4 sentences). Tables for comparisons. Ordered lists for processes. Unordered lists for features/options. |
| 70-89 | Good heading hierarchy with minor skips. Some question-based headings. Mostly short paragraphs. Some use of tables and lists. |
| 50-69 | Heading hierarchy present but inconsistent. Few question-based headings. Mix of short and long paragraphs. Limited tables/lists. |
| 30-49 | Minimal heading structure. No question-based headings. Long paragraphs dominate. Rare use of tables/lists. |
| 0-29 | No heading structure or severely broken hierarchy. Wall-of-text paragraphs. No tables or lists. |
Structural best practices for AI citability:
This measures the presence of specific, verifiable data points that AI systems prioritize when selecting citation sources.
Scoring Criteria:
| Score | Criteria |
|---|---|
| 90-100 | 5+ specific statistics per 500 words. All claims backed by named sources or dates. Uses exact numbers (not "many" or "several"). Includes percentages, dollar amounts, timeframes, and named studies. |
| 70-89 | 3-4 statistics per 500 words. Most claims have sources. Mostly specific numbers with occasional vague quantifiers. |
| 50-69 | 1-2 statistics per 500 words. Some claims sourced. Mix of specific and vague numbers. |
| 30-49 | Less than 1 statistic per 500 words. Few sourced claims. Predominantly vague quantifiers. |
| 0-29 | No statistics. No sourced claims. All quantifiers are vague ("many," "most," "a lot"). |
What counts as a statistic:
What does NOT count:
This measures whether the content provides information that AI systems cannot find elsewhere, making it a necessary citation source.
Scoring Criteria:
| Score | Criteria |
|---|---|
| 90-100 | Contains first-party research, proprietary data, original surveys, or unique datasets. Presents analysis or insights not found on any other page. Clear methodological descriptions. |
| 70-89 | Contains some original insights or unique analysis of existing data. Offers a distinct perspective with original examples. |
| 50-69 | Mostly synthesizes existing information but adds some unique commentary or examples. |
| 30-49 | Largely derivative content that restates common knowledge with minimal original contribution. |
| 0-29 | Entirely derivative. All information is available (often verbatim) on higher-authority sources. |
Signals of unique content:
For each content block, calculate:
Block Citability Score = (Answer * 0.30) + (SelfContain * 0.25) + (Structure * 0.20) + (Stats * 0.15) + (Unique * 0.10)
For each block scoring below 60, generate a specific rewrite suggestion:
Generate a file called GEO-CITABILITY-SCORE.md:
# AI Citability Analysis: [Page Title]
**URL:** [URL]
**Analysis Date:** [Date]
**Overall Citability Score: [X]/100**
**Citability Coverage:** [X]% of content blocks score above 70
---
## Score Summary
| Category | Score | Weight | Weighted |
|---|---|---|---|
| Answer Block Quality | [X]/100 | 30% | [X] |
| Passage Self-Containment | [X]/100 | 25% | [X] |
| Structural Readability | [X]/100 | 20% | [X] |
| Statistical Density | [X]/100 | 15% | [X] |
| Uniqueness & Original Data | [X]/100 | 10% | [X] |
| **Overall** | | | **[X]/100** |
---
## Strongest Content Blocks
### 1. "[Heading]" -- Score: [X]/100
> [First 2 sentences of the block]
**Why it works:** [Explanation]
### 2. "[Heading]" -- Score: [X]/100
> [First 2 sentences of the block]
**Why it works:** [Explanation]
---
## Weakest Content Blocks (Rewrite Priority)
### 1. "[Heading]" -- Score: [X]/100
**Current opening:**
> [First 2 sentences as they exist]
**Problem:** [Specific issue -- buried answer, no facts, etc.]
**Suggested rewrite:**
> [Rewritten opening 2-3 sentences with answer-first pattern and facts]
**Additional improvements:**
- [Add table comparing X, Y, Z]
- [Include statistic about ...]
- [Split long paragraph into 2-3 shorter ones]
---
## Quick Win Reformatting Recommendations
1. **[Specific recommendation]** -- Expected citability lift: +[X] points
2. **[Specific recommendation]** -- Expected citability lift: +[X] points
3. **[Specific recommendation]** -- Expected citability lift: +[X] points
4. **[Specific recommendation]** -- Expected citability lift: +[X] points
5. **[Specific recommendation]** -- Expected citability lift: +[X] points
---
## Per-Section Scores
| Section Heading | Words | Answer Quality | Self-Contained | Structure | Stats | Unique | Overall |
|---|---|---|---|---|---|---|---|
| [H2 heading] | [N] | [X] | [X] | [X] | [X] | [X] | [X] |
| AI System | Citation Preference |
|---|---|
| ChatGPT (Search) | Prefers passages with explicit definitions, named sources, and recent dates. Tends to cite 2-4 sources per response. |
| Perplexity | Heavily favors fact-dense passages with statistics. Cites 4-8 sources per response. Values recency highly. |
| Claude | Prefers well-structured, comprehensive passages. Values nuance and accuracy over brevity. |
| Gemini (AI Overviews) | Prefers concise answer blocks (40-60 words). Values content already ranking in top 10 organic results. |
| Copilot (Bing) | Similar to Gemini. Prefers passages from high-authority domains with clear factual claims. |
Weekly Installs
55
Repository
GitHub Stars
3.7K
First Seen
Feb 27, 2026
Security Audits
Gen Agent Trust HubPassSocketPassSnykWarn
Installed on
codex54
opencode54
kimi-cli52
gemini-cli52
amp52
cline52
专业SEO审计工具:全面网站诊断、技术SEO优化与页面分析指南
72,700 周安装