⚠️

重要前提

安装AI Skills的关键前提是：必须科学上网，且开启TUN模式，这一点至关重要，直接决定安装能否顺利完成，在此郑重提醒三遍：科学上网，科学上网，科学上网。查看完整安装教程 →

GEO技术SEO审计：针对AI搜索优化的网站技术健康检查工具

geo-technical by zubair-trabzada/geo-seo-claude

115 周安装量

5,100 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/zubair-trabzada/geo-seo-claude --skill geo-technical

监控 SEO 网站优化

🇨🇳中文介绍

GEO 技术 SEO 审计

目的

技术 SEO 构成了传统搜索可见性和 AI 搜索引用的基础。一个技术上有问题的网站无法被任何平台抓取、索引或引用。本技能审计 8 个类别的技术健康状况，特别关注 GEO 要求——最关键的是服务端渲染（AI 爬虫不执行 JavaScript）和AI 爬虫访问（许多网站无意中在 robots.txt 中屏蔽了 AI 爬虫）。

如何使用此技能

收集目标 URL（主页 + 2-3 个关键内页）
使用 curl/WebFetch 获取每个页面的原始 HTML 和 HTTP 头信息
运行以下 8 个审计类别
使用评分标准对每个类别进行评分
生成包含结果的 GEO-TECHNICAL-AUDIT.md

类别 1：可抓取性 (15 分)

1.1 robots.txt 有效性

获取 https://[domain]/robots.txt
检查语法有效性：正确的 User-agent、Allow、Disallow 指令
检查常见错误：缺少 User-agent、通配符屏蔽重要路径、Disallow: / 屏蔽整个网站
验证是否引用了 XML 站点地图：Sitemap: https://[domain]/sitemap.xml

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

1.2 AI 爬虫访问（对 GEO 至关重要）

检查 robots.txt 中针对以下 AI 爬虫的指令：

爬虫	User-Agent	平台
GPTBot	GPTBot	ChatGPT / OpenAI
Google-Extended	Google-Extended	Gemini / Google AI 训练
Googlebot	Googlebot	Google 搜索 + AI 概览
Bingbot	bingbot	Bing Copilot + ChatGPT（通过 Bing）
PerplexityBot	PerplexityBot	Perplexity AI
ClaudeBot	ClaudeBot	Anthropic Claude
Amazonbot	Amazonbot	Alexa / Amazon AI
CCBot	CCBot	Common Crawl（被许多 AI 模型使用）
FacebookBot	FacebookExternalHit	Meta AI
Bytespider	Bytespider	TikTok / ByteDance AI
Applebot-Extended	Applebot-Extended	Apple Intelligence

AI 爬虫访问评分：

允许所有主要 AI 爬虫：5 分
屏蔽了部分但允许 Googlebot + Bingbot：3 分
屏蔽了 GPTBot 或 PerplexityBot：1 分（对 GEO 影响显著）
屏蔽了 Googlebot：0 分（致命）

重要细节：屏蔽 Google-Extended 并不屏蔽 Googlebot。Google-Extended 仅控制 AI 训练数据的使用，而非搜索索引。然而，屏蔽 Google-Extended 可能会减少在 AI 概览中的出现。除非有特定的数据许可担忧，否则建议允许 Google-Extended。

1.3 XML 站点地图

获取站点地图（检查 robots.txt 中的位置，或尝试 /sitemap.xml、/sitemap_index.xml）
验证 XML 语法
检查 <lastmod> 日期（应存在且准确）
统计 URL 数量——与预期的可索引页面数量进行比较
如果是大型网站，检查是否有站点地图索引（每个站点地图最多 50,000 个 URL）
验证所有站点地图 URL 是否返回 200 状态码（抽样检查）

主页 = 深度 0。检查所有重要页面是否在3 次点击内可达（深度 3）
深度 4+ 的页面获得的抓取预算显著减少，被 AI 引用的可能性也较低
检查内部链接：关键内容页面是否从主页或主导航链接？

检查应该被索引的页面上是否有 <meta name="robots" content="noindex">
检查是否有 X-Robots-Tag: noindex HTTP 头
常见错误：分页页面、分类页面或关键着陆页上使用了 noindex

检查项	分数
robots.txt 有效且完整	3
允许 AI 爬虫	5
XML 站点地图存在且有效	3
抓取深度在 3 次点击内	2
没有错误的 noindex 指令	2

类别 2：可索引性 (12 分)

每个可索引页面必须有一个 <link rel="canonical" href="..."> 标签
规范标签必须指向自身（自引用）作为权威版本
检查是否存在冲突的规范标签（HTML 中的规范标签与 HTTP 头中的规范标签）
检查是否存在规范链（A 规范指向 B，B 规范指向 C——应该是 A 指向 C）

检查 www 与非 www（两者都应解析，其中一个应重定向）
检查 HTTP 与 HTTPS（HTTP 应重定向到 HTTPS）
检查尾部斜杠的一致性（选择一种模式并重定向另一种）
检查基于参数的重复项（?sort=price 创建重复页面）

如果存在分页内容，检查是否有 rel="next" / rel="prev"（注意：自 2019 年起 Google 忽略这些，但 Bing 仍在使用）
首选：在分页页面上使用 rel="canonical" 指向一个“查看全部”页面或第一页
确保包含独特内容的分页页面没有被 noindex

2.4 Hreflang（国际网站）

检查是否有 <link rel="alternate" hreflang="xx"> 标签
验证：互惠 hreflang（如果页面 A 指向页面 B，B 必须指回 A）
验证：存在 x-default 回退
检查语言/地区代码的有效性（ISO 639-1 / ISO 3166-1）

估算索引页面数量（检查站点地图计数，使用 site:domain.com 估算）
将索引页面数量与实际有价值的内容页面数量进行比较
如果索引页面数量显著超过内容页面数量（由单薄/重复/参数页面引起的索引膨胀），则标记

检查项	分数
所有页面上的规范标签正确	3
无重复内容问题	3
分页处理正确	2
Hreflang 正确（如适用）	2
无索引膨胀	2

类别 3：安全性 (10 分)

3.1 HTTPS 强制实施

网站必须通过 HTTPS 加载
HTTP 必须重定向到 HTTPS（301 重定向）
无混合内容警告（HTTPS 页面上的 HTTP 资源）
SSL/TLS 证书必须有效且未过期

检查 HTTP 响应头中是否有：

头信息	要求值	目的
`Strict-Transport-Security`	`max-age=31536000; includeSubDomains`	强制使用 HTTPS
`Content-Security-Policy`	适当的策略	防止 XSS
`X-Content-Type-Options`	`nosniff`	防止 MIME 嗅探
`X-Frame-Options`	`DENY` 或 `SAMEORIGIN`	防止点击劫持
`Referrer-Policy`	`strict-origin-when-cross-origin` 或更严格	控制引荐来源数据
`Permissions-Policy`	适当的限制	控制浏览器功能

检查项	分数
使用有效证书强制实施 HTTPS	4
存在 HSTS 头	2
X-Content-Type-Options	1
X-Frame-Options	1
Referrer-Policy	1
Content-Security-Policy	1

类别 4：URL 结构 (8 分)

URL 应具有可读性：/blog/seo-guide 而不是 /blog?id=12345
URL 中无会话 ID
仅使用小写（无混合大小写）
使用连字符分隔单词（非下划线）
无特殊字符或编码的空格

4.2 逻辑层次结构

URL 路径应反映网站架构：/category/subcategory/page
在适当情况下保持扁平——避免不必要的深层嵌套
整个网站保持一致的格式

检查重定向链（A 重定向到 B 重定向到 C）
建议最多 1 次跳转（A 直接重定向到 C）
检查重定向循环
所有重定向应为 301（永久），而非 302（临时），除非有意设置为临时

URL 参数不应创建重复的可索引页面
对参数变体使用规范标签或 robots.txt 的 Disallow 指令
在 Google Search Console 和 Bing Webmaster Tools 中配置参数处理

检查项	分数
简洁、可读的 URL	2
逻辑层次结构	2
无重定向链（最多 1 次跳转）	2
已配置参数处理	2

类别 5：移动端优化 (10 分)

自 2024 年 7 月 起，Google 仅使用移动版 Googlebot 抓取所有网站。不再有桌面版抓取。如果你的网站在移动端无法工作，那么它对 Google 就无法工作。就是这样。

检查是否有 <meta name="viewport" content="width=device-width, initial-scale=1">
在移动设备上，内容不得需要水平滚动
无固定宽度布局超过视口宽度

交互元素（按钮、链接）必须至少为 48x48 CSS 像素
点击目标之间的最小间距为 8px
检查导航在移动端是否可用

基本字体大小应至少为 16px
无需要缩放才能阅读的文本
足够的对比度（WCAG AA：正常文本 4.5:1，大文本 3:1）

5.4 移动端内容对等性

桌面端可见的所有内容在移动端也必须可见
没有隐藏在 Googlebot 无法展开的“阅读更多”切换后面的内容（尽管截至 2025 年，Google 在展开这些内容方面已有所改进）
图片和媒体必须在移动端加载

检查项	分数
viewport meta 标签正确	3
响应式布局（无水平滚动）	3
点击目标大小适当	2
字体大小清晰易读	2

类别 6：核心 Web 指标 (15 分)

2026 年指标与阈值

核心 Web 指标使用真实用户数据（现场数据）的第 75 百分位数作为基准。实验室数据对调试有用，但现场数据决定了排名信号。

指标	良好	需要改进	较差	说明
LCP (最大内容绘制)	< 2.5s	2.5s - 4.0s	> 4.0s	衡量加载——直到最大可见元素渲染的时间
INP (交互到下次绘制)	< 200ms	200ms - 500ms	> 500ms	于 2024 年 3 月取代 FID。衡量所有交互，而不仅仅是首次
CLS (累积布局偏移)	< 0.1	0.1 - 0.25	> 0.25	衡量视觉稳定性——意外的布局移动

在没有 CrUX 数据的情况下如何评估

当无法获取真实用户数据时，根据页面特征进行估算：

LCP：检查首屏最大元素。是图片吗（检查大小/格式）？是文本吗（检查网页字体加载）？服务器响应时间（TTFB）如何？
INP：检查页面上是否有繁重的 JavaScript。长任务（>50ms）会阻塞交互性。检查第三方脚本。
CLS：检查是否有未指定明确宽度/高度的图片。检查是否有动态插入到首屏的内容。检查是否有导致布局偏移的网页字体（FOUT/FOIT）。

常见的 LCP 修复方法

优化首屏图片：WebP/AVIF 格式，正确调整大小，使用 <link rel="preload"> 预加载
减少服务器响应时间（TTFB < 800ms）
消除渲染阻塞的 CSS/JS
预连接到关键的第三方来源

常见的 INP 修复方法

使用 requestIdleCallback 或 scheduler.yield() 将长任务（>50ms）分解为更小的块
减少第三方 JavaScript
对屏幕外内容使用 content-visibility: auto
对事件处理程序进行防抖/节流

常见的 CLS 修复方法

始终在图片和视频上包含 width 和 height 属性
使用 CSS aspect-ratio 或显式尺寸为广告和嵌入内容预留空间
使用 font-display: swap 并配合调整大小的后备字体
避免在页面加载后在现有内容上方插入内容

检查项	分数
LCP < 2.5s	5
INP < 200ms	5
CLS < 0.1	5

类别 7：服务端渲染 (15 分) —— 对 GEO 至关重要

为什么 SSR 对 AI 可见性是强制性的

AI 爬虫（GPTBot、PerplexityBot、ClaudeBot 等）不执行 JavaScript。它们获取原始 HTML 并解析它。如果你的内容是由 React、Vue、Angular 或任何其他 JavaScript 框架在客户端渲染的，AI 爬虫看到的将是一个空页面。

即使是执行 JavaScript 的 Googlebot，也会因为需要额外的抓取预算而降低 JS 渲染内容的优先级。Google 在一个单独的“渲染队列”中处理 JS 渲染，这可能会使索引延迟数天或数周。

使用 curl 获取页面（不执行 JavaScript）：curl -s [URL]
将原始 HTML 与渲染后的 DOM（通过浏览器）进行比较
如果关键内容（标题、段落、产品信息、文章文本）在 curl 输出中缺失，则该网站依赖客户端渲染

主要内容文本：文章正文 / 产品描述 / 页面内容是否在原始 HTML 中？
标题：H1、H2、H3 标签是否存在于原始 HTML 中？
导航：主导航是否是服务端渲染的？
结构化数据：JSON-LD 是在原始 HTML 中还是由 JavaScript 注入的？
Meta 标签：标题、描述、规范标签、OG 标签是否在原始 HTML 中？
内部链接：导航和内容链接是否在原始 HTML 中？（对可抓取性至关重要）

可推荐的 SSR 解决方案

框架	SSR 解决方案
React	Next.js (SSR/SSG), Remix, Gatsby (SSG)
Vue	Nuxt.js (SSR/SSG)
Angular	Angular Universal
Svelte	SvelteKit
通用	Prerender.io (预渲染服务), Rendertron

所有关键内容都是服务端渲染的：15 分
主要内容是服务端渲染的，但某些元素仅限 JS：10 分
关键内容需要 JS（产品信息、文章文本）：5 分
整个页面是客户端渲染的（原始 HTML 中 body 为空）：0 分

检查项	分数
主要内容在原始 HTML 中	8
Meta 标签 + 结构化数据在原始 HTML 中	4
内部链接在原始 HTML 中	3

类别 8：页面速度与服务器性能 (15 分)

8.1 首字节时间 (TTFB)

目标：< 800ms（理想情况 < 200ms）
使用 curl 测量：curl -o /dev/null -s -w 'TTFB: %{time_starttransfer}s\n' [URL]
如果 TTFB > 800ms：检查服务器位置、缓存、数据库查询、CDN 使用情况

总页面大小目标：< 2MB（关键页面 < 1MB）
检查未压缩的资源（应启用 gzip/brotli 压缩）
检查未压缩的 CSS 和 JavaScript
检查未使用的 CSS/JS（在许多网站上，这可能占下载字节的 50% 以上）

检查图片格式：优先使用 WebP 或 AVIF，而非 JPEG/PNG
检查过大的图片（图片尺寸大于显示尺寸）
检查懒加载：首屏以下的图片应有 loading="lazy"
检查显式尺寸（width/height 属性可防止 CLS）
首屏图片不应懒加载（损害 LCP）

8.4 代码分割与懒加载

JavaScript 应进行代码分割，以便每个页面只加载所需内容
检查大型 JavaScript 包（> 200KB 压缩为警告，> 500KB 为严重）
第三方脚本应异步加载（async 或 defer）
检查 <head> 中的渲染阻塞资源

检查静态资源（图片、CSS、JS）上的 Cache-Control 头
静态资源应具有较长的缓存时间：max-age=31536000（1 年）并配合内容哈希的文件名
HTML 页面应具有较短的缓存时间或带有验证的 no-cache（ETag 或 Last-Modified）

8.6 CDN 使用情况

检查静态资源是否从 CDN 提供（不同的域名或 CDN 特定的头信息）
对于全球受众，CDN 对保持一致的性能至关重要
检查 CDN 特定的头信息：CF-Ray (Cloudflare)、X-Cache (AWS CloudFront)、X-Served-By (Fastly)

检查项	分数
TTFB < 800ms	3
页面大小 < 2MB	2
图片优化（格式、大小、懒加载）	3
JS 包大小合理（< 200KB 压缩）	2
启用压缩（gzip/brotli）	2
静态资源上的缓存头	2
使用 CDN	1

IndexNow 是一个开放协议，允许网站在内容创建、更新或删除时立即通知搜索引擎。由 Bing、Yandex、Seznam 和 Naver 支持。Google 不支持 IndexNow，但会监控该协议。

为什么它对 GEO 很重要

ChatGPT 使用 Bing 的索引。Bing Copilot 使用 Bing 的索引。更快的 Bing 索引意味着在两大主要平台上更快的 AI 可见性。

检查 IndexNow 密钥文件：https://[domain]/.well-known/indexnow-key.txt 或类似文件
检查 CMS 是否有 IndexNow 插件（WordPress：IndexNow 插件；许多现代 CMS 平台原生支持它）
如果未实施，建议添加并提供说明

类别	最高分	权重
可抓取性	15	核心基础
可索引性	12	核心基础
安全性	10	信任信号
URL 结构	8	抓取效率
移动端优化	10	Google 要求
核心 Web 指标	15	排名信号
服务端渲染	15	GEO 关键
页面速度与服务器	15	性能
总计	100

90-100：优秀——在传统 SEO 和 GEO 方面技术上都健全
70-89：良好——需要解决一些小问题，但基础稳固
50-69：需要改进——显著的技术债务影响可见性
30-49：较差——阻碍抓取、索引或 AI 可见性的主要问题
0-29：严重——需要立即关注的根本性技术故障

生成 GEO-TECHNICAL-AUDIT.md，内容如下：

# GEO Technical SEO Audit — [Domain]
Date: [Date]

## Technical Score: XX/100

## Score Breakdown
| Category | Score | Status |
|---|---|---|
| Crawlability | XX/15 | Pass/Warn/Fail |
| Indexability | XX/12 | Pass/Warn/Fail |
| Security | XX/10 | Pass/Warn/Fail |
| URL Structure | XX/8 | Pass/Warn/Fail |
| Mobile Optimization | XX/10 | Pass/Warn/Fail |
| Core Web Vitals | XX/15 | Pass/Warn/Fail |
| Server-Side Rendering | XX/15 | Pass/Warn/Fail |
| Page Speed & Server | XX/15 | Pass/Warn/Fail |

Status: Pass = 80%+ of category points, Warn = 50-79%, Fail = <50%

## AI Crawler Access
| Crawler | User-Agent | Status | Recommendation |
|---|---|---|---|
| GPTBot | GPTBot | Allowed/Blocked | [Action] |
| Googlebot | Googlebot | Allowed/Blocked | [Action] |
[Continue for all AI crawlers]

## Critical Issues (fix immediately)
[List with specific page URLs and what is wrong]

## Warnings (fix this month)
[List with details]

## Recommendations (optimize this quarter)
[List with details]

## Detailed Findings
[Per-category breakdown with evidence]

🇺🇸English

GEO Technical SEO Audit

Purpose

Technical SEO forms the foundation of both traditional search visibility and AI search citation. A technically broken site cannot be crawled, indexed, or cited by any platform. This skill audits 8 categories of technical health with specific attention to GEO requirements — most critically, server-side rendering (AI crawlers do not execute JavaScript) and AI crawler access (many sites inadvertently block AI crawlers in robots.txt).

How to Use This Skill

Collect the target URL (homepage + 2-3 key inner pages)
Fetch each page using curl/WebFetch to get raw HTML and HTTP headers
Run through each of the 8 audit categories below
Score each category using the rubric
Generate GEO-TECHNICAL-AUDIT.md with results

Category 1: Crawlability (15 points)

1.1 robots.txt Validity

Fetch https://[domain]/robots.txt
Check for syntactic validity: proper User-agent, Allow, Disallow directives
Check for common errors: missing User-agent, wildcards blocking important paths, Disallow: / blocking entire site
Verify XML sitemap is referenced: Sitemap: https://[domain]/sitemap.xml

1.2 AI Crawler Access (CRITICAL for GEO)

Check robots.txt for directives targeting these AI crawlers:

Crawler	User-Agent	Platform
GPTBot	GPTBot	ChatGPT / OpenAI
Google-Extended	Google-Extended	Gemini / Google AI training
Googlebot	Googlebot	Google Search + AI Overviews
Bingbot	bingbot	Bing Copilot + ChatGPT (via Bing)
PerplexityBot	PerplexityBot	Perplexity AI
ClaudeBot	ClaudeBot	Anthropic Claude
Amazonbot	Amazonbot	Alexa / Amazon AI
CCBot	CCBot	Common Crawl (used by many AI models)
FacebookBot	FacebookExternalHit	Meta AI
Bytespider

Scoring for AI crawler access:

All major AI crawlers allowed: 5 points
Some blocked but Googlebot + Bingbot allowed: 3 points
GPTBot or PerplexityBot blocked: 1 point (significant GEO impact)
Googlebot blocked: 0 points (fatal)

Important nuance : Blocking Google-Extended does NOT block Googlebot. Google-Extended only controls AI training data usage, not search indexing. However, blocking Google-Extended may reduce presence in AI Overviews. Recommend allowing Google-Extended unless there is a specific data licensing concern.

1.3 XML Sitemaps

Fetch sitemap (check robots.txt for location, or try /sitemap.xml, /sitemap_index.xml)
Validate XML syntax
Check for <lastmod> dates (should be present and accurate)
Count URLs — compare to expected number of indexable pages
Check for sitemap index if large site (50,000+ URLs per sitemap max)
Verify all sitemap URLs return 200 status codes (sample check)

1.4 Crawl Depth

Homepage = depth 0. Check that all important pages are reachable within 3 clicks (depth 3)
Pages at depth 4+ receive significantly less crawl budget and are less likely to be cited by AI
Check internal linking: are key content pages linked from the homepage or main navigation?

1.5 Noindex Management

Check for <meta name="robots" content="noindex"> on pages that SHOULD be indexed
Check for X-Robots-Tag: noindex HTTP headers
Common mistakes: noindex on paginated pages, category pages, or key landing pages

Category Scoring:

Check	Points
robots.txt valid and complete	3
AI crawlers allowed	5
XML sitemap present and valid	3
Crawl depth within 3 clicks	2
No erroneous noindex directives	2

Category 2: Indexability (12 points)

2.1 Canonical Tags

Every indexable page must have a <link rel="canonical" href="..."> tag
Canonical must point to itself (self-referencing) for the authoritative version
Check for conflicting canonicals (canonical in HTML vs. HTTP header)
Check for canonical chains (A canonicals to B, B canonicals to C — should be A to C)

2.2 Duplicate Content

Check for www vs. non-www (both should resolve, one should redirect)
Check for HTTP vs. HTTPS (HTTP should redirect to HTTPS)
Check for trailing slash consistency (pick one pattern and redirect the other)
Check for parameter-based duplicates (?sort=price creating duplicate pages)

2.3 Pagination

If paginated content exists, check for rel="next" / rel="prev" (note: Google ignores these as of 2019, but Bing still uses them)
Preferred: use rel="canonical" on paginated pages pointing to a view-all page or the first page
Ensure paginated pages are not noindexed if they contain unique content

2.4 Hreflang (international sites)

Check for <link rel="alternate" hreflang="xx"> tags
Validate: reciprocal hreflang (if page A points to page B, B must point back to A)
Validate: x-default fallback exists
Check for language/region code validity (ISO 639-1 / ISO 3166-1)

2.5 Index Bloat

Estimate number of indexed pages (check sitemap count, use site:domain.com estimate)
Compare indexed pages to actual valuable content pages
Flag if indexed pages significantly exceed content pages (index bloat from thin/duplicate/parameter pages)

Category Scoring:

Check	Points
Canonical tags correct on all pages	3
No duplicate content issues	3
Pagination handled correctly	2
Hreflang correct (if applicable)	2
No index bloat	2

Category 3: Security (10 points)

3.1 HTTPS Enforcement

Site must load over HTTPS
HTTP must redirect to HTTPS (301 redirect)
No mixed content warnings (HTTP resources on HTTPS pages)
SSL/TLS certificate must be valid and not expired

3.2 Security Headers

Check HTTP response headers for:

Header	Required Value	Purpose
`Strict-Transport-Security`	`max-age=31536000; includeSubDomains`	Forces HTTPS
`Content-Security-Policy`	Appropriate policy	Prevents XSS
`X-Content-Type-Options`	`nosniff`	Prevents MIME sniffing
`X-Frame-Options`	or

Category Scoring:

Check	Points
HTTPS enforced with valid cert	4
HSTS header present	2
X-Content-Type-Options	1
X-Frame-Options	1
Referrer-Policy	1
Content-Security-Policy	1

Category 4: URL Structure (8 points)

4.1 Clean URLs

URLs should be human-readable: /blog/seo-guide not /blog?id=12345
No session IDs in URLs
Lowercase only (no mixed case)
Hyphens for word separation (not underscores)
No special characters or encoded spaces

4.2 Logical Hierarchy

URL path should reflect site architecture: /category/subcategory/page
Flat where appropriate — avoid unnecessarily deep nesting
Consistent pattern across the site

4.3 Redirect Chains

Check for redirect chains (A redirects to B redirects to C)
Maximum 1 hop recommended (A redirects to C directly)
Check for redirect loops
All redirects should be 301 (permanent), not 302 (temporary), unless intentionally temporary

4.4 Parameter Handling

URL parameters should not create duplicate indexable pages
Use canonical tags or robots.txt Disallow for parameter variations
Configure parameter handling in Google Search Console and Bing Webmaster Tools

Category Scoring:

Check	Points
Clean, readable URLs	2
Logical hierarchy	2
No redirect chains (max 1 hop)	2
Parameter handling configured	2

Category 5: Mobile Optimization (10 points)

Critical Context

As of July 2024 , Google crawls ALL sites exclusively with mobile Googlebot. There is no desktop crawling. If your site does not work on mobile, it does not work for Google. Period.

5.1 Responsive Design

Check for <meta name="viewport" content="width=device-width, initial-scale=1">
Content must not require horizontal scrolling on mobile
No fixed-width layouts wider than viewport

5.2 Tap Targets

Interactive elements (buttons, links) must be at least 48x48 CSS pixels
Minimum 8px spacing between tap targets
Check that navigation is usable on mobile

5.3 Font Sizes

Base font size should be at least 16px
No text requiring zoom to read
Sufficient contrast ratio (WCAG AA: 4.5:1 for normal text, 3:1 for large text)

5.4 Mobile Content Parity

All content visible on desktop must also be visible on mobile
No hidden content behind "read more" toggles that Googlebot cannot expand (though Google has improved at expanding these as of 2025)
Images and media must load on mobile

Category Scoring:

Check	Points
Viewport meta tag correct	3
Responsive layout (no horizontal scroll)	3
Tap targets appropriately sized	2
Font sizes legible	2

Category 6: Core Web Vitals (15 points)

2026 Metrics and Thresholds

Core Web Vitals use the 75th percentile of real user data (field data) as the benchmark. Lab data is useful for debugging but field data determines the ranking signal.

Metric	Good	Needs Improvement	Poor	Notes
LCP (Largest Contentful Paint)	< 2.5s	2.5s - 4.0s	> 4.0s	Measures loading — time until largest visible element renders
INP (Interaction to Next Paint)	< 200ms	200ms - 500ms	> 500ms	Replaced FID in March 2024. Measures ALL interactions, not just first
CLS (Cumulative Layout Shift)	< 0.1	0.1 - 0.25	> 0.25	Measures visual stability — unexpected layout movements

How to Assess Without CrUX Data

When real user data is unavailable, estimate from page characteristics:

LCP : Check largest above-fold element. Is it an image (check size/format)? Is it text (check web font loading)? Server response time (TTFB)?
INP : Check for heavy JavaScript on page. Long tasks (>50ms) block interactivity. Check for third-party scripts.
CLS : Check for images without explicit width/height. Check for dynamically inserted content above the fold. Check for web fonts causing layout shift (FOUT/FOIT).

Common LCP Fixes

Optimize hero images: WebP/AVIF format, correct sizing, preload with <link rel="preload">
Reduce server response time (TTFB < 800ms)
Eliminate render-blocking CSS/JS
Preconnect to critical third-party origins

Common INP Fixes

Break up long tasks (>50ms) into smaller chunks using requestIdleCallback or scheduler.yield()
Reduce third-party JavaScript
Use content-visibility: auto for off-screen content
Debounce/throttle event handlers

Common CLS Fixes

Always include width and height attributes on images and videos
Reserve space for ads and embeds with CSS aspect-ratio or explicit dimensions
Use font-display: swap with size-adjusted fallback fonts
Avoid inserting content above existing content after page load

Category Scoring:

Check	Points
LCP < 2.5s	5
INP < 200ms	5
CLS < 0.1	5

Category 7: Server-Side Rendering (15 points) — CRITICAL FOR GEO

Why SSR Is Mandatory for AI Visibility

AI crawlers (GPTBot, PerplexityBot, ClaudeBot, etc.) do NOT execute JavaScript. They fetch the raw HTML and parse it. If your content is rendered client-side by React, Vue, Angular, or any other JavaScript framework, AI crawlers see an empty page.

Even Googlebot, which does execute JavaScript, deprioritizes JS-rendered content due to the additional crawl budget required. Google processes JS rendering in a separate "rendering queue" that can delay indexing by days or weeks.

Detection Method

Fetch the page with curl (no JavaScript execution): curl -s [URL]
Compare the raw HTML to the rendered DOM (via browser)
If key content (headings, paragraphs, product info, article text) is MISSING from the curl output, the site relies on client-side rendering

What to Check

Main content text : Is the article body / product description / page content in the raw HTML?
Headings : Are H1, H2, H3 tags present in raw HTML?
Navigation : Is the main navigation server-rendered?
Structured data : Is JSON-LD in the raw HTML or injected by JavaScript?
Meta tags : Are title, description, canonical, OG tags in the raw HTML?
Internal links : Are navigation and content links in the raw HTML? (Critical for crawlability)

SSR Solutions to Recommend

Framework	SSR Solution
React	Next.js (SSR/SSG), Remix, Gatsby (SSG)
Vue	Nuxt.js (SSR/SSG)
Angular	Angular Universal
Svelte	SvelteKit
Generic	Prerender.io (prerendering service), Rendertron

Scoring Detail

All key content server-rendered: 15 points
Main content server-rendered but some elements JS-only: 10 points
Critical content requires JS (product info, article text): 5 points
Entire page is client-rendered (empty body in raw HTML): 0 points

Category Scoring:

Check	Points
Main content in raw HTML	8
Meta tags + structured data in raw HTML	4
Internal links in raw HTML	3

Category 8: Page Speed & Server Performance (15 points)

8.1 Time to First Byte (TTFB)

Target: < 800ms (ideally < 200ms)
Measure with curl: curl -o /dev/null -s -w 'TTFB: %{time_starttransfer}s\n' [URL]
If TTFB > 800ms: check server location, caching, database queries, CDN usage

8.2 Resource Optimization

Total page weight target: < 2MB (critical pages < 1MB)
Check for uncompressed resources (gzip/brotli compression should be enabled)
Check for unminified CSS and JavaScript
Check for unused CSS/JS (can represent 50%+ of downloaded bytes on many sites)

8.3 Image Optimization

Check image formats: WebP or AVIF preferred over JPEG/PNG
Check for oversized images (images larger than display size)
Check for lazy loading: images below fold should have loading="lazy"
Check for explicit dimensions (width/height attributes prevent CLS)
Above-fold images should NOT be lazy loaded (harms LCP)

8.4 Code Splitting and Lazy Loading

JavaScript should be code-split so each page only loads what it needs
Check for large JavaScript bundles (> 200KB compressed is a warning, > 500KB is critical)
Third-party scripts should load asynchronously (async or defer)
Check for render-blocking resources in <head>

8.5 Caching

Check Cache-Control headers on static resources (images, CSS, JS)
Static assets should have long cache times: max-age=31536000 (1 year) with content-hashed filenames
HTML pages should have shorter cache or no-cache with validation (ETag or Last-Modified)

8.6 CDN Usage

Check if static resources are served from a CDN (different domain or CDN-specific headers)
For global audience, CDN is critical for consistent performance
Check for CDN-specific headers: CF-Ray (Cloudflare), X-Cache (AWS CloudFront), X-Served-By (Fastly)

Category Scoring:

Check	Points
TTFB < 800ms	3
Page weight < 2MB	2
Images optimized (format, size, lazy)	3
JS bundles reasonable (< 200KB compressed)	2
Compression enabled (gzip/brotli)	2
Cache headers on static resources	2
CDN in use	1

IndexNow Protocol

What It Is

IndexNow is an open protocol that allows websites to notify search engines instantly when content is created, updated, or deleted. Supported by Bing, Yandex, Seznam, and Naver. Google does NOT support IndexNow but monitors the protocol.

Why It Matters for GEO

ChatGPT uses Bing's index. Bing Copilot uses Bing's index. Faster Bing indexing means faster AI visibility on two major platforms.

Implementation Check

Check for IndexNow key file: https://[domain]/.well-known/indexnow-key.txt or similar
Check if CMS has IndexNow plugin (WordPress: IndexNow plugin; many modern CMS platforms support it natively)
If not implemented, recommend adding it with instructions

Overall Scoring

Category	Max Points	Weight
Crawlability	15	Core foundation
Indexability	12	Core foundation
Security	10	Trust signal
URL Structure	8	Crawl efficiency
Mobile Optimization	10	Google requirement
Core Web Vitals	15	Ranking signal
Server-Side Rendering	15	GEO critical
Page Speed & Server	15	Performance
Total	100

Score Interpretation

90-100 : Excellent — technically sound for both traditional SEO and GEO
70-89 : Good — minor issues to address but fundamentally solid
50-69 : Needs Work — significant technical debt impacting visibility
30-49 : Poor — major issues blocking crawling, indexing, or AI visibility
0-29 : Critical — fundamental technical failures requiring immediate attention

Output Format

Generate GEO-TECHNICAL-AUDIT.md with:

# GEO Technical SEO Audit — [Domain]
Date: [Date]

## Technical Score: XX/100

## Score Breakdown
| Category | Score | Status |
|---|---|---|
| Crawlability | XX/15 | Pass/Warn/Fail |
| Indexability | XX/12 | Pass/Warn/Fail |
| Security | XX/10 | Pass/Warn/Fail |
| URL Structure | XX/8 | Pass/Warn/Fail |
| Mobile Optimization | XX/10 | Pass/Warn/Fail |
| Core Web Vitals | XX/15 | Pass/Warn/Fail |
| Server-Side Rendering | XX/15 | Pass/Warn/Fail |
| Page Speed & Server | XX/15 | Pass/Warn/Fail |

Status: Pass = 80%+ of category points, Warn = 50-79%, Fail = <50%

## AI Crawler Access
| Crawler | User-Agent | Status | Recommendation |
|---|---|---|---|
| GPTBot | GPTBot | Allowed/Blocked | [Action] |
| Googlebot | Googlebot | Allowed/Blocked | [Action] |
[Continue for all AI crawlers]

## Critical Issues (fix immediately)
[List with specific page URLs and what is wrong]

## Warnings (fix this month)
[List with details]

## Recommendations (optimize this quarter)
[List with details]

## Detailed Findings
[Per-category breakdown with evidence]

Weekly Installs

Repository

zubair-trabzada…o-claude

GitHub Stars

3.8K

First Seen

Feb 27, 2026

Security Audits

Gen Agent Trust HubPass SocketPass SnykWarn

Installed on

codex55

opencode55

kimi-cli53

gemini-cli53

amp53

cline53

程序化SEO实战指南：大规模创建优质页面，避免内容单薄惩罚

42,700 周安装

GEO技术SEO审计：针对AI搜索优化的网站技术健康检查工具

🇨🇳中文介绍

GEO 技术 SEO 审计

目的

如何使用此技能

类别 1：可抓取性 (15 分)

1.1 robots.txt 有效性

相关 Skills

1.2 AI 爬虫访问（对 GEO 至关重要）

1.3 XML 站点地图

1.4 抓取深度

1.5 Noindex 管理

类别 2：可索引性 (12 分)

2.1 规范标签

2.2 重复内容

2.3 分页

2.4 Hreflang（国际网站）

2.5 索引膨胀

类别 3：安全性 (10 分)

3.1 HTTPS 强制实施

3.2 安全头

类别 4：URL 结构 (8 分)

4.1 简洁的 URL

4.2 逻辑层次结构

4.3 重定向链

4.4 参数处理

类别 5：移动端优化 (10 分)

关键背景

5.1 响应式设计

5.2 点击目标

5.3 字体大小

5.4 移动端内容对等性

类别 6：核心 Web 指标 (15 分)

2026 年指标与阈值

在没有 CrUX 数据的情况下如何评估

常见的 LCP 修复方法

常见的 INP 修复方法

常见的 CLS 修复方法

类别 7：服务端渲染 (15 分) —— 对 GEO 至关重要

为什么 SSR 对 AI 可见性是强制性的

检测方法

需要检查什么

可推荐的 SSR 解决方案

评分细节

类别 8：页面速度与服务器性能 (15 分)

8.1 首字节时间 (TTFB)

8.2 资源优化

8.3 图片优化

8.4 代码分割与懒加载

8.5 缓存

8.6 CDN 使用情况

IndexNow 协议

它是什么

为什么它对 GEO 很重要

实施检查

总体评分

分数解读

输出格式

🇺🇸English

GEO Technical SEO Audit

Purpose

How to Use This Skill

Category 1: Crawlability (15 points)

1.1 robots.txt Validity

1.2 AI Crawler Access (CRITICAL for GEO)

1.3 XML Sitemaps

1.4 Crawl Depth

1.5 Noindex Management

Category 2: Indexability (12 points)

2.1 Canonical Tags

2.2 Duplicate Content

2.3 Pagination

2.4 Hreflang (international sites)

2.5 Index Bloat

Category 3: Security (10 points)

3.1 HTTPS Enforcement

3.2 Security Headers

Category 4: URL Structure (8 points)

4.1 Clean URLs

4.2 Logical Hierarchy