数据分析验证工具 - 分享前审查分析准确性、方法论和潜在偏见

validate-data by anthropics/knowledge-work-plugins

210 周安装量

10,300 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/anthropics/knowledge-work-plugins --skill validate-data

质量管理数据分析业务分析

🇨🇳中文介绍

/validate-data - 分享前验证分析

如果您看到不熟悉的占位符或需要检查连接了哪些工具，请参阅 CONNECTORS.md。

在与利益相关者分享之前，审查分析的准确性、方法论和潜在偏见。生成置信度评估和改进建议。

用法

/validate-data <要审查的分析>

分析可以是：

对话中的文档或报告
文件（markdown、notebook、电子表格）
SQL 查询及其结果
图表及其基础数据
方法论和发现的描述

工作流程

1. 审查方法论和假设

检查：

问题框架 ：分析是否回答了正确的问题？问题是否可能有不同的解释？
数据选择 ：是否使用了正确的表/数据集？时间范围是否合适？
总体定义 ：分析总体是否正确定义？是否存在意外的排除？
指标定义 ：指标是否清晰且一致地定义？它们是否与利益相关者的理解一致？
基线和比较 ：比较是否公平？时间段、队列规模和背景是否具有可比性？

2. 运行交付前质量检查清单

完成以下清单——数据质量、计算、合理性和呈现检查。

3. 检查常见分析陷阱

根据以下详细的陷阱目录进行系统审查（连接爆炸、幸存者偏差、不完整周期比较、分母偏移、平均值的平均值、时区不匹配、选择偏差）。

4. 验证计算和聚合

在可能的情况下，进行抽查：

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

6. 评估叙述和结论

结论是否得到所示数据的支持
是否承认了其他解释
不确定性是否得到适当传达
建议是否从发现中逻辑得出
置信度水平是否与证据强度匹配

7. 提出改进建议

提供具体、可操作的建议：

可以加强结论的额外分析
应注意的注意事项或限制
关键点的更好可视化或框架
利益相关者会想要的缺失背景

8. 生成置信度评估

按 3 级量表对分析进行评级：

可以分享 —— 分析方法论合理，计算已验证，注意事项已注明。有小的改进建议，但没有阻碍性问题。

分享并注明注意事项 —— 分析基本正确，但存在必须向利益相关者传达的具体限制或假设。列出所需的注意事项。

需要修订 —— 发现具体错误、方法论问题或缺失的分析，应在分享前解决。按优先级顺序列出所需的更改。

## 验证报告

### 总体评估：[可以分享 | 分享并注明注意事项 | 需要修订]

### 方法论审查
[关于方法、数据选择、定义的发现]

### 发现的问题
1. [严重程度：高/中/低] [问题描述和影响]
2. ...

### 计算抽查
- [指标]：[已验证 / 发现差异]
- ...

### 可视化审查
[图表或视觉呈现方面的任何问题]

### 建议的改进
1. [改进及其重要性]
2. ...

### 需要向利益相关者说明的注意事项
- [必须传达的注意事项]
- ...

交付前质量检查清单

在与利益相关者分享任何分析之前，请运行此清单。

来源验证 ：确认使用了哪些表/数据源。它们是否适合此问题？
新鲜度 ：数据对于分析足够新。注明“截至”日期。
完整性 ：时间序列中没有意外的缺口或缺失的片段。
空值处理 ：检查关键列中的空值率。空值得到适当处理（排除、估算或标记）。
去重：确认没有因错误的连接或重复的源记录而导致重复计数。
过滤器验证 ：所有 WHERE 子句和过滤器都是正确的。没有意外的排除。

聚合逻辑 ：GROUP BY 包含所有非聚合列。聚合级别与分析粒度匹配。
分母正确性 ：比率和百分比计算使用了正确的分母。分母非零。
日期对齐 ：比较使用相同的时间段长度。部分时间段被排除或注明。
连接正确性 ：JOIN 类型合适（INNER 与 LEFT）。多对多连接没有夸大计数。
指标定义 ：指标与利益相关者定义的方式匹配。任何偏差都已注明。
小计求和 ：部分在预期情况下相加等于整体。如果不相等，请解释原因（例如，重叠）。

数量级 ：数字在合理范围内。收入不为负。百分比在 0-100% 之间。
趋势连续性 ：时间序列中没有无法解释的跳跃或下降。
交叉引用 ：关键数字与其他已知来源（仪表板、先前报告、财务数据）匹配。
数量级 ：总收入在正确的范围内。用户数量与已知数字匹配。
边界情况 ：边界处会发生什么？空片段、零活动期、新实体。

图表准确性 ：条形图从零开始。轴已标记。面板之间的比例一致。
数字格式 ：适当的精度。一致的货币/百分比格式。需要时使用千位分隔符。
标题清晰度 ：标题说明洞察，而不仅仅是指标。指定了日期范围。
注意事项透明度 ：明确说明已知的限制和假设。
可重现性 ：其他人可以根据提供的文档重现此分析。

常见数据分析陷阱

问题：多对多连接默默地增加了行数，夸大了计数和总和。

如何检测 ：

-- 检查连接前后的行数
SELECT COUNT(*) FROM table_a;  -- 1,000
SELECT COUNT(*) FROM table_a a JOIN table_b b ON a.id = b.a_id;  -- 3,500 (有问题)

如何预防 ：

连接后始终检查行数
如果计数增加，调查连接关系（它真的是 1:1 或 1:多吗？）
通过连接计数实体时，使用 COUNT(DISTINCT a.id) 而不是 COUNT(*)

问题：只分析今天存在的实体，忽略了那些被删除、流失或失败的实体。

分析“当前用户”的用户行为会忽略流失用户
查看“使用我们产品的公司”会忽略那些评估后离开的公司
研究“成功”结果的属性，而没有“不成功”的结果

如何预防 ：在得出结论之前，问“谁不在此数据集中？”。

不完整周期比较

问题：将部分周期与完整周期进行比较。

“一月收入为 50 万美元，而十二月为 80 万美元”——但一月尚未结束
“本周注册人数下降”——在周三检查，与完整的前一周比较

如何预防 ：始终过滤到完整周期，或比较同月同日/相同天数。

问题：分母在不同时期之间发生变化，使得比率不可比。

转化率提高是因为你改变了计算“合格”用户的方式
流失率变化是因为“活跃”的定义被更新

如何预防 ：在所有比较的时期使用一致的定义。注意任何定义更改。

平均值的平均值

问题：当组大小不同时，对预先计算的平均值求平均值会得出错误的结果。

组 A：100 名用户，平均收入 50 美元
组 B：10 名用户，平均收入 200 美元
错误：平均值的平均值 = (50 美元 + 200 美元) / 2 = 125 美元
正确：加权平均值 = (10050 美元 + 10200 美元) / 110 = 63.64 美元

如何预防 ：始终从原始数据聚合。永远不要对预先聚合的平均值求平均值。

问题：不同的数据源使用不同的时区，导致错位。

UTC 中的事件时间戳与本地时间的面向用户日期
使用不同截止时间的每日汇总

如何预防 ：在分析之前将所有时间戳标准化为单个时区（推荐 UTC）。记录使用的时区。

细分中的选择偏差

问题：细分是根据你正在测量的结果定义的，造成循环逻辑。

“完成入职的用户留存率更高”——显然，他们是自我选择的
“高级用户产生更多收入”——他们通过产生收入成为高级用户

如何预防 ：根据处理前的特征定义细分，而不是结果。

辛普森悖论 ：数据聚合与细分时趋势反转
相关性被呈现为因果关系，没有支持证据
小样本量导致不可靠的结论
异常值对平均值影响过大（是否应改用中位数？）
多重测试/选择性报告显著结果
前瞻性偏差 ：使用未来信息解释过去事件
选择性时间范围支持特定叙述

结果合理性检查

对于分析中的任何关键数字，验证它通过“嗅觉测试”：

指标类型	合理性检查
用户数量	这与已知的 MAU/DAU 数字匹配吗？
收入	这与已知的 ARR 相比数量级正确吗？
转化率	这在 0% 到 100% 之间吗？它与仪表板数字匹配吗？
增长率	50%+ 的环比增长现实吗，还是存在数据问题？
平均值	考虑到你对分布的了解，平均值合理吗？
百分比	细分百分比之和是否约为 100%？

用两种不同的方式计算相同的指标并验证它们匹配
抽查个别记录——选择几个特定的实体并手动追踪它们的数据
与已知基准比较——与已发布的仪表板、财务报告或先前分析匹配
逆向工程——如果总收入是 X，那么每用户收入乘以用户数是否约等于 X？
边界检查——当你过滤到单日、单个用户或单个类别时会发生什么？这些微观结果合理吗？

需要调查的危险信号

任何在没有明显原因的情况下环比变化超过 50% 的指标
计数或总和是精确的整数（表明过滤器或默认值问题）
比率正好为 0% 或 100%（可能表明数据不完整）
结果完美地证实了假设（现实通常更混乱）
跨时间段或细分的相同值（表明查询忽略了某个维度）

可重现性的文档标准

每个重要的分析都应包括：

## 分析：[标题]

### 问题
[正在回答的具体问题]

### 数据源
- 表：[schema.table_name]（截至[日期]）
- 表：[schema.other_table]（截至[日期]）
- 文件：[文件名]（来源：[来源]）

### 定义
- [指标 A]：[确切的计算方式]
- [细分 X]：[确定成员资格的确切方式]
- [时间段]：[开始日期] 至 [结束日期]，[时区]

### 方法论
1. [分析方法步骤 1]
2. [步骤 2]
3. [步骤 3]

### 假设和限制
- [假设 1 及其合理性]
- [限制 1 及其对结论的潜在影响]

### 关键发现
1. [发现 1 及支持证据]
2. [发现 2 及支持证据]

### SQL 查询
[使用的所有查询，带注释]

### 注意事项
- [读者在据此行动前应了解的事项]

对于任何可能重用的代码（SQL、Python）：

"""
分析：月度队列留存
作者：[姓名]
日期：[日期]
数据源：events 表，users 表
最后验证：[日期]——结果与仪表板匹配度在 2% 以内

目的：
    根据首次活动日期计算月度用户留存队列。

假设：
    - “活跃”意味着当月至少有一个事件
    - 排除测试/内部账户（user_type != 'internal'）
    - 始终使用 UTC 日期

输出：
    队列留存矩阵，行为 cohort_month，列为 months_since_signup。
    值为留存率（0-100%）。
"""

分析的版本控制

将查询和代码保存在版本控制（git）或共享文档系统中
注明使用的数据快照日期
如果使用更新数据重新运行分析，记录更改内容和原因
链接到定期分析的先前版本以进行趋势比较

/validate-data 在我发送给执行团队之前，审查这份季度收入分析：[分析]



/validate-data 检查我的流失分析——我正在比较第四季度与第三季度的流失率，但第四季度的测量窗口较短



/validate-data 这是我们的转化漏斗的 SQL 查询及其结果。逻辑看起来正确吗？[查询 + 结果]

在任何高风险演示或决策之前运行 /validate-data
即使是快速分析也能从合理性检查中受益——只需一分钟，可以挽救你的信誉
如果验证发现问题，修复它们并重新验证
与你的分析一起分享验证输出，以建立利益相关者的信心

🇺🇸English

/validate-data - Validate Analysis Before Sharing

If you see unfamiliar placeholders or need to check which tools are connected, see CONNECTORS.md.

Review an analysis for accuracy, methodology, and potential biases before sharing with stakeholders. Generates a confidence assessment and improvement suggestions.

Usage

/validate-data <analysis to review>

The analysis can be:

A document or report in the conversation
A file (markdown, notebook, spreadsheet)
SQL queries and their results
Charts and their underlying data
A description of methodology and findings

Workflow

1. Review Methodology and Assumptions

Examine:

Question framing : Is the analysis answering the right question? Could the question be interpreted differently?
Data selection : Are the right tables/datasets being used? Is the time range appropriate?
Population definition : Is the analysis population correctly defined? Are there unintended exclusions?
Metric definitions : Are metrics defined clearly and consistently? Do they match how stakeholders understand them?
Baseline and comparison : Is the comparison fair? Are time periods, cohort sizes, and contexts comparable?

2. Run the Pre-Delivery QA Checklist

Work through the checklist below — data quality, calculation, reasonableness, and presentation checks.

3. Check for Common Analytical Pitfalls

Systematically review against the detailed pitfall catalog below (join explosion, survivorship bias, incomplete period comparison, denominator shifting, average of averages, timezone mismatches, selection bias).

4. Verify Calculations and Aggregations

Where possible, spot-check:

Recalculate a few key numbers independently
Verify that subtotals sum to totals
Check that percentages sum to 100% (or close to it) where expected
Confirm that YoY/MoM comparisons use the correct base periods
Validate that filters are applied consistently across all metrics

Apply the result sanity-checking techniques below (magnitude checks, cross-validation, red-flag detection).

5. Assess Visualizations

If the analysis includes charts:

Do axes start at appropriate values (zero for bar charts)?
Are scales consistent across comparison charts?
Do chart titles accurately describe what's shown?
Could the visualization mislead a quick reader?
Are there truncated axes, inconsistent intervals, or 3D effects that distort perception?

6. Evaluate Narrative and Conclusions

Review whether:

Conclusions are supported by the data shown
Alternative explanations are acknowledged
Uncertainty is communicated appropriately
Recommendations follow logically from findings
The level of confidence matches the strength of evidence

7. Suggest Improvements

Provide specific, actionable suggestions:

Additional analyses that would strengthen the conclusions
Caveats or limitations that should be noted
Better visualizations or framings for key points
Missing context that stakeholders would want

8. Generate Confidence Assessment

Rate the analysis on a 3-level scale:

Ready to share -- Analysis is methodologically sound, calculations verified, caveats noted. Minor suggestions for improvement but nothing blocking.

Share with noted caveats -- Analysis is largely correct but has specific limitations or assumptions that must be communicated to stakeholders. List the required caveats.

Needs revision -- Found specific errors, methodological issues, or missing analyses that should be addressed before sharing. List the required changes with priority order.

Output Format

## Validation Report

### Overall Assessment: [Ready to share | Share with caveats | Needs revision]

### Methodology Review
[Findings about approach, data selection, definitions]

### Issues Found
1. [Severity: High/Medium/Low] [Issue description and impact]
2. ...

### Calculation Spot-Checks
- [Metric]: [Verified / Discrepancy found]
- ...

### Visualization Review
[Any issues with charts or visual presentation]

### Suggested Improvements
1. [Improvement and why it matters]
2. ...

### Required Caveats for Stakeholders
- [Caveat that must be communicated]
- ...

Pre-Delivery QA Checklist

Run through this checklist before sharing any analysis with stakeholders.

Data Quality Checks

Source verification : Confirmed which tables/data sources were used. Are they the right ones for this question?
Freshness : Data is current enough for the analysis. Noted the "as of" date.
Completeness : No unexpected gaps in time series or missing segments.
Null handling : Checked null rates in key columns. Nulls are handled appropriately (excluded, imputed, or flagged).
Deduplication : Confirmed no double-counting from bad joins or duplicate source records.
Filter verification : All WHERE clauses and filters are correct. No unintended exclusions.

Calculation Checks

Aggregation logic : GROUP BY includes all non-aggregated columns. Aggregation level matches the analysis grain.
Denominator correctness : Rate and percentage calculations use the right denominator. Denominators are non-zero.
Date alignment : Comparisons use the same time period length. Partial periods are excluded or noted.
Join correctness : JOIN types are appropriate (INNER vs LEFT). Many-to-many joins haven't inflated counts.
Metric definitions : Metrics match how stakeholders define them. Any deviations are noted.
Subtotals sum : Parts add up to the whole where expected. If they don't, explain why (e.g., overlap).

Reasonableness Checks

Magnitude : Numbers are in a plausible range. Revenue isn't negative. Percentages are between 0-100%.
Trend continuity : No unexplained jumps or drops in time series.
Cross-reference : Key numbers match other known sources (dashboards, previous reports, finance data).
Order of magnitude : Total revenue is in the right ballpark. User counts match known figures.
Edge cases : What happens at the boundaries? Empty segments, zero-activity periods, new entities.

Presentation Checks

Chart accuracy : Bar charts start at zero. Axes are labeled. Scales are consistent across panels.
Number formatting : Appropriate precision. Consistent currency/percentage formatting. Thousands separators where needed.
Title clarity : Titles state the insight, not just the metric. Date ranges are specified.
Caveat transparency : Known limitations and assumptions are stated explicitly.
Reproducibility : Someone else could recreate this analysis from the documentation provided.

Common Data Analysis Pitfalls

Join Explosion

The problem : A many-to-many join silently multiplies rows, inflating counts and sums.

How to detect :

-- Check row count before and after join
SELECT COUNT(*) FROM table_a;  -- 1,000
SELECT COUNT(*) FROM table_a a JOIN table_b b ON a.id = b.a_id;  -- 3,500 (uh oh)

How to prevent :

Always check row counts after joins
If counts increase, investigate the join relationship (is it really 1:1 or 1:many?)
Use COUNT(DISTINCT a.id) instead of COUNT(*) when counting entities through joins

Survivorship Bias

The problem : Analyzing only entities that exist today, ignoring those that were deleted, churned, or failed.

Examples :

Analyzing user behavior of "current users" misses churned users
Looking at "companies using our product" ignores those who evaluated and left
Studying properties of "successful" outcomes without "unsuccessful" ones

How to prevent : Ask "who is NOT in this dataset?" before drawing conclusions.

Incomplete Period Comparison

The problem : Comparing a partial period to a full period.

Examples :

"January revenue is $500K vs. December's $800K" -- but January isn't over yet
"This week's signups are down" -- checked on Wednesday, comparing to a full prior week

How to prevent : Always filter to complete periods, or compare same-day-of-month / same-number-of-days.

Denominator Shifting

The problem : The denominator changes between periods, making rates incomparable.

Examples :

Conversion rate improves because you changed how you count "eligible" users
Churn rate changes because the definition of "active" was updated

How to prevent : Use consistent definitions across all compared periods. Note any definition changes.

Average of Averages

The problem : Averaging pre-computed averages gives wrong results when group sizes differ.

Example :

Group A: 100 users, average revenue $50
Group B: 10 users, average revenue $200
Wrong: Average of averages = ($50 + $200) / 2 = $125
Right: Weighted average = (100*$50 + 10*$200) / 110 = $63.64

How to prevent : Always aggregate from raw data. Never average pre-aggregated averages.

Timezone Mismatches

The problem : Different data sources use different timezones, causing misalignment.

Examples :

Event timestamps in UTC vs. user-facing dates in local time
Daily rollups that use different cutoff times

How to prevent : Standardize all timestamps to a single timezone (UTC recommended) before analysis. Document the timezone used.

Selection Bias in Segmentation

The problem : Segments are defined by the outcome you're measuring, creating circular logic.

Examples :

"Users who completed onboarding have higher retention" -- obviously, they self-selected
"Power users generate more revenue" -- they became power users BY generating revenue

How to prevent : Define segments based on pre-treatment characteristics, not outcomes.

Other Statistical Traps

Simpson's paradox : Trend reverses when data is aggregated vs. segmented
Correlation presented as causation without supporting evidence
Small sample sizes leading to unreliable conclusions
Outliers disproportionately affecting averages (should medians be used instead?)
Multiple testing / cherry-picking significant results
Look-ahead bias : Using future information to explain past events
Cherry-picked time ranges that favor a particular narrative

Result Sanity Checking

Magnitude Checks

For any key number in your analysis, verify it passes the "smell test":

Metric Type	Sanity Check
User counts	Does this match known MAU/DAU figures?
Revenue	Is this in the right order of magnitude vs. known ARR?
Conversion rates	Is this between 0% and 100%? Does it match dashboard figures?
Growth rates	Is 50%+ MoM growth realistic, or is there a data issue?
Averages	Is the average reasonable given what you know about the distribution?
Percentages	Do segment percentages sum to ~100%?

Cross-Validation Techniques

Calculate the same metric two different ways and verify they match
Spot-check individual records -- pick a few specific entities and trace their data manually
Compare to known benchmarks -- match against published dashboards, finance reports, or prior analyses
Reverse engineer -- if total revenue is X, does per-user revenue times user count approximately equal X?
Boundary checks -- what happens when you filter to a single day, a single user, or a single category? Are those micro-results sensible?

Red Flags That Warrant Investigation

Any metric that changed by more than 50% period-over-period without an obvious cause
Counts or sums that are exact round numbers (suggests a filter or default value issue)
Rates exactly at 0% or 100% (may indicate incomplete data)
Results that perfectly confirm the hypothesis (reality is usually messier)
Identical values across time periods or segments (suggests the query is ignoring a dimension)

Documentation Standards for Reproducibility

Analysis Documentation Template

Every non-trivial analysis should include:

## Analysis: [Title]

### Question
[The specific question being answered]

### Data Sources
- Table: [schema.table_name] (as of [date])
- Table: [schema.other_table] (as of [date])
- File: [filename] (source: [where it came from])

### Definitions
- [Metric A]: [Exactly how it's calculated]
- [Segment X]: [Exactly how membership is determined]
- [Time period]: [Start date] to [end date], [timezone]

### Methodology
1. [Step 1 of the analysis approach]
2. [Step 2]
3. [Step 3]

### Assumptions and Limitations
- [Assumption 1 and why it's reasonable]
- [Limitation 1 and its potential impact on conclusions]

### Key Findings
1. [Finding 1 with supporting evidence]
2. [Finding 2 with supporting evidence]

### SQL Queries
[All queries used, with comments]

### Caveats
- [Things the reader should know before acting on this]

Code Documentation

For any code (SQL, Python) that may be reused:

"""
Analysis: Monthly Cohort Retention
Author: [Name]
Date: [Date]
Data Source: events table, users table
Last Validated: [Date] -- results matched dashboard within 2%

Purpose:
    Calculate monthly user retention cohorts based on first activity date.

Assumptions:
    - "Active" means at least one event in the month
    - Excludes test/internal accounts (user_type != 'internal')
    - Uses UTC dates throughout

Output:
    Cohort retention matrix with cohort_month rows and months_since_signup columns.
    Values are retention rates (0-100%).
"""

Version Control for Analyses

Save queries and code in version control (git) or a shared docs system
Note the date of the data snapshot used
If an analysis is re-run with updated data, document what changed and why
Link to prior versions of recurring analyses for trend comparison

Examples

/validate-data Review this quarterly revenue analysis before I send it to the exec team: [analysis]



/validate-data Check my churn analysis -- I'm comparing Q4 churn rates to Q3 but Q4 has a shorter measurement window



/validate-data Here's a SQL query and its results for our conversion funnel. Does the logic look right? [query + results]

Tips

Run /validate-data before any high-stakes presentation or decision
Even quick analyses benefit from a sanity check -- it takes a minute and can save your credibility
If the validation finds issues, fix them and re-validate
Share the validation output alongside your analysis to build stakeholder confidence

Weekly Installs

210

Repository

anthropics/know…-plugins

GitHub Stars

10.3K

First Seen

11 days ago

Security Audits

Gen Agent Trust HubPass SocketPass SnykPass

Installed on

codex201

gemini-cli197

opencode197

cursor196

github-copilot195

kimi-cli195

Excel财务建模规范与xlsx文件处理指南：专业格式、零错误公式与数据分析

42,000 周安装

数据分析验证工具 - 分享前审查分析准确性、方法论和潜在偏见

🇨🇳中文介绍

/validate-data - 分享前验证分析

用法

工作流程

1. 审查方法论和假设

2. 运行交付前质量检查清单

3. 检查常见分析陷阱

4. 验证计算和聚合

相关 Skills

5. 评估可视化

6. 评估叙述和结论

7. 提出改进建议

8. 生成置信度评估

输出格式

交付前质量检查清单

数据质量检查

计算检查

合理性检查

呈现检查

常见数据分析陷阱

连接爆炸

幸存者偏差

不完整周期比较

分母偏移

平均值的平均值

时区不匹配

细分中的选择偏差

其他统计陷阱

结果合理性检查

数量级检查

交叉验证技术

需要调查的危险信号

可重现性的文档标准

分析文档模板

代码文档

分析的版本控制

示例

提示

🇺🇸English

/validate-data - Validate Analysis Before Sharing

Usage

Workflow

1. Review Methodology and Assumptions

2. Run the Pre-Delivery QA Checklist

3. Check for Common Analytical Pitfalls

4. Verify Calculations and Aggregations

5. Assess Visualizations

6. Evaluate Narrative and Conclusions

7. Suggest Improvements

8. Generate Confidence Assessment

Output Format

Pre-Delivery QA Checklist

Data Quality Checks

Calculation Checks

Reasonableness Checks

Presentation Checks

Common Data Analysis Pitfalls

Join Explosion

Survivorship Bias

Incomplete Period Comparison

Denominator Shifting

Average of Averages

Timezone Mismatches

Selection Bias in Segmentation

Other Statistical Traps

Result Sanity Checking

Magnitude Checks

Cross-Validation Techniques

Red Flags That Warrant Investigation

Documentation Standards for Reproducibility

Analysis Documentation Template

Code Documentation

Version Control for Analyses

Examples

Tips

最新 Skills