ena-database by davila7/claude-code-templates
npx skills add https://github.com/davila7/claude-code-templates --skill ena-database欧洲核苷酸档案库(ENA)是一个全面的公共存储库,用于存储核苷酸序列数据及相关元数据。通过 REST API 和 FTP 访问和查询 DNA/RNA 序列、原始读数、基因组组装和功能注释,适用于基因组学和生物信息学流程。
此技能应在以下情况下使用:
ENA 将数据组织成层次化的对象类型:
研究/项目 - 分组相关数据并控制发布日期。研究是引用存档数据的主要单位。
样本 - 代表产生测序文库的生物材料单位。在提交大多数数据类型之前,必须注册样本。
原始读数 - 包括:
组装 - 处于不同完成水平的基因组、转录组、宏基因组或宏转录组组装。
序列 - 存储在 EMBL 核苷酸序列数据库中的已组装和注释序列,包括编码/非编码区域和功能注释。
分析 - 序列数据计算分析的结果。
分类学记录 - 分类学信息,包括谱系和等级。
ENA 提供多个 REST API 用于数据访问。请查阅 references/api_reference.md 以获取详细的端点文档。
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
主要 API:
ENA Portal API - 跨所有 ENA 数据类型的高级搜索功能
ENA Browser API - 直接检索记录和元数据
ENA Taxonomy REST API - 查询分类学信息
ENA Cross Reference Service - 从外部数据库访问相关记录
CRAM Reference Registry - 检索参考序列
速率限制:所有 API 的速率限制为每秒 50 个请求。超过此限制将返回 HTTP 429(请求过多)。
基于浏览器的搜索:
程序化查询:
API 查询模式示例:
import requests
# 搜索特定研究中的样本
base_url = "https://www.ebi.ac.uk/ena/portal/api/search"
params = {
"result": "sample",
"query": "study_accession=PRJEB1234",
"format": "json",
"limit": 100
}
response = requests.get(base_url, params=params)
samples = response.json()
元数据格式:
序列数据:
下载方法:
通过登录号检索原始测序读数:
# 使用 Browser API 下载运行文件
accession = "ERR123456"
url = f"https://www.ebi.ac.uk/ena/browser/api/xml/{accession}"
搜索研究中的所有样本:
# 使用 Portal API 列出样本
study_id = "PRJNA123456"
url = f"https://www.ebi.ac.uk/ena/portal/api/search?result=sample&query=study_accession={study_id}&format=tsv"
查找特定生物体的组装:
# 按分类学搜索组装
organism = "Escherichia coli"
url = f"https://www.ebi.ac.uk/ena/portal/api/search?result=assembly&query=tax_tree({organism})&format=json"
获取分类谱系:
# 查询分类学 API
taxon_id = "562" # E. coli
url = f"https://www.ebi.ac.uk/ena/taxonomy/rest/tax-id/{taxon_id}"
批量下载模式:
BLAST 集成:与 EBI 的 NCBI BLAST 服务(REST/SOAP API)集成,针对 ENA 序列进行序列相似性搜索。
速率限制:
数据引用:
API 响应处理:
性能:
此技能包含用于处理 ENA 的详细参考文档:
api_reference.md - 全面的 API 端点文档,包括:
在构建复杂的 API 查询、调试 API 响应或需要特定参数详细信息时,请加载此参考文档。
每周安装次数
119
代码仓库
GitHub 星标数
22.6K
首次出现
2026 年 1 月 21 日
安全审计
安装于
claude-code101
opencode93
gemini-cli88
cursor87
antigravity84
codex78
The European Nucleotide Archive (ENA) is a comprehensive public repository for nucleotide sequence data and associated metadata. Access and query DNA/RNA sequences, raw reads, genome assemblies, and functional annotations through REST APIs and FTP for genomics and bioinformatics pipelines.
This skill should be used when:
ENA organizes data into hierarchical object types:
Studies/Projects - Group related data and control release dates. Studies are the primary unit for citing archived data.
Samples - Represent units of biomaterial from which sequencing libraries were produced. Samples must be registered before submitting most data types.
Raw Reads - Consist of:
Assemblies - Genome, transcriptome, metagenome, or metatranscriptome assemblies at various completion levels.
Sequences - Assembled and annotated sequences stored in the EMBL Nucleotide Sequence Database, including coding/non-coding regions and functional annotations.
Analyses - Results from computational analyses of sequence data.
Taxonomy Records - Taxonomic information including lineage and rank.
ENA provides multiple REST APIs for data access. Consult references/api_reference.md for detailed endpoint documentation.
Key APIs:
ENA Portal API - Advanced search functionality across all ENA data types
ENA Browser API - Direct retrieval of records and metadata
ENA Taxonomy REST API - Query taxonomic information
ENA Cross Reference Service - Access related records from external databases
CRAM Reference Registry - Retrieve reference sequences
Rate Limiting : All APIs have a rate limit of 50 requests per second. Exceeding this returns HTTP 429 (Too Many Requests).
Browser-Based Search:
Programmatic Queries:
Example API Query Pattern:
import requests
# Search for samples from a specific study
base_url = "https://www.ebi.ac.uk/ena/portal/api/search"
params = {
"result": "sample",
"query": "study_accession=PRJEB1234",
"format": "json",
"limit": 100
}
response = requests.get(base_url, params=params)
samples = response.json()
Metadata Formats:
Sequence Data:
Download Methods:
Retrieve raw sequencing reads by accession:
# Download run files using Browser API
accession = "ERR123456"
url = f"https://www.ebi.ac.uk/ena/browser/api/xml/{accession}"
Search for all samples in a study:
# Use Portal API to list samples
study_id = "PRJNA123456"
url = f"https://www.ebi.ac.uk/ena/portal/api/search?result=sample&query=study_accession={study_id}&format=tsv"
Find assemblies for a specific organism:
# Search assemblies by taxonomy
organism = "Escherichia coli"
url = f"https://www.ebi.ac.uk/ena/portal/api/search?result=assembly&query=tax_tree({organism})&format=json"
Get taxonomic lineage:
# Query taxonomy API
taxon_id = "562" # E. coli
url = f"https://www.ebi.ac.uk/ena/taxonomy/rest/tax-id/{taxon_id}"
Bulk Download Pattern:
BLAST Integration: Integrate with EBI's NCBI BLAST service (REST/SOAP API) for sequence similarity searches against ENA sequences.
Rate Limiting:
Data Citation:
API Response Handling:
Performance:
This skill includes detailed reference documentation for working with ENA:
api_reference.md - Comprehensive API endpoint documentation including:
Load this reference when constructing complex API queries, debugging API responses, or needing specific parameter details.
Weekly Installs
119
Repository
GitHub Stars
22.6K
First Seen
Jan 21, 2026
Security Audits
Gen Agent Trust HubPassSocketPassSnykWarn
Installed on
claude-code101
opencode93
gemini-cli88
cursor87
antigravity84
codex78
Lark Skill Maker 教程:基于飞书CLI创建AI技能,自动化工作流与API调用指南
30,100 周安装
OpenClaw 环境安全审计员:一键扫描密钥泄露,审计沙箱配置,保障AI技能运行安全
135 周安装
OpenViking 记忆插件指南:AI助手长期记忆管理与自动上下文注入
135 周安装
ByteRover CLI - 上下文工程平台,为AI编码智能体自动管理项目知识库
135 周安装
Symfony API Platform序列化指南:合约设计、安全防护与渐进式披露
135 周安装
PostgreSQL只读查询技能 - 安全连接AI助手执行数据库查询,支持SSL加密与权限控制
135 周安装
Next.js服务端与客户端组件选择指南:TypeScript最佳实践与性能优化
135 周安装