PubChem 数据库 Python 使用指南：化学结构搜索、性质检索与生物活性数据分析

pubchem-database by davila7/claude-code-templates

190 周安装量

24,100 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/davila7/claude-code-templates --skill pubchem-database

科研工具生物信息学数据处理

🇨🇳中文介绍

PubChem 数据库

概述

PubChem 是世界上最大的免费化学数据库，包含超过 1.1 亿种化合物和超过 2.7 亿条生物活性数据。可通过名称、CID 或 SMILES 查询化学结构，检索分子性质，执行相似性和子结构搜索，使用 PUG-REST API 和 PubChemPy 访问生物活性数据。

何时使用此技能

此技能应在以下情况下使用：

通过名称、结构（SMILES/InChI）或分子式搜索化合物
检索分子性质（分子量、LogP、TPSA、氢键描述符）
执行相似性搜索以查找结构相关的化合物
进行子结构搜索以寻找特定的化学基团
从筛选实验中访问生物活性数据
在化学标识符格式之间转换（CID、SMILES、InChI）
批量处理多个化合物进行类药性筛选或性质分析

核心功能

1. 化学结构搜索

使用多种标识符类型搜索化合物：

按化学名称搜索：

import pubchempy as pcp
compounds = pcp.get_compounds('aspirin', 'name')
compound = compounds[0]

按 CID（化合物 ID）搜索：

compound = pcp.Compound.from_cid(2244)  # 阿司匹林

按 SMILES 搜索：

compound = pcp.get_compounds('CC(=O)OC1=CC=CC=C1C(=O)O', 'smiles')[0]

按 InChI 搜索：

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

8. 生物活性数据访问

从实验中检索生物活性数据：

import requests
import json

# 获取化合物的生物测定摘要
cid = 2244  # 阿司匹林
url = f"https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/cid/{cid}/assaysummary/JSON"

response = requests.get(url)
if response.status_code == 200:
    data = response.json()
    # 处理生物测定信息
    table = data.get('Table', {})
    rows = table.get('Row', [])
    print(f"Found {len(rows)} bioassay records")

对于更复杂的生物活性查询，使用 scripts/bioactivity_query.py 辅助脚本，它提供：

具有活性结果筛选的生物测定摘要
测定目标识别
按生物目标搜索化合物
特定测定的活性化合物列表

9. 综合化合物注释

通过 PUG-View 访问详细的化合物信息：

import requests

cid = 2244
url = f"https://pubchem.ncbi.nlm.nih.gov/rest/pug_view/data/compound/{cid}/JSON"

response = requests.get(url)
if response.status_code == 200:
    annotations = response.json()
    # 包含广泛的数据，包括：
    # - 化学和物理性质
    # - 药物和医药信息
    # - 药理学和生物化学
    # - 安全性和危害
    # - 毒性
    # - 文献引用
    # - 专利

获取特定部分：

# 仅获取药物信息
url = f"https://pubchem.ncbi.nlm.nih.gov/rest/pug_view/data/compound/{cid}/JSON?heading=Drug and Medication Information"

安装 PubChemPy 以进行基于 Python 的访问：

uv pip install pubchempy

对于直接 API 访问和生物活性查询：

uv pip install requests

数据分析可选：

uv pip install pandas

此技能包含用于常见 PubChem 任务的 Python 脚本：

scripts/compound_search.py

提供用于搜索和检索化合物信息的实用函数：

search_by_name(name, max_results=10)：按名称搜索化合物
search_by_smiles(smiles)：按 SMILES 字符串搜索
get_compound_by_cid(cid)：按 CID 检索化合物
get_compound_properties(identifier, namespace, properties)：获取特定性质
similarity_search(smiles, threshold, max_records)：执行相似性搜索
substructure_search(smiles, max_records)：执行子结构搜索
get_synonyms(identifier, namespace)：获取所有同义词
batch_search(identifiers, namespace, properties)：批量搜索多个化合物
download_structure(identifier, namespace, format, filename)：下载结构
print_compound_info(compound)：打印格式化的化合物信息

from scripts.compound_search import search_by_name, get_compound_properties

# 搜索化合物
compounds = search_by_name('ibuprofen')

# 获取特定性质
props = get_compound_properties('aspirin', 'name', ['MolecularWeight', 'XLogP'])

scripts/bioactivity_query.py

提供用于检索生物活性数据的函数：

get_bioassay_summary(cid)：获取化合物的生物测定摘要
get_compound_bioactivities(cid, activity_outcome)：获取筛选后的生物活性
get_assay_description(aid)：获取详细的测定信息
get_assay_targets(aid)：获取测定的生物目标
search_assays_by_target(target_name, max_results)：按目标查找测定
get_active_compounds_in_assay(aid, max_results)：获取活性化合物
get_compound_annotations(cid, section)：获取 PUG-View 注释
summarize_bioactivities(cid)：生成生物活性摘要统计
find_compounds_by_bioactivity(target, threshold, max_compounds)：按目标查找化合物

from scripts.bioactivity_query import get_bioassay_summary, summarize_bioactivities

# 获取生物活性摘要
summary = summarize_bioactivities(2244)  # 阿司匹林
print(f"Total assays: {summary['total_assays']}")
print(f"Active: {summary['active']}, Inactive: {summary['inactive']}")

API 速率限制和最佳实践

每秒最多 5 个请求
每分钟最多 400 个请求
每分钟最多 300 秒运行时间

对重复查询使用 CID：CID 比名称或结构更高效
本地缓存结果：存储频繁访问的数据
批量请求：尽可能合并多个查询
实现延迟：在请求之间添加 0.2-0.3 秒延迟
优雅地处理错误：检查 HTTP 错误和缺失数据
使用 PubChemPy：高级抽象处理许多边缘情况
利用异步模式：用于大型相似性/子结构搜索
指定 MaxRecords：限制结果以避免超时

from pubchempy import BadRequestError, NotFoundError, TimeoutError

try:
    compound = pcp.get_compounds('query', 'name')[0]
except NotFoundError:
    print("Compound not found")
except BadRequestError:
    print("Invalid request format")
except TimeoutError:
    print("Request timed out - try reducing scope")
except IndexError:
    print("No results returned")

工作流程 1：化学标识符转换管道

在不同化学标识符之间转换：

import pubchempy as pcp

# 从任何标识符类型开始
compound = pcp.get_compounds('caffeine', 'name')[0]

# 提取所有标识符格式
identifiers = {
    'CID': compound.cid,
    'Name': compound.iupac_name,
    'SMILES': compound.canonical_smiles,
    'InChI': compound.inchi,
    'InChIKey': compound.inchikey,
    'Formula': compound.molecular_formula
}

工作流程 2：类药性质筛选

使用 Lipinski 五规则筛选化合物：

import pubchempy as pcp

def check_drug_likeness(compound_name):
    compound = pcp.get_compounds(compound_name, 'name')[0]

    # Lipinski 五规则
    rules = {
        'MW <= 500': compound.molecular_weight <= 500,
        'LogP <= 5': compound.xlogp <= 5 if compound.xlogp else None,
        'HBD <= 5': compound.h_bond_donor_count <= 5,
        'HBA <= 10': compound.h_bond_acceptor_count <= 10
    }

    violations = sum(1 for v in rules.values() if v is False)
    return rules, violations

rules, violations = check_drug_likeness('aspirin')
print(f"Lipinski violations: {violations}")

工作流程 3：寻找相似药物候选物

识别与已知药物结构相似的化合物：

import pubchempy as pcp

# 从已知药物开始
reference_drug = pcp.get_compounds('imatinib', 'name')[0]
reference_smiles = reference_drug.canonical_smiles

# 查找相似化合物
similar = pcp.get_compounds(
    reference_smiles,
    'smiles',
    searchtype='similarity',
    Threshold=85,
    MaxRecords=20
)

# 按类药性质筛选
candidates = []
for comp in similar:
    if comp.molecular_weight and 200 <= comp.molecular_weight <= 600:
        if comp.xlogp and -1 <= comp.xlogp <= 5:
            candidates.append(comp)

print(f"Found {len(candidates)} drug-like candidates")

工作流程 4：批量化合物性质比较

比较多个化合物的性质：

import pubchempy as pcp
import pandas as pd

compound_list = ['aspirin', 'ibuprofen', 'naproxen', 'celecoxib']

properties_list = []
for name in compound_list:
    try:
        compound = pcp.get_compounds(name, 'name')[0]
        properties_list.append({
            'Name': name,
            'CID': compound.cid,
            'Formula': compound.molecular_formula,
            'MW': compound.molecular_weight,
            'LogP': compound.xlogp,
            'TPSA': compound.tpsa,
            'HBD': compound.h_bond_donor_count,
            'HBA': compound.h_bond_acceptor_count
        })
    except Exception as e:
        print(f"Error processing {name}: {e}")

df = pd.DataFrame(properties_list)
print(df.to_string(index=False))

工作流程 5：基于子结构的虚拟筛选

筛选包含特定药效团的化合物：

import pubchempy as pcp

# 定义药效团（例如，磺酰胺基团）
pharmacophore_smiles = 'S(=O)(=O)N'

# 搜索包含此子结构的化合物
hits = pcp.get_compounds(
    pharmacophore_smiles,
    'smiles',
    searchtype='substructure',
    MaxRecords=100
)

# 进一步按性质筛选
filtered_hits = [
    comp for comp in hits
    if comp.molecular_weight and comp.molecular_weight < 500
]

print(f"Found {len(filtered_hits)} compounds with desired substructure")

有关详细的 API 文档，包括完整的性质列表、URL 模式、高级查询选项和更多示例，请查阅 references/api_reference.md。此综合参考包括：

完整的 PUG-REST API 端点文档
可用分子性质的完整列表
异步请求处理模式
PubChemPy API 参考
用于注释的 PUG-View API
常见工作流程和用例
官方 PubChem 文档的链接

未找到化合物：

尝试替代名称或同义词
如果已知，使用 CID
检查拼写和化学名称格式

减少 MaxRecords 参数
在请求之间添加延迟
使用 CID 而不是名称以加快查询速度

并非所有性质都适用于所有化合物
在访问前检查性质是否存在：if compound.xlogp:
某些性质仅适用于特定类型的化合物

超出速率限制：

在请求之间实现延迟（0.2-0.3 秒）
尽可能使用批量操作
考虑在本地缓存结果

相似性/子结构搜索挂起：

这些是异步操作，可能需要 15-30 秒
PubChemPy 会自动处理轮询
如果超时，减少 MaxRecords

🇺🇸English

PubChem Database

Overview

PubChem is the world's largest freely available chemical database with 110M+ compounds and 270M+ bioactivities. Query chemical structures by name, CID, or SMILES, retrieve molecular properties, perform similarity and substructure searches, access bioactivity data using PUG-REST API and PubChemPy.

When to Use This Skill

This skill should be used when:

Searching for chemical compounds by name, structure (SMILES/InChI), or molecular formula
Retrieving molecular properties (MW, LogP, TPSA, hydrogen bonding descriptors)
Performing similarity searches to find structurally related compounds
Conducting substructure searches for specific chemical motifs
Accessing bioactivity data from screening assays
Converting between chemical identifier formats (CID, SMILES, InChI)
Batch processing multiple compounds for drug-likeness screening or property analysis

Core Capabilities

1. Chemical Structure Search

Search for compounds using multiple identifier types:

By Chemical Name :

import pubchempy as pcp
compounds = pcp.get_compounds('aspirin', 'name')
compound = compounds[0]

By CID (Compound ID) :

compound = pcp.Compound.from_cid(2244)  # Aspirin

By SMILES :

compound = pcp.get_compounds('CC(=O)OC1=CC=CC=C1C(=O)O', 'smiles')[0]

By InChI :

compound = pcp.get_compounds('InChI=1S/C9H8O4/...', 'inchi')[0]

By Molecular Formula :

compounds = pcp.get_compounds('C9H8O4', 'formula')
# Returns all compounds matching this formula

2. Property Retrieval

Retrieve molecular properties for compounds using either high-level or low-level approaches:

Using PubChemPy (Recommended) :

import pubchempy as pcp

# Get compound object with all properties
compound = pcp.get_compounds('caffeine', 'name')[0]

# Access individual properties
molecular_formula = compound.molecular_formula
molecular_weight = compound.molecular_weight
iupac_name = compound.iupac_name
smiles = compound.canonical_smiles
inchi = compound.inchi
xlogp = compound.xlogp  # Partition coefficient
tpsa = compound.tpsa    # Topological polar surface area

Get Specific Properties :

# Request only specific properties
properties = pcp.get_properties(
    ['MolecularFormula', 'MolecularWeight', 'CanonicalSMILES', 'XLogP'],
    'aspirin',
    'name'
)
# Returns list of dictionaries

Batch Property Retrieval :

import pandas as pd

compound_names = ['aspirin', 'ibuprofen', 'paracetamol']
all_properties = []

for name in compound_names:
    props = pcp.get_properties(
        ['MolecularFormula', 'MolecularWeight', 'XLogP'],
        name,
        'name'
    )
    all_properties.extend(props)

df = pd.DataFrame(all_properties)

Available Properties : MolecularFormula, MolecularWeight, CanonicalSMILES, IsomericSMILES, InChI, InChIKey, IUPACName, XLogP, TPSA, HBondDonorCount, HBondAcceptorCount, RotatableBondCount, Complexity, Charge, and many more (see references/api_reference.md for complete list).

3. Similarity Search

Find structurally similar compounds using Tanimoto similarity:

import pubchempy as pcp

# Start with a query compound
query_compound = pcp.get_compounds('gefitinib', 'name')[0]
query_smiles = query_compound.canonical_smiles

# Perform similarity search
similar_compounds = pcp.get_compounds(
    query_smiles,
    'smiles',
    searchtype='similarity',
    Threshold=85,  # Similarity threshold (0-100)
    MaxRecords=50
)

# Process results
for compound in similar_compounds[:10]:
    print(f"CID {compound.cid}: {compound.iupac_name}")
    print(f"  MW: {compound.molecular_weight}")

Note : Similarity searches are asynchronous for large queries and may take 15-30 seconds to complete. PubChemPy handles the asynchronous pattern automatically.

4. Substructure Search

Find compounds containing a specific structural motif:

import pubchempy as pcp

# Search for compounds containing pyridine ring
pyridine_smiles = 'c1ccncc1'

matches = pcp.get_compounds(
    pyridine_smiles,
    'smiles',
    searchtype='substructure',
    MaxRecords=100
)

print(f"Found {len(matches)} compounds containing pyridine")

Common Substructures :

Benzene ring: c1ccccc1
Pyridine: c1ccncc1
Phenol: c1ccc(O)cc1
Carboxylic acid: C(=O)O

5. Format Conversion

Convert between different chemical structure formats:

import pubchempy as pcp

compound = pcp.get_compounds('aspirin', 'name')[0]

# Convert to different formats
smiles = compound.canonical_smiles
inchi = compound.inchi
inchikey = compound.inchikey
cid = compound.cid

# Download structure files
pcp.download('SDF', 'aspirin', 'name', 'aspirin.sdf', overwrite=True)
pcp.download('JSON', '2244', 'cid', 'aspirin.json', overwrite=True)

6. Structure Visualization

Generate 2D structure images:

import pubchempy as pcp

# Download compound structure as PNG
pcp.download('PNG', 'caffeine', 'name', 'caffeine.png', overwrite=True)

# Using direct URL (via requests)
import requests

cid = 2244  # Aspirin
url = f"https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/cid/{cid}/PNG?image_size=large"
response = requests.get(url)

with open('structure.png', 'wb') as f:
    f.write(response.content)

7. Synonym Retrieval

Get all known names and synonyms for a compound:

import pubchempy as pcp

synonyms_data = pcp.get_synonyms('aspirin', 'name')

if synonyms_data:
    cid = synonyms_data[0]['CID']
    synonyms = synonyms_data[0]['Synonym']

    print(f"CID {cid} has {len(synonyms)} synonyms:")
    for syn in synonyms[:10]:  # First 10
        print(f"  - {syn}")

8. Bioactivity Data Access

Retrieve biological activity data from assays:

import requests
import json

# Get bioassay summary for a compound
cid = 2244  # Aspirin
url = f"https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/cid/{cid}/assaysummary/JSON"

response = requests.get(url)
if response.status_code == 200:
    data = response.json()
    # Process bioassay information
    table = data.get('Table', {})
    rows = table.get('Row', [])
    print(f"Found {len(rows)} bioassay records")

For more complex bioactivity queries , use the scripts/bioactivity_query.py helper script which provides:

Bioassay summaries with activity outcome filtering
Assay target identification
Search for compounds by biological target
Active compound lists for specific assays

9. Comprehensive Compound Annotations

Access detailed compound information through PUG-View:

import requests

cid = 2244
url = f"https://pubchem.ncbi.nlm.nih.gov/rest/pug_view/data/compound/{cid}/JSON"

response = requests.get(url)
if response.status_code == 200:
    annotations = response.json()
    # Contains extensive data including:
    # - Chemical and Physical Properties
    # - Drug and Medication Information
    # - Pharmacology and Biochemistry
    # - Safety and Hazards
    # - Toxicity
    # - Literature references
    # - Patents

Get Specific Section :

# Get only drug information
url = f"https://pubchem.ncbi.nlm.nih.gov/rest/pug_view/data/compound/{cid}/JSON?heading=Drug and Medication Information"

Installation Requirements

Install PubChemPy for Python-based access:

uv pip install pubchempy

For direct API access and bioactivity queries:

uv pip install requests

Optional for data analysis:

uv pip install pandas

Helper Scripts

This skill includes Python scripts for common PubChem tasks:

scripts/compound_search.py

Provides utility functions for searching and retrieving compound information:

Key Functions :

search_by_name(name, max_results=10): Search compounds by name
search_by_smiles(smiles): Search by SMILES string
get_compound_by_cid(cid): Retrieve compound by CID
get_compound_properties(identifier, namespace, properties): Get specific properties
similarity_search(smiles, threshold, max_records): Perform similarity search
substructure_search(smiles, max_records): Perform substructure search
get_synonyms(identifier, namespace): Get all synonyms
batch_search(identifiers, namespace, properties): Batch search multiple compounds

Usage :

from scripts.compound_search import search_by_name, get_compound_properties

# Search for a compound
compounds = search_by_name('ibuprofen')

# Get specific properties
props = get_compound_properties('aspirin', 'name', ['MolecularWeight', 'XLogP'])

scripts/bioactivity_query.py

Provides functions for retrieving biological activity data:

Key Functions :

get_bioassay_summary(cid): Get bioassay summary for compound
get_compound_bioactivities(cid, activity_outcome): Get filtered bioactivities
get_assay_description(aid): Get detailed assay information
get_assay_targets(aid): Get biological targets for assay
search_assays_by_target(target_name, max_results): Find assays by target
get_active_compounds_in_assay(aid, max_results): Get active compounds
get_compound_annotations(cid, section): Get PUG-View annotations
summarize_bioactivities(cid): Generate bioactivity summary statistics

Usage :

from scripts.bioactivity_query import get_bioassay_summary, summarize_bioactivities

# Get bioactivity summary
summary = summarize_bioactivities(2244)  # Aspirin
print(f"Total assays: {summary['total_assays']}")
print(f"Active: {summary['active']}, Inactive: {summary['inactive']}")

API Rate Limits and Best Practices

Rate Limits :

Maximum 5 requests per second
Maximum 400 requests per minute
Maximum 300 seconds running time per minute

Best Practices :

Use CIDs for repeated queries : CIDs are more efficient than names or structures
Cache results locally : Store frequently accessed data
Batch requests : Combine multiple queries when possible
Implement delays : Add 0.2-0.3 second delays between requests
Handle errors gracefully : Check for HTTP errors and missing data
Use PubChemPy : Higher-level abstraction handles many edge cases
Leverage asynchronous pattern : For large similarity/substructure searches
Specify MaxRecords : Limit results to avoid timeouts

Error Handling :

from pubchempy import BadRequestError, NotFoundError, TimeoutError

try:
    compound = pcp.get_compounds('query', 'name')[0]
except NotFoundError:
    print("Compound not found")
except BadRequestError:
    print("Invalid request format")
except TimeoutError:
    print("Request timed out - try reducing scope")
except IndexError:
    print("No results returned")

Common Workflows

Workflow 1: Chemical Identifier Conversion Pipeline

Convert between different chemical identifiers:

import pubchempy as pcp

# Start with any identifier type
compound = pcp.get_compounds('caffeine', 'name')[0]

# Extract all identifier formats
identifiers = {
    'CID': compound.cid,
    'Name': compound.iupac_name,
    'SMILES': compound.canonical_smiles,
    'InChI': compound.inchi,
    'InChIKey': compound.inchikey,
    'Formula': compound.molecular_formula
}

Workflow 2: Drug-Like Property Screening

Screen compounds using Lipinski's Rule of Five:

import pubchempy as pcp

def check_drug_likeness(compound_name):
    compound = pcp.get_compounds(compound_name, 'name')[0]

    # Lipinski's Rule of Five
    rules = {
        'MW <= 500': compound.molecular_weight <= 500,
        'LogP <= 5': compound.xlogp <= 5 if compound.xlogp else None,
        'HBD <= 5': compound.h_bond_donor_count <= 5,
        'HBA <= 10': compound.h_bond_acceptor_count <= 10
    }

    violations = sum(1 for v in rules.values() if v is False)
    return rules, violations

rules, violations = check_drug_likeness('aspirin')
print(f"Lipinski violations: {violations}")

Workflow 3: Finding Similar Drug Candidates

Identify structurally similar compounds to a known drug:

import pubchempy as pcp

# Start with known drug
reference_drug = pcp.get_compounds('imatinib', 'name')[0]
reference_smiles = reference_drug.canonical_smiles

# Find similar compounds
similar = pcp.get_compounds(
    reference_smiles,
    'smiles',
    searchtype='similarity',
    Threshold=85,
    MaxRecords=20
)

# Filter by drug-like properties
candidates = []
for comp in similar:
    if comp.molecular_weight and 200 <= comp.molecular_weight <= 600:
        if comp.xlogp and -1 <= comp.xlogp <= 5:
            candidates.append(comp)

print(f"Found {len(candidates)} drug-like candidates")

Workflow 4: Batch Compound Property Comparison

Compare properties across multiple compounds:

import pubchempy as pcp
import pandas as pd

compound_list = ['aspirin', 'ibuprofen', 'naproxen', 'celecoxib']

properties_list = []
for name in compound_list:
    try:
        compound = pcp.get_compounds(name, 'name')[0]
        properties_list.append({
            'Name': name,
            'CID': compound.cid,
            'Formula': compound.molecular_formula,
            'MW': compound.molecular_weight,
            'LogP': compound.xlogp,
            'TPSA': compound.tpsa,
            'HBD': compound.h_bond_donor_count,
            'HBA': compound.h_bond_acceptor_count
        })
    except Exception as e:
        print(f"Error processing {name}: {e}")

df = pd.DataFrame(properties_list)
print(df.to_string(index=False))

Workflow 5: Substructure-Based Virtual Screening

Screen for compounds containing specific pharmacophores:

import pubchempy as pcp

# Define pharmacophore (e.g., sulfonamide group)
pharmacophore_smiles = 'S(=O)(=O)N'

# Search for compounds containing this substructure
hits = pcp.get_compounds(
    pharmacophore_smiles,
    'smiles',
    searchtype='substructure',
    MaxRecords=100
)

# Further filter by properties
filtered_hits = [
    comp for comp in hits
    if comp.molecular_weight and comp.molecular_weight < 500
]

print(f"Found {len(filtered_hits)} compounds with desired substructure")

Reference Documentation

For detailed API documentation, including complete property lists, URL patterns, advanced query options, and more examples, consult references/api_reference.md. This comprehensive reference includes:

Complete PUG-REST API endpoint documentation
Full list of available molecular properties
Asynchronous request handling patterns
PubChemPy API reference
PUG-View API for annotations
Common workflows and use cases
Links to official PubChem documentation

Troubleshooting

Compound Not Found :

Try alternative names or synonyms
Use CID if known
Check spelling and chemical name format

Timeout Errors :

Reduce MaxRecords parameter
Add delays between requests
Use CIDs instead of names for faster queries

Empty Property Values :

Not all properties are available for all compounds
Check if property exists before accessing: if compound.xlogp:
Some properties only available for certain compound types

Rate Limit Exceeded :

Implement delays (0.2-0.3 seconds) between requests
Use batch operations where possible
Consider caching results locally

Similarity/Substructure Search Hangs :

These are asynchronous operations that may take 15-30 seconds
PubChemPy handles polling automatically
Reduce MaxRecords if timing out

Additional Resources

PubChem Home: https://pubchem.ncbi.nlm.nih.gov/
PUG-REST Documentation: https://pubchem.ncbi.nlm.nih.gov/docs/pug-rest
PUG-REST Tutorial: https://pubchem.ncbi.nlm.nih.gov/docs/pug-rest-tutorial
PubChemPy Documentation: https://pubchempy.readthedocs.io/
PubChemPy GitHub: https://github.com/mcs07/PubChemPy

Weekly Installs

131

Repository

davila7/claude-…emplates

GitHub Stars

22.6K

First Seen

Jan 21, 2026

Security Audits

Gen Agent Trust HubPass SocketPass SnykWarn

Installed on

claude-code110

opencode104

gemini-cli99

cursor95

codex89

antigravity87

智能OCR文字识别工具 - 支持100+语言，高精度提取图片/PDF/手写文本

1,000 周安装

download_structure(identifier, namespace, format, filename): Download structures

print_compound_info(compound): Print formatted compound information

find_compounds_by_bioactivity(target, threshold, max_compounds): Find compounds by target

PubChem 数据库 Python 使用指南：化学结构搜索、性质检索与生物活性数据分析

🇨🇳中文介绍

PubChem 数据库

概述

何时使用此技能

核心功能

1. 化学结构搜索

相关 Skills

2. 性质检索

3. 相似性搜索

4. 子结构搜索

5. 格式转换

6. 结构可视化

7. 同义词检索

8. 生物活性数据访问

9. 综合化合物注释

安装要求

辅助脚本

scripts/compound_search.py

scripts/bioactivity_query.py

API 速率限制和最佳实践

常见工作流程

工作流程 1：化学标识符转换管道

工作流程 2：类药性质筛选

工作流程 3：寻找相似药物候选物

工作流程 4：批量化合物性质比较

工作流程 5：基于子结构的虚拟筛选

参考文档

故障排除

其他资源

🇺🇸English

PubChem Database

Overview

When to Use This Skill

Core Capabilities

1. Chemical Structure Search

2. Property Retrieval

3. Similarity Search

4. Substructure Search

5. Format Conversion

6. Structure Visualization

7. Synonym Retrieval

8. Bioactivity Data Access

9. Comprehensive Compound Annotations

Installation Requirements

Helper Scripts

scripts/compound_search.py

scripts/bioactivity_query.py

API Rate Limits and Best Practices

Common Workflows

Workflow 1: Chemical Identifier Conversion Pipeline

Workflow 2: Drug-Like Property Screening

Workflow 3: Finding Similar Drug Candidates

Workflow 4: Batch Compound Property Comparison

Workflow 5: Substructure-Based Virtual Screening

Reference Documentation

Troubleshooting

Additional Resources

最新 Skills