alicloud-ai-search-dashvector by cinience/alicloud-skills
npx skills add https://github.com/cinience/alicloud-skills --skill alicloud-ai-search-dashvectorCategory: provider
使用 DashVector 管理集合并执行带可选过滤器和稀疏向量的向量相似性搜索。
安装 SDK(建议在虚拟环境中安装以避免 PEP 668 限制):
python3 -m venv .venv . .venv/bin/activate python -m pip install dashvector
通过环境变量提供凭证和端点:
DASHVECTOR_API_KEYDASHVECTOR_ENDPOINT(集群端点)name (str)dimension (int)metric (str: cosine | | )广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
dotproducteuclideanfields_schema (可选的字段类型字典)docs 列表,包含 {id, vector, fields} 或元组sparse_vector 和多向量集合vector 或 id(必须提供其一;如果都为空,则仅应用过滤器)topk (int)filter (类 SQL 的 where 子句)output_fields (字段名列表)include_vector (bool)import os
import dashvector
from dashvector import Doc
client = dashvector.Client(
api_key=os.getenv("DASHVECTOR_API_KEY"),
endpoint=os.getenv("DASHVECTOR_ENDPOINT"),
)
# 1) 创建一个集合
ret = client.create(
name="docs",
dimension=768,
metric="cosine",
fields_schema={"title": str, "source": str, "chunk": int},
)
assert ret
# 2) 更新/插入文档
collection = client.get(name="docs")
ret = collection.upsert(
[
Doc(id="1", vector=[0.01] * 768, fields={"title": "Intro", "source": "kb", "chunk": 0}),
Doc(id="2", vector=[0.02] * 768, fields={"title": "FAQ", "source": "kb", "chunk": 1}),
]
)
assert ret
# 3) 查询
ret = collection.query(
vector=[0.01] * 768,
topk=5,
filter="source = 'kb' AND chunk >= 0",
output_fields=["title", "source", "chunk"],
include_vector=False,
)
for doc in ret:
print(doc.id, doc.fields)
python skills/ai/search/alicloud-ai-search-dashvector/scripts/quickstart.py
环境变量:
DASHVECTOR_API_KEYDASHVECTOR_ENDPOINTDASHVECTOR_COLLECTION (可选)DASHVECTOR_DIMENSION (可选)可选参数:--collection, --dimension, --topk, --filter。
upsert 以实现幂等的数据摄取。dimension 与您的嵌入模型输出大小一致。sparse_vector={token_id: weight, ...}。DASHVECTOR_API_KEYmkdir -p output/alicloud-ai-search-dashvector
for f in skills/ai/search/alicloud-ai-search-dashvector/scripts/*.py; do
python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/alicloud-ai-search-dashvector/validate.txt
通过标准:命令以 0 退出且生成 output/alicloud-ai-search-dashvector/validate.txt 文件。
output/alicloud-ai-search-dashvector/ 目录下。DashVector Python SDK: Client.create, Collection.upsert, Collection.query
来源列表: references/sources.md
每周安装量
244
代码仓库
GitHub 星标数
363
首次出现
Feb 26, 2026
安全审计
安装于
gemini-cli241
github-copilot241
codex241
kimi-cli241
amp241
cursor241
Category: provider
Use DashVector to manage collections and perform vector similarity search with optional filters and sparse vectors.
Install SDK (recommended in a venv to avoid PEP 668 limits):
python3 -m venv .venv . .venv/bin/activate python -m pip install dashvector
Provide credentials and endpoint via environment variables:
DASHVECTOR_API_KEYDASHVECTOR_ENDPOINT (cluster endpoint)name (str)dimension (int)metric (str: cosine | dotproduct | euclidean)fields_schema (optional dict of field types)docs list of {id, vector, fields} or tuplessparse_vector and multi-vector collectionsvector or id (one required; if both empty, only filter is applied)topk (int)filter (SQL-like where clause)output_fields (list of field names)include_vector (bool)import os
import dashvector
from dashvector import Doc
client = dashvector.Client(
api_key=os.getenv("DASHVECTOR_API_KEY"),
endpoint=os.getenv("DASHVECTOR_ENDPOINT"),
)
# 1) Create a collection
ret = client.create(
name="docs",
dimension=768,
metric="cosine",
fields_schema={"title": str, "source": str, "chunk": int},
)
assert ret
# 2) Upsert docs
collection = client.get(name="docs")
ret = collection.upsert(
[
Doc(id="1", vector=[0.01] * 768, fields={"title": "Intro", "source": "kb", "chunk": 0}),
Doc(id="2", vector=[0.02] * 768, fields={"title": "FAQ", "source": "kb", "chunk": 1}),
]
)
assert ret
# 3) Query
ret = collection.query(
vector=[0.01] * 768,
topk=5,
filter="source = 'kb' AND chunk >= 0",
output_fields=["title", "source", "chunk"],
include_vector=False,
)
for doc in ret:
print(doc.id, doc.fields)
python skills/ai/search/alicloud-ai-search-dashvector/scripts/quickstart.py
Environment variables:
DASHVECTOR_API_KEYDASHVECTOR_ENDPOINTDASHVECTOR_COLLECTION (optional)DASHVECTOR_DIMENSION (optional)Optional args: --collection, --dimension, --topk, --filter.
upsert for idempotent ingestion.dimension aligned to your embedding model output size.sparse_vector={token_id: weight, ...} when upserting/querying.DASHVECTOR_API_KEYmkdir -p output/alicloud-ai-search-dashvector
for f in skills/ai/search/alicloud-ai-search-dashvector/scripts/*.py; do
python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/alicloud-ai-search-dashvector/validate.txt
Pass criteria: command exits 0 and output/alicloud-ai-search-dashvector/validate.txt is generated.
output/alicloud-ai-search-dashvector/.DashVector Python SDK: Client.create, Collection.upsert, Collection.query
Source list: references/sources.md
Weekly Installs
244
Repository
GitHub Stars
363
First Seen
Feb 26, 2026
Security Audits
Gen Agent Trust HubPassSocketPassSnykPass
Installed on
gemini-cli241
github-copilot241
codex241
kimi-cli241
amp241
cursor241
Azure 配额管理指南:服务限制、容量验证与配额增加方法
79,700 周安装