⚠️

重要前提

安装AI Skills的关键前提是：必须科学上网，且开启TUN模式，这一点至关重要，直接决定安装能否顺利完成，在此郑重提醒三遍：科学上网，科学上网，科学上网。查看完整安装教程 →

文档聊天界面：构建智能问答系统，让PDF、邮件、代码库变成交互式知识源

document-chat-interface by qodex-ai/ai-agent-skills

51 周安装量

5 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/qodex-ai/ai-agent-skills --skill document-chat-interface

AI/机器学习聊天机器人插件自然语言处理

🇨🇳中文介绍

文档聊天界面

构建智能聊天界面，允许用户使用自然语言查询和交互文档，将静态文档转变为交互式知识源。

概述

文档聊天界面结合了三种能力：

文档处理 - 提取和准备文档
语义理解 - 理解问题并查找相关内容
对话界面 - 保持上下文并提供自然回复

常见应用

PDF 问答：回答关于研究论文、报告、书籍的问题
邮件搜索：以对话方式在邮件存档中查找信息
GitHub 探索器：询问关于代码仓库的问题
知识库：交互式访问公司文档
合同审查：使用自然语言查询法律文档
研究助手：交互式探索学术论文

架构

Document Source
    ↓
Document Processor
    ├→ Extract text
    ├→ Process content
    └→ Generate embeddings
    ↓
Vector Database
    ↓
Chat Interface ← User Question
    ├→ Retrieve relevant content
    ├→ Maintain conversation history
    └→ Generate response

核心组件

1. 文档源

查看 examples/document_processors.py 获取实现：

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

3. 聊天界面设计

查看 examples/conversation_manager.py 获取实现：

维护带大小限制的对话历史
跟踪消息元数据（时间戳、角色）
为 LLM 集成提供上下文
根据需要清除历史记录

扩展问题中的隐式引用
处理代词和上下文引用
使用先前上下文提高问题清晰度

使用文档上下文进行回答
在提示中维护对话历史
提供来源引用
处理范围外的问题

4. 用户体验功能

def format_response_with_citations(response: str, sources: List[Dict]) -> str:
    """Add source citations to response"""

    formatted = response + "\n\n**Sources:**\n"
    for i, source in enumerate(sources, 1):
        formatted += f"[{i}] Page {source['page']} of {source['source']}\n"
        if 'excerpt' in source:
            formatted += f"    \"{source['excerpt'][:100]}...\"\n"

    return formatted

def generate_follow_up_questions(context: str, response: str) -> List[str]:
    """Suggest follow-up questions to user"""

    prompt = f"""
    Based on this Q&A, generate 3 relevant follow-up questions:
    Context: {context[:500]}
    Response: {response[:500]}
    """

    follow_ups = llm.generate(prompt)
    return follow_ups

def handle_query_failure(question: str, error: Exception) -> str:
    """Handle when no relevant documents found"""

    if isinstance(error, NoRelevantDocuments):
        return (
            "I couldn't find information about that in the documents. "
            "Try asking about different topics like: "
            + ", ".join(get_main_topics())
        )
    elif isinstance(error, ContextTooLarge):
        return (
            "The answer requires too much context. "
            "Can you be more specific about what you'd like to know?"
        )
    else:
        return f"I encountered an issue: {str(error)[:100]}"

from langchain.document_loaders import PDFLoader
from langchain.text_splitter import CharacterTextSplitter
from langchain.embeddings import OpenAIEmbeddings
from langchain.vectorstores import Chroma
from langchain.chat_models import ChatOpenAI
from langchain.chains import ConversationalRetrievalChain

# Load document
loader = PDFLoader("document.pdf")
documents = loader.load()

# Split into chunks
splitter = CharacterTextSplitter(chunk_size=1000)
chunks = splitter.split_documents(documents)

# Create embeddings
embeddings = OpenAIEmbeddings()
vectorstore = Chroma.from_documents(chunks, embeddings)

# Create chat chain
llm = ChatOpenAI(model="gpt-4")
qa = ConversationalRetrievalChain.from_llm(
    llm=llm,
    retriever=vectorstore.as_retriever(),
    return_source_documents=True
)

# Chat interface
chat_history = []
while True:
    question = input("You: ")
    result = qa({"question": question, "chat_history": chat_history})
    print(f"Assistant: {result['answer']}")
    chat_history.append((question, result['answer']))

from llama_index import GPTVectorStoreIndex, SimpleDirectoryReader, ChatMemoryBuffer
from llama_index.llms import ChatMessage, MessageRole

# Load documents
documents = SimpleDirectoryReader("./docs").load_data()

# Create index
index = GPTVectorStoreIndex.from_documents(documents)

# Create chat engine with memory
chat_engine = index.as_chat_engine(
    memory=ChatMemoryBuffer.from_defaults(token_limit=3900),
    llm="gpt-4"
)

# Chat loop
while True:
    question = input("You: ")
    response = chat_engine.chat(question)
    print(f"Assistant: {response}")

使用基于 RAG 的方法

from sentence_transformers import SentenceTransformer
import faiss
import numpy as np

# Load and embed documents
model = SentenceTransformer('all-MiniLM-L6-v2')
documents = load_documents("document.pdf")
embeddings = model.encode(documents)

# Create FAISS index
dimension = embeddings.shape[1]
index = faiss.IndexFlatL2(dimension)
index.add(np.array(embeddings).astype('float32'))

# Chat function
def chat(question):
    # Embed question
    q_embedding = model.encode(question)

    # Retrieve documents
    k = 5
    distances, indices = index.search(
        np.array([q_embedding]).astype('float32'), k
    )

    # Get relevant documents
    context = " ".join([documents[i] for i in indices[0]])

    # Generate response
    response = llm.generate(
        f"Context: {context}\nQuestion: {question}\nAnswer:"
    )
    return response

✓ 支持多种格式（PDF、TXT、docx 等）
✓ 高效处理大型文档
✓ 保留文档结构
✓ 提取元数据
✓ 处理多种语言
✓ 为扫描的 PDF 实现 OCR

✓ 维护对话上下文
✓ 询问澄清性问题
✓ 引用来源
✓ 处理歧义
✓ 建议后续问题
✓ 处理范围外的问题

✓ 优化检索速度
✓ 实现缓存
✓ 处理大型文档集
✓ 批量处理文档
✓ 监控延迟
✓ 实现分页

✓ 清晰的回复格式
✓ 能够引用来源
✓ 文档浏览器/探索器
✓ 搜索建议
✓ 查询历史
✓ 导出对话

常见挑战与解决方案

挑战：无关答案

改进检索（更多上下文，更好的嵌入）
根据上下文验证答案
询问澄清性问题
实现置信度评分
使用混合搜索

挑战：跨轮次丢失上下文

维护对话记忆
基于历史更新检索
总结长对话
重新加权先前查询

挑战：处理长文档

分层分块
先总结
问题优化
多跳检索
文档导航

挑战：有限的上下文窗口

压缩检索到的上下文
使用文档摘要
分层检索
关注最相关的部分
迭代优化

def compare_documents(question: str, documents: List[str]):
    """Analyze and compare across multiple documents"""
    results = []

    for doc in documents:
        response = query_document(doc, question)
        results.append({
            "document": doc.name,
            "answer": response
        })

    # Compare and synthesize
    comparison = llm.generate(
        f"Compare these answers: {results}"
    )
    return comparison

交互式文档探索

class DocumentExplorer:
    def __init__(self, documents):
        self.documents = documents

    def browse_by_topic(self, topic):
        """Find documents by topic"""
        pass

    def get_related_documents(self, doc_id):
        """Find similar documents"""
        pass

    def get_key_terms(self, document):
        """Extract key terms and concepts"""
        pass

PyPDF：PDF 处理
python-docx：Word 文档处理
BeautifulSoup：网页抓取
youtube-transcript-api：YouTube 转录文本

LangChain：综合框架
LlamaIndex：文档导向
RAG 库：向量数据库集成

选择要支持的文档源
实现文档加载和处理
设置向量数据库/嵌入
构建聊天界面
实现对话管理
添加来源引用
处理边缘情况（大文档、OCR 等）
实现错误处理
添加性能监控
使用真实文档测试
部署和监控

从简单开始：单个 PDF，基础聊天
添加功能：多文档，对话历史
提高质量：更好的分块，检索
扩展规模：支持更多格式，更大的文档
完善优化：用户体验改进，错误处理

🇺🇸English

Document Chat Interface

Build intelligent chat interfaces that allow users to query and interact with documents using natural language, transforming static documents into interactive knowledge sources.

Overview

A document chat interface combines three capabilities:

Document Processing - Extract and prepare documents
Semantic Understanding - Understand questions and find relevant content
Conversational Interface - Maintain context and provide natural responses

Common Applications

PDF Q &A: Answer questions about research papers, reports, books
Email Search : Find information in email archives conversationally
GitHub Explorer : Ask questions about code repositories
Knowledge Base : Interactive access to company documentation
Contract Review : Query legal documents with natural language
Research Assistant : Explore academic papers interactively

Architecture

Document Source
    ↓
Document Processor
    ├→ Extract text
    ├→ Process content
    └→ Generate embeddings
    ↓
Vector Database
    ↓
Chat Interface ← User Question
    ├→ Retrieve relevant content
    ├→ Maintain conversation history
    └→ Generate response

Core Components

1. Document Sources

See examples/document_processors.py for implementations:

PDF Documents

Extract text from PDF pages
Preserve document structure and metadata
Handle scanned PDFs with OCR (pytesseract)
Extract tables (pdfplumber)

GitHub Repositories

Extract code files from repositories
Parse repository structure
Process multiple file types

Email Archives

Extract email metadata (from, to, subject, date)
Parse email body content
Handle multiple mailbox formats

Web Pages

Extract page text and structure
Preserve heading hierarchy
Extract links and navigation

YouTube/Audio

Get transcripts from YouTube videos
Transcribe audio files
Handle multiple formats

2. Document Processing

See examples/text_processor.py for implementations:

Text Extraction & Cleaning

Remove extra whitespace and special characters
Smart text chunking with overlap
Intelligent sentence boundary detection

Metadata Extraction

Extract title, author, date, language
Calculate word count and document statistics
Track document source and format

Structure Preservation

Keep heading hierarchy in chunks
Preserve section context
Enable hierarchical retrieval

3. Chat Interface Design

See examples/conversation_manager.py for implementations:

Conversation Management

Maintain conversation history with size limits
Track message metadata (timestamps, roles)
Provide context for LLM integration
Clear history as needed

Question Refinement

Expand implicit references in questions
Handle pronouns and context references
Improve question clarity with previous context

Response Generation

Use document context for answering
Maintain conversation history in prompts
Provide source citations
Handle out-of-scope questions

4. User Experience Features

Citation & Sources

def format_response_with_citations(response: str, sources: List[Dict]) -> str:
    """Add source citations to response"""

    formatted = response + "\n\n**Sources:**\n"
    for i, source in enumerate(sources, 1):
        formatted += f"[{i}] Page {source['page']} of {source['source']}\n"
        if 'excerpt' in source:
            formatted += f"    \"{source['excerpt'][:100]}...\"\n"

    return formatted

Clarifying Questions

def generate_follow_up_questions(context: str, response: str) -> List[str]:
    """Suggest follow-up questions to user"""

    prompt = f"""
    Based on this Q&A, generate 3 relevant follow-up questions:
    Context: {context[:500]}
    Response: {response[:500]}
    """

    follow_ups = llm.generate(prompt)
    return follow_ups

Error Handling

def handle_query_failure(question: str, error: Exception) -> str:
    """Handle when no relevant documents found"""

    if isinstance(error, NoRelevantDocuments):
        return (
            "I couldn't find information about that in the documents. "
            "Try asking about different topics like: "
            + ", ".join(get_main_topics())
        )
    elif isinstance(error, ContextTooLarge):
        return (
            "The answer requires too much context. "
            "Can you be more specific about what you'd like to know?"
        )
    else:
        return f"I encountered an issue: {str(error)[:100]}"

Implementation Frameworks

Using LangChain

from langchain.document_loaders import PDFLoader
from langchain.text_splitter import CharacterTextSplitter
from langchain.embeddings import OpenAIEmbeddings
from langchain.vectorstores import Chroma
from langchain.chat_models import ChatOpenAI
from langchain.chains import ConversationalRetrievalChain

# Load document
loader = PDFLoader("document.pdf")
documents = loader.load()

# Split into chunks
splitter = CharacterTextSplitter(chunk_size=1000)
chunks = splitter.split_documents(documents)

# Create embeddings
embeddings = OpenAIEmbeddings()
vectorstore = Chroma.from_documents(chunks, embeddings)

# Create chat chain
llm = ChatOpenAI(model="gpt-4")
qa = ConversationalRetrievalChain.from_llm(
    llm=llm,
    retriever=vectorstore.as_retriever(),
    return_source_documents=True
)

# Chat interface
chat_history = []
while True:
    question = input("You: ")
    result = qa({"question": question, "chat_history": chat_history})
    print(f"Assistant: {result['answer']}")
    chat_history.append((question, result['answer']))

Using LlamaIndex

from llama_index import GPTVectorStoreIndex, SimpleDirectoryReader, ChatMemoryBuffer
from llama_index.llms import ChatMessage, MessageRole

# Load documents
documents = SimpleDirectoryReader("./docs").load_data()

# Create index
index = GPTVectorStoreIndex.from_documents(documents)

# Create chat engine with memory
chat_engine = index.as_chat_engine(
    memory=ChatMemoryBuffer.from_defaults(token_limit=3900),
    llm="gpt-4"
)

# Chat loop
while True:
    question = input("You: ")
    response = chat_engine.chat(question)
    print(f"Assistant: {response}")

Using RAG-Based Approach

from sentence_transformers import SentenceTransformer
import faiss
import numpy as np

# Load and embed documents
model = SentenceTransformer('all-MiniLM-L6-v2')
documents = load_documents("document.pdf")
embeddings = model.encode(documents)

# Create FAISS index
dimension = embeddings.shape[1]
index = faiss.IndexFlatL2(dimension)
index.add(np.array(embeddings).astype('float32'))

# Chat function
def chat(question):
    # Embed question
    q_embedding = model.encode(question)

    # Retrieve documents
    k = 5
    distances, indices = index.search(
        np.array([q_embedding]).astype('float32'), k
    )

    # Get relevant documents
    context = " ".join([documents[i] for i in indices[0]])

    # Generate response
    response = llm.generate(
        f"Context: {context}\nQuestion: {question}\nAnswer:"
    )
    return response

Best Practices

Document Handling

✓ Support multiple formats (PDF, TXT, docx, etc.)
✓ Handle large documents efficiently
✓ Preserve document structure
✓ Extract metadata
✓ Handle multiple languages
✓ Implement OCR for scanned PDFs

Conversation Quality

✓ Maintain conversation context
✓ Ask clarifying questions
✓ Cite sources
✓ Handle ambiguity
✓ Suggest follow-up questions
✓ Handle out-of-scope questions

Performance

✓ Optimize retrieval speed
✓ Implement caching
✓ Handle large document sets
✓ Batch process documents
✓ Monitor latency
✓ Implement pagination

User Experience

✓ Clear response formatting
✓ Ability to cite sources
✓ Document browser/explorer
✓ Search suggestions
✓ Query history
✓ Export conversations

Common Challenges & Solutions

Challenge: Irrelevant Answers

Solutions :

Improve retrieval (more context, better embeddings)
Validate answer against context
Ask clarifying questions
Implement confidence scoring
Use hybrid search

Challenge: Lost Context Across Turns

Solutions :

Maintain conversation memory
Update retrieval based on history
Summarize long conversations
Re-weight previous queries

Challenge: Handling Long Documents

Solutions :

Hierarchical chunking
Summarize first
Question refinement
Multi-hop retrieval
Document navigation

Challenge: Limited Context Window

Solutions :

Compress retrieved context
Use document summarization
Hierarchical retrieval
Focus on most relevant sections
Iterative refinement

Advanced Features

Multi-Document Analysis

def compare_documents(question: str, documents: List[str]):
    """Analyze and compare across multiple documents"""
    results = []

    for doc in documents:
        response = query_document(doc, question)
        results.append({
            "document": doc.name,
            "answer": response
        })

    # Compare and synthesize
    comparison = llm.generate(
        f"Compare these answers: {results}"
    )
    return comparison

Interactive Document Exploration

class DocumentExplorer:
    def __init__(self, documents):
        self.documents = documents

    def browse_by_topic(self, topic):
        """Find documents by topic"""
        pass

    def get_related_documents(self, doc_id):
        """Find similar documents"""
        pass

    def get_key_terms(self, document):
        """Extract key terms and concepts"""
        pass

Resources

Document Processing Libraries

PyPDF: PDF handling
python-docx: Word document handling
BeautifulSoup: Web scraping
youtube-transcript-api: YouTube transcripts

Chat Frameworks

LangChain: Comprehensive framework
LlamaIndex: Document-focused
RAG libraries: Vector DB integration

Implementation Checklist

Choose document source(s) to support
Implement document loading and processing
Set up vector database/embeddings
Build chat interface
Implement conversation management
Add source citation
Handle edge cases (large docs, OCR, etc.)
Implement error handling
Add performance monitoring
Test with real documents
Deploy and monitor

Getting Started

Start Simple : Single PDF, basic chat
Add Features : Multi-document, conversation history
Improve Quality : Better chunking, retrieval
Scale : Support more formats, larger documents
Polish : UX improvements, error handling

Weekly Installs

Repository

qodex-ai/ai-agent-skills

GitHub Stars

First Seen

Jan 22, 2026

Security Audits

Gen Agent Trust HubPass SocketPass SnykWarn

Installed on

opencode34

codex34

gemini-cli33

github-copilot31

cursor31

claude-code30

AI界面设计评审工具 - 全面评估UI/UX设计质量、检测AI生成痕迹与优化用户体验

58,500 周安装

文档聊天界面：构建智能问答系统，让PDF、邮件、代码库变成交互式知识源

🇨🇳中文介绍

文档聊天界面

概述

常见应用

架构

核心组件

1. 文档源

相关 Skills

PDF 文档

GitHub 仓库

邮件存档

网页

YouTube/音频

2. 文档处理

文本提取与清理

元数据提取

结构保留

3. 聊天界面设计

对话管理

问题优化

回复生成

4. 用户体验功能

引用与来源

澄清性问题

错误处理

实现框架

使用 LangChain

使用 LlamaIndex

使用基于 RAG 的方法

最佳实践

文档处理

对话质量

性能

用户体验

常见挑战与解决方案

挑战：无关答案

挑战：跨轮次丢失上下文

挑战：处理长文档

挑战：有限的上下文窗口

高级功能

多文档分析

交互式文档探索

资源

文档处理库

聊天框架

实现清单

入门指南

🇺🇸English

Document Chat Interface

Overview

Common Applications

Architecture

Core Components

1. Document Sources

PDF Documents

GitHub Repositories

Email Archives

Web Pages

YouTube/Audio

2. Document Processing

Text Extraction & Cleaning

Metadata Extraction

Structure Preservation

3. Chat Interface Design

Conversation Management

Question Refinement

Response Generation

4. User Experience Features

Citation & Sources

Clarifying Questions

Error Handling

Implementation Frameworks

Using LangChain

Using LlamaIndex

Using RAG-Based Approach

Best Practices

Document Handling

Conversation Quality

Performance