LLVM IR 完整指南：生成、优化、降级与调试 LLVM 工具链

llvm by mohitmishra786/low-level-dev-skills

267 周安装量

32 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/mohitmishra786/low-level-dev-skills --skill llvm

开发性能优化 C++

🇨🇳中文介绍

LLVM IR 与工具链

目的

引导智能体完成 LLVM IR 的完整流程：生成 IR、使用 opt 运行优化过程、使用 llc 降级为汇编，以及检查 IR 以进行调试或性能分析。

触发词

"显示这个函数的 LLVM IR"
"如何运行 LLVM 优化过程？"
"这条 LLVM IR 指令是什么意思？"
"如何编写自定义的 LLVM 过程？"
"为什么 LLVM 中没有发生自动向量化？"

工作流程

1. 生成 LLVM IR

# 生成文本格式 IR (.ll)
clang -O0 -emit-llvm -S src.c -o src.ll

# 生成位码 (.bc)
clang -O2 -emit-llvm -c src.c -o src.bc

# 将位码反汇编为文本
llvm-dis src.bc -o src.ll

2. 使用 `opt` 运行优化过程

# 应用特定过程
opt -passes='mem2reg,instcombine,simplifycfg' src.ll -S -o out.ll

# 标准优化流水线
opt -passes='default<O2>' src.ll -S -o out.ll
opt -passes='default<O3>' src.ll -S -o out.ll

# 列出可用过程
opt --print-passes 2>&1 | less

# 打印过程前后的 IR
opt -passes='instcombine' --print-before=instcombine --print-after=instcombine src.ll -S -o out.ll 2>&1 | less

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

相关 Skills

find-skills 技能搜索工具 - Vercel Labs 开源智能体技能包管理器

776,000 周安装

Vercel React 最佳实践指南 | 58条Next.js性能优化规则与代码重构

261,300 周安装

Vercel Web界面规范检查工具 - 自动检测代码是否符合Web设计指南

210,800 周安装

agent-browser 浏览器自动化工具 - Vercel Labs 命令行网页操作与测试

140,500 周安装

结构	含义
`alloca`	栈分配（SSA 之前；`mem2reg` 会将其提升到寄存器）
`load`/`store`	内存访问
`getelementptr` (GEP)	指针运算 / 字段访问
`phi`	SSA φ-节点：合并来自前驱基本块的值
`call`/`invoke`	函数调用（`invoke` 包含异常边）
`icmp`/`fcmp`	整数/浮点数比较
`br`	分支（条件或无条件的）
`ret`	返回
`bitcast`	重新解释位（在代码生成中是无操作）
`ptrtoint`/`inttoptr`	指针↔整数转换（尽可能避免）

过程	效果
`mem2reg`	将 alloca 提升为 SSA 寄存器
`instcombine`	指令合并 / 窥孔优化
`simplifycfg`	控制流图清理，移除死代码块
`loop-vectorize`	自动向量化
`slp-vectorize`	超字级并行（直线代码向量化）
`inline`	函数内联
`gvn`	全局值编号（公共子表达式消除）
`licm`	循环不变代码外提
`loop-unroll`	循环展开
`argpromotion`	将指针参数提升为值
`sroa`	聚合体的标量替换

工具	用途
`llvm-dis`	位码 → 文本 IR
`llvm-as`	文本 IR → 位码
`llvm-link`	链接多个位码文件
`llvm-lto`	独立链接时优化
`llvm-nm`	位码/目标文件中的符号
`llvm-objdump`	反汇编目标文件
`llvm-profdata`	合并/显示配置文件引导优化数据
`llvm-cov`	覆盖率报告
`llvm-mca`	机器代码分析器（吞吐量/延迟）

🇺🇸English

LLVM IR and Tooling

Purpose

Guide agents through the LLVM IR pipeline: generating IR, running optimisation passes with opt, lowering to assembly with llc, and inspecting IR for debugging or performance work.

Triggers

"Show me the LLVM IR for this function"
"How do I run an LLVM optimisation pass?"
"What does this LLVM IR instruction mean?"
"How do I write a custom LLVM pass?"
"Why isn't auto-vectorisation happening in LLVM?"

Workflow

1. Generate LLVM IR

# Emit textual IR (.ll)
clang -O0 -emit-llvm -S src.c -o src.ll

# Emit bitcode (.bc)
clang -O2 -emit-llvm -c src.c -o src.bc

# Disassemble bitcode to text
llvm-dis src.bc -o src.ll

2. Run optimisation passes with `opt`

# Apply a specific pass
opt -passes='mem2reg,instcombine,simplifycfg' src.ll -S -o out.ll

# Standard optimisation pipelines
opt -passes='default<O2>' src.ll -S -o out.ll
opt -passes='default<O3>' src.ll -S -o out.ll

# List available passes
opt --print-passes 2>&1 | less

# Print IR before and after a pass
opt -passes='instcombine' --print-before=instcombine --print-after=instcombine src.ll -S -o out.ll 2>&1 | less

3. Lower IR to assembly with `llc`

# Compile IR to object file
llc -filetype=obj src.ll -o src.o

# Compile to assembly
llc -filetype=asm -masm-syntax=intel src.ll -o src.s

# Target a specific CPU
llc -mcpu=skylake -mattr=+avx2 src.ll -o src.s

# Show available targets
llc --version

4. Inspect IR

Key IR constructs to understand:

Construct	Meaning
`alloca`	Stack allocation (pre-SSA; `mem2reg` promotes to registers)
`load`/`store`	Memory access
`getelementptr` (GEP)	Pointer arithmetic / field access
`phi`	SSA φ-node: merges values from predecessor blocks
`call`/

5. Key passes

Pass	Effect
`mem2reg`	Promote alloca to SSA registers
`instcombine`	Instruction combining / peephole
`simplifycfg`	CFG cleanup, dead block removal
`loop-vectorize`	Auto-vectorisation
`slp-vectorize`	Superword-level parallelism (straight-line vectorisation)
`inline`	Function inlining

6. Debugging missed optimisations

# Why was a loop not vectorised?
clang -O2 -Rpass-missed=loop-vectorize -Rpass-analysis=loop-vectorize src.c

# Dump pass pipeline
clang -O2 -mllvm -debug-pass=Structure src.c -o /dev/null 2>&1 | less

# Print IR after each pass (very verbose)
opt -passes='default<O2>' -print-after-all src.ll -S 2>&1 | less

7. Useful llvm tools

Tool	Purpose
`llvm-dis`	Bitcode → textual IR
`llvm-as`	Textual IR → bitcode
`llvm-link`	Link multiple bitcode files
`llvm-lto`	Standalone LTO
`llvm-nm`	Symbols in bitcode/object
`llvm-objdump`	Disassemble objects

For binutils equivalents, see skills/binaries/binutils.

Related skills

Use skills/compilers/clang for source-level Clang flags
Use skills/binaries/linkers-lto for LTO at link time
Use skills/profilers/linux-perf combined with llvm-mca for micro-architectural analysis

Weekly Installs

267

Repository

mohitmishra786/…v-skills

GitHub Stars

First Seen

Feb 20, 2026

Security Audits

Gen Agent Trust HubPass SocketPass SnykPass

Installed on

opencode266

gemini-cli266

github-copilot266

codex266

amp266

kimi-cli266

LLVM IR 完整指南：生成、优化、降级与调试 LLVM 工具链

🇨🇳中文介绍

LLVM IR 与工具链

目的

触发词

工作流程

1. 生成 LLVM IR

2. 使用 `opt` 运行优化过程

相关 Skills

3. 使用 `llc` 将 IR 降级为汇编

4. 检查 IR

5. 关键过程

6. 调试未触发的优化

7. 有用的 llvm 工具

相关技能

🇺🇸English

LLVM IR and Tooling

Purpose

Triggers

Workflow

1. Generate LLVM IR

2. Run optimisation passes with `opt`

3. Lower IR to assembly with `llc`

4. Inspect IR

5. Key passes

6. Debugging missed optimisations

7. Useful llvm tools

Related skills

最新 Skills

LLVM IR 完整指南：生成、优化、降级与调试 LLVM 工具链

🇨🇳中文介绍

LLVM IR 与工具链

目的

触发词

工作流程

1. 生成 LLVM IR

2. 使用 opt 运行优化过程

相关 Skills

3. 使用 llc 将 IR 降级为汇编

4. 检查 IR

5. 关键过程

6. 调试未触发的优化

7. 有用的 llvm 工具

相关技能

🇺🇸English

LLVM IR and Tooling

Purpose

Triggers

Workflow

1. Generate LLVM IR

2. Run optimisation passes with opt

3. Lower IR to assembly with llc

4. Inspect IR

5. Key passes

6. Debugging missed optimisations

7. Useful llvm tools

Related skills

最新 Skills

2. 使用 `opt` 运行优化过程

3. 使用 `llc` 将 IR 降级为汇编

2. Run optimisation passes with `opt`

3. Lower IR to assembly with `llc`