Email preprocessing for LLM consumption
Parse, clean, and structure raw emails into formats optimized for language models. Written in Rust. Available for Node.js and Python.
Get Started View on GitHub llms.txt
What it does
- HTML to Markdown — converts HTML email bodies to clean Markdown, preserving semantic structure and stripping tracking pixels.
- Reply detection — identifies and removes quoted content across Gmail, Outlook, Apple Mail, and non-standard clients.
- Signature stripping — removes email signatures using heuristic pattern matching. No ML, no training data required.
- CTA extraction — surfaces calls-to-action by position and structure, returning structured data your LLM can act on.
Quick install
npm install langmail
pip install langmail
cargo add langmail
AI coding agents
Drop this prompt into Claude Code, Cursor, or any other coding agent to have it integrate langmail into your project:
Integrate the langmail library into this project. langmail is an
email preprocessing library that prepares raw email content for LLM
consumption — parsing, cleaning, and rendering it into context strings.
Before writing any code, fetch the current API reference at:
https://langmail.dev/llms.txt
Use it to determine the correct package name, install method, and API
for this project's language and runtime, then implement accordingly.