llms.txt Benchmark¶
Baseline performance measurements for the zerodep llms.txt parser. No competing zero-dependency Python library exists for comparison, so this benchmark establishes a baseline for future optimization.
Test Environment
- CPU: x86_64 Linux
- Python: 3.12
- Tool: pytest-benchmark 5.2.3 (mean values reported)
- Reference: N/A (no competing library)
- Last Updated: 2026-04-27
Implementation¶
| Implementation | File/Package | Description |
|---|---|---|
| zerodep | llmstxt.py |
Regex-split parser + urllib.parse for URL handling |
Data Sizes Tested¶
| Label | Description |
|---|---|
| Small | H1, blockquote, 1 section with 3 entries (~10 lines) |
| Medium | H1, blockquote, details, 4 sections with 6-10 entries each, Optional section (~50 lines) |
| Large | H1, blockquote, details, 10 sections × 50 entries each + Optional (~600 lines) |
Parse Performance (Mean)¶
| Data Size | zerodep |
|---|---|
| Small | ~7 us |
| Medium | ~12 us |
| Large | ~1,050 us |
Key Takeaways¶
- Microsecond-level parsing — small and medium files parse in under 15 us.
- Linear scaling — performance scales linearly with entry count.
- Regex-split approach — splitting on H2 headers gives O(n) parsing with minimal overhead.
- Zero pip dependencies — uses only
re,dataclasses, andurllib.parsefrom the standard library.
Run It Yourself¶
Latest CI Results¶
Updated automatically on each release via Benchmark CI.