Examples
Jupyter notebook → LLM-ready Markdown
Cell addresses, dependency hints and figure placeholders from a pandas + matplotlib notebook.
sales-analysis.ipynb · QC 100%Out-of-order notebook, caught and flagged
Execution-order warnings and stale-state detection on a messy notebook.
out-of-order.ipynb · QC 100%Word document → Markdown with heading semantics
Styles map to real headings; the table survives as GFM.
q1-operations-review.docx · QC 100%CSV → typed table for LLM context
Column types annotated, truncation explicit, pipes escaped.
sales.csv · QC 100%Web article → clean Markdown
Navigation and ads stripped; the article, its table and figures kept.
article.html · QC 100%Slide deck → outlined Markdown with speaker notes
Slides become titled sections; the table survives as GFM, images become figures, and speaker notes surface.
quarterly-kickoff.pptx · QC 100%Email thread → readable message with honest truncation
Decoded headers, the newest message intact, the quoted reply chain truncated explicitly, attachments listed.
team-thread.eml · QC 100%PDF → Markdown, with honest limits
Per-page text extraction with the layout-lossy disclaimer attached.
vendor-report.pdf · QC 80%