Skip to content

docs: v1.1.0 structured extraction — README + website#3

Merged
NameetP merged 1 commit into
mainfrom
docs/v1.1.0-structured-extraction
Mar 12, 2026
Merged

docs: v1.1.0 structured extraction — README + website#3
NameetP merged 1 commit into
mainfrom
docs/v1.1.0-structured-extraction

Conversation

@NameetP

@NameetP NameetP commented Mar 12, 2026

Copy link
Copy Markdown
Owner

Summary

  • README — new Structured Extraction section covering JSON output, key-value extraction, table extraction, date/amount/rate normalization, and schema-guided mapping with examples
  • Website — v1.1.0 bump, two new feature cards, new Structured Extraction section with JSON output examples, pipeline step 6, updated FAQ + footer

Test plan

  • 225 tests pass
  • Design system pre-commit hook passes (no violations)
  • Visual check on site preview

🤖 Generated with Claude Code

README:
- Updated pipeline diagram to show JSON output
- Added --schema CLI option to options table
- New "Structured Extraction" section with JSON, KV, table, normalization,
  and schema-guided extraction docs + examples
- Updated MCP server and project structure sections

Website (site/index.html):
- Version bump to v1.1.0
- Two new feature cards: structured extraction + value normalization
- New "Structured Extraction" section with JSON output examples
- Pipeline step 6 (Structure) added
- FAQ and footer updated

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@NameetP NameetP merged commit 98f2263 into main Mar 12, 2026
1 of 2 checks passed
@NameetP NameetP deleted the docs/v1.1.0-structured-extraction branch March 12, 2026 13:13
NameetP added a commit that referenced this pull request Mar 12, 2026
README:
- Updated pipeline diagram to show JSON output
- Added --schema CLI option to options table
- New "Structured Extraction" section with JSON, KV, table, normalization,
  and schema-guided extraction docs + examples
- Updated MCP server and project structure sections

Website (site/index.html):
- Version bump to v1.1.0
- Two new feature cards: structured extraction + value normalization
- New "Structured Extraction" section with JSON output examples
- Pipeline step 6 (Structure) added
- FAQ and footer updated

Co-authored-by: Nameet Potnis <nameetpotnis@Nameets-MacBook-Pro.local>
Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
NameetP pushed a commit that referenced this pull request Mar 21, 2026
…dataloader-bench)

- Add Docling table overlay for docs the classifier missed (TEDS 0.704 → 0.884)
- Add TOC table filter to prevent false table injection
- Add split bold heading detection for multi-span bold patterns
- Add horizontal-line table detection (h_lines >= 4)
- Add garbage table filter in fast.py for PyMuPDF 1.27 regression
- Update README with #3 ranking (0.867 overall)

Benchmark: 0.8671 overall, 0.910 NID, 0.884 TEDS, 0.739 MHS

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant