Skip to content

Conversation

@jvallesm
Copy link
Collaborator

This commit

  • Adds the char count as the file length for text files
  • Adds coverage in the integration tests for this data

This commit

- Adds the char count as the file length for text files
- Adds coverage in the integration tests for this data
@jvallesm jvallesm self-assigned this Sep 25, 2025
@jvallesm jvallesm merged commit c92684f into main Sep 25, 2025
8 checks passed
@jvallesm jvallesm deleted the jvalles/page-extraction branch September 25, 2025 14:52
jvallesm pushed a commit that referenced this pull request Oct 7, 2025
🤖 I have created a release *beep* *boop*
---


##
[0.33.0](v0.32.0...v0.33.0)
(2025-09-30)


### Features

* **chunking:** page by chunk when page delimiters are present
([#262](#262))
([13b766e](13b766e))
* **conversion:** add char length to text files
([#261](#261))
([c92684f](c92684f))
* **conversion:** extract pages in conversion step
([#260](#260))
([925dd1c](925dd1c))


### Bug Fixes

* **reprocess:** clean up data from previous file processing
([#259](#259))
([56f0368](56f0368))
* **usage:** add missing error filtering for users/admin
([#257](#257))
([c42ac14](c42ac14))


### Miscellaneous

* **blob:** make file blob paths more descriptive
([#263](#263))
([c23ad31](c23ad31))

---
This PR was generated with [Release
Please](https://github.com/googleapis/release-please). See
[documentation](https://github.com/googleapis/release-please#release-please).

<!-- CURSOR_SUMMARY -->
---

> [!NOTE]
> Prepare 0.33.0 release by updating CHANGELOG and bumping manifest
version.
> 
> - **Release 0.33.0**
>   - Update `CHANGELOG.md` with:
> - **Features**: chunking by page when delimiters exist; conversion
adds char length to text files; extract pages during conversion.
> - **Bug Fixes**: cleanup stale data on reprocess; add missing error
filtering for users/admin.
>     - **Misc**: more descriptive file blob paths.
> - Bump version in `release-please/manifest.json` from `0.32.0` to
`0.33.0`.
> 
> <sup>Written by [Cursor
Bugbot](https://cursor.com/dashboard?tab=bugbot) for commit
c4e8bbd. This will update automatically
on new commits. Configure
[here](https://cursor.com/dashboard?tab=bugbot).</sup>
<!-- /CURSOR_SUMMARY -->
donch1989 pushed a commit to instill-ai/instill-core that referenced this pull request Oct 8, 2025
Because
- The version of the artifact-backend service is not updated in the
instill-core repository.

This commit
- updates the `ARTIFACT_BACKEND_VERSION` in the `.env` file to `0.33.0`.
- updates the `artifactBackend.image.tag` in the helm chart values.yaml
file to `0.33.0`.

## Changes in artifact-backend
- chore(main): release 0.33.0 (instill-ai/artifact-backend#258)
- chore(blob): make file blob paths more descriptive
(instill-ai/artifact-backend#263)
- feat(chunking): page by chunk when page delimiters are present
(instill-ai/artifact-backend#262)
- feat(conversion): add char length to text files
(instill-ai/artifact-backend#261)
- feat(conversion): extract pages in conversion step
(instill-ai/artifact-backend#260)

Co-authored-by: jvallesm <3977183+jvallesm@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants