Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix chunk method "Table" losing content when the Excel file has multi… #4123

Merged
merged 1 commit into from
Dec 19, 2024

Conversation

ly0303521
Copy link
Contributor

@ly0303521 ly0303521 commented Dec 19, 2024

…ple sheets

What problem does this PR solve?

discussed in #4102

  • In excel_parser.py, total means the total number of rows in Excel, but it return in the first iterate, that lead to the wrong to_page
  • In table.py, it when Excel file has multiple sheets, it will be divided into multiple parts, every part size is 3000, data may be empty, because it has recorded in the last iterate.

Type of change

  • Bug Fix (non-breaking change which fixes an issue)

@KevinHuSh KevinHuSh added the ci Continue Integration label Dec 19, 2024
@KevinHuSh KevinHuSh merged commit 101b8ff into infiniflow:main Dec 19, 2024
2 checks passed
learnerLj pushed a commit to learnerLj/ragflow that referenced this pull request Dec 19, 2024
infiniflow#4123)

…ple sheets

### What problem does this PR solve?
discussed in infiniflow#4102
- In excel_parser.py, `total` means the total number of rows in Excel,
but it return in the first iterate, that lead to the wrong `to_page`
- In table.py, it when Excel file has multiple sheets, it will be
divided into multiple parts, every part size is 3000, `data` may be
empty, because it has recorded in the last iterate.
### Type of change

- [x] Bug Fix (non-breaking change which fixes an issue)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ci Continue Integration
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants