Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/parallel doc loading #3

Merged
merged 2 commits into from
Mar 19, 2025
Merged

Conversation

ahmed-ashraff
Copy link
Contributor

Feature/Parallel Document Loading

Description

This feature introduces parallel document loading to improve performance. By utilizing concurrency, the document loading process is now significantly faster.

Key Changes

  • Implemented parallel processing for document loading.
  • Refactored the loader logic to handle concurrency safely.
  • Ensured synchronization to prevent race conditions.

Performance Improvement

  • Achieved a 5.01x speedup compared to the previous sequential implementation on my D partition.

Testing

  • Added unit tests to verify correct document loading behavior.
  • Conducted performance benchmarks to validate the speedup.

Impact

This enhancement greatly improves efficiency, especially for large data directories.

@ahmed-ashraff ahmed-ashraff merged commit a4d9844 into master Mar 19, 2025
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant