-
Couldn't load subscription status.
- Fork 12
Feature/faster stream parsing #188
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
Fixed but with webpack removing require in aws-sdk-sync.ts Added WriteOptions.onCheckpoint - called when checkpoint is called in a toLeo Added ReadOptions.parser to provide a custom parser Added ReadOptions.hooks to allow extending the fromLeo stream Added sdk.streams.tmpDir which defaults to `/tmp/rstreams`
| split((value) => { | ||
| try { | ||
| let obj = JSONparse(value); | ||
| if (obj.size == null) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is take care in JSONpare so it shouldn't be needed here
lib/stream/leo-stream.js
Outdated
| file.range = `bytes=${fileOffset}-${fileEnd}`; | ||
| logger.debug(file.bucket, file.key, file.range); | ||
| s3Stream = ls.fromS3(file); | ||
| let localFile = path.resolve(ls.tmpDir, `s3/${item.s3.key.replace(".gz", "")}-${fileOffset}-${fileEnd}.jsonl.gz`); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Review all localFile locations. They should have the queue
Passing s3 credentials to the task removed merge file V2 Better support for FastJson - handle wildcards '*' and arrays
…-webpack Feature/faster stream parsing webpack
| // If there are multiple files then merge them into the one desired file | ||
| if (!error && subFiles.length > 1) { | ||
| try { | ||
| execSync(`cat ${subFiles.map(p => basename(p).replace(/\\/g, "/")).join(" ")} >> ${task.filePath}`, { |
Check warning
Code scanning / CodeQL
Shell command built from environment values
Added 16.x and 18.x node ci/cd
Bug/faster stream parsing fixes
…ip data to prevent an write after destroy error
Fix config builder when an object isn't actually a ref
Fixing mem leak, uncaught exceptions, etc
No description provided.