Skip to content

Conversation

@czirker
Copy link
Contributor

@czirker czirker commented Sep 27, 2023

No description provided.

Fixed but with webpack removing require in aws-sdk-sync.ts
Added WriteOptions.onCheckpoint - called when checkpoint is called in a toLeo
Added ReadOptions.parser to provide a custom parser
Added ReadOptions.hooks to allow extending the fromLeo stream
Added sdk.streams.tmpDir which defaults to `/tmp/rstreams`
@czirker czirker requested a review from jgrantr September 27, 2023 16:15
split((value) => {
try {
let obj = JSONparse(value);
if (obj.size == null) {
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is take care in JSONpare so it shouldn't be needed here

file.range = `bytes=${fileOffset}-${fileEnd}`;
logger.debug(file.bucket, file.key, file.range);
s3Stream = ls.fromS3(file);
let localFile = path.resolve(ls.tmpDir, `s3/${item.s3.key.replace(".gz", "")}-${fileOffset}-${fileEnd}.jsonl.gz`);
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Review all localFile locations. They should have the queue

czirker and others added 5 commits September 28, 2023 17:04
Passing s3 credentials to the task
removed merge file V2
Better support for FastJson - handle wildcards '*' and arrays
…-webpack

Feature/faster stream parsing webpack
// If there are multiple files then merge them into the one desired file
if (!error && subFiles.length > 1) {
try {
execSync(`cat ${subFiles.map(p => basename(p).replace(/\\/g, "/")).join(" ")} >> ${task.filePath}`, {

Check warning

Code scanning / CodeQL

Shell command built from environment values

This shell command depends on an uncontrolled [absolute path](1).
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant