Add date_prefix configuration option #81

omeganon · 2016-05-18T20:44:06Z

Thanks for contributing to Logstash! If you haven't already signed our CLA, here's a handy link: https://www.elastic.co/contributor-agreement/

This change adds an optional new 'date_prefix' configuration option that
will allow for the creation of a date-based prefix in the S3 path. This
will work in combination with the existing 'prefix' option if desired.
Any value for this option is expected to be a time definition suitable
for passing through strftime (e.g '%Y-%m-%d/'). The value of this option
will be inserted between 'prefix' and the filename generated by
logstash. This is particularly useful if you have large volumes of
logs that you wish to keep for long periods of time (archival
purposes). For example, a 5 minute time_file setting will result in
156,960 files for an 18 month retention period for each host sending
data. It is more convenient to work with that number of files,
particularly in GUI based tools, if they are distributed into
smaller groups of files. This will also allow for more granular
loading of files into Elasticsearch.

This change adds an optional new 'date_prefix' configuration option that will allow for the creation of a date-based prefix in the S3 path. This will work in combination with the existing 'prefix' option if desired. Any value for this option is expected to be a time definition suitable for passing through strftime (e.g '%Y-%m-%d/'). The value of this option will be inserted between 'prefix' and the filename generated by logstash. This is particularly useful if you have large volumes of logs that you wish to keep for long periods of time (archival purposes). For example, a 5 minute time_file setting will result in 156,960 files for an 18 month retention period for each host sending data. It is more convenient to work with that number of files, particularly in GUI based tools, if they are distributed into smaller groups of files. This will also allow for more granular loading of files into Elasticsearch.

joshuaspence · 2016-06-30T02:40:39Z

+1

**Motivation** One of the most requested features was adding a way to add dynamic prefixes using the fieldref syntax for the files on the bucket and also the changes in the pipeline to support shared delegator. The S3 output by nature was always a single threaded writes but had multiples workers to process the upload, the code was threadsafe when used in the concurrency `:single` mode. This PR addresses a few problems and provide shorter and more structured code: - This Plugin now uses the V2 version of the SDK, this make sure we receive the latest updates and changes. - We now uses S3's `upload_file` instead of reading chunks, this method is more efficient and will uses the multipart with threads if the files is too big. - You can now use the `fieldref` syntax in the prefix to dynamically changes the target with the events it receives. - The Upload queue is now a bounded list, this options is necessary to allow back pressure to be communicated back to the pipeline but its configurable by the user. - If the queue is full the plugin will start the upload in the current thread. - The plugin now threadsafe and support the concurrency model `shared` - The rotation strategy can be selected, the recommended is `size_and_time` that will check for both the configured limits (`size` and `time` are also available) - The `restore` option will now use a separate threadpool with an unbounded queue - The `restore` option will not block the launch of logstash and will uses less resources than the real time path - The plugin now uses `multi_receive_encode`, this will optimize the writes to the files - rotate operation are now batched to reduce the number of IO calls. - Empty file will not be uploaded by any rotation rotation strategy - We now use Concurrent-Ruby for the implementation of the java executor - If you have finer grain permission on prefixes or want faster boot, you can disable the credentials check with `validate_credentials_on_root_bucket` - The credentials check will no longer fails if we can't delete the file - We now have a full suite of integration test for all the defined rotation Fixes: #4 #81 #44 #59 #50 Fixes #102

barqshasbite · 2019-12-18T20:17:47Z

The prefix setting fully supports logstash interpolation now, which can accomplish what this PR is trying to do. So, this PR can be closed.

kares · 2020-02-07T18:49:04Z

The prefix setting fully supports logstash interpolation now, which can accomplish what this PR is trying to do. So, this PR can be closed.

thanks for the PR, ^^ makes sense - we do not want too many prefixes (would be confusing) ...
ideally one should be enough, if anyone feels prefix can not accomplish this let us know.

adamvduke mentioned this pull request May 31, 2016

Dynamic Prefix: Support configuring uploaded file paths/names within a bucket #4

Closed

ph mentioned this pull request Sep 16, 2016

Refactoring, Dynamic prefix and AWS v2 #102

Closed

kares closed this Feb 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add date_prefix configuration option #81

Add date_prefix configuration option #81

Uh oh!

omeganon commented May 18, 2016

Uh oh!

joshuaspence commented Jun 30, 2016

Uh oh!

barqshasbite commented Dec 18, 2019

Uh oh!

kares commented Feb 7, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Add date_prefix configuration option #81

Add date_prefix configuration option #81

Uh oh!

Conversation

omeganon commented May 18, 2016

Uh oh!

joshuaspence commented Jun 30, 2016

Uh oh!

barqshasbite commented Dec 18, 2019

Uh oh!

kares commented Feb 7, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants