`file_input` operator

The file_input operator reads logs from files. It will place the lines read into the message field of the new entry.

Configuration Fields

Field	Default	Description
`id`	`file_input`	A unique identifier for the operator.
`output`	Next in pipeline	The connected operator(s) that will receive all outbound entries.
`include`	required	A list of file glob patterns that match the file paths to be read.
`exclude`	[]	A list of file glob patterns to exclude from reading.
`poll_interval`	200ms	The duration between filesystem polls.
`multiline`		A `multiline` configuration block. See below for details.
`force_flush_period`	`500ms`	Time since last read of data from file, after which currently buffered log should be send to pipeline. Takes duration as value. Zero means waiting for new data forever.
`encoding`	`utf-8`	The encoding of the file being read. See the list of supported encodings below for available options.
`include_file_name`	`true`	Whether to add the file name as the attribute `log.file.name`.
`include_file_path`	`false`	Whether to add the file path as the attribute `log.file.path`.
`include_file_name_resolved`	`false`	Whether to add the file name after symlinks resolution as the attribute `log.file.name_resolved`.
`include_file_path_resolved`	`false`	Whether to add the file path after symlinks resolution as the attribute `log.file.path_resolved`.
`start_at`	`end`	At startup, where to start reading logs from the file. Options are `beginning` or `end`. This setting will be ignored if previously read file offsets are retrieved from a persistence mechanism.
`fingerprint_size`	`1kb`	The number of bytes with which to identify a file. The first bytes in the file are used as the fingerprint. Decreasing this value at any point will cause existing fingerprints to forgotten, meaning that all files will be read from the beginning (one time).
`max_log_size`	`1MiB`	The maximum size of a log entry to read before failing. Protects against reading large amounts of data into memory
`max_concurrent_files`	1024	The maximum number of log files from which logs will be read concurrently (minimum = 2). If the number of files matched in the `include` pattern exceeds half of this number, then files will be processed in batches. One batch will be processed per `poll_interval`.
`attributes`	{}	A map of `key: value` pairs to add to the entry's attributes.
`resource`	{}	A map of `key: value` pairs to add to the entry's resource.

Note that by default, no logs will be read unless the monitored file is actively being written to because start_at defaults to end.

include and exclude fields use github.com/bmatcuk/doublestar for expression language. For reference documentation see here.

`multiline` configuration

If set, the multiline configuration block instructs the file_input operator to split log entries on a pattern other than newlines.

The multiline configuration block must contain exactly one of line_start_pattern or line_end_pattern. These are regex patterns that match either the beginning of a new log entry, or the end of a log entry.

If using multiline, last log can sometimes be not flushed due to waiting for more content. In order to forcefully flush last buffered log after certain period of time, use force_flush_period option.

Also refer to recombine operator for merging events with greater control.

File rotation

When files are rotated and its new names are no longer captured in include pattern (i.e. tailing symlink files), it could result in data loss. To avoid the data loss, choose move/create rotation method and set max_concurrent_files higher than the twice of the number of files to tail.

Supported encodings

Key	Description
`nop`	No encoding validation. Treats the file as a stream of raw bytes
`utf-8`	UTF-8 encoding
`utf-16le`	UTF-16 encoding with little-endian byte order
`utf-16be`	UTF-16 encoding with little-endian byte order
`ascii`	ASCII encoding
`big5`	The Big5 Chinese character encoding

Other less common encodings are supported on a best-effort basis. See https://www.iana.org/assignments/character-sets/character-sets.xhtml for other encodings available.

Example Configurations

Simple file input

Configuration:

- type: file_input
  include:
    - ./test.log

`./test.log`	Output bodies
`log1 log2 log3`	{ "body": "log1" }, { "body": "log2" }, { "body": "log3" }

Multiline file input

Configuration:

- type: file_input
  include:
    - ./test.log
  multiline:
    line_start_pattern: 'START '

`./test.log`	Output bodies
`START log1 log2 START log3 log4`	{ "body": "START log1\nlog2\n" }, { "body": "START log3\nlog4\n" }

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

file_input.md

file_input.md

`file_input` operator

Configuration Fields

`multiline` configuration

File rotation

Supported encodings

Example Configurations

Simple file input

Multiline file input

Files

file_input.md

Latest commit

History

file_input.md

File metadata and controls

file_input operator

Configuration Fields

multiline configuration

File rotation

Supported encodings

Example Configurations

Simple file input

Multiline file input

`file_input` operator

`multiline` configuration