互联网技术圈互联网技术圈互联网技术圈

» »

« Msgpack codec plugin Netflow codec plugin »

Multiline codec plugin

Plugin version: v3.0.10
Released on: 2018-06-29
Changelog

For other versions, see theVersioned plugin docs.

Getting Help

For questions about the plugin, open a topic in the Discuss forums. For bugs or feature requests, open an issue in Github.For the list of Elastic supported plugins, please consult the Elastic Support Matrix.

Description

The multiline codec will collapse multiline messages and merge them into asingle event.

If you are using a Logstash input plugin that supports multiplehosts, such as the beats input plugin, you should not usethe multiline codec to handle multiline events. Doing so may result in themixing of streams and corrupted event data. In this situation, you need tohandle multiline events before sending the event data to Logstash.

The original goal of this codec was to allow joining of multiline messagesfrom files into a single event. For example, joining Java exception andstacktrace messages into a single event.

The config looks like this:

input {  stdin {    codec => multiline {      pattern => "pattern, a regexp"      negate => "true" or "false"      what => "previous" or "next"    }  }}

The pattern should match what you believe to be an indicator that the fieldis part of a multi-line event.

The what must be previous or next and indicates the relationto the multi-line event.

The negate can be true or false (defaults to false). If true, amessage not matching the pattern will constitute a match of the multilinefilter and the what will be applied. (vice-versa is also true)

For example, Java stack traces are multiline and usually have the messagestarting at the far-left, with each subsequent line indented. Do this:

input {  stdin {    codec => multiline {      pattern => "^\s"      what => "previous"    }  }}

This says that any line starting with whitespace belongs to the previous line.

Another example is to merge lines not starting with a date up to the previousline..

input {  file {    path => "/var/log/someapp.log"    codec => multiline {      # Grok pattern names are valid! :)      pattern => "^%{TIMESTAMP_ISO8601} "      negate => true      what => "previous"    }  }}

This says that any line not starting with a timestamp should be merged with the previous line.

One more common example is C line continuations (backslash). Here’s how to do that:

input {  stdin {    codec => multiline {      pattern => "\\$"      what => "next"    }  }}

This says that any line ending with a backslash should be combined with thefollowing line.

Multiline Codec Configuration Options

Setting	Input type	Required
`auto_flush_interval`	number	No
`charset`	string, one of ["ASCII-8BIT", "UTF-8", "US-ASCII", "Big5", "Big5-HKSCS", "Big5-UAO", "CP949", "Emacs-Mule", "EUC-JP", "EUC-KR", "EUC-TW", "GB2312", "GB18030", "GBK", "ISO-8859-1", "ISO-8859-2", "ISO-8859-3", "ISO-8859-4", "ISO-8859-5", "ISO-8859-6", "ISO-8859-7", "ISO-8859-8", "ISO-8859-9", "ISO-8859-10", "ISO-8859-11", "ISO-8859-13", "ISO-8859-14", "ISO-8859-15", "ISO-8859-16", "KOI8-R", "KOI8-U", "Shift_JIS", "UTF-16BE", "UTF-16LE", "UTF-32BE", "UTF-32LE", "Windows-31J", "Windows-1250", "Windows-1251", "Windows-1252", "IBM437", "IBM737", "IBM775", "CP850", "IBM852", "CP852", "IBM855", "CP855", "IBM857", "IBM860", "IBM861", "IBM862", "IBM863", "IBM864", "IBM865", "IBM866", "IBM869", "Windows-1258", "GB1988", "macCentEuro", "macCroatian", "macCyrillic", "macGreek", "macIceland", "macRoman", "macRomania", "macThai", "macTurkish", "macUkraine", "CP950", "CP951", "IBM037", "stateless-ISO-2022-JP", "eucJP-ms", "CP51932", "EUC-JIS-2004", "GB12345", "ISO-2022-JP", "ISO-2022-JP-2", "CP50220", "CP50221", "Windows-1256", "Windows-1253", "Windows-1255", "Windows-1254", "TIS-620", "Windows-874", "Windows-1257", "MacJapanese", "UTF-7", "UTF8-MAC", "UTF-16", "UTF-32", "UTF8-DoCoMo", "SJIS-DoCoMo", "UTF8-KDDI", "SJIS-KDDI", "ISO-2022-JP-KDDI", "stateless-ISO-2022-JP-KDDI", "UTF8-SoftBank", "SJIS-SoftBank", "BINARY", "CP437", "CP737", "CP775", "IBM850", "CP857", "CP860", "CP861", "CP862", "CP863", "CP864", "CP865", "CP866", "CP869", "CP1258", "Big5-HKSCS:2008", "ebcdic-cp-us", "eucJP", "euc-jp-ms", "EUC-JISX0213", "eucKR", "eucTW", "EUC-CN", "eucCN", "CP936", "ISO2022-JP", "ISO2022-JP2", "ISO8859-1", "ISO8859-2", "ISO8859-3", "ISO8859-4", "ISO8859-5", "ISO8859-6", "CP1256", "ISO8859-7", "CP1253", "ISO8859-8", "CP1255", "ISO8859-9", "CP1254", "ISO8859-10", "ISO8859-11", "CP874", "ISO8859-13", "CP1257", "ISO8859-14", "ISO8859-15", "ISO8859-16", "CP878", "MacJapan", "ASCII", "ANSI_X3.4-1968", "646", "CP65000", "CP65001", "UTF-8-MAC", "UTF-8-HFS", "UCS-2BE", "UCS-4BE", "UCS-4LE", "CP932", "csWindows31J", "SJIS", "PCK", "CP1250", "CP1251", "CP1252", "external", "locale"]	No
`max_bytes`	bytes	No
`max_lines`	number	No
`multiline_tag`	string	No
`negate`	boolean	No
`pattern`	string	Yes
`patterns_dir`	array	No
`what`	string, one of `["previous", "next"]`	Yes

`auto_flush_interval`

Value type is number
There is no default value for this setting.

The accumulation of multiple lines will be converted to an event when either amatching new line is seen or there has been no new data appended for this manyseconds. No default. If unset, no auto_flush. Units: seconds

`charset`

Value can be any of: ASCII-8BIT, UTF-8, US-ASCII, Big5, Big5-HKSCS, Big5-UAO, CP949, Emacs-Mule, EUC-JP, EUC-KR, EUC-TW, GB2312, GB18030, GBK, ISO-8859-1, ISO-8859-2, ISO-8859-3, ISO-8859-4, ISO-8859-5, ISO-8859-6, ISO-8859-7, ISO-8859-8, ISO-8859-9, ISO-8859-10, ISO-8859-11, ISO-8859-13, ISO-8859-14, ISO-8859-15, ISO-8859-16, KOI8-R, KOI8-U, Shift_JIS, UTF-16BE, UTF-16LE, UTF-32BE, UTF-32LE, Windows-31J, Windows-1250, Windows-1251, Windows-1252, IBM437, IBM737, IBM775, CP850, IBM852, CP852, IBM855, CP855, IBM857, IBM860, IBM861, IBM862, IBM863, IBM864, IBM865, IBM866, IBM869, Windows-1258, GB1988, macCentEuro, macCroatian, macCyrillic, macGreek, macIceland, macRoman, macRomania, macThai, macTurkish, macUkraine, CP950, CP951, IBM037, stateless-ISO-2022-JP, eucJP-ms, CP51932, EUC-JIS-2004, GB12345, ISO-2022-JP, ISO-2022-JP-2, CP50220, CP50221, Windows-1256, Windows-1253, Windows-1255, Windows-1254, TIS-620, Windows-874, Windows-1257, MacJapanese, UTF-7, UTF8-MAC, UTF-16, UTF-32, UTF8-DoCoMo, SJIS-DoCoMo, UTF8-KDDI, SJIS-KDDI, ISO-2022-JP-KDDI, stateless-ISO-2022-JP-KDDI, UTF8-SoftBank, SJIS-SoftBank, BINARY, CP437, CP737, CP775, IBM850, CP857, CP860, CP861, CP862, CP863, CP864, CP865, CP866, CP869, CP1258, Big5-HKSCS:2008, ebcdic-cp-us, eucJP, euc-jp-ms, EUC-JISX0213, eucKR, eucTW, EUC-CN, eucCN, CP936, ISO2022-JP, ISO2022-JP2, ISO8859-1, ISO8859-2, ISO8859-3, ISO8859-4, ISO8859-5, ISO8859-6, CP1256, ISO8859-7, CP1253, ISO8859-8, CP1255, ISO8859-9, CP1254, ISO8859-10, ISO8859-11, CP874, ISO8859-13, CP1257, ISO8859-14, ISO8859-15, ISO8859-16, CP878, MacJapan, ASCII, ANSI_X3.4-1968, 646, CP65000, CP65001, UTF-8-MAC, UTF-8-HFS, UCS-2BE, UCS-4BE, UCS-4LE, CP932, csWindows31J, SJIS, PCK, CP1250, CP1251, CP1252, external, locale
Default value is "UTF-8"

The character encoding used in this input. Examples include UTF-8and cp1252

This setting is useful if your log files are in Latin-1 (aka cp1252)or in another character set other than UTF-8.

This only affects "plain" format logs since JSON is UTF-8 already.

`max_bytes`

Value type is bytes
Default value is "10 MiB"

The accumulation of events can make logstash exit with an out of memory errorif event boundaries are not correctly defined. This settings make sure to flushmultiline events after reaching a number of bytes, it is used in combinationmax_lines.

`max_lines`

Value type is number
Default value is 500

`multiline_tag`

Value type is string
Default value is "multiline"

Tag multiline events with a given tag. This tag will only be addedto events that actually have multiple lines in them.

`negate`

Value type is boolean
Default value is false

Negate the regexp pattern (if not matched).

`pattern`

This is a required setting.
Value type is string
There is no default value for this setting.

The regular expression to match.

`patterns_dir`

Value type is array
Default value is []

Logstash ships by default with a bunch of patterns, so you don’tnecessarily need to define this yourself unless you are adding additionalpatterns.

Pattern files are plain text with format:

NAME PATTERN

For example:

NUMBER \d+

`what`

This is a required setting.
Value can be any of: previous, next
There is no default value for this setting.

If the pattern matched, does event belong to the next or previous event?

« Msgpack codec plugin Netflow codec plugin »