FreeBSD Manual Pages
IPFIXCOL2-JSON-OUTPUT(7) IPFIXcol collector IPFIXCOL2-JSON-OUTPUT(7) NAME ipfixcol2-json-output - JSON (output plugin) DESCRIPTION Special version of JSON output plugin for Apache Kafka. The plugin is copy of the original plugin so it offers the same formatting op- tions, however, it provides only Kafka output. See "Kafka notes" section below for more information. The plugin converts IPFIX flow records into JSON format and sends them to Apache Kafka. Some elements (timestamps, IP/MAC addresses, TCP flags, etc.) can be converted into human readable format. Others are represented as plain numbers. Each flow record is converted individually and send to outputs where at least one output method must be configured. All fields in an IPFIX record are formatted as: <field name>:<value>, where the field name represents a name of a corresponding Information Element as "<Enter- prise Name>:<Information Element Name>" and value is a number, string, boolean or null (if conversion from IPFIX to JSON fails or it is a zero-length field). The Enterprise Name is a name of an organization that manages the field, such as IANA, CISCO, etc. If the field name and type is unknown and "ignoreUnknown" option is disabled, the field name use format 'enXX:idYY' where XX is an Enter- prise Number and YY is an Information Element ID. See notes for in- structions on how to add missing definitions of Information Elements. Flow records with structured data types (basicList, subTemplateList and subTemplateMultiList) are supported. See the particular section below for information about the formatting of these types. EXAMPLE OUTPUT The following output is for illustration purpose only. Available IPFIX fields depend on a structure of records that your exporter sends. See documentation of the exporter, if you want to change available IPFIX fields. { "@type": "ipfix.entry", "iana:octetDeltaCount": 3568720, "iana:packetDeltaCount": 2383, "iana:flowStartMilliseconds": "2018-05-11T19:44:29.006Z", "iana:flowEndMilliseconds": "2018-05-11T19:44:30.364Z", "iana:ingressInterface": 1, "iana:ipVersion": 4, "iana:sourceIPv4Address": "192.168.0.100", "iana:destinationIPv4Address": "192.168.0.101", "iana:ipClassOfService": 0, "iana:ipTTL": 62, "iana:protocolIdentifier": "TCP", "iana:tcpControlBits": ".AP.S.", "iana:sourceTransportPort": 47200, "iana:destinationTransportPort": 20810, "iana:egressInterface": 0, "iana:samplingInterval": 0, "iana:samplingAlgorithm": 0 } The plugin can produce multiple record types, which are always distin- guished by "@type" key: ipfix.entry Flow record captured by the exporter as shown in the example above. This record is always converted and it is the most common type. ipfix.optionsEntry Record containing additional information provided by the ex- porter to the collector. Usually it is used for information about the exporter configuration (such as packet sampling) and flow export statistics. Periodicity and content of the record always depends on the particular exporter. By default, record conversion is disabled, see ignoreOptions parameter in the con- figuration section. ipfix.template and ipfix.optionsTemplate Records describing structure of flow records (i.e. ipfix.entry) and options records (i.e. ipfix.optionsEntry), respectively. These types are usually useful only for exporter analytics. Pe- riodicity of these records usually depends on the exporter and used transport protocol. For example, for connection over UDP it corresponds to "template refresh interval" configured on the ex- porter. However, for TCP and SCTP sessions, the template records are usually sent only once immediately after session start. By default, template record conversion is disabled, see template- Info parameter in the configuration section. EXAMPLE CONFIGURATION Below you see a complex configuration with all available output options enabled. Don't forget to remove (or comment) outputs that you don't want to use! <output> <name>JSON output</name> <plugin>json-kafka</plugin> <params> <tcpFlags>formatted</tcpFlags> <timestamp>formatted</timestamp> <protocol>formatted</protocol> <ignoreUnknown>true</ignoreUnknown> <ignoreOptions>true</ignoreOptions> <nonPrintableChar>true</nonPrintableChar> <octetArrayAsUint>true</octetArrayAsUint> <numericNames>false</numericNames> <splitBiflow>false</splitBiflow> <detailedInfo>false</detailedInfo> <templateInfo>false</templateInfo> <outputs> <kafka> <name>Send to Kafka</name> <brokers>127.0.0.1</brokers> <topic>ipfix</topic> <blocking>false</blocking> <partition>unassigned</partition> <!-- Zero or more additional properties --> <property> <key>compression.codec</key> <value>lz4</value> </property> </kafka> </outputs> </params> </output> PARAMETERS Formatting parameters: tcpFlags Convert TCP flags to common textual representation (formatted, e.g. ".A..S.") or to a number (raw). [values: formatted/raw, de- fault: formatted] timestamp Convert timestamp to ISO 8601 textual representation (all time- stamps in UTC and milliseconds, e.g. "2018-01-22T09:29:57.828Z") or to a unix timestamp (all timestamps in milliseconds). [val- ues: formatted/unix, default: formatted] protocol Convert protocol identification to formatted style (e.g. instead 6 writes "TCP") or to a number. [values: formatted/raw, de- fault: formatted] ignoreUnknown Skip unknown Information Elements (i.e. record fields with un- known name and data type). If disabled, data of unknown ele- ments are formatted as unsigned integer or hexadecimal values. For more information, see octetArrayAsUint option. [values: true/false, default: true] ignoreOptions Skip non-flow records used for reporting metadata about IPFIX Exporting and Metering Processes (i.e. records described by Op- tions Templates). [values: true/false, default: true] nonPrintableChar Ignore non-printable characters (newline, tab, control charac- ters, etc.) in IPFIX strings. If disabled, these characters are escaped on output. [values: true/false, default: true] octetArrayAsUint Converter each IPFIX field with octetArray type (including IPFIX fields with unknown definitions) as unsigned integer if the size of the field is less or equal to 8 bytes. Fields with the size above the limit are always converted as string representing hexadecimal value, which is typically in network byte order (e.g. "0x71E1"). Keep on mind, that there might be octetArray fields with variable length that might be interpreted differ- ently based on their size. If disabled, octetArray fields are never interpreted as unsigned integers. [values: true/false, default: true] numericNames Use only short identification of Information Elements (i.e. "enXX:idYY"). If enabled, the short version is used even if the definition of the field is known. This option can help to create a shorter JSON records with key identifiers which are indepen- dent on the internal configuration. [values: true/false, de- fault: false] splitBiflow In case of Biflow records, split the record to two unidirec- tional flow records. Non-biflow records are unaffected. [values: true/false, default: false] detailedInfo Add additional information about the IPFIX message (such as ex- port time, Observation Domain ID, IP address of the exporter, etc.) to which each record belongs. Additional fields starts with "ipfix:" prefix. [values: true/false, default: false] templateInfo Convert Template and Options Template records. See the particu- lar section below for information about the formatting of these records. [values: true/false, default: false] ---- Output types: At least one output must be configured. Multiple kafka outputs can be used at the same time if the outputs are not in colli- sion with each other. kafka Send data to Kafka i.e. Kafka producer. name Identification name of the output. Used only for read- ability. brokers Initial list of brokers as a CSV list of broker "host" or "host:port". topic Kafka topic to produce to. partition Partition number to produce to. If the value is "unas- signed", then the default random distribution is used. [default: "unassigned"] brokerVersion Older broker versions (before 0.10.0) provide no way for a client to query for supported protocol features making it impossible for the client to know what features it may use. As a workaround a user may set this property to the expected broker version and the client will automatically adjust its feature set. [default: <empty>] blocking Enable blocking on produce. If disabled and a cluster is down or not able to retrieve records fast enough, some flow records may be dropped. On the other hand, if en- abled, no records are dropped. However, if the cluster is slow or not accessible at all, the plugin waits (i.e. blocks) until data are send. This can significantly slow down or block(!) the whole collector and other output plugins [true/false, default: false] performanceTuning By default, the connection provided by librdkafka is not optimized for high throughput required for transport of JSON records. This option adds optional library parame- ters, which reduces messaging overhead and significantly improves throughput. In particular, Kafka message capac- ity is increased and maximal buffering interval is pro- longed. These options can be overwritten by user defined properties. [true/false, default: true] property Additional configuration properties of librdkafka library as key/value pairs. Multiple <property> parameters, which can improve performance, can be defined. See the project website for the full list of supported options. Keep on mind that some options might not be available in all versions of the library. NOTES If one or more Information Element definitions are missing, you can easily add them. All definitions are provided by libfds <https://github.com/CESNET/libfds/> library. See the project website for help. If a flow record contains multiple occurrences of the same Information Element, their values are stored into a single name/value pair as JSON array. Order of the values in the array corresponds to their order in the flow record. For higher performance, it is advisable to use non-formatted conversion of IPFIX data types. In that case, you should prefer, for example, timestamps as numbers over ISO 8601 strings and numeric identifiers of fields as they are usually shorted. STRUCTURED DATA TYPES Flow records can be extended with structured data types (as described in RFC6313). Each of these types are formatted as JSON objects with "@type" field which helps to distinguish its formatting. Moreover, as the standard describes, the semantic of the list is also included. Converted basicList contains "fieldID" with the Information Element identifier of zero or more Information Element(s) contained in the list. All values are stored as a JSON array in "data" field. { "example:blField": { "@type": "basicList", "semantic": "anyOf", "fieldID": "iana:octetDeltaCount", "data": [23, 34, 23] } } Converted subTemplateList contains only additional "data" field with array of zero or more JSON objects. As all nested JSON object are de- scribed by the same IPFIX Template, it's guaranteed the their structure is also the same. The "semantic" field indicates the relationship among the different JSON objects. { "example:stlField": { "@type": "subTemplateList", "semantic": "allOf", "data": [ {"keyA.1": "value1", "keyA.2": "value2"}, {"keyB.1": "value1", "keyB.2": "value2"}, ] } } Converted subTemplateMultiList is similar to the previous type, how- ever, sub-records can be even more nested. The "data" field contains a JSON array with zero or more nested JSON arrays. Each nested array contains zero or more JSON objects and it's guaranteed that JSON ob- jects in the same array have the same structure. The "semantic" field indicates top-level relationship among the nested arrays. { "example:stmlField": { "@type": "subTemplateMultiList", "semantic": "allOf", "data" : [ [ {"keyA.1": "value1", "keyA.2": "value2"}, {"keyB.1": "value1", "keyB.2": "value2"}, ], [ {"idA.1": "something", "idB.1": 123} ] ] } } Keep on mind that all structures can be nested in each other. TEMPLATE AND OPTIONS TEMPLATE RECORDS These records are converted only if <templateInfo> option is enabled. Template record describes structure of flow records, and its "@type" is "ipfix.template". Converted Template record contains "ipfix:tem- plateId" field, which is unique to the Transport Session and Observa- tion Domain, and "ipfix:fields", which is an array of JSON objects specifying fields of flow records. { "@type": "ipfix.template", "ipfix:templateId": 49171, "ipfix:fields": [ { "ipfix:elementId": 16399, "ipfix:enterpriseId": 6871, "ipfix:fieldLength": 1 }, { "ipfix:elementId": 184, "ipfix:enterpriseId": 29305, "ipfix:fieldLength": 4 }, { ... } ] } Options Template record describes structure of additional information for the collector, such as sampling configuration, export statistics, etc. The description is converted to a record with "@type" equal to "ipfix.optionsEntry". Converted Options Template record is similar to the previous type, however, it contains additional "ipfix:scopeCount" field, which specifies number of scope fields in the record. A scope count of N specifies that the first N field specifiers are scope fields. { "@type": "ipfix.optionsTemplate", "ipfix:templateId": 53252, "ipfix:scopeCount": 1, "ipfix:fields": [ { "ipfix:elementId": 130, "ipfix:enterpriseId": 0, "ipfix:fieldLength": 4 }, { "ipfix:elementId": 103, "ipfix:enterpriseId": 6871, "ipfix:fieldLength": 4 }, { ... } ] } KAFKA NOTES The plugin depends on librdkafka which unfortunately redefines some common symbols (e.g. thrd_create) when built without support for C11 threads. This is usually not a problem, however, expected return codes of thread functions differs from common glibc/musl implementation. As the JSON plugin and librdkafka library are loaded after collector start, the common version of symbols are used and the redefined ones are ignored. Later when librdkafka tries to create its internal pro- cessing threads, it fails as thrd_create() returned unexpected return code. This issue can be resolved by using RTLD_DEEPBIND flag of dlopen() function when loading the plugin. However, this flag is a GNU extension, which is not supported by all C libraries, and it can also break some other standard functions (e.g. C++ iostream). Affected versions of the library are available in official package repositories of multiple distributions. Therefore, we created this spe- cial version of JSON output which instructs the collector core to use the RTLD_DEEPBIND flag. This issue should be fixes since librdkafka v1.4.2. For more informa- tion see librdkafka issue #2681 on Github. AUTHOR Luk Hutk (lukas.hutak@cesnet.cz), Petr Velan (petr.velan@cesnet.cz) COPYRIGHT Copyright 2020 CESNET, z.s.p.o. 2.2 2020-07-10 IPFIXCOL2-JSON-OUTPUT(7)
NAME | DESCRIPTION | EXAMPLE OUTPUT | EXAMPLE CONFIGURATION | PARAMETERS | NOTES | STRUCTURED DATA TYPES | TEMPLATE AND OPTIONS TEMPLATE RECORDS | KAFKA NOTES | AUTHOR | COPYRIGHT
Want to link to this manual page? Use this URL:
<https://man.freebsd.org/cgi/man.cgi?query=ipfixcol2-json-kafka-output&sektion=7&manpath=FreeBSD+Ports+14.3.quarterly>
