Apache Kafka data input configuration fields
Summarize
Summary of Apache Kafka data input configuration fields
This guide outlines the configuration fields for setting up Apache Kafka data inputs in ServiceNow's Health Log Analytics application. Proper configuration allows efficient log data streaming from Apache Kafka into the ServiceNow instance, enhancing data analysis and operational insights.
Show less
Key Features
- Name: Required field for naming the new data input.
- Description: Field to describe the purpose of the data input.
- Execute on: Option to select a specific MID Server or MID Server cluster for log data ingestion.
- MID Server: Required when using a specific MID Server; supports basic authentication only.
- MID Server Cluster: Required when using a specific cluster, supporting failover configurations.
- Application service: Required field to bind log data to an application service, which must be operational.
- Query Settings: Includes fields for defining the starting date and time for log data reading.
- Transport: Fields for specifying Kafka node names, topics, credentials, and group IDs.
- Advanced configuration: Allows for timeout settings, character encoding, and event batching options.
Key Outcomes
By effectively configuring the Apache Kafka data input, customers can:
- Stream log data seamlessly into ServiceNow for improved monitoring and analytics.
- Utilize failover capabilities to ensure continuous log data processing in case of MID Server failures.
- Set parameters that optimize log ingestion performance, reducing system congestion.
Successful setup and configuration of these fields will enhance the operational efficiency of log management within the ServiceNow platform.
Description of the fields on the Apache Kafka data input configuration form.
Basic configuration
| Field | Description |
|---|---|
| Name | Name of the new data input. This field is required. |
| Description | Description of the data input. |
| Execute on | Option to determine whether to use a specific MID Server or a MID Server cluster. This feature is supported in the Health Log Analytics application, Version 26.0.17 - February 2023 and later, available from the ServiceNow Store. |
| MID |
(Only when the Execute on field is set to Specific MID Server) MID Server to which log data from Apache Kafka is pulled.Note: This field is required.
|
| MID Server Cluster | (Only when the Execute on field is set to Specific MID Server Cluster) The MID Server cluster to which the log data is pulled.The data input runs on a single MID Server in the cluster until that MID Server fails. The system then moves all the data input tasks to the next available MID Server in the cluster according to the configured order. This feature is supported in the Health Log Analytics application, Version 26.0.17 - February 2023 and later, available from the ServiceNow Store. Note: For more information about MID Server clusters, see Configure a MID Server cluster.
This field is required. |
| Application service | The application service to which to bind the log data. Note: This field is required.If no relevant
application service exists, Create an application service and add CIs to it. Set the status of the new application service to
Operational. |
| Field | Description |
|---|---|
| Status | Status of the data input. |
| Transport | Protocol used to stream the log data. This data input uses Apache Kafka to stream log data to your instance. |
| Sources count | The number of log sources this data input has created. |
| Disabled since | The time when the data input stopped or failed. |
| Last log time | The time when the last log streamed in the data input. |
| Error message | The streaming error. This field is populated automatically. It displays only when a streaming error has occurred. |
| Field | Description | Example |
|---|---|---|
| From | Starting date and time for reading the data. Data older than this date and
time is not read. Note:
Setting this value to a past date might require the system
to read large amounts of data, causing congestion. This field is required. |
Now -1 week |
| Field | Description | Example |
|---|---|---|
| Kafka node names | A comma-separated list in the format HOST:PORT,HOST:PORT. The list does not have to include all the Apache Kafka Cluster servers. | 123.4.5.6:9092,123.3.4.5:9093 |
| Topics | A comma-separated list of topics to which the data input must
subscribe. This field is required. |
FirstTopic,SecondTopic,ThirdTopic |
| Kafka credentials | Reference to the Apache Kafka credentials. You can display the Kafka SSL credentials form by selecting the information icon (
For a description of the fields on the Kafka SSL credentials form, see Kafka SSL credentials fields. |
None |
| Group Id | The name of the Apache Kafka Consumer Group. | logs |
Advanced configuration
| Field | Description | Default value |
|---|---|---|
| Timeout | The time, in milliseconds, spent waiting in the poll if data is not available in the topics. | 500 |
| Node discovery timeout | The time, in milliseconds, before node discovery times out. | 30 |
| Default timezone | The default timezone if the log doesn't include timezone information. | GMT |
| Sub sample drop ratio | The number of events to batch together, out of which one will be discarded. This setting is used to reduce the number of fetched events. | -1 |
| Sub sample receive ratio | The number of events to batch together, out of which all but one will be discarded. This setting is used to decrease the number of received events. | -1 |
| Max length in bytes | The maximum length, in bytes, of events. | 32766 |
| Character encoding | The character encoding for this data input. | UTF-8 |
| Drop if queue is full | Option for selecting to discard logs if there is a load on the MID Server. | False |