/
Rubrik collector

Rubrik collector

Overview

Rubrik is a software-defined data management platform for physical, virtual and hybrid environments, that simplifies and unifies backup, data protection and instant recovery onsite, in the cloud and at the data centre.

Devo collector features

Feature

Details

Feature

Details

Allow parallel downloading (multipod)

not allowed

Running environments

collector server

Populated Devo events

table

Flattening preprocessing

no

Data sources

Data source

Description

API endpoint

Collector service name

Devo table

Available from release

Data source

Description

API endpoint

Collector service name

Devo table

Available from release

Audit Logs

Returns all corresponding the audit logs.

{tenant}.my.rubrik.com/api/graphql

audit_logs

cloud.rubrik.audit

v1.1.0

Events

Returns all the corresponding event logs.

{tenant}.my.rubrik.com/api/graphql

events

cloud.rubrik.events

v1.1.0

For more information on how the events are parsed, visit our page.

Flattening preprocessing

Data source

Collector service

Optional

Flattening details

Data source

Collector service

Optional

Flattening details

Audit Logs

audit_logs

yes

not required

Events

events

yes

not required

Minimum configuration required for basic pulling

Although this collector supports advanced configuration, the fields required to retrieve data with basic configuration are defined below.

This minimum configuration refers exclusively to those specific parameters of this integration. There are more required parameters related to the generic behavior of the collector. Check setting sections for details.

Setting

Details

Setting

Details

tenant

The tenant name to be added in the base url.

client_id

Client ID for authentication.

client_secret

Client Secret for authentication.

See the Accepted authentication methods section to verify what settings are required based on the desired authentication method.

Accepted authentication methods

Authentication method

Client Id

Client Secret

Base Url

Authentication method

Client Id

Client Secret

Base Url

Basic Authentication

REQUIRED

REQUIRED

REQUIRED

Run the collector

Once the data source is configured, you can either send us the required information if you want us to host and manage the collector for you (Cloud collector), or deploy and host the collector in your own machine using a Docker image (On-premise collector).

Collector services detail

This section is intended to explain how to proceed with specific actions for services.

Audit logs

Used Graphql Query

query AuditLogListQuery( $after: String $first: Int $filters: UserAuditFilter $sortBy: UserAuditSortField $sortOrder: SortOrder ) { userAuditConnection( after: $after first: $first filters: $filters sortBy: $sortBy sortOrder: $sortOrder ) { edges { node { userNote userName id message time severity status cluster { id name timezone __typename } orgId orgName __typename } cursor __typename } pageInfo { endCursor hasNextPage hasPreviousPage __typename } __typename } }

Events

Used Graphql Query

query EventSeriesListQuery( $after: String $filters: ActivitySeriesFilter $first: Int $sortBy: ActivitySeriesSortField $sortOrder: SortOrder ) { activitySeriesConnection( after: $after first: $first filters: $filters sortBy: $sortBy sortOrder: $sortOrder ) { edges { cursor node { ...EventSeriesFragment cluster { id name status timezone __typename } activityConnection(first: 1) { nodes { id message __typename } __typename } __typename } __typename } pageInfo { endCursor hasNextPage hasPreviousPage __typename } __typename } } fragment EventSeriesFragment on ActivitySeries { id fid activitySeriesId lastUpdated lastActivityType lastActivityStatus objectId objectName objectType severity progress isCancelable isPolarisEventSeries location effectiveThroughput dataTransferred logicalSize organizations { id name __typename } clusterUuid clusterName __typename }

This collector uses persistent storage to download events in an orderly fashion and avoid duplicates. In case you want to re-ingest historical data or recreate the persistence, you can restart the persistence of this collector by following these steps:

  1. Edit the configuration file.

  2. Change the value of the start_time_value parameter to a different one.

  3. Save the changes.

  4. Restart the collector.

The collector will detect this change and will restart the persistence using the parameters of the configuration file or the default configuration in case it has not been provided.

Note that this action clears the persistence and cannot be recovered in any way. Resetting persistence could result in duplicate or lost events.

This collector has different security layers that detect both an invalid configuration and abnormal operation. This table will help you detect and resolve the most common errors.

Error type

Error ID

Error message

Cause

Solution

Error type

Error ID

Error message

Cause

Solution

ApiError

496

Connection Timeout Error - Verify that endpoint and URL parameters are correct.Exception: {exception}

Some exception occurred while making the API request.

Reach out to the developer with the exact error message.

 

495

Failed to parse {resp_type} object from response: {res.content}.Exception: {exception}

Some exception occurred while making the API request.

Reach out to the developer with the exact error message.

 

498

Max Retries Error - Request attempts with {retries} retries failed. \n{reason}

Some exception occurred while making the API request.

Reach out to the developer with the exact error message.

 

497

Connection Error - Ensure the URL parameters are correct and that host has access to server.\n Exception: {exception}

Some exception occurred while making the API request.

Reach out to the developer with the exact error message.

Collector operations

This section is intended to explain how to proceed with specific operations of this collector.

Initialization

The initialization module is in charge of setup and running the input (pulling logic) and output (delivering logic) services and validating the given configuration.

A successful run has the following output messages for the initializer module:

INFO MainThread -> (CollectorMultithreadingQueue) standard_queue_multithreading -> max_size_in_messages: 10000, max_size_in_mb: 1024, max_wrap_size_in_items: 100 WARNING MainThread -> [INTERNAL LOGIC] DevoSender::_validate_kwargs_for_method__init__ -> The <address> does not appear to be an IP address and cannot be verified: collector-us.devo.io WARNING MainThread -> [OUTPUT] OutputLookupSenders -> <threshold_for_using_gzip_in_transport_layer> setting has been modified from 1.1 to 1.0 due to this configuration increases the Lookup sender performance. WARNING MainThread -> [INTERNAL LOGIC] DevoSender::_validate_kwargs_for_method__init__ -> The <address> does not appear to be an IP address and cannot be verified: collector-us.devo.io INFO MainThread -> [OUTPUT] OutputMultithreadingController(threatquotient_collector) -> Starting thread INFO MainThread -> [OUTPUT] DevoSender(standard_senders,devo_sender_0) -> Starting thread INFO MainThread -> [OUTPUT] DevoSenderManagerMonitor(standard_senders,devo_1) -> Starting thread (every 600 seconds) INFO MainThread -> [OUTPUT] DevoSenderManager(standard_senders,manager,devo_1)(devo_1) -> Starting thread INFO MainThread -> [OUTPUT] DevoSender(lookup_senders,devo_sender_0) -> Starting thread INFO MainThread -> [OUTPUT] DevoSenderManagerMonitor(lookup_senders,devo_1) -> Starting thread (every 600 seconds) INFO MainThread -> [OUTPUT] DevoSenderManager(lookup_senders,manager,devo_1)(devo_1) -> Starting thread INFO MainThread -> InitVariables Started INFO MainThread -> start_time_value initialized INFO MainThread -> verify_host_ssl_cert initialized INFO MainThread -> event_fetch_limit_in_items initialized INFO MainThread -> InitVariables Terminated INFO MainThread -> [INPUT] InputMultithreadingController(threatquotient_collector) - Starting thread (executing_period=300s) INFO MainThread -> [INPUT] InputThread(threatquotient_collector,threatquotient_data_puller#111) - Starting thread (execution_period=600s) INFO MainThread -> [INPUT] ServiceThread(threatquotient_collector,threatquotient_data_puller#111,events#predefined) - Starting thread (execution_period=600s) INFO MainThread -> [SETUP] ThreatQuotientDataPullerSetup(threatquotient_collector,threatquotient_data_puller#111,events#predefined) - Starting thread INFO MainThread -> [INPUT] ThreatQuotientDataPuller(threatquotient_collector,threatquotient_data_puller#111,events#predefined) - Starting thread

Events delivery and Devo ingestion

The event delivery module is in charge of receiving the events from the internal queues where all events are injected by the pullers and delivering them using the selected compatible delivery method.

A successful run has the following output messages for the initializer module:

Sender services

The Integrations Factory Collector SDK has 3 different senders services depending on the event type to delivery (internal, standard, and lookup). This collector uses the following Sender Services:

Sender services

Description

Sender services

Description

internal_senders

In charge of delivering internal metrics to Devo such as logging traces or metrics.

standard_senders

In charge of delivering pulled events to Devo.

Sender statistics

Each service displays its own performance statistics that allow checking how many events have been delivered to Devo by type:

Logging trace

Description

Logging trace

Description

Number of available senders: 1

Displays the number of concurrent senders available for the given Sender Service.

sender manager internal queue size: 0

Displays the items available in the internal sender queue.

Total number of messages sent: 44, messages sent since "2022-06-28 10:39:22.511671+00:00": 21 (elapsed 0.007 seconds)

Displays the number of events from the last time and following the given example, the following conclusions can be obtained:

  • 44 events were sent to Devo since the collector started.

  • The last checkpoint timestamp was 2022-06-28 10:39:22.511671+00:00.

  • 21 events where sent to Devo between the last UTC checkpoint and now.

  • Those 21 events required 0.007 seconds to be delivered.

To check the memory usage of this collector, look for the following log records in the collector which are displayed every 5 minutes by default, always after running the memory-free process.

  • The used memory is displayed by running processes and the sum of both values will give the total used memory for the collector.

  • The global pressure of the available memory is displayed in the global value.

  • All metrics (Global, RSS, VMS) include the value before freeing and after previous -> after freeing memory

Sometimes it is necessary to activate the debug mode of the collector's logging. This debug mode increases the verbosity of the log and allows you to print execution traces that are very helpful in resolving incidents or detecting bottlenecks in heavy download processes.

  • To enable this option you just need to edit the configuration file and change the debug_status parameter from false to true and restart the collector.

  • To disable this option, you just need to update the configuration file and change the debug_status parameter from true to false and restart the collector.

For more information, visit the configuration and parameterization section corresponding to the chosen deployment mode.

Change log

Release

Released on

Release type

Details

Recommendations

Release

Released on

Release type

Details

Recommendations

v1.3.1

Sep 4, 2024

IMPROVEMENTS

Improvements

  • Changed the devo tag for the events service to handle latest json response.

Recommended version

v1.3.0

Mar 6, 2024

IMPROVEMENTS

Improvements

  • Better error reporting for access token generation exceptions (e.g. client ID does not exist, IPs must be whitelisted, etc.)

  • Updated DCSDK to 1.12.4.

Update

v1.2.0

Mar 6, 2024

IMPROVEMENTS

Improvements

  • Updated page size to 1000 to increase performance.

  • Added `override_filters_object_types` to allow users to fetch specific object types from the API for the `events` service (default is to fetch all object types).

  • Updated `DCSDK` to 1.12.3.

Update

v1.1.2

Mar 6, 2024

BUG FIX

  • Fixed the regex check for Devo tag in the schemas.

Update

v1.1.1

Mar 3, 2024

BUG FIX

  • Fixed the `requirements.txt`

Update

v1.1.0

Feb 28, 2024

FIRST RELEASE

Released the first version of the Rubrik collector.

Initial version