Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 3 Current »

Overview

Workday is a service that automates the sourcing, aggregation, normalization, and data management of security data across your organization into a security data lake stored in your account. A security data lake helps make your organization’s security data broadly accessible to your preferred security analytics solutions to power use cases such as threat detection, investigation, and incident response.

Workday has adopted the Open Cybersecurity Schema Framework (OCSF), an open standard. With OCSF support, the service can normalize and combine security data from AWS and a broad range of enterprise security data sources.

Devo collector features

Feature

Details

Allow parallel downloading (multipod)

not allowed

Running environments

  • collector server

  • on-premise

Populated Devo events

table

Flattening preprocessing

no

Allowed source events obfuscation

yes

Data sources

Data source

Description

API endpoint

Collector service name

Devo table

Available from release

Workday

Provides the ability to view log entries in the User Activity report data source

/activityLogging

activity_logging

app.workday.user_activity.activity

v1.0.0

For more information on how the events are parsed, visit our page.

Minimum configuration required for basic pulling

Although this collector supports advanced configuration, the fields required to retrieve data with basic configuration are defined below.

This minimum configuration refers exclusively to those specific parameters of this integration. There are more required parameters related to the generic behavior of the collector. Check setting sections for details.

Setting

Details

client_id

Client ID

client_secret

Client Secret

refresh_token

The refresh token

base_url

The base URL

Please note that the tenant is used as part of the API URL. For instance, in the URL https://wd5-services1.myworkday.com/ccx/api/v1/acme, the base API URL is https://wd5-services1.myworkday.com and the tenant is “acme.”

tenant

The Workday tenant

Please note that the tenant is used as part of the API URL. For instance, in the URL https://wd5-services1.myworkday.com/ccx/api/v1/acme, the base API URL is https://wd5-services1.myworkday.com and the tenant is “acme.”

See the Accepted authentication methods section to verify what settings are required based on the desired authentication method.

Accepted authentication methods

Authentication method

Client ID

Client Secret

client_id / client_secret

Required

Required

Run the collector

Once the data source is configured, you can either send us the required information if you want us to host and manage the collector for you (Cloud collector), or deploy and host the collector in your own machine using a Docker image (On-premise collector).

Collector services detail

This section is intended to explain how to proceed with specific actions for services.

Events service

 Verify data collection

Internal process and deduplication method

All Workday events are fetched via the activityLogging  endpoint: {base_url}/ccx/api/privacy/v1/{tenant}/activityLogging

The collector continually pulls new events since the last recorded timestamp. A unique hash value is computed for each event and used for deduplication purposes to ensure events are not fetched multiple times in subsequent pulls.

Devo categorization and destination

All events of this service are ingested into the table app.workday.user_activity.activity.

Setup output

A successful run has the following output messages for the setup module:

2023-06-19T00:22:09.263    INFO InputProcess::MainThread -> WorkdayPullerSetup(unknown,workday#10001,activity_logging#predefined) -> Starting thread
2023-06-19T00:22:09.264    INFO InputProcess::MainThread -> WorkdayPuller(workday,10001,activity_logging,predefined) - Starting thread
2023-06-19T00:22:09.264 WARNING InputProcess::WorkdayPuller(workday,10001,activity_logging,predefined) -> Waiting until setup will be executed
2023-06-19T00:22:09.651 WARNING InputProcess::WorkdayPullerSetup(unknown,workday#10001,activity_logging#predefined) -> Testing fetch from https://wd5-services1.myworkday.com/ccx/api/privacy/v1/acme/activityLogging.
2023-06-19T00:22:10.067    INFO InputProcess::WorkdayPullerSetup(unknown,workday#10001,activity_logging#predefined) -> Successfully tested fetch from https://wd5-services1.myworkday.com/ccx/api/privacy/v1/acme/activityLogging. Source is pullable.
2023-06-19T00:22:10.068    INFO InputProcess::WorkdayPullerSetup(unknown,workday#10001,activity_logging#predefined) -> Setup for module <WorkdayPuller> has been successfully executed

Puller output

2023-06-19T00:28:13.500    INFO InputProcess::WorkdayPuller(workday,10001,activity_logging,predefined) -> WorkdayPuller(workday,10001,activity_logging,predefined) Starting the execution of pre_pull()
2023-06-19T00:28:13.501    INFO InputProcess::WorkdayPuller(workday,10001,activity_logging,predefined) -> Reading persisted data
2023-06-19T00:28:13.502    INFO InputProcess::WorkdayPuller(workday,10001,activity_logging,predefined) -> Data retrieved from the persistence: {'@persistence_version': 1, 'start_time_in_utc': '2023-06-19T04:23:01Z', 'last_event_time_in_utc': '2023-06-19T04:26:52Z', 'last_ids': ['60ff86dc074ba6762adc31bb7e1cf9de1bb739fef8e6f3138f1220ce39f98261', '5137ae41a8ed84eda7d1b3daf077022e6f138829315961743be50f210971ff56', '9f5c18ae1506a54755a559b2806f61bf234fd4489fcde694804f4b7d51c70ee0'], 'next_offset': 0}
2023-06-19T00:28:13.503    INFO InputProcess::WorkdayPuller(workday,10001,activity_logging,predefined) -> Running the persistence upgrade steps
2023-06-19T00:28:13.503    INFO InputProcess::WorkdayPuller(workday,10001,activity_logging,predefined) -> Running the persistence corrections steps
2023-06-19T00:28:13.503    INFO InputProcess::WorkdayPuller(workday,10001,activity_logging,predefined) -> Running the persistence corrections steps
2023-06-19T00:28:13.504    INFO InputProcess::WorkdayPuller(workday,10001,activity_logging,predefined) -> No changes were detected in the persistence
2023-06-19T00:28:13.504    INFO InputProcess::WorkdayPuller(workday,10001,activity_logging,predefined) -> WorkdayPuller(workday,10001,activity_logging,predefined) Finalizing the execution of pre_pull()
2023-06-19T00:28:13.504    INFO InputProcess::WorkdayPuller(workday,10001,activity_logging,predefined) -> Starting data collection every 60 seconds
2023-06-19T00:28:13.505    INFO InputProcess::WorkdayPuller(workday,10001,activity_logging,predefined) -> Pull Started
2023-06-19T00:28:15.338    INFO InputProcess::WorkdayPuller(workday,10001,activity_logging,predefined) -> Fetching all activity logs subject to the following parameters: {'from': '2023-06-19T04:26:52Z', 'to': '2023-06-19T04:28:13Z', 'limit': 1000, 'offset': 0}
2023-06-19T00:28:15.506    INFO InputProcess::WorkdayPuller(workday,10001,activity_logging,predefined) -> 104 record(s) returned for last batch which is less than limit of 1000. No new records remaining. Finishing pull.
2023-06-19T00:28:15.507    INFO InputProcess::WorkdayPuller(workday,10001,activity_logging,predefined) -> (Partial) Statistics for this pull cycle (@devo_pulling_id=1687148893500):Number of requests made: 1; Number of events received: 104; Number of duplicated events filtered out: 3; Number of events generated and sent: 101; Average of events per second: 50.447.
2023-06-19T00:28:15.507    INFO InputProcess::WorkdayPuller(workday,10001,activity_logging,predefined) -> Statistics for this pull cycle (@devo_pulling_id=1687148893500):Number of requests made: 1; Number of events received: 104; Number of duplicated events filtered out: 3; Number of events generated and sent: 101; Average of events per second: 50.442.
2023-06-19T00:28:15.507    INFO InputProcess::WorkdayPuller(workday,10001,activity_logging,predefined) -> The data is up to date!
2023-06-19T00:28:15.508    INFO InputProcess::WorkdayPuller(workday,10001,activity_logging,predefined) -> Data collection completed. Elapsed time: 2.007 seconds. Waiting for 57.993 second(s) until the next one

After a successful collector’s execution (that is, no error logs found), you will see the following log message:

2023-06-19T00:28:15.507    INFO InputProcess::WorkdayPuller(workday,10001,activity_logging,predefined) -> Statistics for this pull cycle (@devo_pulling_id=1687148893500):Number of requests made: 1; Number of events received: 104; Number of duplicated events filtered out: 3; Number of events generated and sent: 101; Average of events per second: 50.442.
2023-06-19T00:28:15.507    INFO InputProcess::WorkdayPuller(workday,10001,activity_logging,predefined) -> The data is up to date!
2023-06-19T00:28:15.508    INFO InputProcess::WorkdayPuller(workday,10001,activity_logging,predefined) -> Data collection completed. Elapsed time: 2.007 seconds. Waiting for 57.993 second(s) until the next one
 Restart the persistence

This collector uses persistent storage to download events in an orderly fashion and avoid duplicates. In case you want to re-ingest historical data or recreate the persistence, you can restart the persistence of this collector by following these steps:

  1. Edit the configuration file.

  2. Change the value of the reset_persistence_auth parameter to a different one.

  3. Save the changes.

  4. Restart the collector.

The collector will detect this change and will restart the persistence using the parameters of the configuration file or the default configuration in case it has not been provided.

Note that this action clears the persistence and cannot be recovered in any way. Resetting persistence could result in duplicate or lost events.

 Troubleshooting

This collector has different security layers that detect both an invalid configuration and abnormal operation. This table will help you detect and resolve the most common errors.

Error Type

Error Id

Error Message

Cause

Solution

InitVariablesError

1

Invalid start_time_in_utc: {ini_start_str}. Must be in parseable datetime format.

The configured start_time_in_utc parameter is a non-parseable format.

Update the start_time_in_utc value to have the recommended format as indicated in the guide.

InitVariablesError

2

Invalid start_time_in_utc: {ini_start_str}. Must be in the past..

The configured start_time_in_utc parameter is a future date.

Update the start_time_in_utc value to a past datetime.

SetupError

101

Failed to fetch OAuth token from {token_endpoint}. Exception: {e}.

The provided credentials, base URL, and/or token endpoint is incorrect.

Revisit the configuration steps and ensure that the correct values were specified in the config file.

SetupError

102

Failed to fetch data from {endpoint}. Source is not pullable.

The provided credentials, base URL, and/or token endpoint is incorrect.

Revisit the configuration steps and ensure that the correct values were specified in the config file.

ApiError

401

Error during API call (API provider HTML error response here)

The server returned an HTTP 401 response.

Ensure that the provided credentials are correct and provide read access to the targeted data.

ApiError

429

Error during API call to (API provider HTML error response here)

The server returned an HTTP 429 response.

The collector will attempt to retry requests (default up to 3 times) and respect back-off headers if they exist. If the collector repeatedly encounters this error, adjust the rate limit and/or contact the API provider to ensure that you have enough quota to complete the data pull.

ApiError

500

Error during API call to (API provider HTML error response here)

The server returned an HTTP 500 response.

If the API returns a 500 but successfully completes subsequent runs then you may ignore this error. If the API repeatedly returns a 500 error, ensure the server is reachable and operational.

Collector operations

This section is intended to explain how to proceed with specific operations of this collector.

 Verify collector operations

Initialization

The initialization module is in charge of setup and running the input (pulling logic) and output (delivering logic) services and validating the given configuration.

A successful run has the following output messages for the initializer module:

2023-01-10T15:22:57.146    INFO MainProcess::MainThread -> Loading configuration using the following files: {"full_config": "config.yaml", "job_config_loc": null, "collector_config_loc": null}
2023-01-10T15:22:57.146    INFO MainProcess::MainThread -> Using the default location for "job_config_loc" file: "/etc/devo/job/job_config.json"
2023-01-10T15:22:57.147    INFO MainProcess::MainThread -> "\etc\devo\job" does not exists
2023-01-10T15:22:57.147    INFO MainProcess::MainThread -> Using the default location for "collector_config_loc" file: "/etc/devo/collector/collector_config.json"
2023-01-10T15:22:57.148    INFO MainProcess::MainThread -> "\etc\devo\collector" does not exists
2023-01-10T15:22:57.148    INFO MainProcess::MainThread -> Results of validation of config files parameters: {"config": "config.yaml", "config_validated": True, "job_config_loc": "/etc/devo/job/job_config.json", "job_config_loc_default": True, "job_config_loc_validated": False, "collector_config_loc": "/etc/devo/collector/collector_config.json", "collector_config_loc_default": True, "collector_config_loc_validated": False}
2023-01-10T15:22:57.171 WARNING MainProcess::MainThread -> [WARNING] Illegal global setting has been ignored -> multiprocessing: False

Events delivery and Devo ingestion

The event delivery module is in charge of receiving the events from the internal queues where all events are injected by the pullers and delivering them using the selected compatible delivery method.

A successful run has the following output messages for the initializer module:

2023-01-10T15:23:00.788    INFO OutputProcess::MainThread -> DevoSender(standard_senders,devo_sender_0) -> Starting thread
2023-01-10T15:23:00.789    INFO OutputProcess::MainThread -> DevoSenderManagerMonitor(standard_senders,devo_1) -> Starting thread (every 300 seconds)
2023-01-10T15:23:00.790    INFO OutputProcess::MainThread -> DevoSenderManager(standard_senders,manager,devo_1) -> Starting thread
2023-01-10T15:23:00.842    INFO OutputProcess::MainThread -> global_status: {"output_process": {"process_id": 18804, "process_status": "running", "thread_counter": 21, "thread_names": ["MainThread", "pydevd.Writer", "pydevd.Reader", "pydevd.CommandThread", "pydevd.CheckAliveThread", "DevoSender(standard_senders,devo_sender_0)", "DevoSenderManagerMonitor(standard_senders,devo_1)", "DevoSenderManager(standard_senders,manager,devo_1)", "OutputStandardConsumer(standard_senders_consumer_0)", 

Sender services

The Integrations Factory Collector SDK has 3 different senders services depending on the event type to delivery (internal, standard, and lookup). This collector uses the following Sender Services:

Logging trace

Description

Number of available senders: 1

Displays the number of concurrent senders available for the given Sender Service.

sender manager internal queue size: 0

Displays the items available in the internal sender queue.

This value helps detect bottlenecks and needs to increase the performance of data delivery to Devo. This last can be made by increasing the concurrent senders.

Total number of messages sent: 44, messages sent since "2022-06-28 10:39:22.511671+00:00": 21 (elapsed 0.007 seconds)

Displays the number of events from the last time the collector executed the pull logic. Following the given example, the following conclusions can be obtained:

  • 44 events were sent to Devo since the collector started.

  • The last checkpoint timestamp was 2022-06-28 10:39:22.511671+00:00.

  • 21 events were sent to Devo between the last UTC checkpoint and now.

  • Those 21 events required 0.007 seconds to be delivered.

By default these traces will be shown every 10 minutes.

Sender statistics

Each service displays its own performance statistics that allow checking how many events have been delivered to Devo by type:

Logging trace

Description

Number of available senders: 1

Displays the number of concurrent senders available for the given Sender Service.

sender manager internal queue size: 0

Displays the items available in the internal sender queue.

Standard - Total number of messages sent: 57, messages sent since "2023-01-10 16:09:16.116750+00:00": 0 (elapsed 0.000 seconds

Displays the number of events from the last time and following the given example, the following conclusions can be obtained:

  • 44 events were sent to Devo since the collector started.

  • The last checkpoint timestamp was 2023-01-10 16:09:16.116750+00:00.

  • 21 events where sent to Devo between the last UTC checkpoint and now.

  • Those 21 events required 0.007 seconds to be delivered.

 Check memory usage

To check the memory usage of this collector, look for the following log records in the collector which are displayed every 5 minutes by default, always after running the memory-free process.

  • The used memory is displayed by running processes and the sum of both values will give the total used memory for the collector.

  • The global pressure of the available memory is displayed in the global value.

  • All metrics (Global, RSS, VMS) include the value before freeing and after previous -> after freeing memory

INFO InputProcess::MainThread -> [GC] global: 20.4% -> 20.4%, process: RSS(34.50MiB -> 34.08MiB), VMS(410.52MiB -> 410.02MiB)
INFO OutputProcess::MainThread -> [GC] global: 20.4% -> 20.4%, process: RSS(28.41MiB -> 28.41MiB), VMS(705.28MiB -> 705.28MiB)

Change log

Release

Released on

Release type

Details

Recommendations

v1.1.0

NEW FEATURE

New features

  • Updated to DCSDK from 1.9.0 to 1.11.1

  • Update to Docker Image 1.2.0

Bug fixes

  • Added default tag to the config

Recommended version

v1.0.0

INITIAL RELEASE

-

-

  • No labels