| Change type | Description |
|---|---|
| Introduced in version 3.18 |
|
Users with an Administrator role can configure the following run clean up settings to save system resources:
- You can configure Data360 Analyze to automatically purge runs generated by schedules.
- You can configure Data360 Analyze to automatically purge managed Data Flow Outputs created by scheduled runs.
- You can configure Data360 Analyze to automatically purge node states created by scheduled runs.
- You can delete temporary data after a successful run.
Configure automatic run purging
Automatic run purging is useful if you have a high volume of regular runs and only need to see the most recent.
- From the folders panel, click Settings.
- Select Scheduling.
- From the Details panel, specify how often you want to remove scheduled runs from
Data360 Analyze by selecting the relevant check box(es) and entering
the required number of runs and/or days. Tip: You can select a combination of the two options (Keep (X) most recent runs and Delete runs older than (X) days). For example, if you specify that you want to keep the last five runs, and you also specify that you want to delete any runs that are older than five days - if the fifth run is older than five days, it is deleted.
- Click Apply Changes.
The cleanup settings apply to all runs generated by schedules. When enabled, the cleanup will run every day at midnight. When Keep (X) most recent runs is used, the purge routine takes place after each run of the schedule.
Configure automatic Data Flow Output purging
Data Flows can be configured to publish Data Flow Outputs. Scheduled Data Flows can generate Data Flow Output datasets, which can sometimes be very large. You can configure the cleanup settings to automatically delete these datasets after a certain number of days. However, if the datasets are marked as not managed, they will not be cleaned up automatically. In such cases, it is the user's responsibility to manually clean up the unmanaged datasets.
- From the folders panel, click Settings.
- Select Scheduling.
- From the Details panel, select the checkbox Delete Data Flow outputs older than [x] days and specify how many days the datasets will be retained.
- Click Apply Changes.
Configure automatic node state purging
Scheduled Data Flows can generate a lot of node state data. This option helps you clean up the node states, including temporary data files, while keeping the run state and managed data sets intact.
- From the folders panel, click Settings.
- Select Scheduling.
- From the Details panel, select the checkbox Delete node states older than [x] days and specify how many days the datasets will be retained.
- Click Apply Changes.
Delete temporary run data
You can save system resources by deleting temporary files when a run completes successfully.
- From the folders panel, click Settings.
- Select Scheduling.
- From the details panel, specify whether you want to delete temporary run data when a run completes successfully. Choose from:
- Never - Select this option if you have a requirement to retain all temporary run data.
- Immediately - Select this option if you do not need to keep temporary run data for successfully completed runs.
- After X days - Select this option if you need to keep all temporary run data for a specified period of time, for example if you have an audit requirement to retain all run data for 30 days.