Menu Path : Management>Archive Cleanup

Read Access Rights:

  • Log in to Management Console

Write Access Rights:

  • Organize archive cleanup
  • Organize message defragmentation

The Archive Cleanup page enables you to manage disk space usage by cleaning up archived messages and defragmenting the Error Queue:

Archive Cleanup

Copies of messages that leave Rhapsody (through an output communication point or by deletion from a queue) and all notification events raised in Rhapsody are kept for archival purposes. Users can view or resend these messages. However, it is not advisable that messages be kept indefinitely, as they do take up disk space. The archive cleanup is the mechanism by which messages are cleaned up.

An archive cleanup task can be run manually or scheduled to run every x hours (where x is a whole number greater than one) at y minutes past the hour (0 - 59), daily at a specified time, weekly (at a specified day and time), or monthly (at a specified day and time). Alternatively, you can use a cron expression to schedule an archive cleanup task.

The settings for archive cleanup are set in the Archive Cleanup panel of the Archive Cleanup page:

Manual Cleanup

You can also run an archive cleanup task manually.

Click the Cleanup Now link to start an archive cleanup task manually. A manual archive cleanup task cleans up any messages and logs older than the time specified in the Keep archive data for, Keep logs for and Keep notification events for fields.

Automatic Cleanup

For an automatic cleanup the following settings must be specified:

  • Keep archive data for - the number of hours or days messages will be retained in the archive. After this time, the messages will no longer be available for viewing or resending.
  • Keep logs for - the number of days worth of binary logs to keep with Rhapsody. This is the number of days the log messages will be visible via the Management Console.

    This setting only relates to the binary logs used for visibility in the Management Console, and not the textual logs kept in the <rhapsody>\logs directory.

  • Cleanup notification event history - the number of days to keep notification events (they are kept indefinitely when the checkbox is unchecked). By default, notification events are archived and deleted after 90 days.
  • Cleanup backups - the number of days to keep backups (they are kept indefinitely when the checkbox is unchecked). By default, backups are kept indefinitely. 
  • Cleanup - when the archive cleanup task should run. You can schedule archive cleanup to run according to your requirements, by selecting and completing the cleanup fields. Alternatively, select the Advanced button and enter the appropriate details in the Cron Expression field to schedule archive cleanup using a cron expression.

Select the Apply button to start the automatic archive cleanup process.

Scheduling Archive Cleanup

The archive cleanup task can have a minor impact on engine performance. Each component queue is locked for a small period of time (generally sub-second) while the queue is analyzed for live messages.

Archive Cleanup can be scheduled to run every x hours (where x is a whole number greater than one) at y minutes past the hour (0 - 59), daily at a specified time, weekly (at a specified day and time), or monthly (ar a specified day and time). By default, the archive cleanup process runs every hour (on the hour).

However, you can schedule the archive cleanup task to run according to your requirements, by completing the frequency fields. Alternatively, select the Advanced button and enter the appropriate details in the Cron Expression field to schedule an archive cleanup task using a cron expression. You must use the syntax described in Syntax Rules.

Syntax Rules

Archive cleanup uses the cron expressions as understood by the Quartz Scheduler.

The format for the expression is <second> <minute> <hour> <day-of-month> <month> <day-of-week>, where each field is defined as:

  • <second> - the second at which the action should occur, 0 through 59. Do not set as *, otherwise the action is run every second.
  • <minute> - the minute at which the action should occur, 0 through 59. Do not set as *, otherwise the action is run every minute.
  • <hour> - the hour at which the action should occur, 0 through 23. Use * for every hour.
  • <day-of-month> - the day of the month on which the action should occur, 1 through 31. Use * for every day. Use ? if the day is unknown.
  • <month> - the month in which the action should occur, 1 through 12. Use * for every month.
  • <day-of-week> - the day of the week on which the action should occur. Enter three-letter acronyms for the days of the week. For example, Mon, Tues, Wed, Thu, and so on. Alternatively, you can use 1 through 7, where 1 is Sunday, 2 is Monday and 7 is Saturday. Use * for every day. Use ? if the day is unknown.

Special Characters

The following special characters are also allowed:

Character

Description

* (all values)

Used to select all values within a field. For example, * in the minute field means every minute.

? (no specific value)

Useful when you need to specify something in one of the two fields in which the character is allowed, but not the other. For example, if you want to trigger an event on a particular day of the month (say, the 10th), but don't care what day of the week that happens to be, enter a 10 in the day-of-month field, and ? in the day-of-week field.

,

Used to specify additional values. For example, MON,WED,FRI in the day-of-week field means the days Monday, Wednesday, and Friday.

L (last day of the month)

Can be used in the <day-of-month> to refer to the last day of the month. For example, 0 0 * L * ?.

Example Expressions

The following table lists some examples of basic expressions:

Example

Description

0 0 12 * * ?

Run at 12pm (noon) every day.

0 15 10 ? * *

Run at 10:15am every day.

0 * 14 * * ?

Run every minute starting at 2pm and ending at 2:59pm, every day.

0 0/5 14 * * ?

Run every 5 minutes starting at 2pm and ending at 2:55pm, every day.

0 15 10 15 * ?

Run at 10:15am on the 15th day of every month.

0 10,44 14 ? 3 WED

Run at 2:10pm and at 2:44pm every Wednesday in the month of March.

0 15 10 ? * MON-FRI

Run at 10:15am every Monday, Tuesday, Wednesday, Thursday and Friday.

Support for specifying both a day-of-week and a day-of-month value is not available. You need to use the ? character in one of these fields.

Cleanup Status

The archive cleanup task may detect files that are eligible for clean up but cannot be deleted at that moment in time. Typically, this is because the file has been read at most five minutes prior to Archive Cleanup attempting to delete it. An archive cleanup task can be run again at a later time when it will attempt to delete these files again.

If the number of files that fail to be deleted continues to grow, then you should look in the archive cleanup logs to see which files could not be removed. By default, the logs are stored in log/archiveCleanupLog/cleanup_log.txt, where all successfully and unsuccessfully deleted files are logged during an archive cleanup. The log is a Log4J file which can be configured in the log4j.properties file with the logger name ArchiveCleanup.

In the worst case scenario, files that fail to be deleted during Archive Cleanup will be marked for deletion when the JVM exits. If you encounter issues where the archive cleanup task continually fails to delete a file, contact Rhapsody Support.

Archive cleanup also calculates the number of messages on the Error Queue that are eligible for defragmentation.

Detecting Old Error Messages

Archive Cleanup automatically detects when defragmentation is required on the Error Queue and notifies the user with a count of messages that are eligible for defragmentation and the following notifications:

  • A warning banner on the Archive Cleanup page. This banner is automatically dismissed only when there are no more messages eligible for defragmentation on the Error Queue.
  • An alert in the System Monitor. Refer to Custom Thresholds to configure this alert. The alert is also presented as a warning icon in the navigation menu.

Refer to Error Queue Defragmentation for details on how Rhapsody defragments the Error Queue.

Live Message Scanning During Archive Cleanup

Live message scanning enables archive cleanup to determine which messages are live in the communication point input queue. You can set the input queue size threshold for live messages using the rhapsody.properties file in order to determine how to perform live message scanning during archive cleanup:

Property Description Default Value

QueueService.input.liveMessageScanningThreshold

If the input queue size exceeds the threshold, scanning is performed file by file. If an input queue size is lower than the threshold, scanning is performed message by message. The minimum allowable value is 10.

50

Error Queue Defragmentation

Rhapsody stores message metadata in files, with one file usually containing metadata for multiple messages. A file can only be removed once all of the messages it references are eligible for archive cleanup. Consequently, messages left on the Error Queue prevent not only themselves from being cleaned up, but also other messages that were originally received at the same time. Reducing the file size can alleviate the issue, but greatly increases the overhead required to perform a cleanup.

Rhapsody can defragment messages on the Error Queue by re-writing them into new files in the message store, so that the old files can then be removed by the archive cleanup task. The Error Queue defragmentation task can have a minor impact on engine performance.

When a message has been defragmented, it retains the message data of the original message but not the full event path. A defragmented message only retains the first and last message events.

You cannot:

  • Reprocess defragmented messages at intermediary steps between the first and last message events.
  • Use the Snapshot Load filter to load snapshots of a defragmented message at intermediate steps between the first and last message events.

In Message View, you can view the truncated path of a defragmented message and its message body and properties, and display the details of the original message by clicking its message ID. If the original message has been cleaned up, clicking the message ID displays a message indicating the original message has been cleaned up.


Messages determined to be eligible for defragmentation are normally error messages older than Keep archive data for period plus a configurable duration (two days by default). 

Under the following circumstances, archive cleanup does not mark sufficiently old messages as eligible for defragmentation:

  • When archive cleanup has not yet processed the Error Queue (because all the files that store message metadata reference a least one messages on a live queue).
  • When archive cleanup determines that running the defragmentation task would not yield any benefit (for example, when only one file that stores message metadata exists).

While Error Queue defragmentation functionality can assist you to manage the Error Queue, it is not a substitute for Error Queue management.

Running the Defragmentation Task

To run an Error Queue defragmentation task:

  1. Configure the settings for the defragmentation task. Refer to Configuring the Defragmentation Task for details.
  2. Run the task either:
    • Manually by clicking the Defrag Now link.
    • Automatically after archive cleanup or by scheduling it (both are disabled by default). Refer to Scheduling the Defragmentation Task for details on scheduling the defragmentation task.
  3. Review the status of the defragmentation task. Refer to Defragmentation Task Status for details.

Configuring the Defragmentation Task

You can manage Error Queue defragmentation tasks through the Error Queue defragmentation panel on Archive Cleanup page:

You can set the following configuration options through the Management Console:

Setting

Description Default Value

Defrag threshold

The number of days after the archive period that a message is to be eligible for defragmentation.

In other words, messages determined to be eligible for defragmentation are error messages older than the Keep archive data for period plus the Defrag threshold.

2 (days)

Defrag batch size

The number of messages that can be defragmented before the defragmentation task is paused.

Defragmenting messages in batches and limiting the total number of messages that can be defragmented can reduce the potential performance impact of the defragmentation task on message processing.

1000 (messages)
Pause between batches  The duration the defragmentation task is paused between message batches. 60 (seconds)
Max. messages to defrag The maximum number of messages that can be defragmented in a single run. 10000 (messages)
Run Defrag

You can configure a defragmentation task to run using one of the following options:

  • Scheduled - according to a user-defined schedule.
  • Automatic - automatically after archive cleanup.
  • Off (default) - scheduled and automatic running of the defragmentation task is disabled.

You manually run the defragmentation task by clicking the Defrag Now link.
You can cancel a  running defragmentation task by clicking the Cancel link.

Scheduled (to run at 3:00 am daily)
Schedule

You can schedule a defragmentation task to run on the following basis:

  • Hourly
  • Daily
  • Weekly
  • Monthly
  • Advanced

Refer to Scheduling the Defragmentation Task for details.


You can set the following configuration options through the  rhapsody.properties  file:

# Priority for the thread that runs the message defrag
#CleanupService.messageDefrag.threadPriority=1
# Minimum disk space available to allow defrag
#CleanupService.messageDefrag.minimumSpaceMegabytes=2048

Scheduling the Defragmentation Task

You can schedule to run the defragmentation task according to the following schedule:

Scheduling Frequency

Description

Hourly Every h hours at m minutes past the hour
Daily At time hh:mm
Weekly On day d at time hh:mm
Monthly On day at time hh:mm
Advanced As per a cron expression, for example 0 0 * * * ? to run on the hour every hour.

Refer to Scheduling Archive Cleanup for details on scheduling.

Defragmentation Task Status

The status of any current  defragmentation task and results of the last defragmentation task are displayed on the archive cleanup page in the Management Console:


Error Queue Defragmentation Status Description
Current status Whether the defragmentation task is currently Running or Not Running.
Last run time Whether the defragmentation task has been run since Rhapsody was last started, and when.
Number of messages defragged The total number of messages defragmented by the last defragmentation task.
Stopped prematurely

Whether the last defragmentation task defragmented all eligible messages before:

  • Reaching the maximum number of messages configured for a single run, or
  • The amount of disk space fell below the minimum threshold value.