Symantec logo

Failing back using fast failback synchronization

We recommend that you use the fast failback synchronization method. This procedure assumes that the fast failback feature was enabled on the new Primary when takeover was performed. Failing back to the original Primary using fast failback involves the following steps:

  1. Converting the original Primary to an acting Secondary, as shown in the
    vxprint -l rvgname output, and replaying the DCM or SRL of the original Primary to set bits in the DCM of the new Primary. This is performed automatically when the Primary recovers, unless fast failback was disabled during the takeover.

    It is possible that the Primary and Secondary data volumes are not up-to-date because all updates to the original Primary might not have reached the Secondary before the takeover. The failback process takes care of these writes by replaying the SRL or DCM of the original Primary. After the original Primary detects that a takeover has occurred, the new Primary uses information in the DCM or SRL of the original Primary to set bits in its DCM for any blocks that changed on the original Primary before the takeover. You can use the vxrlink status command to monitor the progress of the DCM replay.

  2. Converting the original Primary to Secondary and synchronizing the data volumes on the original Primary with the data volumes on the new Primary using the vradmin fbsync command. This command replays the failback log to synchronize the data volumes. The blocks that changed on the original Primary are resynchronized with the new Primary after the DCM of the new Primary is replayed. During the resynchronization, the data from the data volumes on the new Primary is transferred to the data volumes on the original Primary.

    This step is not required if the -autofb option was used at the time of the takeover. The data on the original Primary data volumes is inconsistent for the duration of the replay. To keep a consistent copy of the original Primary data, take a snapshot of the data volumes before starting the replay. When using the vradmin fbsync command, you can also specify the cache or the cachesize option so that a space-optimized snapshot of the original Primary data volumes is automatically created. If the RVG on the original Primary has VxVM ISP volumes, then you cannot use the cachesize attribute.

  3. Migrating the Primary Role back to the original Primary and starting replication.

In the following illustration, the original Primary seattle has recovered and is now the acting Secondary. The new Primary london uses information in the DCM or SRL of the original Primary to set bits in its DCM for any blocks that changed on the original Primary before the takeover.

Click the thumbnail above to view full-sized image.

In the following illustration, the fast failback feature was enabled on the new Primary london when takeover was performed.

The original Primary seattle is being resynchronized using the failback log.

Click the thumbnail above to view full-sized image.

Example 1—Failing back to the original Primary using fast failback

In this example, the Primary host seattle has restarted after an unexpected failure. After the failure, the original Primary seattle was taken over by the Secondary host london. Each data volume on the Secondary london has a Data Change Map (DCM) associated with it. As a result, fast failback is enabled on london.

An application is running on london and incoming writes are being logged to its DCM. This example shows how to fail back to the original Primary seattle using the fast failback feature.

 To fail back to the original Primary seattle using fast failback

  1. Examine the original Primary and make sure you want to convert the original Primary to Secondary.
  2. Convert the original Primary to Secondary and synchronize the data volumes in the original Primary RVG hr_rvg with the data volumes on the new Primary RVG hr_rvg on london using the fast failback feature. To synchronize the Secondary using fast failback, type the following command on the new Primary london or the original Primary seattle:

# vradmin -g hrdg [-wait] fbsync hr_rvg \

[cache=cacheobj | cachesize=size]

When the synchronization completes, go to the next step. You can check the status of the synchronization using the vxrlink status command. The -wait option with the vradmin fbsync command can also be used to wait for the completion of the synchronization process.

The cache attribute specifies a name for the precreated cache object, on which the snapshots for the volumes in the specified RVG will be created. For more information on creating the cache object refer to the information provided in the section Preparing the RVG volumes for snapshot operation. The cachesize attribute specifies a default size for the cache object with respect to the source volume. You can specify only one of these attributes at one time with the vradmin fbsync to create one cache object for each snapshot.

The parameters cache and cachesize are optional. If you do not specify either of these parameters then the vradmin fbsync will convert the original Primary to a Secondary and synchronize the data volumes on the original Primary with the data volumes on the new Primary, without creating the snapshots.

This step is not required if the -autofb option was used at the time of takeover

  1. At a convenient time, stop the application on the new Primary.
  2. Migrate the Primary role from the new Primary host london to the original Primary host seattle by typing the following command on any host in the RDS:

# vradmin -g hrdg migrate hr_rvg seattle

Replication from the original Primary seattle to the original Secondary london is started by default.

  1. Restart the application on the original Primary seattle. Because the application was stopped properly before the migration, an application recovery is not required.
Example 2—Failing back to the original Primary using fast failback in a multiple Secondaries setup

In this example, the setup consists of two Secondaries, london and tokyo. The Primary host seattle has restarted after an unexpected failure. After the failure, the original Primary seattle was taken over by the Secondary host london. Each data volume on the Secondary london has a Data Change Map (DCM) associated with it. As a result, fast failback is enabled on london.

If you had created RLINKs between the hosts london and tokyo when setting up the replication, you do not need to manually reconfigure the additional Secondary tokyo as a part of the RDS. It will automatically be added as a Secondary of the new Primary london.

An application is running on london and incoming writes are being logged to its DCM. This example shows how to fail back to the original Primary seattle using the fast failback feature.

  1. Perform step 1 to step 4 described in the example Example 1—Failing back to the original Primary using fast failback.
  2. After migration, you must synchronize the additional secondary tokyo with the original Primary seattle.

    On the original Primary seattle:

# vradmin -g hrdg -c checkpt syncrvg hr_rvg tokyo

The -c option when used with the vradmin syncrvg command automatically starts a checkpoint with the specified name, checkpt, in this example. After the data volumes are synchronized, the checkpoint is ended.

  1. Restart the application on the original Primary seattle. Because the application was stopped properly before the migration, an application recovery is not required.