Migration

Introduction
ChangeUnit methods
ChangeUnit attributes
ChangeUnit example
Best practices

The @ChangeLog annotation has been deprecated in favour of the @ChangeUnit. For more information check this section

# Introduction

A migration is composed by multiple smaller pieces called ChangeUnits, which are processed in order by the Mongock runner.

ChangeUnits are the unit of migration. These refer to the annotated classes where developers write migration logic/scripts.

All classes with the @ChangeUnit annotation will be scanned by Mongock to execute the migration.

A migration is constituted by an ordered list of ChangeUnits. Each of the ChangeUnits represent a unit of migration, which has the following implications:

Each ChangeUnit is wrapped in a transaction:.
- When transactions are possible(transactional environment), Mongock uses the mechanism provided by the database.
- On the other hand, in non-transactional environments, Mongock will try to provide an artificial transactional atmosphere by rolling back the failed change manually.
In targeted operations, such as undo, upgrade, etc., the ChangeUnit is the unit of the operation. For example, when performing an undo operation, it needs to specify the ChangeUnitId until which all the ChangeUnits are reverted(inclusive).
A ChangeUnit has only one migration method, which is marked with the @Execution annotation, and a rollback method, annotated with @RollbackExecution.

# ChangeUnit methods

Every class marked as @ChangeUnit will be marked as a migration class and can contain methods annotated as follow:

@Execution: The main migration method(Mandatory).
@RollbackExecution: This method basically reverts the changes made by the @Execution method. It's mandatory and highly recommended to properly implement it. It can be left empty if developers don't think is required in some scenarios.

It will be triggered in the two following situations:
- When the @Execution method fails in a non-transactional environment.
- In recovery operation like undo.
@BeforeExecution: Optional method that will be executed before the actual migration, meaning this that it won't be part of the transactional and executed in non-transactional context. It's useful to perform DDL operations in database where they are not allowed inside a transaction, like MongoDB, or as preparation for the actual migration.

This method is treated and tracked in the database history like the @Execution method, meaning this that in case of failure, it will force the migration to be aborted, tracked in the database as failed and Mongock will run it again in the next execution.

@RollbackBeforeExecution: Similar to the @RollbackExecution for the @Execution method. It reverts back the changes made by the @BeforeExecution method. It's only mandatory when the method @BeforeExecution is present.

# ChangeUnit attributes

Multiple attributes can be passed to ChangeUnits to configure these. The following table represents the list of attributes that you can set for preparing your migration:

Method	Description	Mandatory?	Default value
id	Returns the ChangeUnit's id that will be stored in the ChangeUnit history table/collection and will the way to identify a ChangeUnit. The combination of this field and the author must be unique among the changeunits.	YES	n/a
order	Returns the ChangeUnit's execution order. The order will be applied treating the value as alphanumeric.	YES	n/a
author	Returns the ChangeUnit's author. The combination of this and the author must be unique among the changeunits.	NO	`default-author`
runAlways	Returns whether the ChangeUnit is runAlways or not. For more information, visit the runner configuration section.	NO	`false`
systemVersion	Returns the ChangeUnit's system version. For more information, visit the runner configuration section.	NO	`0`

# ChangeUnit example

ChangeUnits accept dependency injections directly in the method and at constructor level

The value of @ChangeUnit's order annotation field will be treated as alphanumeric.

@ChangeUnit(id="myMigrationChangeUnitId", order = "001", author = "mongock_test", systemVersion = "1")
public class MyMigrationChangeUnit {
  private final MongoTemplate template;
  
  public MyMigrationChangeUnit(MongoTemplate template) {
    this.template = template;
  }
  //Note this method / annotation is Optional
  @BeforeExecution
  public void before() {
    mongoTemplate.createCollection("clients");
  }
  //Note this method / annotation is Optional
  @RollbackBeforeExecution
  public void rollbackBefore() {
    mongoTemplate.dropCollection("clients");
  }
  @Execution
  public void migrationMethod() {
    getClientDocuments()
      .stream()
      .forEach(clientDocument -> mongoTemplate.save(clientDocument, CLIENTS_COLLECTION_NAME));
  }
  @RollbackExecution
  public void rollback() {
        mongoTemplate.deleteMany(new Document());
  }
}

# Best practices

Use the Operation classes in favour of persisted objects in your ChangeUnits Although Mongock provides a powerful mechanism that allows you to inject any dependency you wish to your ChangeUnits, these are considered the source of truth and should treated like static resources, once executed shouldn't be changed.

With this in mind, imagine the scenario you have the class Client that represents a table in your database. You create a ChangeUnit which uses the field name of the client. One month later, you realise the field is not needed anymore and decide to remove it. If you remove it, the first ChangeUnit's code won't compile. This leaves you with two options: either remove/update the first ChangeUnit or keep the unneeded field name. Neither of which is a good option.

An example for MongoDB would be to use MongoTemplate in favour of using Repository classes directly to perform the migrations.
High Availability Considerations: In a distributed environment where multiple nodes of the application are running, there are a few considerations when building migrations with Mongock:
- Backwards compatible ChangeUnits While the migration process is taking place, the old version of the software is likely to still be running. During this time, it can happen that the old version of the software is dealing with the new version of the data. Scenarios where the data is a mix between old and new versions could also occur. This means the software must still work regardless of the status of the database. It can be a detriment to High Availability if ChangeUnits are non-backward-compatible ChangeUnits.
- 2-stage approach There are certain update operations that can leave code and data in an inconsistent state whilst performing a change. In such scenarios, we recommend to perform the change in two independent deployments. The first one: only provides additions and is compatible with the current deployed code. At this stage, the code would work with the old structure as well as the next change that will be applied. At this point, if the migration was executed, it affects the database but not the code allowing services to be running because the migration didn't produce a breaking change. The next step is required to ensure the new refactored code is also deployed. Once this is done, we have the first part of the data migration done(we only have to remove what's not needed anymore) and the code is able to work with the actual version of the data and the next migration that will be applying. The last stage is to do the new deployment with the data migration(which is compatible with the current code deployed) together with the code reflecting the change. Once again, there are chances that the data migration is done but the service itself(code) doesn't. This is not a problem as the code deployed is also compatible with the new version of the data.
Light ChangeUnits Try to wrap your migration in relatively light ChangeUnits. The concept of light is not universal, but the time to execute a ChangeUnit shouldn't mean a risk to the application's startup.

For example, when using Kubernetes in a transactional environment, if a ChangeUnit can potentially take longer than the Kubernetes initial delay, the services will proably fall into a infinite loop. If there is no ChangeUnit that puts this in risk, the worse case scenario is that the service will be re-started by the Kubernetes agents as it needs more time to acomplish the entire migration, but eventually the migration will finalise.
Try to enforce idempotency in your ChangeUnits (for non-transactional environment). In these cases, a ChangeUnit can be interrupted at any time. Mongock will execute again in the next execution. Although you have the rollback feature, in non-transactional environments it's not guaranteed that it's executed correctly.
ChangeUnit reduces its execution time in every iteration (for non-transactional environment). A ChangeUnit can be interrupted at any time. This means an specific ChangeUnit needs to be re-run. In the undesired scenario where the ChangeUnit's execution time is greater than the interruption time(could be Kubernetes initial delay), that ChangeUnit won't be ever finished. So the ChangeUnit needs to be developed in such a way that every iteration reduces its execution time, so eventually, after some iterations, the ChangeUnit finished.

ChangeLog

From Mongock version 5, @ChangeLog and @ChangeSet annotations are deprecated and shouldn't be used. However, these won't be removed for backwards compatibility.

Please follow one of the recommended approaches depending on your use case:

- For existing changeLogs created prior version 5: Leave it untouched, keeping the deprecated annotation.

- For new migrations from version 5: Use the new @ChangeUnit annotation instead.

Please visit the ChangeLog - version 4 section to access the ChangeLog documentation for Version 4.

For more information about the reason we have adopted this change, please visit our FAQ section.

Migration

Table of Contents

# Introduction

# ChangeUnit methods

# ChangeUnit attributes

# ChangeUnit example

# Best practices

ChangeLog