Upgrading Self-Managed Commerce · Self-Managed Commerce

This section provides generic instructions to upgrade a version of Self-Managed Commerce to a newer version. Depending on your customizations, you might need to perform additional tasks before, during, or after the upgrade.

The upgrade process consists of three phases:

Preparation: Gathers information about your known customizations and decide how you want to mitigate your risks.
Customization Analysis (Optional): Analyzes determine what customizations are made. This optional phase is for organizations if the customizations made to the platform is unknown.
Upgrade: Performs the upgrade to a newer version of Self-Managed Commerce.

Preparation

The preparation phase involves gathering information and mitigating risks before performing the upgrade. The following diagram shows an overview of the preparation process:

Upgrade Preparation

Identify customizations: Create a list of all known Elastic Path platform customizations. To know your Elastic Path platform customizations, see Customization Analysis and reverse engineer the code to find them.
Determine versions: Get the appropriate source code for your upgrade.
Identify modernizations: Do the following:
- Review the documentation and technology stack for the new version of Elastic Path
- Review the Elastic Path Release Notes, for the current version of Elastic Path and for all versions until the version you are upgrading to or other details
Review version-specific upgrade notes: Review the upgrade notes for your target version and all intermediate versions since your current version.
Identify a regression test strategy: Identify customizations that lack written or automated test cases.
Reduce risk where possible: Do the following:
- Address the risks found during the Customization Analysis
- Improve test coverage
- Refactor code
- Provision the resources that require time to arrange ahead of time, such as new hardware or training
- Defer upgrades of independent systems (scope reduction)
Train the team on new version: Create a training plan to train the team on new technologies, if any.
Identify components to upgrade: Identify the minimum set of components or dependencies that must be upgraded. Hosted infrastructure might need to be upgraded for the new Elastic Path platform.

Customization Analysis

Customization analysis helps you derive a complete list of the customizations that are made to the platform in a time-boxed investigation. This section focuses on the completeness of the list, not on the detailed understanding of each customization.

Customization Analysis

Apply reverse engineering patterns: Reverse engineering is a general software engineering practice.
Identify risks in the source code: Record the following problems if you find them while examining the source code:
- No test coverage
- Code smells
- Cross-cutting concerns
Identify risks in the infrastructure: Record the following problems if you find them while examining the environment installation, build, deploy, and release processes:
- No Continuous Integration
- Manual deploy or release process
- Lack of staging environments
Prepare upgrade action list: Arrange the unorganized list of features or customizations for one of the following upgrade actions:
- Upgrade feature as-is
- Problematic feature implementation that requires reimplementation
- Implement a new feature using functionality in a newer version of Self-Managed Commerce
- Remove unused code or feature

Upgrade

The upgrade phase involves the steps to merge the latest release with your customizations, deploy the new environments, and cut over to the upgraded environment. The steps required for this phase are illustrated at a high level by the diagram below:

Upgrade Process

The steps in this phase should be followed starting at the top and making your way down the tree. Each step can be completed as long as the steps with arrows directing to that step are completed. If multiple steps are available, these can be completed in parallel by different team members. Each step is described in more detail in the sections below:

Local environment setup

Review the Configuring Environment steps for your target version. In some cases, the required Java version, Maven version, database version, or IDE configuration may have changed. Ensure that your local environment is setup properly to allow you to build the source code and startup the services locally.

Merge and fix compilation issues

Open a terminal window and change the current directory to your existing ep-commerce source code folder. Make sure you are in your main branch, and have retrieved the latest source code.

Next, create a branch for the upgrade by running the following (replace the version number with the version you are targeting):

git checkout -b upgrade/8.5.x

Then ensure that you have a Git remote configured to retrieve the latest version from Elastic Path. Run the following:

git remote -v

Review the list and see if any of the entries are for git@code.elasticpath.com:ep-commerce/ep-commerce.git. If there is, make note of the remote name. If not, run the following:

git remote add upstream git@code.elasticpath.com:ep-commerce/ep-commerce.git

Next, retrieve the upstream repository by running the following (replace upstream with your name for the ep-commerce repository, if necessary):

git fetch upstream

Then start the merge process as follows (replace the version number with the version you are targeting):

git merge release/8.5.x

You may see a large number of merge conflicts. Many of these are due to the way that Elastic Path releases patches and version upgrades, which do not have a shared ancestor that Git recognizes. Start addressing these conflicts by running the following:

git mergetool

In most cases, you can resolve the conflicts by taking the theirs version. However, if your team has customized platform source, you may need to manually combine parts of the ours version and the theirs version.

Address all conflicts and then confirm that the source compiles by running:

mvn clean install -DskipAllTests -Djacoco.skip=true

Address any compilation issues until you are able to successfully compile. Commit your changes and push your branch to your Git repository so other team members can access it.

note

Many of the steps above will soon be replaced with a new tool called the Self-Managed Commerce Upgrader, which will automate much of this step. This includes fetching the remote source code, initiating the merge, and resolving many of the conflicts between official patches and the release branch.

Upgrade deployment tooling

Ensure that your deployment tooling is able to deploy both your existing Self-Managed Commerce version as well as the target version.

If you are using CloudOps for Kubernetes, review the compatibility matrix to identify a version that can support both your existing and target Self-Managed Commerce versions. If necessary, upgrade CloudOps for Kubernetes to ensure that your build and deployment pipelines function correctly.

Fix Checkstyle and PMD errors

In this step, fix all Checkstyle, PMD, and Jacoco errors. To identify these failures, run a build without any unit tests or integration tests as follows:

mvn clean install -DskipTests=true -DskipITests=true -DskipCucumberTests=true

Fix unit tests

In this step, fix all failing unit tests. To identify these failures, run a build without any Checkstyle, PMD, or integration tests as follows:

mvn clean install -DskipITests=true -DskipCucumberTests=true -Dpmd.skip=true -Dcheckstyle.skip=true -Djacoco.skip=true

Fix local database reset issues

In this step, fix all issues with Liquibase scripts while resetting your local database. To identify these failures, run a local database reset as follows:

cd extensions/database
mvn clean install -Dreset-db

Fix Cortex startup issues

In this step, start Cortex locally and ensure that it is able to startup successfully. Instructions for starting Cortex locally can be found here.

Fix Search Server startup issues

In this step, start Search Server locally and ensure that it is able to startup successfully. Instructions for starting Search Server locally can be found here.

Fix Commerce Manager startup issues

In this step, start Commerce Manager locally and ensure that it is able to startup successfully. Instructions for starting Commerce Manager locally can be found here.

Fix Integration Server startup issues

In this step, start Integration Server locally and ensure that it is able to startup successfully. Instructions for starting Integration Server locally can be found here.

Fix Batch Server startup issues

In this step, start Batch Server locally and ensure that it is able to startup successfully. Instructions for starting Batch Server locally can be found here.

Fix Data Sync Server startup issues

In this step, start Data Sync Server locally and ensure that it is able to startup successfully. Instructions for starting Data Sync Server locally can be found here.

Deploy dev environment

In this step, deploy a dev environment for the target version using your deployment tooling and ensure that all services start successfully and can be accessed by your development team.

If you are using CloudOps for Kubernetes, see Options for Deploying Self-Managed Commerce Environments.

Fix integration tests

In this step, fix all failing integration tests. To identify these failures, run a build without any unit or Cucumber tests as follows:

mvn clean install -DskipTests=true -DskipCucumberTests=true

Fix staging snapshot database upgrade issues

In this step, ensure that your staging database can be upgraded to the target version without issues. To test the upgrade process, follow these sub-steps:

Create a snapshot of your staging database. This can be done using database or AWS-specific tooling.
Restore the snapshot to a new database or schema. This can be done using database or AWS-specific tooling.
Run the Data Population Tool against the restored database using the update-db command to upgrade the schema and data to be compatible with the new target version.
Ensure that the job completes successfully. If not, address any failures in the Liquibase changesets.
Delete the staging database.

An example of how to do this using CloudOps for Kubernetes is as follows:

Create a snapshot of your staging database using the AWS console.
Run the create-and-manage-data-server Jenkins job, setting the snapshotToRestore parameter to the name of the snapshot generated in step 1.
Run the run-data-pop-tool Jenkins job, setting the dataPopToolCommand parameter to update-db, and setting the kubernetesNickname value to the same value specified in step 2.
Run the create-and-manage-database-server Jenkins job, setting the kubernetesNickname value to the same value specified in step 2, and setting the deleteDatabase parameter.

Fix prod snapshot database upgrade issues

In this step, ensure that your production database can be upgraded to the target version without issues. To test the upgrade process, follow these sub-steps:

Create a snapshot of your staging database. This can be done using database or AWS-specific tooling.
Restore the snapshot to a new database or schema. This can be done using database or AWS-specific tooling.
Sanitize the data by clearing sensitive customer and order data with personally-identifiable information.
Run the Data Population Tool against the restored database using the update-db command to upgrade the schema and data to be compatible with the new target version.
Ensure that the job completes successfully. If not, address any failures in the Liquibase changesets.
Delete the prod database.

An example of how to do this using CloudOps for Kubernetes is as follows:

Create a snapshot of your production database using the AWS console.
Run the create-and-manage-data-server Jenkins job, setting the snapshotToRestore parameter to the name of the snapshot generated in step 1.
Sanitize the data by clearing sensitive customer and order data with personally-identifiable information.
Run the run-data-pop-tool Jenkins job, setting the dataPopToolCommand parameter to update-db, and setting the kubernetesNickname value to the same value specified in step 2.
Run the create-and-manage-database-server Jenkins job, setting the kubernetesNickname value to the same value specified in step 2, and setting the deleteDatabase parameter.

Prod database upgrade performance testing

In this step, we repeat the steps described in Fix prod snapshot database upgrade issues, but this time make note of how long the upgrade process takes. This is important because it reflects how long your site will be offline during the production cutover. If the upgrade process is taking too long, review the logs and identify which changeset IDs are taking the most time, and open a support request for assistance.

Storefront updates

In this step, connect a test deployment of your front-end to the dev environment with the target version of Self-Managed Commerce. Ensure that all functionality still works as expected. Unless you are upgrading to a new major release, the Cortex API should be backwards compatible with the previous version.

Fix Cucumber tests

In this step, fix all failing Cucumber tests. To identify these failures, run a build as follows:

mvn clean install -DskipTests=true -DskipITests=true

Fix Selenium tests

In this step, fix all failing Selenium tests. To identify these failures, follow the Failsafe Selenium Tests instructions. To run all Selenium tests, omit the -Dcucumber.options parameter, and make sure to run the tests in both of these folders:

extensions/cm/ext-cm-modules/system-tests/selenium
extensions/cm/ext-cm-modules/ext-system-tests/selenium

Refactor customizations into Extension Point Framework extensions where possible

The Extension Point Framework is a new approach to building customizations that isolates your custom code from the platform source. It also provides guarantees around compatibility of your customizations with future upgrades. Review the compatibility matrix to see which versions of the Extension Point Framework Connectivity Library are compatible with your target version of Self-Managed Commerce. Then review the Extension Points that are available in that version to determine if any of your customizations are candidates to be migrated into Extension Point Framework extensions.

For more information about the Extension Point Framework, see our documentation.

Regression testing

Test the storefront and Commerce Manager using the dev environment to ensure that all expected functionality is working as expected.

Regression bug fixes

In this step, address any broken functionality found during regression testing.

Deploy new staging environment

In this step, we deploy a staging environment for the target version using your deployment tooling. To do this, follow these sub-steps:

Create a snapshot of your staging database. This can be done using database or AWS-specific tooling.
Restore the snapshot to a new database or schema. This can be done using database or AWS-specific tooling.
Run the Data Population Tool against the restored database using the update-db command to upgrade the schema and data to be compatible with the new target version.
Deploy the Self-Managed Commerce services, connecting to the database that was restored.

An example of how to do this using CloudOps for Kubernetes is as follows:

Create a snapshot of your staging database using the AWS console.
Run the create-and-manage-data-server Jenkins job, setting the snapshotToRestore parameter to the name of the snapshot generated in step 1.
Run the run-data-pop-tool Jenkins job, setting the dataPopToolCommand parameter to update-db, and setting the kubernetesNickname value to the same value specified in step 2.
Run the deploy-or-delete-commerce-stack Jenkins job, setting the kubernetesNickname value to the same value specified in step 2.

Deploy new prod environment

In this step, we deploy a prod environment for the target version using your deployment tooling. To do this, follow these sub-steps:

Create a snapshot of your prod database. This can be done using database or AWS-specific tooling.
Restore the snapshot to a new database or schema. This can be done using database or AWS-specific tooling.
Sanitize the data by clearing sensitive customer and order data with personally-identifiable information.
Run the Data Population Tool against the restored database using the update-db command to upgrade the schema and data to be compatible with the new target version.
Deploy the Self-Managed Commerce services, connecting to the database that was restored.

An example of how to do this using CloudOps for Kubernetes is as follows:

Create a snapshot of your prod database using the AWS console.
Run the create-and-manage-data-server Jenkins job, setting the snapshotToRestore parameter to the name of the snapshot generated in step 1.
Sanitize the data by clearing sensitive customer and order data with personally-identifiable information.
Run the run-data-pop-tool Jenkins job, setting the dataPopToolCommand parameter to update-db, and setting the kubernetesNickname value to the same value specified in step 2.
Run the deploy-or-delete-commerce-stack Jenkins job, setting the kubernetesNickname value to the same value specified in step 2.

Acceptance testing

Arrange with your business users to test the storefront and Commerce Manager using the new staging and prod environments to ensure that all expected functionality is working as expected. Also ensure that the business users are aware of the release highlights showing new features in the release.

Acceptance bug fixes

In this step, address any broken functionality reported by business users.

Performance testing

In this step, run a load test on your old and new staging environments and compare the results to ensure that performance has not degraded. To do this, we recommend using our Transactional Data Generator JMeter scripts. These scripts execute common commerce operations against the Cortex APIs and summarize results such as average response times and throughput.

Performance bug fixes

If any Cortex APIs are showing response time or throughput degradation, review the source code around these operations to identify the regression. We also recommend making use of the Query Analyzer to determine if any JPA queries are causing the performance regression.

Security testing

Review the deployment security document to ensure that your new prod environment follows security best practices. You may also want to consider a review or pen testing by your security team.

Disable prod traffic on existing prod environment

As the first step of the production cut over process, display a maintenance page on your front-end to prevent shoppers from transacting through Cortex. Also inform your business users to avoid making any changes through Commerce Manager.

Transfer and upgrade prod database snapshot

In this step, replace the existing (sanitized) prod database snapshot with the latest production database.

An example of how to do this using CloudOps for Kubernetes is as follows:

Create a snapshot of your existing prod database using the AWS console.
Run the create-and-manage-database-server Jenkins job to delete the existing target database, setting the kubernetesNickname value the name of the target prod environment, and setting the deleteDatabase parameter.
Run the create-and-manage-data-server Jenkins job, setting the snapshotToRestore parameter to the name of the snapshot generated in step 1.
Run the run-data-pop-tool Jenkins job, setting the dataPopToolCommand parameter to update-db, and setting the kubernetesNickname value to the same value specified in step 3.

note

If you are using author to live functionality, you may also need to repeat these steps for your author prod environment.

Enable prod traffic on new prod environment

Restart all services on your target prod environment. Configure your front-end to point to the new prod environment Cortex URLs. Remove the maintenance page.

Decommission old environments

In this step, delete the old service environments and databases that are no longer required.

An example of how to do this using CloudOps for Kubernetes is as follows:

Run the deploy-or-delete-commerce-stack Jenkins job for each old environment, setting the kubernetesNickname parameter to the name of environment, and setting the deleteStack parameter.
Run the create-and-manage-database-server Jenkins job for each old database, setting the kubernetesNickname value to the name of environment, and setting the deleteDatabase parameter.