Support Center
A multi-server environment will have more than one Docker Quickstart container running, with a total of more than one server. An external load-balancer will distribute the incoming traffic to one or more servers.
This is in contrast with single-server environments that have only one Zato container with one server.
Docker Quickstart is always the recommended way to run any Zato environment, from development, through testing, to production.
In very specific, niche cases, Zato support may advise that you use a different installation method. Otherwise, if you build your environments without support from Zato Source, always create them using Docker Quickstart.
Use multi-server environments if you would like to:
If you have more than one server in your environment, consider that all of them run independently of each other, which means that:
If you use publish/subscribe, note that message queues are not shared across servers and each server will effectively act as a separate message broker. It means that you should use pub/sub on only one server in the whole environment, unless you specifically want for each server to be an independent message broker.
If you use the scheduler, it must be started in only one container in the whole environment. All jobs will always run in this container's server. It also means that one-time tasks scheduled dynamically should only be created by that one server.
If you don't use either pub/sub or the scheduler, you can add multiple servers with an external load-balancer in front of them without any further considerations.
Most of the mainstream load-balancers will work well with Zato. The most commonly used ones are HAProxy, Envoy, NetScaler and AWS ELB.
There are two ways to implement the standby server:
Which way to choose will depend on your preferences and requirements. Consider that:
In either case, the container with the standby server is not started until it's needed. It's only a question whether the host should be always running in the background or not.
It's recommended to choose the first method initially - not keeping the standby host running - and switch to the other one if you decide that it will be beneficial in your case.
The very act of switching the traffic from the active server to the standby one will be implemented using your own provisioning scripts and CI/CD pipelines.
Each Zato container will give you monitoring endpoints that let you decide whether the components inside the container are running or whether the standby server should be started.
For instance - your monitoring tool will be pinging the active server's monitoring endpoint once in 30 seconds. If 3 pings are missed, it will trigger a CI/CD script. The script will stop a container with the first server, and then the same script will start a new container on the mirror host, turning it into a new active server, then the script will notify your monitoring product, and the load-balancer in front of the servers, with information about which server is the active one and which is the standby now.
In an environment of this type, there's more than one server that actively processes incoming requests. If you use pub/sub, you need to configure the containers to start pub/sub in only one of them. Similarly, if you use the scheduler, you need to configure the containers to start the scheduler on only one of them.
With this configuration, multiple servers manage incoming requests, ensuring High Availability. Moreover, the use of multiple servers enhances the environment's performance.
By default, a starting container will start pub/sub. To prevent it, pass the Zato_Start_PubSub=False
environment variable when the container is starting. If you don't set this flag, or if you set it to True, the container's server will start pub/sub.
Likewise, a starting container will by default start the scheduler. To prevent it, pass the Zato_Start_Scheduler=False
environment variable when the container is starting. If you don't set this flag, or if you set it to True, the container's server will start the scheduler.
For instance - if you have 5 servers, and you set the scheduler's flag to False on 4 of them, only the fifth one will start the scheduler.
There are no limits as to how many active servers you can have in an environment.
Servers don't need to have direct TCP connectivity enabled between them. It means that each can run behind a firewall, with a load-balancer in front of them all, and each server can potentially be installed in a different geographic region.
Note that the servers with pub/sub and the scheduler don't need to accept API requests from the load-balancer. If it suits your requirements better, you can just treat them as servers dedicated to these tasks, without the overhead of handling incoming requests.
Book a demo with an expert who will help you build meaningful systems that match your ambitions
"For me, Zato Source is the only technology partner to help with operational improvements."