SRM implementation with vSphere Replication

It has been way too long since my last post and I figured I would update what has been happening.

Project - SRM implementation with vSphere Replication

I had an opportunity to implement an VMware Site Recovery Manager implementation for a Disaster Recovery initiative at the company I am working at.

The main thing to understand is the SRM itself is not the Disaster Recovery tool itself, it is the workflow engine for the fail over and fail back.  The process for the fail over NEEDS to be fleshed out before this tool is installed.  So perform the fail over manually before SRM is installed and document this, even if this is conceptually.

The disparate storage back-end from the main office to the alternate site made array based replication not ideal (licensing cost didn't help either) so vSphere replication was chosen as the replication mechanism.


The design for SRM is fairly simple and the implementation requirements are also fairly straightforward:

  1. Have two sites with ESXi hosts
  2. Have two vCenters (same edition is critical)
  3. Have a methodology for your SQL installation 
    1. SQL server installed on the same Windows machine as the application 
    2. SQL server installed in a central instance 
  4. Have a good network connection between the sites 
  5. Ideally have a segregated network for Replication Traffic and SRM Management (not critical but good to have)
As we were wanting to do the replication of our Messaging (Exchange 2007) environment IP addressing was ideal to remain the same, so thinking about IP spanning across sites was also considered.  This was the dangerous consideration as determined by Cisco, citing numerous examples of spanning tree issues causing major outages. So other factors entered into the consideration of how this was to be designed.

The basics of the SRM implementation

SRM needs to talk to the Site vCenter, plain and simple.  The two vCenters shouldn't be in linked mode to so to produce less errors in case of a site failure.  So I determined for the second site, since this is a simple straight forward install the vCenter Appliance would be enough and would be quicker to install.  
  1. Download the vCenter appliance OVF
  2. Get a static IP for the appliance
  3. Install the OVF 
  4. Do initial setup using the built in Database
  5. Connect ESXi Hosts to the vCenter 
The infrastructure is setup for the receiving / recovery site. 
  1. Create a Windows Server that the SQL server will reside on 
  2. Install SQL server (it can not be SQL express) 
  3. If there is a policy to put the application server on the same windows box as the SQL server, install SRM on the same VMguest.
  4. During the install there is a section to install vSphere Replication, make sure to install the component for this whether or not you will use it as it will provide functionality in the future.
  5. Connect it to the vCenter and SRM will be available at the VIClient (not the Web Client)
  6. Once the vCenter is connected a "vSphere Plug-in" will be available, install it and go to the "solution"

vSphere Replication

This one wasn't intuitive to begin with but after you figure it out, it is quite simple, in fact too simple and does need some additional components to track and monitor the connection.

The appliance is the piece that performs the replication, it contains the Replication Manager and Replication Server.  Addition replication Servers can provides additional redundancy for the replication and isn't needed for small installs or few VMs to replicate.  
  1. Deploy the VR appliance (link on the Getting Started tab)
  2. The "Reconfigure" link opens a web browser and the link to the admin page of the VRA
  3. Configure, and register.
To initiate the replication, 
  1. Open the inventory tab and select the VM to be replicated.  
  2. Right click and select vSphere Replication
  3. This sets up the replication to the configuration settings desired.  
That's it, other than the Protection Groups and the Recovery plans.  Remember Protection Groups are the bundle of VMs to replicate / protect and the Recovery plan is the order to be recovered.  Once it is determined the application to be protected, initiate smaller protection groups and larger recovery plans.

I hope this helps those that are working through this.  Drop a comment if for questions, or comments.