Time To Recover And Reducing Cost Are Biggest Backup Challenges, Says Double-Take Software

Double-Take® Software (NASDAQ:DBTK) today announced that 57% of IT organisations see reducing recovery time in the event of IT failure and cutting the cost of backup as the two biggest painpoints for backup and disaster recovery.

[UKPRwire, Wed Nov 18 2009] Double-Take® Software (NASDAQ:DBTK) today announced that 57% of IT organisations see reducing recovery time in the event of IT failure and cutting the cost of backup as the two biggest painpoints for backup and disaster recovery. The next most significant difficulties were the ability to roll back to any point in time when recovering workloads and recovery testing.

The findings are the result of a survey into attitudes towards backup and the effect of virtualisation on business continuity strategies, conducted at the Storage Expo show held in London on 14-15 October. Virtualisation is already in place with the majority of those questioned, with 86 per cent of those surveyed having a virtual infrastructure in place within their organisations. Further findings from the survey include:

– Tape backup is the most popular technology involved for recovery of virtual machines, with 60 per cent of organisations relying on tape to protect their virtualisation implementations. 53 per cent of organisations are using disk-to-disk backup products, while proprietary virtualisation products such as VMware’s VMotion are used by 23 per cent
– 17 per cent of organisations are only using tape backup for the backup / recovery of their virtual machines
– The number of respondents that were able to judge their Recovery Point Objectives (RPO) when it came to virtualised environments was much lower than those able to define their Recovery Time Objectives (RTO) – only 45 per cent of those surveyed were able to state their satisfaction level around their RPOs

Commenting on the findings, Ian Masters, UK sales and marketing director, Double-Take Software said, “The growth of virtualisation into a mainstream technology has been due to the cost savings that server consolidation can deliver, but the backup and recovery process is still not well understood. Knowing how to recover workloads back to the optimum point in time involves using the best practices that exist in the physical DR space, as well as knowing how to exploit the benefits of virtualisation. Having an approach in place that can handle both physical and virtual workloads, and carry out recovery onto any platform, is one way to reduce the cost of backup, as well as reducing recovery time.”

“The percentage of organisations that are using only tape for their virtual machine backup and recovery is surprising,” Masters continued. “Tape backup covers a single point in time and was not designed for the demands of virtualisation when it comes to the recovery of workloads. There is therefore a big potential for data to be lost in the event of a failure, and the time taken to recover can be substantial. Disk-to-disk replication of virtual machine workloads offers a better level of protection, as this approach can provide real-time protection for data, as well as the ability to roll back to any point in time using continuous data protection or snapshots.”

The survey, carried out at Storage Expo by Double-Take Software, received 169 responses across a range of vertical markets and company sizes. Double-Take Software white papers on best practice DR and backup strategies for virtualised environments, reducing cost and recovery time and extracting more value through centralised backup are available here: http://www.doubletake.com/uk/resources/whitepapers/Pages/default.aspx “

For the full article, click here!

To learn more about Double-Take Backup, click here!

Double-Take 101: NIC Teaming and Full-Server Technologies

I get this question in one form or another from just about every client I visit. 

If the servers use NIC Teaming, will Double-Take Availability and Double-Take Backup be able to fail over/recover properly?  The user guide says it isn’t supported, but I have teaming on all my servers. Help!

Well, that’s half-right.  As per the user guides (and my own support knowledge), we may have an issue with NIC Teaming on a Target device, but never on a Source machine.  Let me explain.

With Double-Take Software’s System State Manager technology (SSM), during a failover or recovery, we do not overwrite information having to do with hardware systems on the Target device.  That allows SSM to safely move a system from a physical machine to a virtual machine – as just one example of cross-platform failover and recovery.

Normally, specific software solutions running on either the Source or Target servers (or Recovery Server in the case of Double-Take Backup) don’t impact our ability to properly move the system state from one hardware platform to the other. The exception is software-based hardware devices.  This is a tricky thing to describe, but as NIC teaming is one great example, we can use that to explain the problem.

A NIC team is two or more Network Interface Cards (NIC’s) that act as a single unit.  This is done at the OS level (Windows in this case) via software installed and configured to act as a virtualized hardware device.  So even though there are multiple physical devices, they appear to be a single, unified NIC at the Windows level.  This means that the drivers for these systems are used to address both the virtual NIC (the team) and the physical devices (the network cards in the server). Note, none of this refers to the virtual hardware in an ESX or Hyper-V guest machine, here we’re talking about the software systems in Windows that produce a virtual NIC device that Windows and installed applications talk to.

Since the virtual teamed NIC appears as a hardware device to Windows, we do not overwrite that system when SSM produces a failover or recovery event. What this means to you is that on a failover event, if the Target had teamed NIC’s before we began the failover, those teams would not get overwritten. In nearly all cases this results in the team being broken on failover. This doesn’t cause any lasting problems, and you can re-integrate the team from the server console, but it can be annoying to have to do this during an emergency situation.

Therefore, you will want to make sure the Target doesn’t have a NIC Team in place day-to-day.  It can have multiple NIC’s installed, but do not team them.  This will ensure that we can properly fail over or recover without disrupting networking activity during the process.  The server will come up with networking intact on one ore more individual NIC’s, which will let you resume normal operations until you can manually configure a NIC team after the failover event.  This is accomplished by our SSM and other functions applying the network information from the Source to one or more discreet NIC’s on the Target, or by keeping the Target NIC info in place and re-directing clients via DNS.  The Target will have a reduced overall throughput capacity until you re-create the Team after failover, but no other repercussions have been seen.

The Source servers don’t need to be altered. We can easily fail over from a server with a NIC team to one that does not use teamed NIC’s, so leave the production boxes as-is.  This only applies to the Target or Recovery Server, build it and maintain it without the Team configured.

Also, keep in mind that this only applies to systems using our SSM technology. Specifically that would be Double-Take Availability with Full Server Failover or Virtual Recovery Assistant features or Double-Take Backup with Full Server Protection enabled.  Other workflows and features like DTAM do not require any special consideration when it comes to NIC teaming at all.

5 Common Backup and Recovery Mistakes

Fri, Nov 13, 2009 By Brace Rennels

Server backup and recovery isn’t rocket surgery, but you would be surprised by the number of IT organizations that can’t get it right. Here are five of the most common mistakes that I have seen.

1. “Don’t worry, it’s Backup Up”

Famous last words! Is it really? Prove it! Just because you have run a backup procedure, throw in a fresh tape into the drive doesn’t necessarily mean that the server(s) are protected. Over confidence or misconceptions about backup are common mistakes. It’s like having a plan without ever testing. So, test it. If you have data backed up when was the last time you tried to restore it, is the data any good? How long did the process take? Don’t leave anything to chance. Make sure that your backup and recovery procedures are exercised as a part of your business continuity plan and test the integrity of the data that is on the backup media, whether that is disk or tape.

2. “I have everything backed up”

Great genius but do you really need a copy of everything on that server? Specifically a file share server because I can guarantee you that someone has their iTunes database backed up to their file share so they don’t lose their favorite Wilco song. Evaluating what needs to be backed up in order to recover servers to a functional level and is often overlooked as a part of the backup planning process. 2009 through 1020 is the beginning of the “Be Green” “only consume what you need” generation and having everything isn’t always what you need and is likely costing the company money to keep it all archived. Don’t waste tape or disk space backing up files that aren’t necessary to the recovery process. Excluding any multimedia (wmv, wav, mp3), temp files or duplicate files using data deduplication will only help assure the necessary data required to recover what is important to the company is protected.

3. “I can recover from Tape”

Really? Is that where you want it? I know that it is possible but did anyone check the integrity of the data on the tape? Nothing lasts forever and although tape has a great track record there are issues that can cause data on tape to be unrecoverable. I had the opportunity to speak with the CIO of a large gaming facility a few years ago and he was telling me how he was nervous that they had all of this tape archived at an offsite facility but worried about the integrity of the tape and their ability to be able to recover an entire data center if necessary. If the data is kept at an optimum temperature, with low humidity then it can probably last a lifetime but I’d like to meet the person who would want to perform this recovery. Not me nor anyone that I know wants to attempt to recover a datacenter from tape. There is just too high of a probability of something going wrong, from the restore order, data spanning multiple tapes and oh yea, we all remember operator error, right?

For the two other mistakes, click here!

To learn more, register for our upcoming webinar on “Remote Office Backup and Rapid Recovery Solutions…on a budget”.

Double-Take Software Webinar Registration

Double-Take® Software to Host Educational Webinar on Remote Office Backup

Southborough, Mass. – November 12, 2009 Double-Take® Software (NASDAQ: DBTK) today announced it will host an educational webinar titled, “Remote Office Backup and Rapid Recovery Solutions… on a Budget,” on November 19, 2009 at 11:00 a.m. Eastern Time.  Featuring Ryan Hulse, product manager, this webinar will cover Double-Take Software’s newest backup and on demand recovery solution, Double-Take® Backup, and how it solves branch office data protection challenges.
Branch office data protection amplifies traditional backup challenges. Hulse will discuss how Double-Take Backup removes those challenges and reduces costs by simplifying protection and recovery for critical systems while continuously replicating changes to data, applications and the source server’s operating system to protect the entire server.  Hulse will also highlight why Double-Take Backup is ideal for remote office and branch office backup scenarios and explain the solution’s unique capabilities and features, including:
• Continuous server backups
• Any-point-in-time CDP recovery
• Reducing and replacing tape dependencies
• Achieving an RPO of “near zero” data loss
• Elimination of backup windows
• Flexible recovery choices to dissimilar physical and virtual environments
• Complete protection for branch offices
Webinar Details
 
What:  Webinar: “Remote Office Backup and Rapid Recovery Solutions… on a Budget”
When:   Thursday, November 19, 2009; 11:00 a.m. ET
Who:   Ryan Hulse, product manager, Double-Take Software
Where:  http://tinyurl.com/yhvoacl
 
To learn more or register for this Double-Take Software webinar, please click here.
Also, Check out this video to learn why Double-Take Backup is better than Tape Backup!

Park Community Federal Credit Union Selects Double-Take® Software

Double-Take® Availability Provides Protection for Park Community’s Exchange and SQL Servers through Real-Time Replication
 
Southborough, Mass. – November 10, 2009 Double-Take® Software (NASDAQ: DBTK) announced today that Park Community Federal Credit Union has selected Double-Take® Availability to protect and provide high availability for its Exchange and SQL Servers, as well as all other servers in their environment. Following extensive research, Park Community selected Double-Take Availability because it was the only solution that could meet the credit union’s disaster recovery needs: ensuring the availability of applications like SQL Server and Exchange Server and effective protection of large quantities of data. 
 
Double-Take Availability provides cost-effective full-server protection and offers rapid recovery from any system outage. The software continuously captures changes as they happen and replicates those changes in real-time to another server, physical or virtual, at any location, locally or globally. With Double-Take Availability, companies like Park Community can quickly and efficiently implement a disaster recovery plan, which allows them to keep their businesses afloat if disaster or downtime strikes.  
 
“Implementing a disaster recovery plan is critical for any business.  Park Community, like many companies, needed a solution that provides continuous, real-time replication to their offsite DR facility for maximum protection against data loss and application downtime,” said Bob Roudebush, director of solutions engineering at Double-Take Software. “Double-Take Availability provided this level of protection in a cost effective way, using software-based protection that is easy to deploy and manage and leverages existing IT investments.”
 
Without the proper data protection solution in place, one that protects data continuously and also provides failover capabilities for the applications that create that data, system outages or site-wide disasters can result in hours or days of downtime and lost productivity. In fact, the Diffusion Group, a consumer technology research and strategic marketing firm, reported that 60 percent of companies that lose their data go out of business within 6 months of the disaster, and 72 percent of businesses that suffer major data loss go under within 24 months.
 
With a small IT staff and tight budget, Park Community required a solution that was efficient enough to protect a massive amount of data using the organization’s existing WAN infrastructure. Double-Take Availability allowed Park Community to replicate data on 30-plus virtual servers, including all data on one server that maintained one terabyte of business-critical data and over 20 million files. Park Community tried various other solutions, none of which were able to replicate in real-time without failing.  In frequent DR testing, Double-Take Availability allows Park Community to recover seamlessly within their strict recovery time objective (RTO) and recovery point objective (RPO) requirements. 
 
“We originally intended to protect only five or six highly critical servers,” said Eric Hubbard, network engineer at Park Community Federal Credit Union.  “However, because Double-Take Availability is affordable and didn’t require additional bandwidth, we were able to implement this disaster recovery solution across our entire server environment all within budget.”
 
For more information on Double-Take Software, go to www.doubletake.com
For more information on Double-Take Availability, go to http://tinyurl.com/ydnsqk5
For the full article, click here!
 

Double-Take 101: GeoCluster Passive Node Volumes

Following up on my last column on what you could do with GeoCluster (now part of Double-Take Availability); I’d like to take some time to explain an interesting phenomenon that end-users see in GC all the time. While inside the cluster manger GUI, GC disks behave exactly the same as shared-cluster disks, when you create a GC between two or more servers, you can see the disk volumes “attached” to the passive nodes in Windows.  This is much different behavior than a traditional shared-disk cluster, where the volumes only appear on the active node for that disk resource.

The reason this happens is due to the fact that GC needs to replicate data from the owning node for that disk resource to all passive nodes that could potentially become owners if resources are moved or fail over.  In a shared-disk cluster, no replication is performed, so the disks only need to be attached to one node at a time; but GC needs the volumes mounted for that replication to occur.  The end result is that if you open up My Computer or Windows Explorer on a passive node, you will see the volumes mounted at all times.  However, you will not be able to do anything with those volumes except view the contents on the drive.  This is less than even read-only privileges, as you won’t be able to do things necessary to open and read files, like change the Last Access or Last Modified dates.

The technology that allows a volume to be mounted, but not accidentally altered, is part of the GC solution set itself.  During configuration, GC inserts a blocking file system filter driver.  This allows GC to control the flow of read and write I/O to disk.  The system can be manually invoked in regular Double-Take Availability or Backup connections as well, but it is turned on by default for GC connections.  Only the node that currently owns the disk resource can bypass the block to read and write to the disk, replicating changes a the byte-level to all other potential owning nodes.

When an arbitration occurs, and the resources move to another node, the block is removed on that node only so that the cluster can resume normal operations.  Once the failed node is brought back into the cluster, GC will block that node’s volumes, allowing it to be re-synchronized and brought back into quiescence with the cluster as a whole.

The practical upshot of all this is the answer to two of our most common GeoCluster questions.  1) No, you cannot use the passive-node data for any purpose, even though you are able to view it in Windows Explorer. 2) Even though you can see the volumes and directories in Windows Explorer, replication will safely occur and the cluster will not be able to accidentally attach to the wrong node’s copy of the data.

Part 2: GeoCluster for Windows 2008 and Hyper-V

GeoCluster is cost effective and easily deployed as it integrates with the Microsoft cluster resources. This provides several benefits in the event of a node failure as the cluster resources can automatically roll the node, automatically start the application resources and bring the disk resource online. Some solutions state they can provide geographically dispersed clusters but are usually complex, cost prohibitive and not flexible due to the fiber channel connectivity required for back end disk based replication. And if using a hardware specific solution you may also loose the benefit of using the failover clustering console which easily manages all the cluster resource groups. Just because you have the ability to separate the nodes across town doesn’t mean that have eliminated the single point of failure and aren’t susceptible to a regional disasters.

Unless, you use GeoCluster which eliminates the need for shared disk and enables each cluster node to have its own set of data you won’t really have a true geographically dispersed cluster architecture.

With the addition of Hyper-V many data centers are now looking to create virtualized GeoClusters. This combination provides maximum flexibility as you not only have the ability to stretch the nodes across greater distances but GeoCluster provides that quick migration capabilities between Hyper-V hosts, near or far. Because GeoCluster can utilize existing physical, virtual, LAN/WAN IP infrastructure it provides the greatest flexibility and high availability foundation for future adoption of SaaS applications via the Cloud. Whether you use an existing cloud, point to point or satellite technology for communications, Double-Take Software provides the dynamic infrastructure for availability between physical or virtual hosts and geographically disperse cluster nodes for disaster recovery.

For more information on Failover Clustering with Microsoft Hyper-V check out the latest Microsoft blog.

For more information on Double-Take Software and GeoCluster click here.

Double-Take 101: What is a GeoCluster? How does it work with Microsoft Clustering?

Editor’s Note: This was taken from questions submitted to the Double-Take User Group and the email address info@doubletake.com. If you’d like to see your questions answered here in Double-Take 101: just reach out through either the group or via email.

Microsoft Clustering Services (MSCS) and related technologies have provided a stable, highly automated availability solution for many years now. As the technology has progressed through NT4 into Server 2000, 2003 and now 2008; more and better feature-sets have made MSCS an attractive solution for High Availability in a local environment. When looking at clustering, there are some limitations that have persisted, even into the current incarnation of MSCS, Server 2008 Failover Clustering (FC). While they do not detract from the overall usefulness of the clustering, they do create logical restrictions to how a cluster could traditionally be deployed.

The most well-known of these considerations is that a shared-disk architecture can both limit the physical distance between nodes of a cluster, and also impose a single copy of the data, creating a potential single-point-of-failure scenario. With some applications, such as Exchange 2007, application-based technologies have been able to overcome this requirement by altering how the cluster functions. This works for those specific apps, but doesn’t assist with other applications that are cluster-aware, but do not permit for Distributed Failover Clustering (DFC) natively like Exchange does.

GeoCluster, which was first introduced by Double-Take Software in the days of NT4, seeks to address this concern. Now a feature of the Double-Take Availability product line, GeoCluster extends Microsoft Clustering technologies on Server 2003 and 2008 to distribute data to more than one disk resource. This provides a higher level of data-redundancy to match the server-redundancy that the underlying cluster architecture provides.

Built to be configured along-side MSCS or DFC, GeoCluster allows you to configure volumes on each node which show as unified disk resources in the cluster manager. This allows you to use the GeoCluster replicated disks with any cluster-aware application, regardless of vendor. From a day-to-day perspective, the cluster acts identically to what you would see if you had a shared-disk architecture, but the similarities stop there.

GeoCluster uses the Double-Take Replication Engine to allow the owning node for each disk resource to replicate changes to data to all other potential owning nodes throughout the cluster. This ensures that each node which may be called upon to run the application will have a copy of all data that the application needs to run properly. GeoCluster supports the same number of potential nodes as the version of Windows will permit; so as long as the cluster design is valid for your OS, it will work with GeoCluster as well. By leveraging the Double-Take Replication Engine, we can provide byte-level, write-order intact copies, safely replicating data from one node to another even for complex applications and solution sets.

If an application on a node – or the node itself – fails, GeoCluster allows the native cluster technology to arbitrate which node should take over the responsibilities, and automatically begins protecting the data from that new owner to all surviving nodes listed as potential owners. Once the original node is fixed or replaced, the GeoCluster system will make sure a copy of the data is synced to it, and ongoing replication keeps it ready to resume its role once either you manually move the resources back or – if you have configured automatic moves to preferred owners – once the cluster moves the resources back. Safety systems integrated into GeoCluster will prohibit the Cluster from accidentally assigning resources to a node that is not properly protected, allowing for resumption of duties only after the data is intact on the original, repaired, node.

GeoCluster provides an automated and integrated methodology for replication of data between nodes of an MSCS or DFC, while allowing that cluster to function as if it were leveraging a shared disk architecture. As an added benefit, provided all hardware is cleared as Cluster Capable, the hardware for the individual nodes and their disk resources do not have to match. This allows you greater flexibility, more resiliency and extended reliability for highly critical applications that can take advantage of Microsoft Cluster Technologies

As always, you don’t have to take our word for it.  You can find out more right from Microsoft, check out this blog to see what the Microsoft Virtualization Team is doing with partners like Double-Take Software!

Part 1: GeoCluster for Windows 2008 and Hyper-V

Windows 2008 Failover Cluster services introduced some improvements over it’s predecessor MSCS. A 500ms roundtrip UDP heartbeat and residing on the same LAN/Subnet is no longer required which improves the ability to stretch the nodes greater distances for a true geographically dispersed cluster. There is still a shared disk requirement but with products like GeoCluster that allows each node to share and own their own copy of the data, the GeoCluster solution provides the ability to stretch the cluster nodes across farther distances for not only for Double-Take Availability but also disaster recovery.

Click Video below to see the Double-Take Software 101 Video on Failover Clustering and GeoCluster.

For more information on Failover Clustering with Microsoft Hyper-V check out the latest Microsoft blog.

For more information on Double-Take Software and GeoCluster click here.

Double-Take 101: Can I use Availability and Backup together?

Yes! Double-Take Availability and Double-Take Backup work very well together, and were designed from the ground up to be able to inter-operate without a hitch. Since they’re used for very different purposes, you will want to make sure you have the right product for your needs; but where that is both products, you can feel safe using both without worrying about conflicts.

Double-Take Availability is used for failover. It is typically one-to-one, and allows for you to get back up and running in about 30 minutes (less in many cases). You have to pre-define the failover target before you can get started.

Double-Take Backup is a recovery solution, everything replicates to one or more Repository Servers, from which you can restore individual files, directories or volumes at any point in time, or the entire server back to the same or different hardware.

Most of our clients have need for Double-Take Backup on nearly every server in their environment, but only a certain sub-set of servers need Double-Take Availability. When we built the Double-Take Software Suite version 5.2, we took this into account.

First, install and configure all Double-Take Availability solutions that you need. Once your source and target systems are up and running, and your availability jobs are mirrored and in a Protected state, you can install Double-Take Backup by using the activation key via the Double-Take Management Console and then launching the Double-Take Backup wizard from the Repository or some other machine (like your workstation).  The new Unified Installer for Double-Take means you don’t have to install any new code if Double-Take Availability has already been put onto the source servers. The Double-Take Backup wizard will take into account that Double-Take Availability is configured on the source, and take the appropriate action to not interfere. For servers that don’t need Double-Take Availability, just install Double-Take Backup directly, and then run the Double-Take Backup wizard to set up protection to a Repository Server.

One word of caution. If you are using the Double-Take Application Manager (DTAM) wizard, and you fail over for an extended period of time, you will want to re-run the Double-Take Backup wizard, setting up Backup protection from the DTAM target to the Repository Server to resume data protection while you are failed over. You can use the Double-Take Backup key from the original source to set up this protection.

So, for those servers that are hyper-sensitive to downtime, use both Double-Take Availability and Double-Take Backup. For all other servers, use Double-Take Backup alone to provide Rapid Recovery to repaired or replaced physical or virtual hardware.