By Andrew Witts, Infinite Blue, Senior Director of Alliances
Fig 1: Operational Resilience Flow
But this all sounds familiar to what we do, what’s the difference? How does this differ from disaster recovery or business continuity. Here’s differentiator; when we do an impact assessment and figure out a recovery time object for our applications or our processes, what does that mean? It means we need to recover after a disaster within that timeframe for the business to survive. Did you notice the key word there? “After”. Our focus is often how to recover, how quickly we can recover. But like with the heart analogy, wouldn’t it be better to be aware of metrics that could tell us or warn us that things were heading in the wrong direction so we could turn the proverbial ship around before the disaster happened? That sounds like a healthier and more proactive approach. But this is the real world, disaster can and WILL happen. All the seismographs that monitor the faults around the world do not prevent earthquakes. To that end, disaster recovery and business continuity can never go away, but perhaps we can add a layer to it that incorporates operational or maybe even enterprise resilience.
What if instead of just measuring how much time a process can be down before there’s a problem, we measure and monitor how many people are needed? How many people can call out sick before the process performance is degraded? How many products, transactions or cups of coffee can we produce if we down a person, a computer, a booth, a store, a manufacturing plant, or maybe a single source vendor? How will that increase or decrease customer satisfaction or breach SLAs if we’re seeing a trend towards the direction that will lead to a disaster or crisis? People I’ve spoken with in the industry have varying levels of program maturity in their organization. Some components of what has been mentioned here may already be part of your program. I would submit that if that’s the case, then the argument of operational resilience being a good practice has been made as you may already be doing part of it.
We have multiple different disciplines in the field of BCDR. I like to look at it with this example: If I asked an IT person to solve a problem, they might develop a program. If I asked a businessperson to solve the same problem they’d come up with a project. If I asked a first responder to solve the problem, they’d have a response to an emergency. We have different BCDR disciplines for a reason, they were solved by people with different purviews and focus. We have often found them operating independently and siloed with some overlap. I see resilience as a major step towards tying many of the disciplines together cohesively. Ultimately our goal is to ensure the viability and dare I say, “Resilience” of our organizations. I am very curious to see how it develops and if it truly becomes a best practice that is here to stay.