Nutanix, Inc.
High reliability in storage array systems
Last updated:
Abstract:
A storage system for providing zero data loss during failover and failback. A storage system comprises a dual controller that includes a first controller to process a first set of storage I/O operations and a first hardware timer. Redundancy is provided by a second controller that processes a second set of storage I/O operations. The second controller has a corresponding second hardware timer. Both the first controller and second controller each employ respective dual channel drive interfaces. The dual channel drives are arranged in an array. Each controller executes instructions to access at least two different portions of any one of the plurality of dual channel drives. In a failover scenario, when a controller suspects failure of a peer, a lockless signaling protocol is invoked. After a delay for a hardware timer timeout period, the protocol signals an intention to take over resources as soon as mutual exclusion is guaranteed.
Utility
15 Jun 2017
6 Aug 2019