Advanced Micro Devices, Inc.
TOLERATING MEMORY STACK FAILURES IN MULTI-STACK SYSTEMS

Last updated:

Abstract:

Memory management circuitry and processes operate to improve reliability of a group of memory stacks, providing that if a memory stack or a portion thereof fails during the product's lifetime, the system may still recover with no errors or data loss. A front-end controller receives a block of data requested to be written to memory, divides the block into sub-blocks, and creates a new redundant reliability sub-block. The sub-blocks are then written to different memory stacks. When reading data from the memory stacks, the front-end controller detects errors indicating a failure within one of the memory stacks, and recovers corrected data using the reliability sub-block. The front-end controller may monitor errors for signs of a stack failure and disable the failed stack.

Status:
Application
Type:

Utility

Filling date:

31 Oct 2018

Issue date:

30 Apr 2020