[PATCH 0/7] EDAC/i10nm: Add RRL support for Intel Granite Rapids server

Qiuxu Zhuo posted 7 patches 8 months ago
drivers/edac/i10nm_base.c | 477 ++++++++++++++++++++++----------------
drivers/edac/skx_common.c |   1 +
drivers/edac/skx_common.h |  61 ++++-
3 files changed, 330 insertions(+), 209 deletions(-)
[PATCH 0/7] EDAC/i10nm: Add RRL support for Intel Granite Rapids server
Posted by Qiuxu Zhuo 8 months ago
The RRL (Retry Read error Log) registers provide additional error information
during DDR memory correction. This can assist memory DIMM vendors in isolating
problematic cells within the DIMM and help CSPs predict uncorrectable errors.
This patch series is to add RRL support for Intel Granite Rapids servers and
is structured as follows:

Patch 1-3: Bug fixes and cleanup.

Patch 4-6: Refactor the RRL (Retry Read error Log) code of previous generations
           and prepare for Patch 7.

Patch 7  : Add RRL support for Intel Granite Rapids server.

Qiuxu Zhuo (7):
  EDAC/skx_common: Fix general protection fault
  EDAC/{skx_common,i10nm}: Fix the loss of saved RRL for HBM pseudo channel 0
  EDAC/i10nm: Explicitly set the modes of the RRL register sets
  EDAC/{skx_common,i10nm}: Structure the per-channel RRL registers
  EDAC/{skx_common,i10nm}: Refactor enable_retry_rd_err_log()
  EDAC/{skx_common,i10nm}: Refactor show_retry_rd_err_log()
  EDAC/{skx_common,i10nm}: Add RRL support for Intel Granite Rapids server

 drivers/edac/i10nm_base.c | 477 ++++++++++++++++++++++----------------
 drivers/edac/skx_common.c |   1 +
 drivers/edac/skx_common.h |  61 ++++-
 3 files changed, 330 insertions(+), 209 deletions(-)


base-commit: 8ffd015db85fea3e15a77027fda6c02ced4d2444
-- 
2.43.0
RE: [PATCH 0/7] EDAC/i10nm: Add RRL support for Intel Granite Rapids server
Posted by Luck, Tony 7 months, 4 weeks ago
> The RRL (Retry Read error Log) registers provide additional error information
> during DDR memory correction. This can assist memory DIMM vendors in isolating
> problematic cells within the DIMM and help CSPs predict uncorrectable errors.
> This patch series is to add RRL support for Intel Granite Rapids servers and
> is structured as follows:
>
> Patch 1-3: Bug fixes and cleanup.
>
> Patch 4-6: Refactor the RRL (Retry Read error Log) code of previous generations
>            and prepare for Patch 7.
>
> Patch 7  : Add RRL support for Intel Granite Rapids server.

Applied to edac-drivers branch of RAS tree for next merge window.

Thanks

-Tony