[PATCH 00/14] scsi: scsi_error: Introduce new error handle mechanism

JiangJianJun posted 14 patches 1 month ago
Only 0 patches received!
There is a newer version of this series
[PATCH 00/14] scsi: scsi_error: Introduce new error handle mechanism
Posted by JiangJianJun 1 month ago
>Barely half of your emails have made it through for me and they landed in my
>spam folder. So please check your email setup.

I also find it strange, but my colleague can receive it. Maybe i reset email
and send again? 

>Also, was this all tested with libata and libsas attached devices as well ?
>They all depend on scsi EH.

There is currently no tool available for injecting faults into hard drives,
but we have implemented this solution in our company's products. So i just
test with scsi_debug.
Re: [PATCH 00/14] scsi: scsi_error: Introduce new error handle mechanism
Posted by Damien Le Moal 1 month ago
On 9/2/25 2:30 PM, JiangJianJun wrote:
>> Barely half of your emails have made it through for me and they landed in my
>> spam folder. So please check your email setup.
> 
> I also find it strange, but my colleague can receive it. Maybe i reset email
> and send again? 
> 
>> Also, was this all tested with libata and libsas attached devices as well ?
>> They all depend on scsi EH.
> 
> There is currently no tool available for injecting faults into hard drives,
> but we have implemented this solution in our company's products. So i just
> test with scsi_debug.

Use write long command to "destroy" sectors. Then try to read them. That will
generate uncorrectable read errors.

See sg_write_long (sg3utils).


-- 
Damien Le Moal
Western Digital Research
[PATCH 00/14] scsi: scsi_error: Introduce new error handle mechanism
Posted by JiangJianJun 1 month ago
>Use write long command to "destroy" sectors. Then try to read them. That will
>generate uncorrectable read errors.

There is a misunderstanding here, the condition that triggers this error
handler is when the device is slow or unresponsive, or fails to start;
bad blocks refer to data errors rather than faults.
Re: [PATCH 00/14] scsi: scsi_error: Introduce new error handle mechanism
Posted by Damien Le Moal 1 month ago
On 9/2/25 3:03 PM, JiangJianJun wrote:
>> Use write long command to "destroy" sectors. Then try to read them. That will
>> generate uncorrectable read errors.
> 
> There is a misunderstanding here, the condition that triggers this error
> handler is when the device is slow or unresponsive, or fails to start;
> bad blocks refer to data errors rather than faults.

You probably easilly can simulate an unresponsive device using either qemu or
tcmu-runner.


-- 
Damien Le Moal
Western Digital Research