This patch set adds an optional TX CPU memcpy offload path to ntb_transport
and does a small cleanup. The new module parameter:
tx_memcpy_offload=<bool> (default: N)
When enabled, ntb_transport creates a per-QP kernel thread and schedules TX
memcpy_toio() work onto it.
On some platforms (e.g. Renesas R-Car S4), using a DMA engine in the TX
path does not always improve throughput, and memcpy_toio() becomes the
bottleneck on the CPU. Offloading the memcpy_toio() and the subsequent
descriptor writes to a per-QP kthread can raise the achievable throughput
by keeping the data path hot while letting the worker thread perform the
copy.
Koichiro Den (2):
NTB: ntb_transport: Remove unused 'retries' field from ntb_queue_entry
NTB: ntb_transport: Add 'tx_memcpy_offload' module option
drivers/ntb/ntb_transport.c | 115 ++++++++++++++++++++++++++++++++----
1 file changed, 102 insertions(+), 13 deletions(-)
--
2.48.1