From nobody Tue Dec 2 02:04:20 2025 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.129.124]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 8D08A28468D for ; Fri, 21 Nov 2025 02:00:49 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=170.10.129.124 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1763690451; cv=none; b=nCuiTRmcGul2gHhFUJjl5UakR+y9hrg5pD4LcDVGRy0XPxk2mm5+S4XnRGD/odGP7oIcEqpszQDJASvCtsWy+4vlnDumB7+Fw3dV6Q2UZ2t261c+oacYTKvoCLByBifYfZyUepmNJoXF7eabz3tpf8kq+HT5cHvCwOQa7ttTZs0= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1763690451; c=relaxed/simple; bh=bpMj+5iJHb0HItYhdwJPBNL7yd6120JB6BmDfWc4IIw=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=C35diUMxmpBeSBmyM1hGNx49PX7G5fddvYCXRgGCpDJLG+s42qMTtAWfZdcRe8EGCWR1+/uAtbE29Q6+g5FtSM5ExD6cwsa0OqoaF+tqu2zMUrhnbFd7HwM0LR976I+ZcztyC2oGAC9TZegPyeY8CGpjdP7s7drFZ1x0QBC4RHI= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=pass (p=quarantine dis=none) header.from=redhat.com; spf=pass smtp.mailfrom=redhat.com; dkim=pass (1024-bit key) header.d=redhat.com header.i=@redhat.com header.b=GsntFpQt; arc=none smtp.client-ip=170.10.129.124 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=quarantine dis=none) header.from=redhat.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=redhat.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (1024-bit key) header.d=redhat.com header.i=@redhat.com header.b="GsntFpQt" DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1763690447; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=8yAOTot2+IdfJy0TEgJ6Iw4zloe0w//2T4DOJDh9HAg=; b=GsntFpQt4idx3sJtb9xJl17RrhyQAITmmWp16KXP+ZUvLZcENJkLf5+sx2zp3L3YvOUJEZ E1U9Afa0/86GIKLczhxdxAzfnrkmzk9BABO2+qSiJaYUf75lNnyc1lCunFK/DcyBqa29Ae BJvcr70yWYU2v8g/FJtgIefPexiq6do= Received: from mx-prod-mc-03.mail-002.prod.us-west-2.aws.redhat.com (ec2-54-186-198-63.us-west-2.compute.amazonaws.com [54.186.198.63]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_256_GCM_SHA384) id us-mta-139-_16O6borOS-k_ymtuWLJ8A-1; Thu, 20 Nov 2025 21:00:41 -0500 X-MC-Unique: _16O6borOS-k_ymtuWLJ8A-1 X-Mimecast-MFC-AGG-ID: _16O6borOS-k_ymtuWLJ8A_1763690440 Received: from mx-prod-int-01.mail-002.prod.us-west-2.aws.redhat.com (mx-prod-int-01.mail-002.prod.us-west-2.aws.redhat.com [10.30.177.4]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by mx-prod-mc-03.mail-002.prod.us-west-2.aws.redhat.com (Postfix) with ESMTPS id 59DA21956048; Fri, 21 Nov 2025 02:00:40 +0000 (UTC) Received: from localhost (unknown [10.72.116.211]) by mx-prod-int-01.mail-002.prod.us-west-2.aws.redhat.com (Postfix) with ESMTP id 869B130044DB; Fri, 21 Nov 2025 02:00:39 +0000 (UTC) From: Ming Lei To: Jens Axboe , linux-block@vger.kernel.org Cc: Caleb Sander Mateos , Uday Shankar , Stefani Seibold , Andrew Morton , linux-kernel@vger.kernel.org, Ming Lei Subject: [PATCH V4 24/27] selftests: ublk: handle UBLK_U_IO_COMMIT_IO_CMDS Date: Fri, 21 Nov 2025 09:58:46 +0800 Message-ID: <20251121015851.3672073-25-ming.lei@redhat.com> In-Reply-To: <20251121015851.3672073-1-ming.lei@redhat.com> References: <20251121015851.3672073-1-ming.lei@redhat.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable X-Scanned-By: MIMEDefang 3.4.1 on 10.30.177.4 Content-Type: text/plain; charset="utf-8" Implement UBLK_U_IO_COMMIT_IO_CMDS to enable efficient batched completion of I/O operations in the batch I/O framework. This completes the batch I/O infrastructure by adding the commit phase that notifies the kernel about completed I/O operations: Key features: - Batch multiple I/O completions into single UBLK_U_IO_COMMIT_IO_CMDS - Dynamic commit buffer allocation and management per thread - Automatic commit buffer preparation before processing events - Commit buffer submission after processing completed I/Os - Integration with existing completion workflows Implementation details: - ublk_batch_prep_commit() allocates and initializes commit buffers - ublk_batch_complete_io() adds completed I/Os to current batch - ublk_batch_commit_io_cmds() submits batched completions to kernel - Modified ublk_process_io() to handle batch commit lifecycle - Enhanced ublk_complete_io() to route to batch or legacy completion The commit buffer stores completion information (tag, result, buffer details) for multiple I/Os, then submits them all at once, significantly reducing syscall overhead compared to individual I/O completions. Signed-off-by: Ming Lei --- tools/testing/selftests/ublk/batch.c | 74 ++++++++++++++++++++++++++-- tools/testing/selftests/ublk/kublk.c | 8 ++- tools/testing/selftests/ublk/kublk.h | 69 +++++++++++++++++--------- 3 files changed, 122 insertions(+), 29 deletions(-) diff --git a/tools/testing/selftests/ublk/batch.c b/tools/testing/selftests= /ublk/batch.c index 01f00c21dfdb..e240d4decedf 100644 --- a/tools/testing/selftests/ublk/batch.c +++ b/tools/testing/selftests/ublk/batch.c @@ -174,7 +174,7 @@ static void ublk_init_batch_cmd(struct ublk_thread *t, = __u16 q_id, cmd->elem_bytes =3D elem_bytes; cmd->nr_elem =3D nr_elem; =20 - user_data =3D build_user_data(buf_idx, _IOC_NR(op), 0, q_id, 0); + user_data =3D build_user_data(buf_idx, _IOC_NR(op), nr_elem, q_id, 0); io_uring_sqe_set_data64(sqe, user_data); =20 t->cmd_inflight +=3D 1; @@ -244,9 +244,11 @@ static void ublk_batch_compl_commit_cmd(struct ublk_th= read *t, =20 if (op =3D=3D _IOC_NR(UBLK_U_IO_PREP_IO_CMDS)) ublk_assert(cqe->res =3D=3D 0); - else if (op =3D=3D _IOC_NR(UBLK_U_IO_COMMIT_IO_CMDS)) - ;//assert(cqe->res =3D=3D t->commit_buf_size); - else + else if (op =3D=3D _IOC_NR(UBLK_U_IO_COMMIT_IO_CMDS)) { + int nr_elem =3D user_data_to_tgt_data(cqe->user_data); + + ublk_assert(cqe->res =3D=3D t->commit_buf_elem_size * nr_elem); + } else ublk_assert(0); =20 ublk_free_commit_buf(t, buf_idx); @@ -263,3 +265,67 @@ void ublk_batch_compl_cmd(struct ublk_thread *t, return; } } + +void ublk_batch_commit_io_cmds(struct ublk_thread *t) +{ + struct io_uring_sqe *sqe; + unsigned short buf_idx; + unsigned short nr_elem =3D t->commit.done; + + /* nothing to commit */ + if (!nr_elem) { + ublk_free_commit_buf(t, t->commit.buf_idx); + return; + } + + ublk_io_alloc_sqes(t, &sqe, 1); + buf_idx =3D t->commit.buf_idx; + sqe->addr =3D (__u64)t->commit.elem; + sqe->len =3D nr_elem * t->commit_buf_elem_size; + + /* commit isn't per-queue command */ + ublk_init_batch_cmd(t, t->commit.q_id, sqe, UBLK_U_IO_COMMIT_IO_CMDS, + t->commit_buf_elem_size, nr_elem, buf_idx); + ublk_setup_commit_sqe(t, sqe, buf_idx); +} + +static void ublk_batch_init_commit(struct ublk_thread *t, + unsigned short buf_idx) +{ + /* so far only support 1:1 queue/thread mapping */ + t->commit.q_id =3D t->idx; + t->commit.buf_idx =3D buf_idx; + t->commit.elem =3D ublk_get_commit_buf(t, buf_idx); + t->commit.done =3D 0; + t->commit.count =3D t->commit_buf_size / + t->commit_buf_elem_size; +} + +void ublk_batch_prep_commit(struct ublk_thread *t) +{ + unsigned short buf_idx =3D ublk_alloc_commit_buf(t); + + ublk_assert(buf_idx !=3D UBLKS_T_COMMIT_BUF_INV_IDX); + ublk_batch_init_commit(t, buf_idx); +} + +void ublk_batch_complete_io(struct ublk_thread *t, struct ublk_queue *q, + unsigned tag, int res) +{ + struct batch_commit_buf *cb =3D &t->commit; + struct ublk_batch_elem *elem =3D (struct ublk_batch_elem *)(cb->elem + + cb->done * t->commit_buf_elem_size); + struct ublk_io *io =3D &q->ios[tag]; + + ublk_assert(q->q_id =3D=3D t->commit.q_id); + + elem->tag =3D tag; + elem->buf_index =3D ublk_batch_io_buf_idx(t, q, tag); + elem->result =3D res; + + if (!ublk_queue_no_buf(q)) + elem->buf_addr =3D (__u64) (uintptr_t) io->buf_addr; + + cb->done +=3D 1; + ublk_assert(cb->done <=3D cb->count); +} diff --git a/tools/testing/selftests/ublk/kublk.c b/tools/testing/selftests= /ublk/kublk.c index e981fcf18475..6565e804679c 100644 --- a/tools/testing/selftests/ublk/kublk.c +++ b/tools/testing/selftests/ublk/kublk.c @@ -852,7 +852,13 @@ static int ublk_process_io(struct ublk_thread *t) return -ENODEV; =20 ret =3D io_uring_submit_and_wait(&t->ring, 1); - reapped =3D ublk_reap_events_uring(t); + if (ublk_thread_batch_io(t)) { + ublk_batch_prep_commit(t); + reapped =3D ublk_reap_events_uring(t); + ublk_batch_commit_io_cmds(t); + } else { + reapped =3D ublk_reap_events_uring(t); + } =20 ublk_dbg(UBLK_DBG_THREAD, "submit result %d, reapped %d stop %d idle %d\n= ", ret, reapped, (t->state & UBLKS_T_STOPPING), diff --git a/tools/testing/selftests/ublk/kublk.h b/tools/testing/selftests= /ublk/kublk.h index 51fad0f4419b..0a355653d64c 100644 --- a/tools/testing/selftests/ublk/kublk.h +++ b/tools/testing/selftests/ublk/kublk.h @@ -182,6 +182,14 @@ struct ublk_batch_elem { __u64 buf_addr; }; =20 +struct batch_commit_buf { + unsigned short q_id; + unsigned short buf_idx; + void *elem; + unsigned short done; + unsigned short count; +}; + struct ublk_thread { struct ublk_dev *dev; unsigned idx; @@ -207,6 +215,7 @@ struct ublk_thread { void *commit_buf; #define UBLKS_T_COMMIT_BUF_INV_IDX ((unsigned short)-1) struct allocator commit_buf_alloc; + struct batch_commit_buf commit; =20 struct io_uring ring; }; @@ -416,30 +425,6 @@ static inline struct ublk_io *ublk_get_io(struct ublk_= queue *q, unsigned tag) return &q->ios[tag]; } =20 -static inline int ublk_complete_io(struct ublk_thread *t, struct ublk_queu= e *q, - unsigned tag, int res) -{ - struct ublk_io *io =3D &q->ios[tag]; - - ublk_mark_io_done(io, res); - - return ublk_queue_io_cmd(t, io); -} - -static inline void ublk_queued_tgt_io(struct ublk_thread *t, struct ublk_q= ueue *q, - unsigned tag, int queued) -{ - if (queued < 0) - ublk_complete_io(t, q, tag, queued); - else { - struct ublk_io *io =3D ublk_get_io(q, tag); - - t->io_inflight +=3D queued; - io->tgt_ios =3D queued; - io->result =3D 0; - } -} - static inline int ublk_completed_tgt_io(struct ublk_thread *t, struct ublk_queue *q, unsigned tag) { @@ -493,6 +478,42 @@ int ublk_batch_alloc_buf(struct ublk_thread *t); /* Free commit buffers and cleanup batch allocator */ void ublk_batch_free_buf(struct ublk_thread *t); =20 +/* Prepare a new commit buffer for batching completed I/O operations */ +void ublk_batch_prep_commit(struct ublk_thread *t); +/* Submit UBLK_U_IO_COMMIT_IO_CMDS with batched completed I/O operations */ +void ublk_batch_commit_io_cmds(struct ublk_thread *t); +/* Add a completed I/O operation to the current batch commit buffer */ +void ublk_batch_complete_io(struct ublk_thread *t, struct ublk_queue *q, + unsigned tag, int res); + +static inline int ublk_complete_io(struct ublk_thread *t, struct ublk_queu= e *q, + unsigned tag, int res) +{ + if (ublk_queue_batch_io(q)) { + ublk_batch_complete_io(t, q, tag, res); + return 0; + } else { + struct ublk_io *io =3D &q->ios[tag]; + + ublk_mark_io_done(io, res); + return ublk_queue_io_cmd(t, io); + } +} + +static inline void ublk_queued_tgt_io(struct ublk_thread *t, struct ublk_q= ueue *q, + unsigned tag, int queued) +{ + if (queued < 0) + ublk_complete_io(t, q, tag, queued); + else { + struct ublk_io *io =3D ublk_get_io(q, tag); + + t->io_inflight +=3D queued; + io->tgt_ios =3D queued; + io->result =3D 0; + } +} + extern const struct ublk_tgt_ops null_tgt_ops; extern const struct ublk_tgt_ops loop_tgt_ops; extern const struct ublk_tgt_ops stripe_tgt_ops; --=20 2.47.0