From nobody Fri Sep 19 02:30:31 2025 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 7E424C4332F for ; Wed, 30 Nov 2022 05:37:53 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232959AbiK3Fhu (ORCPT ); Wed, 30 Nov 2022 00:37:50 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:43396 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229875AbiK3Fhn (ORCPT ); Wed, 30 Nov 2022 00:37:43 -0500 Received: from mail-qk1-x72c.google.com (mail-qk1-x72c.google.com [IPv6:2607:f8b0:4864:20::72c]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 860491C431; Tue, 29 Nov 2022 21:37:41 -0800 (PST) Received: by mail-qk1-x72c.google.com with SMTP id z17so11511316qki.11; Tue, 29 Nov 2022 21:37:41 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=oKPR57ek3ooMmQ0+P1pqWsdpJjQbw5NulQVkSVNRu1E=; b=EV9IW20OrVRhM9suJigdFA5+18vgpaTzBQ4q+EXeow1qz2b+V1im7z7Z6un2WNmsTw mCjzcFHsC0O2l8XqSsqY6cXPq8fOO6TEHI00P3BZpTGlAzkOc3ZOiVwpuRiAZabZOYvl Lukmglq1AqQDcNY3fJjSBiw7OaJBjC4/xvxGajobULDjHKs+appLj7gV1Zi3/oBpivIB qzwz/UI2wntk8bkfbUiwElEYaV7OMR7MBUYSvkbbvDuKcSwFNkdHKKaWP9QDGFvnN4dQ SzPfMvpWUWg8Yu+b8+x1KJpSWgl8bmqyrtEqG4SbbwOKX8dqKY5e7JecV8COzAh7CMGS JpPw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=oKPR57ek3ooMmQ0+P1pqWsdpJjQbw5NulQVkSVNRu1E=; b=KXw+iYTz2YiGPc7mn8xYEbvsvQeLmcMNSRTs+3ay6ymf22zBtMbDF9H2EhYOKTMd6l smYQDEQjpmd0ImwgvKsd5oOviJsl+lYqEpfTy1E6JHUVJ/HXXFG3P5yCyI1lB3k+CGIW zrx9SRKdF03h8wUuUFEoCcIj2BLJdgZOPqCE/+B31NvLHlw7tksh8FyIr4oD1qAcu5OP +hBoUtYhp7SF6pefCeAoOj6aldrZ9mBaZS5PBJ8ABes1KmD/LByJ9n4QxH8zu0LB8xdw DYEQ6hsB7r9473cKdJaJk7PRoYRvYZpey5WosLNSqRWdmzxk2SmkjR+nzbk9P0S6/l3p FryA== X-Gm-Message-State: ANoB5pm4ki/H82HGAyIrr4GMbxQKnb7v4g6HT8+kNnzSqq8rTdc1kKdN EfG7t0H6h7Nvh9kwzYJHLg== X-Google-Smtp-Source: AA0mqf4oXB/zkTlgTxjuRLQfqUfakOmEWU22ocH6K0bhvbU2hXohDbj2ZXGfjq+i/zU0MPFGBLmkkw== X-Received: by 2002:ae9:e115:0:b0:6fc:2903:1dd1 with SMTP id g21-20020ae9e115000000b006fc29031dd1mr31838425qkm.232.1669786660561; Tue, 29 Nov 2022 21:37:40 -0800 (PST) Received: from bytedance.attlocal.net ([130.44.212.155]) by smtp.gmail.com with ESMTPSA id i11-20020ac8764b000000b003a611cb2a95sm321010qtr.9.2022.11.29.21.37.38 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 29 Nov 2022 21:37:40 -0800 (PST) From: Peilin Ye To: Alexander Viro Cc: Peilin Ye , Cong Wang , Andrew Morton , linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, Peilin Ye Subject: [PATCH v3] coredump: Use vmsplice_to_pipe() for pipes in dump_emit_page() Date: Tue, 29 Nov 2022 21:37:34 -0800 Message-Id: <20221130053734.2811-1-yepeilin.cs@gmail.com> X-Mailer: git-send-email 2.25.1 In-Reply-To: <20221031210349.3346-1-yepeilin.cs@gmail.com> References: <20221031210349.3346-1-yepeilin.cs@gmail.com> MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Type: text/plain; charset="utf-8" From: Peilin Ye Currently, there is a copy for each page when dumping VMAs to pipe handlers using dump_emit_page(). For example: fs/binfmt_elf.c:elf_core_dump() fs/coredump.c:dump_user_range() :dump_emit_page() fs/read_write.c:__kernel_write_iter() fs/pipe.c:pipe_write() lib/iov_iter.c:copy_page_from_iter() Use vmsplice_to_pipe() instead of __kernel_write_iter() to avoid this copy for pipe handlers. Tested by dumping a 32-GByte core into a simple handler that splice()s from stdin to disk in a loop, PIPE_DEF_BUFFERS (16) pages at a time. Before After Improved by Time to Completion 40.77 seconds 35.49 seconds 12.95% CPU Usage 92.27% 86.40% 6.36% Suggested-by: Cong Wang Signed-off-by: Peilin Ye --- change in v3: - do not rely on error checking in vmsplice_to_pipe() (Al Viro) - rebase onto linux-next change in v2: - fix warning in net/tls/tls_sw.c (kernel test robot) fs/coredump.c | 10 +++++++++- fs/splice.c | 4 ++-- include/linux/coredump.h | 3 +++ include/linux/splice.h | 3 +++ 4 files changed, 17 insertions(+), 3 deletions(-) diff --git a/fs/coredump.c b/fs/coredump.c index de78bde2991b..7f0981d71881 100644 --- a/fs/coredump.c +++ b/fs/coredump.c @@ -42,6 +42,7 @@ #include #include #include +#include =20 #include #include @@ -586,6 +587,8 @@ void do_coredump(const kernel_siginfo_t *siginfo) goto fail_unlock; } =20 + set_bit(COREDUMP_USE_PIPE, &cprm.flags); + if (cprm.limit =3D=3D 1) { /* See umh_pipe_setup() which sets RLIMIT_CORE =3D 1. * @@ -861,7 +864,12 @@ static int dump_emit_page(struct coredump_params *cprm= , struct page *page) return 0; pos =3D file->f_pos; iov_iter_bvec(&iter, ITER_SOURCE, &bvec, 1, PAGE_SIZE); - n =3D __kernel_write_iter(cprm->file, &iter, &pos); + + if (test_bit(COREDUMP_USE_PIPE, &cprm->flags)) + n =3D vmsplice_to_pipe(file, &iter, 0); + else + n =3D __kernel_write_iter(cprm->file, &iter, &pos); + if (n !=3D PAGE_SIZE) return 0; file->f_pos =3D pos; diff --git a/fs/splice.c b/fs/splice.c index 5969b7a1d353..c9be20f4115e 100644 --- a/fs/splice.c +++ b/fs/splice.c @@ -1234,8 +1234,8 @@ static long vmsplice_to_user(struct file *file, struc= t iov_iter *iter, * as splice-from-memory, where the regular splice is splice-from-file (or * to file). In both cases the output is a pipe, naturally. */ -static long vmsplice_to_pipe(struct file *file, struct iov_iter *iter, - unsigned int flags) +long vmsplice_to_pipe(struct file *file, struct iov_iter *iter, + unsigned int flags) { struct pipe_inode_info *pipe; long ret =3D 0; diff --git a/include/linux/coredump.h b/include/linux/coredump.h index d3eba4360150..3e34009487bf 100644 --- a/include/linux/coredump.h +++ b/include/linux/coredump.h @@ -28,8 +28,11 @@ struct coredump_params { int vma_count; size_t vma_data_size; struct core_vma_metadata *vma_meta; + unsigned long flags; }; =20 +#define COREDUMP_USE_PIPE 0 + /* * These are the only things you should do on a core-file: use only these * functions to write out all the necessary info. diff --git a/include/linux/splice.h b/include/linux/splice.h index a55179fd60fc..38b3560a318b 100644 --- a/include/linux/splice.h +++ b/include/linux/splice.h @@ -10,6 +10,7 @@ #define SPLICE_H =20 #include +#include =20 /* * Flags passed in from splice/tee/vmsplice @@ -81,6 +82,8 @@ extern ssize_t splice_direct_to_actor(struct file *, stru= ct splice_desc *, extern long do_splice(struct file *in, loff_t *off_in, struct file *out, loff_t *off_out, size_t len, unsigned int flags); +extern long vmsplice_to_pipe(struct file *file, struct iov_iter *iter, + unsigned int flags); =20 extern long do_tee(struct file *in, struct file *out, size_t len, unsigned int flags); --=20 2.20.1