From nobody Fri Jan 2 18:51:51 2026 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id E5A6ECD6101 for ; Mon, 9 Oct 2023 16:11:44 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1377149AbjJIQLo (ORCPT ); Mon, 9 Oct 2023 12:11:44 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:34002 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1376437AbjJIQLm (ORCPT ); Mon, 9 Oct 2023 12:11:42 -0400 Received: from mgamail.intel.com (mgamail.intel.com [192.55.52.120]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id C14469E; Mon, 9 Oct 2023 09:11:39 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1696867899; x=1728403899; h=from:to:cc:subject:date:message-id:mime-version: content-transfer-encoding; bh=HlqQNZO2lBLG5FDVJXSmzDmPTXKU/zhkIq27Tby8kpk=; b=XA3T9q18+VmJQnFWiSDrr3iWxj5RkSVFHRceBX6BymgNdezx8ZmakTgI MYGEbeea9ko3Nfn3IbDAjDAIrnFgH/pE7FGjImn0zKwF7vePIva5fOO/v BOLJ4XEEEQ5aP2Ry9Mzy2cUcUs263u4DBGeHzAMafLaB8LMPofGw78JHY q6sgrCl0xywjJ3Q7HaH3u5/ccrRiq49i70xlweM5m4W6231IXzo7Fhja/ p0xHleTYkygS4znrmHaaiFiCiGkBDqaPnt+jvRXat8Y0mfcXxHZb4dq1e ugWuvOoLkFECZY6tfNEBQDjNCvBKqIhDL2US8wuzV97JLwW/+op/dQUmW w==; X-IronPort-AV: E=McAfee;i="6600,9927,10858"; a="383057642" X-IronPort-AV: E=Sophos;i="6.03,210,1694761200"; d="scan'208";a="383057642" Received: from orsmga004.jf.intel.com ([10.7.209.38]) by fmsmga104.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 09 Oct 2023 09:11:38 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=McAfee;i="6600,9927,10858"; a="876856549" X-IronPort-AV: E=Sophos;i="6.03,210,1694761200"; d="scan'208";a="876856549" Received: from irvmail002.ir.intel.com ([10.43.11.120]) by orsmga004.jf.intel.com with ESMTP; 09 Oct 2023 09:11:34 -0700 Received: from lincoln.igk.intel.com (lincoln.igk.intel.com [10.102.21.235]) by irvmail002.ir.intel.com (Postfix) with ESMTP id 3ECE93635D; Mon, 9 Oct 2023 17:11:33 +0100 (IST) From: Larysa Zaremba To: bpf@vger.kernel.org, Martin KaFai Lau Cc: Larysa Zaremba , Alexei Starovoitov , Daniel Borkmann , Andrii Nakryiko , Song Liu , Yonghong Song , John Fastabend , KP Singh , Stanislav Fomichev , Hao Luo , Jiri Olsa , linux-kernel@vger.kernel.org, Maciej Fijalkowski Subject: [PATCH bpf-next] selftests/bpf: add options and frags to xdp_hw_metadata Date: Mon, 9 Oct 2023 18:05:16 +0200 Message-ID: <20231009160520.20831-1-larysa.zaremba@intel.com> X-Mailer: git-send-email 2.41.0 MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Type: text/plain; charset="utf-8" This is a follow-up to the commit 9b2b86332a9b ("bpf: Allow to use kfunc XDP hints and frags together"). The are some possible implementations problems that may arise when providing metadata specifically for multi-buffer packets, therefore there must be a possibility to test such option separately. Add an option to use multi-buffer AF_XDP xdp_hw_metadata and mark used XDP program as capable to use frags. As for now, xdp_hw_metadata accepts no options, so add simple option parsing logic and a help message. For quick reference, also add an ingress packet generation command to the help message. The command comes from [0]. Example of output for multi-buffer packet: xsk_ring_cons__peek: 1 0xead018: rx_desc[15]->addr=3D10000000000f000 addr=3Df100 comp_addr=3Df000 rx_hash: 0x5789FCBB with RSS type:0x29 rx_timestamp: 1696856851535324697 (sec:1696856851.5353) XDP RX-time: 1696856843158256391 (sec:1696856843.1583) delta sec:-8.3771 (-8377068.306 usec) AF_XDP time: 1696856843158413078 (sec:1696856843.1584) delta sec:0.0002 (156.687 usec) 0xead018: complete idx=3D23 addr=3Df000 xsk_ring_cons__peek: 1 0xead018: rx_desc[16]->addr=3D100000000008000 addr=3D8100 comp_addr=3D8000 0xead018: complete idx=3D24 addr=3D8000 xsk_ring_cons__peek: 1 0xead018: rx_desc[17]->addr=3D100000000009000 addr=3D9100 comp_addr=3D9000 = EoP 0xead018: complete idx=3D25 addr=3D9000 Metadata is printed for the first packet only. [0] https://lore.kernel.org/all/20230119221536.3349901-18-sdf@google.com/ Signed-off-by: Larysa Zaremba --- .../selftests/bpf/progs/xdp_hw_metadata.c | 2 +- tools/testing/selftests/bpf/xdp_hw_metadata.c | 92 ++++++++++++++++--- 2 files changed, 79 insertions(+), 15 deletions(-) diff --git a/tools/testing/selftests/bpf/progs/xdp_hw_metadata.c b/tools/te= sting/selftests/bpf/progs/xdp_hw_metadata.c index 63d7de6c6bbb..8767d919c881 100644 --- a/tools/testing/selftests/bpf/progs/xdp_hw_metadata.c +++ b/tools/testing/selftests/bpf/progs/xdp_hw_metadata.c @@ -21,7 +21,7 @@ extern int bpf_xdp_metadata_rx_timestamp(const struct xdp= _md *ctx, extern int bpf_xdp_metadata_rx_hash(const struct xdp_md *ctx, __u32 *hash, enum xdp_rss_hash_type *rss_type) __ksym; =20 -SEC("xdp") +SEC("xdp.frags") int rx(struct xdp_md *ctx) { void *data, *data_meta, *data_end; diff --git a/tools/testing/selftests/bpf/xdp_hw_metadata.c b/tools/testing/= selftests/bpf/xdp_hw_metadata.c index 17c980138796..25225720346b 100644 --- a/tools/testing/selftests/bpf/xdp_hw_metadata.c +++ b/tools/testing/selftests/bpf/xdp_hw_metadata.c @@ -26,6 +26,7 @@ #include #include #include +#include #include #include =20 @@ -49,19 +50,29 @@ struct xsk { struct xdp_hw_metadata *bpf_obj; struct xsk *rx_xsk; const char *ifname; +bool use_frags; int ifindex; int rxq; =20 void test__fail(void) { /* for network_helpers.c */ } =20 -static int open_xsk(int ifindex, struct xsk *xsk, __u32 queue_id) +static struct xsk_socket_config gen_socket_config(void) { - int mmap_flags =3D MAP_PRIVATE | MAP_ANONYMOUS | MAP_NORESERVE; - const struct xsk_socket_config socket_config =3D { + struct xsk_socket_config socket_config =3D { .rx_size =3D XSK_RING_PROD__DEFAULT_NUM_DESCS, .tx_size =3D XSK_RING_PROD__DEFAULT_NUM_DESCS, .bind_flags =3D XDP_COPY, }; + + if (use_frags) + socket_config.bind_flags |=3D XDP_USE_SG; + return socket_config; +} + +static int open_xsk(int ifindex, struct xsk *xsk, __u32 queue_id) +{ + int mmap_flags =3D MAP_PRIVATE | MAP_ANONYMOUS | MAP_NORESERVE; + struct xsk_socket_config socket_config =3D gen_socket_config(); const struct xsk_umem_config umem_config =3D { .fill_size =3D XSK_RING_PROD__DEFAULT_NUM_DESCS, .comp_size =3D XSK_RING_CONS__DEFAULT_NUM_DESCS, @@ -263,11 +274,14 @@ static int verify_metadata(struct xsk *rx_xsk, int rx= q, int server_fd, clockid_t verify_skb_metadata(server_fd); =20 for (i =3D 0; i < rxq; i++) { + bool first_seg =3D true; + bool is_eop =3D true; + if (fds[i].revents =3D=3D 0) continue; =20 struct xsk *xsk =3D &rx_xsk[i]; - +peek: ret =3D xsk_ring_cons__peek(&xsk->rx, 1, &idx); printf("xsk_ring_cons__peek: %d\n", ret); if (ret !=3D 1) @@ -276,12 +290,19 @@ static int verify_metadata(struct xsk *rx_xsk, int rx= q, int server_fd, clockid_t rx_desc =3D xsk_ring_cons__rx_desc(&xsk->rx, idx); comp_addr =3D xsk_umem__extract_addr(rx_desc->addr); addr =3D xsk_umem__add_offset_to_addr(rx_desc->addr); - printf("%p: rx_desc[%u]->addr=3D%llx addr=3D%llx comp_addr=3D%llx\n", - xsk, idx, rx_desc->addr, addr, comp_addr); - verify_xdp_metadata(xsk_umem__get_data(xsk->umem_area, addr), - clock_id); + is_eop =3D !(rx_desc->options & XDP_PKT_CONTD); + printf("%p: rx_desc[%u]->addr=3D%llx addr=3D%llx comp_addr=3D%llx%s\n", + xsk, idx, rx_desc->addr, addr, comp_addr, is_eop ? " EoP" : ""); + if (first_seg) { + verify_xdp_metadata(xsk_umem__get_data(xsk->umem_area, addr), + clock_id); + first_seg =3D false; + } + xsk_ring_cons__release(&xsk->rx, 1); refill_rx(xsk, comp_addr); + if (!is_eop) + goto peek; } } =20 @@ -404,6 +425,54 @@ static void timestamping_enable(int fd, int val) error(1, errno, "setsockopt(SO_TIMESTAMPING)"); } =20 +static void print_usage(void) +{ + const char *usage =3D + " Usage: xdp_hw_metadata [OPTIONS] [IFNAME]\n" + " Options:\n" + " -m Enable multi-buffer XDP for larger MTU\n" + " -h Display this help and exit\n\n" + " Generate test packets on the other machine with:\n" + " echo -n xdp | nc -u -q1 9091\n"; + + printf("%s", usage); +} + +static void read_args(int argc, char *argv[]) +{ + char opt; + + while ((opt =3D getopt(argc, argv, "mh")) !=3D -1) { + switch (opt) { + case 'm': + use_frags =3D true; + break; + case 'h': + print_usage(); + exit(0); + case '?': + if (isprint(optopt)) + fprintf(stderr, "Unknown option: -%c\n", optopt); + fallthrough; + default: + print_usage(); + error(-1, opterr, "Command line options error"); + } + } + + if (optind >=3D argc) { + fprintf(stderr, "No device name provided\n"); + print_usage(); + exit(-1); + } + + ifname =3D argv[optind]; + ifindex =3D if_nametoindex(ifname); + + if (!ifname) + error(-1, errno, "Invalid interface name"); +} + int main(int argc, char *argv[]) { clockid_t clock_id =3D CLOCK_TAI; @@ -413,13 +482,8 @@ int main(int argc, char *argv[]) =20 struct bpf_program *prog; =20 - if (argc !=3D 2) { - fprintf(stderr, "pass device name\n"); - return -1; - } + read_args(argc, argv); =20 - ifname =3D argv[1]; - ifindex =3D if_nametoindex(ifname); rxq =3D rxq_num(ifname); =20 printf("rxq: %d\n", rxq); --=20 2.41.0