From nobody Sun Feb 8 14:10:25 2026 Received: from mail-pj1-f73.google.com (mail-pj1-f73.google.com [209.85.216.73]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 6740732ABEC for ; Mon, 27 Oct 2025 22:09:23 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.216.73 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1761602965; cv=none; b=Y2lsL6T6RxhD640pHvOwrUkSBy0cVT2QcoayGs63RtOw3VLFCGcINEw5Rq2oXzA8oiPfzV4xXHhk2cPrEyioh8cMvgIQ6cyoeNLDUeGtD0FWoKcluoJ2Crei4mCbHa8txZ/TJwsfS08FsQYE7RkbAhEgfPNH+7+mx74Cznx85gc= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1761602965; c=relaxed/simple; bh=SpOlLR5RnFo8iIKIWl4mbtRGkAQPWiArijnvyIW/b4g=; h=Date:In-Reply-To:Mime-Version:References:Message-ID:Subject:From: To:Cc:Content-Type; b=g0FMbblCdWzEgGUEv5a9kWtoPwp1UiqpmLN1gKxuwPt4LCoI9i+nBIBzW7biSKg4YsmWaFIlbyg6G2NDFn0t4AcOcTM95XO1Np4q7HwVjY31HyH0C8H+zqyrl/KWn7sLsy85GCGL9/kW45dX12iXkAm7wsddV9Cohejv0Xe6MJI= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=pass (p=reject dis=none) header.from=google.com; spf=pass smtp.mailfrom=flex--xur.bounces.google.com; dkim=pass (2048-bit key) header.d=google.com header.i=@google.com header.b=EsKVfMpE; arc=none smtp.client-ip=209.85.216.73 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=reject dis=none) header.from=google.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=flex--xur.bounces.google.com Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=google.com header.i=@google.com header.b="EsKVfMpE" Received: by mail-pj1-f73.google.com with SMTP id 98e67ed59e1d1-32ee62ed6beso8342050a91.2 for ; Mon, 27 Oct 2025 15:09:23 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20230601; t=1761602963; x=1762207763; darn=vger.kernel.org; h=cc:to:from:subject:message-id:references:mime-version:in-reply-to :date:from:to:cc:subject:date:message-id:reply-to; bh=MABumZwH6isM2C1P4GdxLdhOL8tfk2ZitqwVmPx6KfQ=; b=EsKVfMpEx53mqbQAb8irOVds+7ejjDWe8PHSK2szJOvI9p0z3NWJErRPjbuJB0JxIv KQclbYWBMwRgSMNnDP79knMm5ndHucUZP5HjeoTpxaN2PU/sI/OuPqhJ4XJ+GQIpPDCy lzHq3iPImnnzZIajrqmYkWgRtEoAnCeDUElK8XUTFqK1cQ3YeVPUFt55NJ5YEZ1k8uok IQwMa2xYwIfe7JI/Cq2AZuO9o0bXam7BHN2mYlHkU9Q63mmzQyOR3bAphNkE9ycPz+no hdCOcbBPkqSkbn0+Ah0r2n2EsjNY7K7eiGORgnuSoCHp3y6KZ2WmQ3NJeISNPpegtu/l 5Jxg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1761602963; x=1762207763; h=cc:to:from:subject:message-id:references:mime-version:in-reply-to :date:x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=MABumZwH6isM2C1P4GdxLdhOL8tfk2ZitqwVmPx6KfQ=; b=RXspPAQ5elHix1bAhh7smhC6F+pzhZEQG0lW2r8irRROsN+C3bB3asjMIfIE5CetVi wbCucSa2S96Th8Hm3oTvdqRye4w5xvcZYzmJ/vIN5OaCVkIqS/rrn7FTRU5AKsUX87Wm k+ANSHtmyQStHgY9SQDhHgezMrTS1KhB5qdKFXQvKRnfO0CzCFe+tJKpC6N11aL3ekbT F7G5f+hXCP9wCtR/551EwgjYsUwTH9qD/KdWUMFbJhFeM+5q9kL/JJAae8411h0wfUGY HukjoXF4R0WYze0MWrUY70kbN7AmFgKcOfwqPbRF7NDW7Clzo8ul8ztyR5om1Z1mag67 vtEQ== X-Gm-Message-State: AOJu0Yy0/KxqB0hEvf9Kp28hnnFyhoaMQGS0CNa12YOY6mtTbJgUIVF9 0NlN//SkbrFZZruUd8qKe4U0fYJQUzL2nBw4TxfChOWQG+Qpd8UopP+l/MdlSHp5oUCBIw== X-Google-Smtp-Source: AGHT+IF0SDbxWzAiNDxQIpBAoHx/djb9jE9MDJkbzGhBOaTqIZMZmehEEKhvBV1cY6HmFg3/3gSb9Lg= X-Received: from pjbnm10.prod.google.com ([2002:a17:90b:19ca:b0:33b:b0fe:e54d]) (user=xur job=prod-delivery.src-stubby-dispatcher) by 2002:a17:90b:2e50:b0:32e:64ca:e84a with SMTP id 98e67ed59e1d1-340279f1b38mr1606499a91.12.1761602962615; Mon, 27 Oct 2025 15:09:22 -0700 (PDT) Date: Mon, 27 Oct 2025 22:09:14 +0000 In-Reply-To: <20251027220914.2665978-1-xur@google.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: Mime-Version: 1.0 References: <20251027220914.2665978-1-xur@google.com> X-Mailer: git-send-email 2.51.1.838.g19442a804e-goog Message-ID: <20251027220914.2665978-3-xur@google.com> Subject: [PATCH v4 2/2] kbuild: distributed build support for Clang ThinLTO From: xur@google.com To: Masahiro Yamada , Nathan Chancellor , Nicolas Schier , Nick Desaulniers , Bill Wendling , Justin Stitt , Miguel Ojeda , Thomas Gleixner , Alice Ryhl , Sami Tolvanen , "Mike Rapoport (Microsoft)" , Rafael Aquini , Michael Ellerman , Stafford Horne , Christophe Leroy , Piotr Gorski , Rong Xu , Teresa Johnson Cc: linux-kernel@vger.kernel.org, linux-kbuild@vger.kernel.org, llvm@lists.linux.dev Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset="utf-8" From: Rong Xu Add distributed ThinLTO build support for the Linux kernel. This new mode offers several advantages: (1) Increased flexibility in handling user-specified build options. (2) Improved user-friendliness for developers. (3) Greater convenience for integrating with objtool and livepatch. Note that "distributed" in this context refers to a term that differentiates in-process ThinLTO builds by invoking backend compilation through the linker, not necessarily building in distributed environments. Distributed ThinLTO is enabled via the `CONFIG_LTO_CLANG_THIN_DIST` Kconfig option. For example: > make LLVM=3D1 defconfig > scripts/config -e LTO_CLANG_THIN_DIST > make LLVM=3D1 oldconfig > make LLVM=3D1 vmlinux -j <..> The build flow proceeds in four stages: 1. Perform FE compilation, mirroring the in-process ThinLTO mode. 2. Thin-link the generated IR files and object files. 3. Find all IR files and perform BE compilation, using the flags stored in the .*.o.cmd files. 4. Link the BE results to generate the final vmlinux.o. NOTE: This patch currently implements the build for the main kernel image (vmlinux) only. Kernel module support is planned for a subsequent patch. Tested on the following arch: x86, arm64, loongarch, and riscv. The earlier implementation details can be found here: https://discourse.llvm.org/t/rfc-distributed-thinlto-build-for-kernel/85934 Signed-off-by: Rong Xu Co-developed-by: Masahiro Yamada Signed-off-by: Masahiro Yamada --- .gitignore | 2 ++ Makefile | 9 +++++---- arch/Kconfig | 19 ++++++++++++++++++ scripts/Makefile.lib | 7 +++++++ scripts/Makefile.thinlto | 40 ++++++++++++++++++++++++++++++++++++++ scripts/Makefile.vmlinux_a | 37 +++++++++++++++++++++++++++++++++++ scripts/mod/modpost.c | 15 +++++++++++--- 7 files changed, 122 insertions(+), 7 deletions(-) create mode 100644 scripts/Makefile.thinlto diff --git a/.gitignore b/.gitignore index 86a1ba0d90353..d34f28ca55400 100644 --- a/.gitignore +++ b/.gitignore @@ -55,6 +55,7 @@ *.zst Module.symvers dtbs-list +builtin.order modules.order =20 # @@ -66,6 +67,7 @@ modules.order /vmlinux.32 /vmlinux.map /vmlinux.symvers +/vmlinux.thinlto-index /vmlinux.unstripped /vmlinux-gdb.py /vmlinuz diff --git a/Makefile b/Makefile index 89a25bac2bbab..a608a082599d9 100644 --- a/Makefile +++ b/Makefile @@ -1000,10 +1000,10 @@ export CC_FLAGS_SCS endif =20 ifdef CONFIG_LTO_CLANG -ifdef CONFIG_LTO_CLANG_THIN -CC_FLAGS_LTO :=3D -flto=3Dthin -fsplit-lto-unit -else +ifdef CONFIG_LTO_CLANG_FULL CC_FLAGS_LTO :=3D -flto +else +CC_FLAGS_LTO :=3D -flto=3Dthin -fsplit-lto-unit endif CC_FLAGS_LTO +=3D -fvisibility=3Dhidden =20 @@ -1572,6 +1572,7 @@ endif # CONFIG_MODULES CLEAN_FILES +=3D vmlinux.symvers modules-only.symvers \ modules.builtin modules.builtin.modinfo modules.nsdeps \ modules.builtin.ranges vmlinux.o.map vmlinux.unstripped \ + vmlinux.thinlto-index builtin.order \ compile_commands.json rust/test \ rust-project.json .vmlinux.objs .vmlinux.export.c \ .builtin-dtbs-list .builtin-dtb.S @@ -2014,7 +2015,7 @@ clean: $(clean-dirs) $(call cmd,rmfiles) @find . $(RCS_FIND_IGNORE) \ \( -name '*.[aios]' -o -name '*.rsi' -o -name '*.ko' -o -name '.*.cmd' \ - -o -name '*.ko.*' \ + -o -name '*.ko.*' -o -name '*.o.thinlto.bc' \ -o -name '*.dtb' -o -name '*.dtbo' \ -o -name '*.dtb.S' -o -name '*.dtbo.S' \ -o -name '*.dt.yaml' -o -name 'dtbs-list' \ diff --git a/arch/Kconfig b/arch/Kconfig index 74ff011335322..f323ff2f12f87 100644 --- a/arch/Kconfig +++ b/arch/Kconfig @@ -864,6 +864,25 @@ config LTO_CLANG_THIN https://clang.llvm.org/docs/ThinLTO.html =20 If unsure, say Y. + +config LTO_CLANG_THIN_DIST + bool "Clang ThinLTO in distributed mode (EXPERIMENTAL)" + depends on HAS_LTO_CLANG && ARCH_SUPPORTS_LTO_CLANG_THIN + select LTO_CLANG + help + This option enables Clang's ThinLTO in distributed build mode. + In this mode, the linker performs the thin-link, generating + ThinLTO index files. Subsequently, the build system explicitly + invokes ThinLTO backend compilation using these index files + and pre-linked IR objects. The resulting native object files + are with the .thinlto-native.o suffix. + + This build mode offers improved visibility into the ThinLTO + process through explicit subcommand exposure. It also makes + final native object files directly available, benefiting + tools like objtool and kpatch. Additionally, it provides + crucial granular control over back-end options, enabling + module-specific compiler options, and simplifies debugging. endchoice =20 config ARCH_SUPPORTS_AUTOFDO_CLANG diff --git a/scripts/Makefile.lib b/scripts/Makefile.lib index 1d581ba5df66f..65470adaa89ca 100644 --- a/scripts/Makefile.lib +++ b/scripts/Makefile.lib @@ -239,6 +239,12 @@ ifdef CONFIG_LTO_CLANG cmd_ld_single =3D $(if $(objtool-enabled)$(is-single-obj-m), ; $(LD) $(ld_= flags) -r -o $(tmp-target) $@; mv $(tmp-target) $@) endif =20 +ifdef CONFIG_LTO_CLANG_THIN_DIST +# Save the _c_flags, sliently. +quiet_cmd_save_c_flags =3D + cmd_save_c_flags =3D printf '\n%s\n' 'saved_c_flags_$@ :=3D $(_c_fla= gs)' >> $(dot-target).cmd +endif + quiet_cmd_cc_o_c =3D CC $(quiet_modtag) $@ cmd_cc_o_c =3D $(CC) $(c_flags) -c -o $@ $< \ $(cmd_ld_single) \ @@ -246,6 +252,7 @@ quiet_cmd_cc_o_c =3D CC $(quiet_modtag) $@ =20 define rule_cc_o_c $(call cmd_and_fixdep,cc_o_c) + $(call cmd,save_c_flags) $(call cmd,checksrc) $(call cmd,checkdoc) $(call cmd,gen_objtooldep) diff --git a/scripts/Makefile.thinlto b/scripts/Makefile.thinlto new file mode 100644 index 0000000000000..03349ac69de5f --- /dev/null +++ b/scripts/Makefile.thinlto @@ -0,0 +1,40 @@ +PHONY :=3D __default +__default: + +include include/config/auto.conf +include $(srctree)/scripts/Kbuild.include +include $(srctree)/scripts/Makefile.lib + +native-objs :=3D $(patsubst %.o,%.thinlto-native.o,$(call read-file, vmlin= ux.thinlto-index)) + +__default: $(native-objs) + +# Generate .thinlto-native.o (obj) from .o (bitcode) and .thinlto.bc (summ= ary) files +# ------------------------------------------------------------------------= --- +quiet_cmd_cc_o_bc =3D CC $(quiet_modtag) $@ + be_flags =3D $(shell sed -n '/saved_c_flags_/s/.*:=3D //p' \ + $(dir $(<)).$(notdir $(<)).cmd) + cmd_cc_o_bc =3D \ + $(CC) $(be_flags) -x ir -fno-lto -Wno-unused-command-line-argument \ + -fthinlto-index=3D$(word 2, $^) -c -o $@ $< + +targets +=3D $(native-objs) +$(native-objs): %.thinlto-native.o: %.o %.o.thinlto.bc FORCE + $(call if_changed,cc_o_bc) + +# Add FORCE to the prerequisites of a target to force it to be always rebu= ilt. +# ------------------------------------------------------------------------= --- + +PHONY +=3D FORCE +FORCE: + +# Read all saved command lines and dependencies for the $(targets) we +# may be building above, using $(if_changed{,_dep}). As an +# optimization, we don't need to read them if the target does not +# exist, we will rebuild anyway in that case. + +existing-targets :=3D $(wildcard $(sort $(targets))) + +-include $(foreach f,$(existing-targets),$(dir $(f)).$(notdir $(f)).cmd) + +.PHONY: $(PHONY) diff --git a/scripts/Makefile.vmlinux_a b/scripts/Makefile.vmlinux_a index 9774f02b43b2f..382cebe1ee571 100644 --- a/scripts/Makefile.vmlinux_a +++ b/scripts/Makefile.vmlinux_a @@ -21,6 +21,41 @@ targets +=3D built-in-fixup.a built-in-fixup.a: $(KBUILD_VMLINUX_OBJS) scripts/head-object-list.txt FORCE $(call if_changed,ar_builtin_fixup) =20 +ifdef CONFIG_LTO_CLANG_THIN_DIST + +quiet_cmd_builtin.order =3D GEN $@ + cmd_builtin.order =3D $(AR) t $< > $@ + +targets +=3D builtin.order +builtin.order: built-in-fixup.a FORCE + $(call if_changed,builtin.order) + +quiet_cmd_ld_thinlto_index =3D LD $@ + cmd_ld_thinlto_index =3D \ + $(LD) $(KBUILD_LDFLAGS) -r --thinlto-index-only=3D$@ @$< + +targets +=3D vmlinux.thinlto-index +vmlinux.thinlto-index: builtin.order FORCE + $(call if_changed,ld_thinlto_index) + +quiet_cmd_ar_vmlinux.a =3D GEN $@ + cmd_ar_vmlinux.a =3D \ + rm -f $@; \ + while read -r obj; do \ + if grep -q $${obj} $(word 2, $^); then \ + echo $${obj%.o}.thinlto-native.o; \ + else \ + echo $${obj}; \ + fi; \ + done < $< | xargs $(AR) cDPrS --thin $@ + +targets +=3D vmlinux.a +vmlinux.a: builtin.order vmlinux.thinlto-index FORCE + $(Q)$(MAKE) -f $(srctree)/scripts/Makefile.thinlto + $(call if_changed,ar_vmlinux.a) + +else + # vmlinux.a # ------------------------------------------------------------------------= --- =20 @@ -28,6 +63,8 @@ targets +=3D vmlinux.a vmlinux.a: built-in-fixup.a FORCE $(call if_changed,copy) =20 +endif + # Add FORCE to the prerequisites of a target to force it to be always rebu= ilt. # ------------------------------------------------------------------------= --- =20 diff --git a/scripts/mod/modpost.c b/scripts/mod/modpost.c index 47c8aa2a69392..38f62eb4bd8ba 100644 --- a/scripts/mod/modpost.c +++ b/scripts/mod/modpost.c @@ -1473,13 +1473,22 @@ static void extract_crcs_for_object(const char *obj= ect, struct module *mod) char cmd_file[PATH_MAX]; char *buf, *p; const char *base; - int dirlen, ret; + int dirlen, baselen_without_suffix, ret; =20 base =3D get_basename(object); dirlen =3D base - object; =20 - ret =3D snprintf(cmd_file, sizeof(cmd_file), "%.*s.%s.cmd", - dirlen, object, base); + baselen_without_suffix =3D strlen(object) - dirlen - strlen(".o"); + + /* + * When CONFIG_LTO_CLANG_THIN_DIST=3Dy, the ELF is *.thinlto-native.o + * but the symbol CRCs are recorded in *.o.cmd file. + */ + if (strends(object, ".thinlto-native.o")) + baselen_without_suffix -=3D strlen(".thinlto-native"); + + ret =3D snprintf(cmd_file, sizeof(cmd_file), "%.*s.%.*s.o.cmd", + dirlen, object, baselen_without_suffix, base); if (ret >=3D sizeof(cmd_file)) { error("%s: too long path was truncated\n", cmd_file); return; --=20 2.51.1.838.g19442a804e-goog