[v1] LLMinus: LLM-Assisted Merge Conflict Resolution

[RFC 0/5] LLMinus: LLM-Assisted Merge Conflict Resolution

Posted by Sasha Levin 1 month, 2 weeks ago

At the 2025 Maintainer's Summit, there was discussion around the various hats
Linus wears in the community. One of them being the hat of the one who merges
commits into master and resolves conflicts.

Linus made an interesting observation: he enjoys doing merges in C and has
become exceptionally good at it through decades of experience - he can "do them
in his sleep". But he also observed that merges in Rust are more difficult as
he's not familiar enough with the language. He tries to resolve them himself,
then refers back to linux-next's resolution. When his resolution doesn't match,
he uses it as a teaching moment.

This observation points to something fundamental about merge conflict
resolution: it is the epitome of understanding code. To resolve a conflict, one
must understand why the divergence occurred, what the developers on each side
were trying to accomplish, and then unify the divergence in a way that makes
the final code equal to or better than the sum of both parts.

LLMinus is a tool designed to support a maintainer's decision making around
merge conflict resolution by learning from past merges as well as investigating
into the different branches, trying to understand the underlying reason behind
a conflict.

LLMinus learns from the kernel's git history, extracting cases where manual
conflict resolution was required. For each historical merge, it captures what
each branch changed and how the conflict was resolved.  These resolutions are
converted into semantic embeddings, creating a searchable knowledge base of
past merge patterns.

When a maintainer encounters a conflict, LLMinus finds semantically similar
historical resolutions and constructs a prompt for an LLM that includes the
current conflict, similar past resolutions, and guides the LLM to investigate
thoroughly before attempting resolution.

The "LLMinus pull" command integrates directly with lore.kernel.org:

    LLMinus pull <message-id>

This fetches the pull request email, executes the pull, and - if conflicts
arise - invokes the LLM with full context including any conflict resolution
instructions the submitting maintainer provided.

In the immediate term, I'm hoping to turn LLMinus into a tool that is useful
for Linus Torvalds, Mark Brown, and other maintainers who pull from sub-trees
to help understand and review conflicts and support their decision making.

To support the effort of improving the tool, I plan to use LLMinus in my
linus-next work, auditing every conflict resolution it suggests against what
Linus actually does.  This serves two purposes: using Linus's resolutions to
continuously improve the tooling, and potentially spotting issues in merges
that warrant a second look. I will track divergences and build statistics on
how well the tool performs, ideally reaching parity with Linus in the future.

Another point raised at the summit was the value of linux-next's "fs-next"
branch - filesystem maintainers benefit from having their own integration
branch focused on fs/ issues. Currently, creating similar branches for other
subsystems would overwhelm the linux-next maintainer with additional merge
work. LLMinus could change this equation, enabling more subsystem-specific
integration branches without proportionally increasing human effort.

Here is "LLMinus pull 98b74397-05bc-dbee-cab4-3f40d643eaac@kernel.org" on top
of v6.19-rc1:

    === Fetching Pull Request ===

    Fetching: https://lore.kernel.org/all/98b74397-05bc-dbee-cab4-3f40d643eaac@kernel.org/raw
    Subject: [GIT PULL] RISC-V updates for the v6.19 merge window (part two)
    From: Paul Walmsley <pjw@kernel.org>
    Date: Thu, 11 Dec 2025 19:36:25 -0700 (MST)
    Git URL: git://git.kernel.org/pub/scm/linux/kernel/git/riscv/linux tags/riscv-for-linus-6.19-mw2

    === Executing Git Pull ===

    Executing: git pull git://git.kernel.org/pub/scm/linux/kernel/git/riscv/linux tags/riscv-for-linus-6.19-mw2
    Auto-merging Documentation/admin-guide/kernel-parameters.txt
    Auto-merging Documentation/devicetree/bindings/riscv/extensions.yaml
    Auto-merging arch/riscv/Kconfig
    Auto-merging arch/riscv/include/asm/hwcap.h
    CONFLICT (content): Merge conflict in arch/riscv/include/asm/hwcap.h
    Auto-merging arch/riscv/include/asm/pgtable.h
    Auto-merging arch/riscv/kernel/cpufeature.c
    Auto-merging include/linux/mm.h
    CONFLICT (content): Merge conflict in include/linux/mm.h
    Auto-merging tools/testing/selftests/riscv/hwprobe/which-cpus.c
    Automatic merge failed; fix conflicts and then commit the result.

    === Merge Conflicts Detected ===

    Found 2 conflict region(s) to resolve
    Looking for similar historical conflicts...
    Found 3 similar historical resolutions


And the resulting merge commit (where one part matches the resolution
instructions, and one part differs):

    Merge tags/riscv-for-linus-6.19-mw2 RISC-V updates for the v6.19 merge window (part two)

    Second set of RISC-V updates for v6.19-rc1

    - Add support for control flow integrity for userspace processes.
      This is based on the standard RISC-V ISA extensions Zicfiss and
      Zicfilp

    - Add probing and userspace reporting support for the standard RISC-V
      ISA extensions Zilsd and Zclsd, which implement load/store dual
      instructions on RV32

    - Abstract the register saving code in setup_sigcontext() so it can be
      used for stateful RISC-V ISA extensions beyond the vector extension

    - Add the SBI extension ID and some initial data structure definitions
      for the RISC-V standard SBI debug trigger extension

    - Clean up some code slightly: change some page table functions to
      avoid atomic operations oinn !SMP and to avoid unnecessary casts to
      atomic_long_t; and use the existing RISCV_FULL_BARRIER macro in
      place of some open-coded "fence rw,rw" instructions

    Merge conflict resolution:

    # Merge Conflict Resolution: riscv-for-linus-6.19-mw2

    ## Summary

    This merge integrates RISC-V CFI (Control Flow Integrity) support for userspace
    processes along with additional ISA extension probing support. The resolution
    followed the maintainer's guidance from Paul Walmsley's pull request email.

    ## Conflicts Resolved

    ### 1. arch/riscv/include/asm/hwcap.h - ISA Extension ID Renumbering

    **Conflict:** Both branches added new RISC-V ISA extensions with overlapping IDs.
    - HEAD added SVRSW60T59B at 100 and ZALASR at 101
    - MERGE_HEAD added ZALASR at 100, plus ZILSD, ZCLSD, ZICFILP, ZICFISS for CFI

    **Resolution:** Kept all extensions with renumbered IDs to avoid duplicates:
    - SVRSW60T59B: 100 (from HEAD)
    - ZALASR: 101 (bumped from 100)
    - ZILSD: 102 (bumped from 101)
    - ZCLSD: 103 (bumped from 102)
    - ZICFILP: 104 (bumped from 103)
    - ZICFISS: 105 (bumped from 104)

    As the maintainer noted, the exact numbers are not important - they just need
    to be unique and below RISCV_ISA_EXT_MAX (128).

    ### 2. include/linux/mm.h - VM_SHADOW_STACK for RISC-V CFI

    **Conflict:** The VMA flags code was significantly refactored by commit
    9ea35a25d51b ("mm: introduce VMA flags bitmap type"). The incoming RISC-V
    CFI changes used the old-style #define syntax which conflicted with the
    new enum-based DECLARE_VMA_BIT_ALIAS approach.

    **Resolution:** Following the maintainer's guidance:

    a) In the enum section (line 362), added RISC-V CFI to the x86 shadow stack
       condition to share the same bit alias:
       ```c
       #if defined(CONFIG_X86_USER_SHADOW_STACK) || defined(CONFIG_RISCV_USER_CFI)
       ```

    b) In the VM_SHADOW_STACK macro definition (line 463-464), added RISC-V CFI
       to enable the flag:
       ```c
       #if defined(CONFIG_X86_USER_SHADOW_STACK) || defined(CONFIG_ARM64_GCS) || \
           defined(CONFIG_RISCV_USER_CFI)
       ```

    This follows the same pattern used for x86 and ARM64 shadow stacks, where
    x86 and RISC-V share HIGH_ARCH_5 (bit 37) and ARM64 GCS uses HIGH_ARCH_6.

    ## Rationale

    The resolution is cleaner than the maintainer's suggested diff because it:
    1. Properly integrates with the new VMA flags enum system
    2. Maintains consistency with how x86 and ARM64 shadow stacks are handled
    3. Doesn't leave any remnants of the old-style macros

    ## Testing Considerations

    - RISC-V CFI requires the Zicfiss and Zicfilp ISA extensions
    - The Kconfig prevents enabling CFI on no-MMU systems for bisectability
    - Full testing should include both hardware emulation and QEMU

    Link: https://lore.kernel.org/all/98b74397-05bc-dbee-cab4-3f40d643eaac@kernel.org/

The tool is available in tools/LLMinus and can be built with:

    cd tools/llminus && cargo build --release

Few notes:

 - My Rust knowledge is questionable, and a lot of the code was written with
   the help of an LLM. It's best if we don't dig too deep into actual code
   review at this point and focus on the concept itself.

 - The tool will work with any LLM that can take a prompt via stdin, but will
   work even better with tools that allow the LLM to run other tools as part of
   it's investigative work.

 - There's no GPU support just yet, so creating the embeddings for the entire
   history takes quite a while...

Sasha Levin (5):
  LLMinus: Add skeleton project with learn command
  LLMinus: Add vectorize command with fastembed
  LLMinus: Add find command for similarity search
  LLMinus: Add resolve command for LLM-assisted conflict resolution
  LLMinus: Add pull command for LLM-assisted kernel pull request merging

 tools/llminus/.gitignore  |    1 +
 tools/llminus/Cargo.toml  |   20 +
 tools/llminus/src/main.rs | 2289 +++++++++++++++++++++++++++++++++++++
 3 files changed, 2310 insertions(+)
 create mode 100644 tools/llminus/.gitignore
 create mode 100644 tools/llminus/Cargo.toml
 create mode 100644 tools/llminus/src/main.rs

-- 
2.51.0

Re: [RFC 0/5] LLMinus: LLM-Assisted Merge Conflict Resolution

Posted by Sasha Levin 1 month, 2 weeks ago

On Fri, Dec 19, 2025 at 01:16:24PM -0500, Sasha Levin wrote:
>Another point raised at the summit was the value of linux-next's "fs-next"
>branch - filesystem maintainers benefit from having their own integration
>branch focused on fs/ issues. Currently, creating similar branches for other
>subsystems would overwhelm the linux-next maintainer with additional merge
>work. LLMinus could change this equation, enabling more subsystem-specific
>integration branches without proportionally increasing human effort.

I've been toying with this one for the past day.

I've started by letting LLMinus learn merge conflict resolutions on linux-next,
so that it could use it as references later. At that point, very little is
needed to have an LLM resolve different variations of the same conflict (that
just appears different because we mix-and-match various trees).

I assigned categories to the various trees used by -next (see
https://gist.github.com/sashalevin/163df4ae1163e0e22a97edc40e14b7f5) and built
a simple wrapper script to generate per-category integration branches, letting
LLMinus resolve conflicts whenever we hit one.

The resulting branches were pushed to
https://git.kernel.org/pub/scm/linux/kernel/git/sashal/linux-next.git/refs/ .
Each category has a %s-next branch, and a larger all-next branch which merges
all of them together and is the equivalent of linux-next.

Please let me know what you think!

-- 
Thanks,
Sasha

Re: [RFC 0/5] LLMinus: LLM-Assisted Merge Conflict Resolution

Posted by Mark Brown 1 month, 2 weeks ago

On Sun, Dec 21, 2025 at 11:10:11AM -0500, Sasha Levin wrote:

> I assigned categories to the various trees used by -next (see
> https://gist.github.com/sashalevin/163df4ae1163e0e22a97edc40e14b7f5) and built
> a simple wrapper script to generate per-category integration branches, letting
> LLMinus resolve conflicts whenever we hit one.

Those categories appear to be a bit randomly assigned FWIW.  I'm not
clear who would want the various intermediate merges either, I suppose
that having some of the trees pulled into multiple places might help
shake out some of the issues due to things getting sent to Linus in a
different order but OTOH it will increase the total number of merges
done and tested which is itself a cost.  We could also shake out
ordering issues by doing something like randomise the ordering.  I think
I'd want some demand or use case for doing more intermediate merges
rather than just doing a bunch of them for the sake of it.

This seems like a very separate experiment to your LLM merge thing.

> The resulting branches were pushed to
> https://git.kernel.org/pub/scm/linux/kernel/git/sashal/linux-next.git/refs/ .
> Each category has a %s-next branch, and a larger all-next branch which merges
> all of them together and is the equivalent of linux-next.

> Please let me know what you think!

It might be easier to tell what it's done if you ran this with the same
inputs as the last -next (it's on Christmas break at the minute),
there's quite large differences in the end result but most if not all of
that is that the input trees you're using are fresher than the last
-next.  Though I think even with the same base there'd be a bit of a
needle in a haystack thing finding interesting cases, probably it'd be
more useful to find and highlight specific cases where it does something
interesting.

Re: [RFC 0/5] LLMinus: LLM-Assisted Merge Conflict Resolution

Posted by Sasha Levin 1 month, 2 weeks ago

On Mon, Dec 22, 2025 at 02:50:55PM +0000, Mark Brown wrote:
>On Sun, Dec 21, 2025 at 11:10:11AM -0500, Sasha Levin wrote:
>
>> I assigned categories to the various trees used by -next (see
>> https://gist.github.com/sashalevin/163df4ae1163e0e22a97edc40e14b7f5) and built
>> a simple wrapper script to generate per-category integration branches, letting
>> LLMinus resolve conflicts whenever we hit one.
>
>Those categories appear to be a bit randomly assigned FWIW.  I'm not

There should be some sense there :) but yes, we could fine tune it as we go.

>clear who would want the various intermediate merges either, I suppose
>that having some of the trees pulled into multiple places might help
>shake out some of the issues due to things getting sent to Linus in a
>different order but OTOH it will increase the total number of merges
>done and tested which is itself a cost.  We could also shake out
>ordering issues by doing something like randomise the ordering.  I think
>I'd want some demand or use case for doing more intermediate merges
>rather than just doing a bunch of them for the sake of it.

My thinking around it was to enable faster per-subsystem tests than what we
currently do. For example, we can quickly build mm-next and run mm focused
tests on it.

Since creating these per-subsystem trees is fairly cheap and can happen even
few times a day, we can help identify issues way earlier during the process.

>This seems like a very separate experiment to your LLM merge thing.

Right, just going off on a tangent based on the Maintainer's summit feedback of
how useful fs-next is.

>> The resulting branches were pushed to
>> https://git.kernel.org/pub/scm/linux/kernel/git/sashal/linux-next.git/refs/ .
>> Each category has a %s-next branch, and a larger all-next branch which merges
>> all of them together and is the equivalent of linux-next.
>
>> Please let me know what you think!
>
>It might be easier to tell what it's done if you ran this with the same
>inputs as the last -next (it's on Christmas break at the minute),
>there's quite large differences in the end result but most if not all of
>that is that the input trees you're using are fresher than the last
>-next.  Though I think even with the same base there'd be a bit of a
>needle in a haystack thing finding interesting cases, probably it'd be
>more useful to find and highlight specific cases where it does something
>interesting.

Yup, I figured I'd wait until break is over and compare the trees once the next
linux-next is released.

-- 
Thanks,
Sasha

Re: [RFC 0/5] LLMinus: LLM-Assisted Merge Conflict Resolution

Posted by Mark Brown 1 month, 2 weeks ago

On Tue, Dec 23, 2025 at 07:36:18AM -0500, Sasha Levin wrote:
> On Mon, Dec 22, 2025 at 02:50:55PM +0000, Mark Brown wrote:
> > On Sun, Dec 21, 2025 at 11:10:11AM -0500, Sasha Levin wrote:

> > clear who would want the various intermediate merges either, I suppose
> > that having some of the trees pulled into multiple places might help
> > shake out some of the issues due to things getting sent to Linus in a
> > different order but OTOH it will increase the total number of merges
> > done and tested which is itself a cost.  We could also shake out
> > ordering issues by doing something like randomise the ordering.  I think
> > I'd want some demand or use case for doing more intermediate merges
> > rather than just doing a bunch of them for the sake of it.

> My thinking around it was to enable faster per-subsystem tests than what we
> currently do. For example, we can quickly build mm-next and run mm focused
> tests on it.

If we start putting everything into intermediate merges then inevitably
some of those merges are going to be later in the process and will get
generated later in the process, meaning they're nearer to the production
of the full -next.  I'm also not clear that we have enough trees that
would update multiple times a day.

> Since creating these per-subsystem trees is fairly cheap and can happen even
> few times a day, we can help identify issues way earlier during the process.

To be clear unless things are super prone to conflicts the big cost with
adding stuff to -next isn't generally doing the merges, it's build
testing the results.  To that end the main potential advantage I can see
in doing submerges would be if we could parallelise the build testing
portion of things.  That would need some consideration of the complexity
of the scripting, the build machines and the cogantive load involved,
and if we were doing that the considerations for constructing submerges
would be a bit different.  It has crossed my mind, but it'd be non
trivial to do and not intending to produce intermediate merges that are
useful to anyone else.

> > This seems like a very separate experiment to your LLM merge thing.

> Right, just going off on a tangent based on the Maintainer's summit feedback of
> how useful fs-next is.

A key part of this is that the filesystem people had the need, capacity
and desire to test a specific merge.  It's not that the merge started
happening then the filesystem people saw it and realised that it'd be
really useful, they wanted and asked for the merge because it filled a
specific need they had identified.  If there's other situations like
that that's a very different, much more clearly valuable, prospect than
producing intermediate merges and hoping they're useful.

With my testing hat on there's costs to adding extra trees to test, and
with producing those trees more often.  You need capacity to both run
the tests and triage the results, and an audience that is going to care
about the results.  If you're adding a merged tree you generally either
want to be able to drop individual testing of the component trees or to
have some reason to believe that that specific merge is likely to be
where relevant issues are introduced.  For example the reason I
generally recomment that people doing CI cover -next as well as their
specific trees is that you can catch issues from other trees that are
going to impact your testing (eg, breaking the platforms you test)
before they end up coming into your tree via Linus' tree, keeping your
baseline stable.  With that goal you're actively looking to see as many
trees as possible integrated.

My guess would be that many areas of the kernel already have workflows
that meet whatever needs they have for integration trees and have no
need to do something centrally, if there are areas where there's a need
then by all means but I think they should be something that people
actively want.

Re: [RFC 0/5] LLMinus: LLM-Assisted Merge Conflict Resolution

Posted by Sasha Levin 1 month ago

On Tue, Dec 23, 2025 at 05:47:58PM +0000, Mark Brown wrote:
>On Tue, Dec 23, 2025 at 07:36:18AM -0500, Sasha Levin wrote:
>> On Mon, Dec 22, 2025 at 02:50:55PM +0000, Mark Brown wrote:
>> > On Sun, Dec 21, 2025 at 11:10:11AM -0500, Sasha Levin wrote:
>
>> > clear who would want the various intermediate merges either, I suppose
>> > that having some of the trees pulled into multiple places might help
>> > shake out some of the issues due to things getting sent to Linus in a
>> > different order but OTOH it will increase the total number of merges
>> > done and tested which is itself a cost.  We could also shake out
>> > ordering issues by doing something like randomise the ordering.  I think
>> > I'd want some demand or use case for doing more intermediate merges
>> > rather than just doing a bunch of them for the sake of it.
>
>> My thinking around it was to enable faster per-subsystem tests than what we
>> currently do. For example, we can quickly build mm-next and run mm focused
>> tests on it.
>
>If we start putting everything into intermediate merges then inevitably
>some of those merges are going to be later in the process and will get
>generated later in the process, meaning they're nearer to the production
>of the full -next.  I'm also not clear that we have enough trees that
>would update multiple times a day.

I've left the script running over the holiday break, and the rate of changes is
very surprisingly high (specially given it was a holiday in most of the
world!).

>> Since creating these per-subsystem trees is fairly cheap and can happen even
>> few times a day, we can help identify issues way earlier during the process.
>
>To be clear unless things are super prone to conflicts the big cost with
>adding stuff to -next isn't generally doing the merges, it's build
>testing the results.  To that end the main potential advantage I can see
>in doing submerges would be if we could parallelise the build testing
>portion of things.  That would need some consideration of the complexity
>of the scripting, the build machines and the cogantive load involved,
>and if we were doing that the considerations for constructing submerges
>would be a bit different.  It has crossed my mind, but it'd be non
>trivial to do and not intending to produce intermediate merges that are
>useful to anyone else.

The way I have it working is that I will only recreate a sub-tree if any of the
-next trees that are part of it were rebased, otherwise I will just merge new
changes on top of the existing tree.

I will also do a build test only after I merged everything into the sub-tree.
If we hit a build error, I will bisect between the last known good point and
HEAD.

Between the above, as well as tracking "known-broken" trees, the volume of
build tests is not that scary.

-- 
Thanks,
Sasha

Re: [RFC 0/5] LLMinus: LLM-Assisted Merge Conflict Resolution

Posted by Mark Brown 1 month ago

On Mon, Jan 05, 2026 at 01:00:20PM -0500, Sasha Levin wrote:
> On Tue, Dec 23, 2025 at 05:47:58PM +0000, Mark Brown wrote:

> > To be clear unless things are super prone to conflicts the big cost with
> > adding stuff to -next isn't generally doing the merges, it's build
> > testing the results.  To that end the main potential advantage I can see
> > in doing submerges would be if we could parallelise the build testing
> > portion of things.  That would need some consideration of the complexity
> > of the scripting, the build machines and the cogantive load involved,
> > and if we were doing that the considerations for constructing submerges
> > would be a bit different.  It has crossed my mind, but it'd be non
> > trivial to do and not intending to produce intermediate merges that are
> > useful to anyone else.

> The way I have it working is that I will only recreate a sub-tree if any of the
> -next trees that are part of it were rebased, otherwise I will just merge new
> changes on top of the existing tree.

> I will also do a build test only after I merged everything into the sub-tree.
> If we hit a build error, I will bisect between the last known good point and
> HEAD.

Yeah, there's already optimisations along those lines in there which
help a lot.

> Between the above, as well as tracking "known-broken" trees, the volume of
> build tests is not that scary.

Most of the time it's fine like you say, otherwise it'd be completely
unsustainable.  Some of the time between the number of trees that decide
to simultaneously make changes that trigger full allmodconfig rebuilds,
generate messy conflicts or whatever else things blow up in your face.
There's an interlock in the scripts that stops releases going out after
3am or something which I am pretty confident is in there due to bitter
experience.

[RFC v2 0/7] LLMinus: LLM-Assisted Merge Conflict Resolution