Create a document to summarize hard-earned knowledge about RSB-related
mitigations, with references, and replace the overly verbose yet
incomplete comments with a reference to the document.
Signed-off-by: Josh Poimboeuf <jpoimboe@kernel.org>
---
Documentation/admin-guide/hw-vuln/index.rst | 1 +
Documentation/admin-guide/hw-vuln/rsb.rst | 241 ++++++++++++++++++++
arch/x86/kernel/cpu/bugs.c | 64 ++----
3 files changed, 255 insertions(+), 51 deletions(-)
create mode 100644 Documentation/admin-guide/hw-vuln/rsb.rst
diff --git a/Documentation/admin-guide/hw-vuln/index.rst b/Documentation/admin-guide/hw-vuln/index.rst
index ff0b440ef2dc..451874b8135d 100644
--- a/Documentation/admin-guide/hw-vuln/index.rst
+++ b/Documentation/admin-guide/hw-vuln/index.rst
@@ -22,3 +22,4 @@ are configurable at compile, boot or run time.
srso
gather_data_sampling
reg-file-data-sampling
+ rsb
diff --git a/Documentation/admin-guide/hw-vuln/rsb.rst b/Documentation/admin-guide/hw-vuln/rsb.rst
new file mode 100644
index 000000000000..97bf75993d5d
--- /dev/null
+++ b/Documentation/admin-guide/hw-vuln/rsb.rst
@@ -0,0 +1,241 @@
+.. SPDX-License-Identifier: GPL-2.0
+
+=======================
+RSB-related mitigations
+=======================
+
+.. warning::
+ Please keep this document up-to-date, otherwise you will be
+ volunteered to update it and convert it to a very long comment in
+ bugs.c!
+
+Since 2018 there have been many Spectre CVEs related to the Return Stack
+Buffer (RSB). Information about these CVEs and how to mitigate them is
+scattered amongst a myriad of microarchitecture-specific documents.
+
+This document attempts to consolidate all the relevant information in
+once place and clarify the reasoning behind the current RSB-related
+mitigations.
+
+At a high level, there are two classes of RSB attacks: RSB poisoning
+(Intel and AMD) and RSB underflow (Intel only). They must each be
+considered individually for each attack vector (and microarchitecture
+where applicable).
+
+----
+
+RSB poisoning (Intel and AMD)
+=============================
+
+SpectreRSB
+~~~~~~~~~~
+
+RSB poisoning is a technique used by Spectre-RSB [#spectre-rsb]_ where
+an attacker poisons an RSB entry to cause a victim's return instruction
+to speculate to an attacker-controlled address. This can happen when
+there are unbalanced CALLs/RETs after a context switch or VMEXIT.
+
+* All attack vectors can potentially be mitigated by flushing out any
+ poisoned RSB entries using an RSB filling sequence
+ [#intel-rsb-filling]_ [#amd-rsb-filling]_ when transitioning between
+ untrusted and trusted domains. But this has a performance impact and
+ should be avoided whenever possible.
+
+* On context switch, the user->user mitigation requires ensuring the
+ RSB gets filled or cleared whenever IBPB gets written [#cond-ibpb]_
+ during a context switch:
+
+ * AMD:
+ IBPB (or SBPB [#amd-sbpb]_ if used) automatically clears the RSB
+ if IBPB_RET is set in CPUID [#amd-ibpb-rsb]_. Otherwise the RSB
+ filling sequence [#amd-rsb-filling]_ must be always be done in
+ addition to IBPB.
+
+ * Intel:
+ IBPB automatically clears the RSB:
+
+ "Software that executed before the IBPB command cannot control
+ the predicted targets of indirect branches executed after the
+ command on the same logical processor. The term indirect branch
+ in this context includes near return instructions, so these
+ predicted targets may come from the RSB." [#intel-ibpb-rsb]_
+
+* On context switch, user->kernel attacks are mitigated by SMEP, as user
+ space can only insert its own return addresses into the RSB:
+
+ * AMD:
+ "Finally, branches that are predicted as 'ret' instructions get
+ their predicted targets from the Return Address Predictor (RAP).
+ AMD recommends software use a RAP stuffing sequence (mitigation
+ V2-3 in [2]) and/or Supervisor Mode Execution Protection (SMEP)
+ to ensure that the addresses in the RAP are safe for
+ speculation. Collectively, we refer to these mitigations as "RAP
+ Protection"." [#amd-smep-rsb]_
+
+ * Intel:
+ "On processors with enhanced IBRS, an RSB overwrite sequence may
+ not suffice to prevent the predicted target of a near return
+ from using an RSB entry created in a less privileged predictor
+ mode. Software can prevent this by enabling SMEP (for
+ transitions from user mode to supervisor mode) and by having
+ IA32_SPEC_CTRL.IBRS set during VM exits." [#intel-smep-rsb]_
+
+* On VMEXIT, guest->host attacks are mitigated by eIBRS (and PBRSB
+ mitigation if needed):
+
+ * AMD:
+ "When Automatic IBRS is enabled, the internal return address
+ stack used for return address predictions is cleared on VMEXIT."
+ [#amd-eibrs-vmexit]_
+
+ * Intel:
+ "On processors with enhanced IBRS, an RSB overwrite sequence may
+ not suffice to prevent the predicted target of a near return
+ from using an RSB entry created in a less privileged predictor
+ mode. Software can prevent this by enabling SMEP (for
+ transitions from user mode to supervisor mode) and by having
+ IA32_SPEC_CTRL.IBRS set during VM exits. Processors with
+ enhanced IBRS still support the usage model where IBRS is set
+ only in the OS/VMM for OSes that enable SMEP. To do this, such
+ processors will ensure that guest behavior cannot control the
+ RSB after a VM exit once IBRS is set, even if IBRS was not set
+ at the time of the VM exit." [#intel-eibrs-vmexit]_
+
+ Note that some Intel CPUs are susceptible to Post-barrier Return
+ Stack Buffer Predictions (PBRSB)[#intel-pbrsb]_, where the last CALL
+ from the guest can be used to predict the first unbalanced RET. In
+ this case the PBRSB mitigation is needed in addition to eIBRS.
+
+AMD Retbleed / SRSO / Branch Type Confusion
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+On AMD, poisoned RSB entries can also be created by the AMD Retbleed
+variant [#retbleed-paper]_ and/or Speculative Return Stack Overflow
+[#amd-srso]_ (Inception [#inception-paper]_). These attacks are made
+possible by Branch Type Confusion [#amd-btc]_. The kernel protects
+itself by replacing every RET in the kernel with a branch to a single
+safe RET.
+
+----
+
+RSB underflow (Intel only)
+==========================
+
+Intel Retbleed
+~~~~~~~~~~~~~~
+
+Some Intel Skylake-generation CPUs are susceptible to the Intel variant
+of Retbleed [#retbleed-paper]_ (Return Stack Buffer Underflow
+[#intel-rsbu]_). If a RET is executed when the RSB buffer is empty due
+to mismatched CALLs/RETs or returning from a deep call stack, the branch
+predictor can fall back to using the Branch Target Buffer (BTB). If a
+user forces a BTB collision then the RET can speculatively branch to a
+user-controlled address.
+
+* Note that RSB filling doesn't fully mitigate this issue. If there
+ are enough unbalanced RETs, the RSB may still underflow and fall back
+ to using a poisoned BTB entry.
+
+* On context switch, user->user underflow attacks are mitigated by the
+ conditional IBPB [#cond-ibpb]_ on context switch which clears the BTB:
+
+ * "The indirect branch predictor barrier (IBPB) is an indirect branch
+ control mechanism that establishes a barrier, preventing software
+ that executed before the barrier from controlling the predicted
+ targets of indirect branches executed after the barrier on the same
+ logical processor." [#intel-ibpb-btb]_
+
+ .. note::
+ I wasn't able to find any offical documentation from Intel
+ explicitly stating that IBPB clears the BTB. However, it's
+ broadly known to be true and relied upon in several mitigations.
+
+* On context switch and VMEXIT, user->kernel and guest->host underflows
+ are mitigated by IBRS or eIBRS:
+
+ * "Enabling IBRS (including enhanced IBRS) will mitigate the "RSBU"
+ attack demonstrated by the researchers. As previously documented,
+ Intel recommends the use of enhanced IBRS, where supported. This
+ includes any processor that enumerates RRSBA but not RRSBA_DIS_S."
+ [#intel-rsbu]_
+
+ As an alternative to classic IBRS, call depth tracking can be used to
+ track kernel returns and fill the RSB when it gets close to being
+ empty.
+
+Restricted RSB Alternate (RRSBA)
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+
+Some newer Intel CPUs have Restricted RSB Alternate (RRSBA) behavior,
+which, similar to the Intel variant of Retbleed described above, also
+falls back to using the BTB on RSB underflow. The only difference is
+that the predicted targets are restricted to the current domain.
+
+* "Restricted RSB Alternate (RRSBA) behavior allows alternate branch
+ predictors to be used by near RET instructions when the RSB is
+ empty. When eIBRS is enabled, the predicted targets of these
+ alternate predictors are restricted to those belonging to the
+ indirect branch predictor entries of the current prediction domain.
+ [#intel-eibrs-rrsba]_
+
+When a CPU with RRSBA is vulnerable to Branch History Injection
+[#bhi-paper]_ [#intel-bhi]_, an RSB underflow could be used for an
+intra-mode BTI attack. This is mitigated by clearing the BHB on
+kernel entry.
+
+However if the kernel uses retpolines instead of eIBRS, it needs to
+disable RRSBA:
+
+* "Where software is using retpoline as a mitigation for BHI or
+ intra-mode BTI, and the processor both enumerates RRSBA and
+ enumerates RRSBA_DIS controls, it should disable this behavior. "
+ [#intel-retpoline-rrsba]_
+
+----
+
+References
+==========
+
+.. [#spectre-rsb] `Spectre Returns! Speculation Attacks using the Return Stack Buffer <https://arxiv.org/pdf/1807.07940.pdf>`_
+
+.. [#intel-rsb-filling] "Empty RSB Mitigation on Skylake-generation" in `Retpoline: A Branch Target Injection Mitigation <https://www.intel.com/content/www/us/en/developer/articles/technical/software-security-guidance/technical-documentation/retpoline-branch-target-injection-mitigation.html#inpage-nav-5-1>`_
+
+.. [#amd-rsb-filling] "Mitigation V2-3" in `Software Techniques for Managing Speculation <https://www.amd.com/content/dam/amd/en/documents/processor-tech-docs/programmer-references/software-techniques-for-managing-speculation.pdf>`_
+
+.. [#cond-ibpb] Whether IBPB is written depends on whether the prev and/or next task is protected from Spectre attacks. It typically requires opting in per task or system-wide. For more details see the documentation for the ``spectre_v2_user`` cmdline option in Documentation/admin-guide/kernel-parameters.txt.
+
+.. [#amd-sbpb] IBPB without flushing of branch type predictions. Only exists for AMD.
+
+.. [#amd-ibpb-rsb] "Function 8000_0008h -- Processor Capacity Parameters and Extended Feature Identification" in `AMD64 Architecture Programmer's Manual Volume 3: General-Purpose and System Instructions <https://www.amd.com/content/dam/amd/en/documents/processor-tech-docs/programmer-references/24594.pdf>`_. SBPB behaves the same way according to `this email <https://lore.kernel.org/5175b163a3736ca5fd01cedf406735636c99a>`_.
+
+.. [#intel-ibpb-rsb] "Introduction" in `Post-barrier Return Stack Buffer Predictions / CVE-2022-26373 / INTEL-SA-00706 <https://www.intel.com/content/www/us/en/developer/articles/technical/software-security-guidance/advisory-guidance/post-barrier-return-stack-buffer-predictions.html>`_
+
+.. [#amd-smep-rsb] "Existing Mitigations" in `Technical Guidance for Mitigating Branch Type Confusion <https://www.amd.com/content/dam/amd/en/documents/resources/technical-guidance-for-mitigating-branch-type-confusion.pdf>`_
+
+.. [#intel-smep-rsb] "Enhanced IBRS" in `Indirect Branch Restricted Speculation <https://www.intel.com/content/www/us/en/developer/articles/technical/software-security-guidance/technical-documentation/indirect-branch-restricted-speculation.html>`_
+
+.. [#amd-eibrs-vmexit] "Extended Feature Enable Register (EFER)" in `AMD64 Architecture Programmer's Manual Volume 2: System Programming <https://www.amd.com/content/dam/amd/en/documents/processor-tech-docs/programmer-references/24593.pdf>`_
+
+.. [#intel-eibrs-vmexit] "Enhanced IBRS" in `Indirect Branch Restricted Speculation <https://www.intel.com/content/www/us/en/developer/articles/technical/software-security-guidance/technical-documentation/indirect-branch-restricted-speculation.html>`_
+
+.. [#intel-pbrsb] `Post-barrier Return Stack Buffer Predictions / CVE-2022-26373 / INTEL-SA-00706 <https://www.intel.com/content/www/us/en/developer/articles/technical/software-security-guidance/advisory-guidance/post-barrier-return-stack-buffer-predictions.html>`_
+
+.. [#retbleed-paper] `Retbleed: Arbitrary Speculative Code Execution with Return Instruction <https://comsec.ethz.ch/wp-content/files/retbleed_sec22.pdf>`_
+
+.. [#amd-btc] `Technical Guidance for Mitigating Branch Type Confusion <https://www.amd.com/content/dam/amd/en/documents/resources/technical-guidance-for-mitigating-branch-type-confusion.pdf>`_
+
+.. [#amd-srso] `Technical Update Regarding Speculative Return Stack Overflow <https://www.amd.com/content/dam/amd/en/documents/corporate/cr/speculative-return-stack-overflow-whitepaper.pdf>`_
+
+.. [#inception-paper] `Inception: Exposing New Attack Surfaces with Training in Transient Execution <https://comsec.ethz.ch/wp-content/files/inception_sec23.pdf>`_
+
+.. [#intel-rsbu] `Return Stack Buffer Underflow / Return Stack Buffer Underflow / CVE-2022-29901, CVE-2022-28693 / INTEL-SA-00702 <https://www.intel.com/content/www/us/en/developer/articles/technical/software-security-guidance/advisory-guidance/return-stack-buffer-underflow.html>`_
+
+.. [#intel-ibpb-btb] `Indirect Branch Predictor Barrier' <https://www.intel.com/content/www/us/en/developer/articles/technical/software-security-guidance/technical-documentation/indirect-branch-predictor-barrier.html>`_
+
+.. [#intel-eibrs-rrsba] "Guidance for RSBU" in `Return Stack Buffer Underflow / Return Stack Buffer Underflow / CVE-2022-29901, CVE-2022-28693 / INTEL-SA-00702 <https://www.intel.com/content/www/us/en/developer/articles/technical/software-security-guidance/advisory-guidance/return-stack-buffer-underflow.html>`_
+
+.. [#bhi-paper] `Branch History Injection: On the Effectiveness of Hardware Mitigations Against Cross-Privilege Spectre-v2 Attacks <http://download.vusec.net/papers/bhi-spectre-bhb_sec22.pdf>`_
+
+.. [#intel-bhi] `Branch History Injection and Intra-mode Branch Target Injection / CVE-2022-0001, CVE-2022-0002 / INTEL-SA-00598 <https://www.intel.com/content/www/us/en/developer/articles/technical/software-security-guidance/technical-documentation/branch-history-injection.html>`_
+
+.. [#intel-retpoline-rrsba] "Retpoline" in `Branch History Injection and Intra-mode Branch Target Injection / CVE-2022-0001, CVE-2022-0002 / INTEL-SA-00598 <https://www.intel.com/content/www/us/en/developer/articles/technical/software-security-guidance/technical-documentation/branch-history-injection.html>`_
diff --git a/arch/x86/kernel/cpu/bugs.c b/arch/x86/kernel/cpu/bugs.c
index 680c779e9711..e78bb781c091 100644
--- a/arch/x86/kernel/cpu/bugs.c
+++ b/arch/x86/kernel/cpu/bugs.c
@@ -1594,25 +1594,25 @@ static void __init spec_ctrl_disable_kernel_rrsba(void)
static void __init spectre_v2_select_rsb_mitigation(enum spectre_v2_mitigation mode)
{
/*
- * Similar to context switches, there are two types of RSB attacks
- * after VM exit:
+ * WARNING! There are many subtleties to consider when changing *any*
+ * code related to RSB-related mitigations. Before doing so, carefully
+ * read the following document, and update if necessary:
*
- * 1) RSB underflow
+ * Documentation/admin-guide/hw-vuln/rsb.rst
*
- * 2) Poisoned RSB entry
+ * In an overly simplified nutshell:
*
- * When retpoline is enabled, both are mitigated by filling/clearing
- * the RSB.
+ * - User->user RSB attacks are conditionally mitigated during
+ * context switch by cond_mitigation -> __write_ibpb().
*
- * When IBRS is enabled, while #1 would be mitigated by the IBRS branch
- * prediction isolation protections, RSB still needs to be cleared
- * because of #2. Note that SMEP provides no protection here, unlike
- * user-space-poisoned RSB entries.
+ * - User->kernel and guest->host attacks are mitigated by eIBRS or
+ * RSB filling.
*
- * eIBRS should protect against RSB poisoning, but if the EIBRS_PBRSB
- * bug is present then a LITE version of RSB protection is required,
- * just a single call needs to retire before a RET is executed.
+ * Though, depending on config, note that other alternative
+ * mitigations may end up getting used instead, e.g., IBPB on
+ * entry/vmexit, call depth tracking, or return thunks.
*/
+
switch (mode) {
case SPECTRE_V2_NONE:
break;
@@ -1832,44 +1832,6 @@ static void __init spectre_v2_select_mitigation(void)
spectre_v2_enabled = mode;
pr_info("%s\n", spectre_v2_strings[mode]);
- /*
- * If Spectre v2 protection has been enabled, fill the RSB during a
- * context switch. In general there are two types of RSB attacks
- * across context switches, for which the CALLs/RETs may be unbalanced.
- *
- * 1) RSB underflow
- *
- * Some Intel parts have "bottomless RSB". When the RSB is empty,
- * speculated return targets may come from the branch predictor,
- * which could have a user-poisoned BTB or BHB entry.
- *
- * AMD has it even worse: *all* returns are speculated from the BTB,
- * regardless of the state of the RSB.
- *
- * When IBRS or eIBRS is enabled, the "user -> kernel" attack
- * scenario is mitigated by the IBRS branch prediction isolation
- * properties, so the RSB buffer filling wouldn't be necessary to
- * protect against this type of attack.
- *
- * The "user -> user" attack scenario is mitigated by RSB filling.
- *
- * 2) Poisoned RSB entry
- *
- * If the 'next' in-kernel return stack is shorter than 'prev',
- * 'next' could be tricked into speculating with a user-poisoned RSB
- * entry.
- *
- * The "user -> kernel" attack scenario is mitigated by SMEP and
- * eIBRS.
- *
- * The "user -> user" scenario, also known as SpectreBHB, requires
- * RSB clearing.
- *
- * So to mitigate all cases, unconditionally fill RSB on context
- * switches.
- *
- * FIXME: Is this pointless for retbleed-affected AMD?
- */
spectre_v2_select_rsb_mitigation(mode);
/*
--
2.48.1
On 02/04/2025 7:19 pm, Josh Poimboeuf wrote: > Create a document to summarize hard-earned knowledge about RSB-related > mitigations, with references, and replace the overly verbose yet > incomplete comments with a reference to the document. > > Signed-off-by: Josh Poimboeuf <jpoimboe@kernel.org> Excellent. I've been trying to do something like this for a while, and not managing to get this far. > diff --git a/Documentation/admin-guide/hw-vuln/rsb.rst b/Documentation/admin-guide/hw-vuln/rsb.rst > new file mode 100644 > index 000000000000..97bf75993d5d > --- /dev/null > +++ b/Documentation/admin-guide/hw-vuln/rsb.rst > @@ -0,0 +1,241 @@ > +.. SPDX-License-Identifier: GPL-2.0 > + > +======================= > +RSB-related mitigations > +======================= > + > +.. warning:: > + Please keep this document up-to-date, otherwise you will be > + volunteered to update it and convert it to a very long comment in > + bugs.c! > + > +Since 2018 there have been many Spectre CVEs related to the Return Stack > +Buffer (RSB). 2017. I, and many others, spent the whole second half of 2017 tied up in all of this. I'd drop the Spectre, or swap it for Speculation-related. Simply "Spectre" CVEs tend to focus on the conditional and indirect predictors. > Information about these CVEs and how to mitigate them is > +scattered amongst a myriad of microarchitecture-specific documents. You should note that the AMD terms RAS and RAP are the same thing, considering that you link to the documents. > + > +This document attempts to consolidate all the relevant information in > +once place and clarify the reasoning behind the current RSB-related > +mitigations. > + > +At a high level, there are two classes of RSB attacks: RSB poisoning > +(Intel and AMD) and RSB underflow (Intel only). They must each be > +considered individually for each attack vector (and microarchitecture > +where applicable). > + One question before getting further. This seems to be focused on (mis)prediction of RETs ? That's fine, but it wants spelling out, because it is distinct from the other class of issues when RET instructions execute in bad ways. When lecturing, my go-to example is Spectre-v1.1 / BCBS (store which clobbers or aliases the return address), because an astounding number of things can go wrong in different ways from there. Next, before diving into the specifics, it's incredibly relevant to have a section briefly describing how the RSB typically works. It's key to understanding the rest of the documents, and there will definitely be people reading the document who don't know it. The salient points are (on CPUs since ~Nehalem era. Confirm with Intel and AMD. You can spot it in the optimisation guides, because it's where the phrase such as "only taken branches consume prediction resource" began appearing): * Branch prediction is **prior** to instruction decode, and guesses at the location, type, and target of all near branches. * The RSB is a prediction structure used by branches predicted as CALLs or RETs. * When a CALL is predicted, the predicted return-address is pushed on the RSB * When a RET is predicted, the RSB is popped * Later, decode will cross-check the prediction with the instruction stream. It can issue corrections to the predictor state, and restart prediction/fetch from the point that things appeared to go wrong. This can include editing transient state in the RSB. For the observant reader, yes, the RSB is filled using predicted targets. This is why the SRSO vuln is so evil. So, with the behaviour summarised, next some properties (disclaimer: varies by vendor) * It is logically a stack, but has finite capacity. Executing more RET instructions than CALLs will underflow it. ** AMD reuses the -1'th entry and doesn't move the pointer ** Intel may fall back to a prediction from a different predictor * It is a structure shared across all security domains in Core/Thread. Guest & Host is most relevant to the doc, but SMM/ACM/SEAM/XuCode are all included. ** Some AMD CPUs dynamically re-partition the RSB(RAS) when a sibling thread goes idle ** Some Intel CPUs only have a 32-bit wide RSB, and reuse the upper bits of the location for the predicted target ** Some Intel CPUs hardwire bit 47(?) which causes the kernel to follow userspace predictions. > +---- > + > +RSB poisoning (Intel and AMD) > +============================= > + > +SpectreRSB > +~~~~~~~~~~ > + > +RSB poisoning is a technique used by Spectre-RSB [#spectre-rsb]_ where > +an attacker poisons an RSB entry to cause a victim's return instruction > +to speculate to an attacker-controlled address. This can happen when > +there are unbalanced CALLs/RETs after a context switch or VMEXIT. > + > +* All attack vectors can potentially be mitigated by flushing out any > + poisoned RSB entries using an RSB filling sequence > + [#intel-rsb-filling]_ [#amd-rsb-filling]_ when transitioning between > + untrusted and trusted domains. But this has a performance impact and > + should be avoided whenever possible. More importantly, 32-entry RSB stuffing loops are only applicable to pre-eIBRS and pre-ERAPS hardware. They are known unsafe to use on newer microarchitectures, inc Gracemont (128 entries) and Zen5 (64 entries). > + > +* On context switch, the user->user mitigation requires ensuring the > + RSB gets filled or cleared whenever IBPB gets written [#cond-ibpb]_ > + during a context switch: > + > + * AMD: > + IBPB (or SBPB [#amd-sbpb]_ if used) automatically clears the RSB > + if IBPB_RET is set in CPUID [#amd-ibpb-rsb]_. Otherwise the RSB > + filling sequence [#amd-rsb-filling]_ must be always be done in > + addition to IBPB. Honestly, I dislike this way of characterising it. IBPB was very clearly spec'd to flush all indirect prediction structures, and some AMD CPUs have an errata where this isn't true and has to be filled in by the OS. > + > + * Intel: > + IBPB automatically clears the RSB: > + > + "Software that executed before the IBPB command cannot control > + the predicted targets of indirect branches executed after the > + command on the same logical processor. The term indirect branch > + in this context includes near return instructions, so these > + predicted targets may come from the RSB." [#intel-ibpb-rsb]_ > + > +* On context switch, user->kernel attacks are mitigated by SMEP, as user > + space can only insert its own return addresses into the RSB: It's more subtle than this (see the 32-bit wide prediction). A user/supervisor split address space limits the ranges of addresses that userspace can insert into the predictor. There is a corner case at the canonical boundary. Userspace can insert the first non-canonincal address, and on some CPUs, this is interpreted as the first high canonical address. Guard pages on both sides of the canonical boundary mitigate this. In the unbalanced case for user->kernel, a bad prediction really is made (coming out of the RSB), and it's only the SMEP #PF at instruction fetch which prevents you speculatively executing in userspace. In the 32-bit width case, the kernel predicts to {high_kern:low_user} target. > + > + * AMD: > + "Finally, branches that are predicted as 'ret' instructions get > + their predicted targets from the Return Address Predictor (RAP). > + AMD recommends software use a RAP stuffing sequence (mitigation > + V2-3 in [2]) and/or Supervisor Mode Execution Protection (SMEP) > + to ensure that the addresses in the RAP are safe for > + speculation. Collectively, we refer to these mitigations as "RAP > + Protection"." [#amd-smep-rsb]_ > + > + * Intel: > + "On processors with enhanced IBRS, an RSB overwrite sequence may > + not suffice to prevent the predicted target of a near return > + from using an RSB entry created in a less privileged predictor > + mode. Software can prevent this by enabling SMEP (for > + transitions from user mode to supervisor mode) and by having > + IA32_SPEC_CTRL.IBRS set during VM exits." [#intel-smep-rsb]_ > + > +* On VMEXIT, guest->host attacks are mitigated by eIBRS (and PBRSB > + mitigation if needed): > + > + * AMD: > + "When Automatic IBRS is enabled, the internal return address > + stack used for return address predictions is cleared on VMEXIT." > + [#amd-eibrs-vmexit]_ > + > + * Intel: > + "On processors with enhanced IBRS, an RSB overwrite sequence may > + not suffice to prevent the predicted target of a near return > + from using an RSB entry created in a less privileged predictor > + mode. Software can prevent this by enabling SMEP (for > + transitions from user mode to supervisor mode) and by having > + IA32_SPEC_CTRL.IBRS set during VM exits. Processors with > + enhanced IBRS still support the usage model where IBRS is set > + only in the OS/VMM for OSes that enable SMEP. To do this, such > + processors will ensure that guest behavior cannot control the > + RSB after a VM exit once IBRS is set, even if IBRS was not set > + at the time of the VM exit." [#intel-eibrs-vmexit]_ > + > + Note that some Intel CPUs are susceptible to Post-barrier Return > + Stack Buffer Predictions (PBRSB)[#intel-pbrsb]_, where the last CALL > + from the guest can be used to predict the first unbalanced RET. In > + this case the PBRSB mitigation is needed in addition to eIBRS. > + > +AMD Retbleed / SRSO / Branch Type Confusion > +~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ > + > +On AMD, poisoned RSB entries can also be created by the AMD Retbleed > +variant [#retbleed-paper]_ and/or Speculative Return Stack Overflow > +[#amd-srso]_ (Inception [#inception-paper]_). These attacks are made > +possible by Branch Type Confusion [#amd-btc]_. The kernel protects > +itself by replacing every RET in the kernel with a branch to a single > +safe RET. BTC and SRSO are unrelated things. "predicted branch types" is an inherent property of the predictor. BTC is specifically the case where decode doesn't halt, and still issues the wrong uops into the pipeline to execute. SRSO goes entirely wrong at the BP/IF stages (BP racing ahead while IF is stalled unable to fetch anything), so the damage is done by the time Decode sees the instruction stream. You also need to cover the AMD going-idle issue where the other thread's RSB entries magically appear in my RSB, and my head/tail pointer is reset to a random position. > + > +---- > + > +RSB underflow (Intel only) > +========================== Well, not really. AMD can underflow too. It just picks a fixed entry and keeps on reusing that. (Great for the alarming number of programming languages which consider recursion a virtue.) > + > +Intel Retbleed > +~~~~~~~~~~~~~~ > + > +Some Intel Skylake-generation CPUs are susceptible to the Intel variant > +of Retbleed [#retbleed-paper]_ (Return Stack Buffer Underflow > +[#intel-rsbu]_). If a RET is executed when the RSB buffer is empty due > +to mismatched CALLs/RETs or returning from a deep call stack, the branch > +predictor can fall back to using the Branch Target Buffer (BTB). If a > +user forces a BTB collision then the RET can speculatively branch to a > +user-controlled address. > + > +* Note that RSB filling doesn't fully mitigate this issue. If there > + are enough unbalanced RETs, the RSB may still underflow and fall back > + to using a poisoned BTB entry. > + > +* On context switch, user->user underflow attacks are mitigated by the > + conditional IBPB [#cond-ibpb]_ on context switch which clears the BTB: > + > + * "The indirect branch predictor barrier (IBPB) is an indirect branch > + control mechanism that establishes a barrier, preventing software > + that executed before the barrier from controlling the predicted > + targets of indirect branches executed after the barrier on the same > + logical processor." [#intel-ibpb-btb]_ > + > + .. note:: > + I wasn't able to find any offical documentation from Intel > + explicitly stating that IBPB clears the BTB. However, it's > + broadly known to be true and relied upon in several mitigations. Part of this is because when the vendors say the BTB, they're translating their internal names into what academia calls them. "Flush the BTB" isn't a helpful statement anyway. See AMD's IBPB vs SBPB which controls whether the branch types predictions remain intact. Given how many rounds of Intel microcode there have been making IBPB scrub more, it clearly wasn't scrubbing everything in the first place. > + > +* On context switch and VMEXIT, user->kernel and guest->host underflows > + are mitigated by IBRS or eIBRS: > + > + * "Enabling IBRS (including enhanced IBRS) will mitigate the "RSBU" > + attack demonstrated by the researchers. Yeah, except it doesn't. Intra-mode BTI is a thing, and that will leak your secrets too. > As previously documented, > + Intel recommends the use of enhanced IBRS, where supported. This > + includes any processor that enumerates RRSBA but not RRSBA_DIS_S." > + [#intel-rsbu]_ > + > + As an alternative to classic IBRS, call depth tracking can be used to legacy IBRS. Please don't invent yet another term for it :), and "classic" further implies there might be a time when anyone looks back fondly on it. > + track kernel returns and fill the RSB when it gets close to being > + empty. > + > +Restricted RSB Alternate (RRSBA) > +~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ This needs to discuss RSBA and RRSBA. They're distinct. To a first approximation, RRSBA == RSBA + eIBRS. The "restricted" nature is "mode-tagged predictions". Except there's a narrow range of CPUs around Icelake which have eIBRS and do not suffer (R)RSBA. > + > +Some newer Intel CPUs have Restricted RSB Alternate (RRSBA) behavior, > +which, similar to the Intel variant of Retbleed described above, also > +falls back to using the BTB on RSB underflow. The only difference is > +that the predicted targets are restricted to the current domain. > + > +* "Restricted RSB Alternate (RRSBA) behavior allows alternate branch > + predictors to be used by near RET instructions when the RSB is > + empty. When eIBRS is enabled, the predicted targets of these > + alternate predictors are restricted to those belonging to the > + indirect branch predictor entries of the current prediction domain. > + [#intel-eibrs-rrsba]_ > + > +When a CPU with RRSBA is vulnerable to Branch History Injection > +[#bhi-paper]_ [#intel-bhi]_, an RSB underflow could be used for an > +intra-mode BTI attack. This is mitigated by clearing the BHB on > +kernel entry. > + > +However if the kernel uses retpolines instead of eIBRS, it needs to > +disable RRSBA: > + > +* "Where software is using retpoline as a mitigation for BHI or > + intra-mode BTI, and the processor both enumerates RRSBA and > + enumerates RRSBA_DIS controls, it should disable this behavior. " > + [#intel-retpoline-rrsba]_ IIRC, not all CPUs which suffer RRSBA have the RRSBA_DIS_* controls. But, I think that's enough nitpicking for now. I really think some Intel and AMD architects ought to weigh in too. ~Andrew
On Wed, Apr 02, 2025 at 11:19:23AM -0700, Josh Poimboeuf wrote: > + Note that some Intel CPUs are susceptible to Post-barrier Return > + Stack Buffer Predictions (PBRSB)[#intel-pbrsb]_, where the last CALL > + from the guest can be used to predict the first unbalanced RET. In > + this case the PBRSB mitigation is needed in addition to eIBRS. I get Sphinx unreferenced footnote warning: Documentation/admin-guide/hw-vuln/rsb.rst:221: WARNING: Footnote [#] is not referenced. [ref.footnote] To fix that, I have to separate the footnote: ---- >8 ---- diff --git a/Documentation/admin-guide/hw-vuln/rsb.rst b/Documentation/admin-guide/hw-vuln/rsb.rst index 97bf75993d5d43..dd727048b00204 100644 --- a/Documentation/admin-guide/hw-vuln/rsb.rst +++ b/Documentation/admin-guide/hw-vuln/rsb.rst @@ -102,7 +102,7 @@ there are unbalanced CALLs/RETs after a context switch or VMEXIT. at the time of the VM exit." [#intel-eibrs-vmexit]_ Note that some Intel CPUs are susceptible to Post-barrier Return - Stack Buffer Predictions (PBRSB)[#intel-pbrsb]_, where the last CALL + Stack Buffer Predictions (PBRSB) [#intel-pbrsb]_, where the last CALL from the guest can be used to predict the first unbalanced RET. In this case the PBRSB mitigation is needed in addition to eIBRS. Thanks. -- An old man doll... just what I always wanted! - Clara
On Thu, Apr 03, 2025 at 09:57:29AM +0700, Bagas Sanjaya wrote: > On Wed, Apr 02, 2025 at 11:19:23AM -0700, Josh Poimboeuf wrote: > > + Note that some Intel CPUs are susceptible to Post-barrier Return > > + Stack Buffer Predictions (PBRSB)[#intel-pbrsb]_, where the last CALL > > + from the guest can be used to predict the first unbalanced RET. In > > + this case the PBRSB mitigation is needed in addition to eIBRS. > > I get Sphinx unreferenced footnote warning: > > Documentation/admin-guide/hw-vuln/rsb.rst:221: WARNING: Footnote [#] is not referenced. [ref.footnote] > > To fix that, I have to separate the footnote: > > ---- >8 ---- > diff --git a/Documentation/admin-guide/hw-vuln/rsb.rst b/Documentation/admin-guide/hw-vuln/rsb.rst > index 97bf75993d5d43..dd727048b00204 100644 > --- a/Documentation/admin-guide/hw-vuln/rsb.rst > +++ b/Documentation/admin-guide/hw-vuln/rsb.rst > @@ -102,7 +102,7 @@ there are unbalanced CALLs/RETs after a context switch or VMEXIT. > at the time of the VM exit." [#intel-eibrs-vmexit]_ > > Note that some Intel CPUs are susceptible to Post-barrier Return > - Stack Buffer Predictions (PBRSB)[#intel-pbrsb]_, where the last CALL > + Stack Buffer Predictions (PBRSB) [#intel-pbrsb]_, where the last CALL > from the guest can be used to predict the first unbalanced RET. In > this case the PBRSB mitigation is needed in addition to eIBRS. Fixed, thanks! -- Josh
* Josh Poimboeuf <jpoimboe@kernel.org> wrote: > Create a document to summarize hard-earned knowledge about RSB-related > mitigations, with references, and replace the overly verbose yet > incomplete comments with a reference to the document. Just a few nits: > +RSB poisoning (Intel and AMD) > +============================= > + > +SpectreRSB > +~~~~~~~~~~ > >+ >+RSB poisoning is a technique used by Spectre-RSB [#spectre-rsb]_ where >+an attacker poisons an RSB entry to cause a victim's return instruction >+to speculate to an attacker-controlled address. This can happen when >+there are unbalanced CALLs/RETs after a context switch or VMEXIT. s/Spectre-RSB /SpectreRSB Which is the name the title just a few lines above uses, and which appears to be broadly the in-tree consensus spelling as well. > + > +AMD Retbleed / SRSO / Branch Type Confusion > +~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Nit: the underline is one character too long. :-) > +On AMD, poisoned RSB entries can also be created by the AMD Retbleed > +variant [#retbleed-paper]_ and/or Speculative Return Stack Overflow > +[#amd-srso]_ (Inception [#inception-paper]_). These attacks are made > +possible by Branch Type Confusion [#amd-btc]_. The kernel protects > +itself by replacing every RET in the kernel with a branch to a single > +safe RET. s/Retbleed /RETBleed Seems to be the consensus spelling in-tree. (There's a few more cases in this document as well.) > + * WARNING! There are many subtleties to consider when changing *any* > + * code related to RSB-related mitigations. Before doing so, carefully > + * read the following document, and update if necessary: > * > + * Documentation/admin-guide/hw-vuln/rsb.rst > * > + * In an overly simplified nutshell: > * > + * - User->user RSB attacks are conditionally mitigated during > + * context switch by cond_mitigation -> __write_ibpb(). s/during context switch /during context switches > * > + * - User->kernel and guest->host attacks are mitigated by eIBRS or > + * RSB filling. > * > + * Though, depending on config, note that other alternative > + * mitigations may end up getting used instead, e.g., IBPB on > + * entry/vmexit, call depth tracking, or return thunks. > */ s/__write_ibpb() /write_ibpb() as per the discussion under patch #1. Thanks, Ingo
On Wed, Apr 02, 2025 at 09:58:59PM +0200, Ingo Molnar wrote: > > * Josh Poimboeuf <jpoimboe@kernel.org> wrote: > > > Create a document to summarize hard-earned knowledge about RSB-related > > mitigations, with references, and replace the overly verbose yet > > incomplete comments with a reference to the document. > > Just a few nits: Ack, thanks! -- Josh
© 2016 - 2025 Red Hat, Inc.