Add a new memory page allocator that implements the cache coloring mechanism.
The allocation algorithm enforces equal frequency distribution of cache
partitions, following the coloring configuration of a domain. This allows
for an even utilization of cache sets for every domain.
Pages are stored in a color-indexed array of lists. Those lists are filled
by a simple init function which computes the color of each page.
When a domain requests a page, the allocator extracts the page from the list
with the maximum number of free pages among those that the domain can access,
given its coloring configuration.
The allocator can only handle requests of order-0 pages. This allows for
easier implementation and since cache coloring targets only embedded systems,
it's assumed not to be a major problem.
The buddy allocator must coexist with the colored one because the Xen heap
isn't colored. For this reason a new Kconfig option and a command line
parameter are added to let the user set the amount of memory reserved for
the buddy allocator. Even when cache coloring is enabled, this memory
isn't managed by the colored allocator.
Colored heap information is dumped in the dump_heap() debug-key function.
Based on original work from: Luca Miccio <lucmiccio@gmail.com>
Signed-off-by: Marco Solieri <marco.solieri@minervasys.tech>
Signed-off-by: Carlo Nonato <carlo.nonato@minervasys.tech>
---
v10:
- stated explicit dependency on CONFIG_LLC_COLORING for buddy-alloc-size
- fix for MISRA rule 20.7 parenthesis
v9:
- added ASSERT(order == 0) when freeing a colored page
- moved buddy_alloc_size initialization logic in Kconfig
v8:
- requests that uses MEMF_* flags that can't be served are now going to fail
- free_color_heap_page() is called directly from free_heap_pages()
v7:
- requests to alloc_color_heap_page() now fail if MEMF_bits is used
v6:
- colored allocator functions are now static
v5:
- Carlo Nonato as the new author
- the colored allocator balances color usage for each domain and it searches
linearly only in the number of colors (FIXME removed)
- addedd scrub functionality
- removed stub functions (still requires some macro definition)
- addr_to_color turned to mfn_to_color for easier operations
- removed BUG_ON in init_color_heap_pages() in favor of panic()
- only non empty page lists are logged in dump_color_heap()
v4:
- moved colored allocator code after buddy allocator because it now has
some dependencies on buddy functions
- buddy_alloc_size is now used only by the colored allocator
- fixed a bug that allowed the buddy to merge pages when they were colored
- free_color_heap_page() now calls mark_page_free()
- free_color_heap_page() uses of the frametable array for faster searches
- added FIXME comment for the linear search in free_color_heap_page()
- removed alloc_color_domheap_page() to let the colored allocator exploit
some more buddy allocator code
- alloc_color_heap_page() now allocs min address pages first
- reduced the mess in end_boot_allocator(): use the first loop for
init_color_heap_pages()
- fixed page_list_add_prev() (list.h) since it was doing the opposite of
what it was supposed to do
- fixed page_list_add_prev() (non list.h) to check also for next existence
- removed unused page_list_add_next()
- moved p2m code in another patch
---
docs/misc/cache-coloring.rst | 37 ++++++
docs/misc/xen-command-line.pandoc | 14 +++
xen/arch/arm/include/asm/mm.h | 5 +
xen/common/Kconfig | 8 ++
xen/common/llc-coloring.c | 13 +++
xen/common/page_alloc.c | 188 +++++++++++++++++++++++++++++-
xen/include/xen/llc-coloring.h | 4 +
7 files changed, 265 insertions(+), 4 deletions(-)
diff --git a/docs/misc/cache-coloring.rst b/docs/misc/cache-coloring.rst
index c14b6ee2f4..25f015c2de 100644
--- a/docs/misc/cache-coloring.rst
+++ b/docs/misc/cache-coloring.rst
@@ -11,6 +11,9 @@ To compile LLC coloring support set ``CONFIG_LLC_COLORING=y``.
If needed, change the maximum number of colors with
``CONFIG_LLC_COLORS_ORDER=<n>``.
+If needed, change the buddy allocator reserved size with
+``CONFIG_BUDDY_ALLOCATOR_SIZE=<n>``.
+
Runtime configuration is done via `Command line parameters`_.
For DomUs follow `DomUs configuration`_.
@@ -108,6 +111,8 @@ Specific documentation is available at `docs/misc/xen-command-line.pandoc`.
+----------------------+-------------------------------+
| ``dom0-llc-colors`` | Dom0 color configuration |
+----------------------+-------------------------------+
+| ``buddy-alloc-size`` | Buddy allocator reserved size |
++----------------------+-------------------------------+
Colors selection format
***********************
@@ -195,6 +200,17 @@ the ``llc-colors`` option. For example:
**Note:** If no color configuration is provided for a domain, the default one,
which corresponds to all available colors is used instead.
+Colored allocator and buddy allocator
+*************************************
+
+The colored allocator distributes pages based on color configurations of
+domains so that each domains only gets pages of its own colors.
+The colored allocator is meant as an alternative to the buddy allocator because
+its allocation policy is by definition incompatible with the generic one. Since
+the Xen heap is not colored yet, we need to support the coexistence of the two
+allocators and some memory must be left for the buddy one. Buddy memory
+reservation is configured via Kconfig or via command-line.
+
Known issues and limitations
****************************
@@ -205,3 +221,24 @@ In the domain configuration, "xen,static-mem" allows memory to be statically
allocated to the domain. This isn't possible when LLC coloring is enabled,
because that memory can't be guaranteed to use only colors assigned to the
domain.
+
+Cache coloring is intended only for embedded systems
+####################################################
+
+The current implementation aims to satisfy the need of predictability in
+embedded systems with small amount of memory to be managed in a colored way.
+Given that, some shortcuts are taken in the development. Expect worse
+performances on larger systems.
+
+Colored allocator can only make use of order-0 pages
+####################################################
+
+The cache coloring technique relies on memory mappings and on the smallest
+mapping granularity to achieve the maximum number of colors (cache partitions)
+possible. This granularity is what is normally called a page and, in Xen
+terminology, the order-0 page is the smallest one. The fairly simple
+colored allocator currently implemented, makes use only of such pages.
+It must be said that a more complex one could, in theory, adopt higher order
+pages if the colors selection contained adjacent colors. Two subsequent colors,
+for example, can be represented by an order-1 page, four colors correspond to
+an order-2 page, etc.
diff --git a/docs/misc/xen-command-line.pandoc b/docs/misc/xen-command-line.pandoc
index bfdc8b0002..3a70c49c05 100644
--- a/docs/misc/xen-command-line.pandoc
+++ b/docs/misc/xen-command-line.pandoc
@@ -270,6 +270,20 @@ and not running softirqs. Reduce this if softirqs are not being run frequently
enough. Setting this to a high value may cause boot failure, particularly if
the NMI watchdog is also enabled.
+### buddy-alloc-size (arm64)
+> `= <size>`
+
+> Default: `64M`
+
+Amount of memory reserved for the buddy allocator when colored allocator is
+active. This option is available only when `CONFIG_LLC_COLORING` is enabled.
+The colored allocator is meant as an alternative to the buddy allocator,
+because its allocation policy is by definition incompatible with the generic
+one. Since the Xen heap systems is not colored yet, we need to support the
+coexistence of the two allocators for now. This parameter, which is optional
+and for expert only, it's used to set the amount of memory reserved to the
+buddy allocator.
+
### cet
= List of [ shstk=<bool>, ibt=<bool> ]
diff --git a/xen/arch/arm/include/asm/mm.h b/xen/arch/arm/include/asm/mm.h
index 5abd4b0d1c..c1a5ac7bee 100644
--- a/xen/arch/arm/include/asm/mm.h
+++ b/xen/arch/arm/include/asm/mm.h
@@ -145,6 +145,11 @@ struct page_info
#else
#define PGC_static 0
#endif
+#ifdef CONFIG_LLC_COLORING
+/* Page is cache colored */
+#define _PGC_colored PG_shift(4)
+#define PGC_colored PG_mask(1, 4)
+#endif
/* ... */
/* Page is broken? */
#define _PGC_broken PG_shift(7)
diff --git a/xen/common/Kconfig b/xen/common/Kconfig
index b4ec6893be..bbd4738d4e 100644
--- a/xen/common/Kconfig
+++ b/xen/common/Kconfig
@@ -537,4 +537,12 @@ config LLC_COLORS_ORDER
The default value corresponds to an 8 MiB 16-ways LLC, which should be
more than what's needed in the general case.
+config BUDDY_ALLOCATOR_SIZE
+ int "Buddy allocator reserved memory size (MiB)" if LLC_COLORING
+ default "0" if !LLC_COLORING
+ default "64"
+ help
+ Amount of memory reserved for the buddy allocator to serve Xen heap,
+ working alongside the colored one.
+
endmenu
diff --git a/xen/common/llc-coloring.c b/xen/common/llc-coloring.c
index 14e056fb10..6ba9fb72ca 100644
--- a/xen/common/llc-coloring.c
+++ b/xen/common/llc-coloring.c
@@ -28,6 +28,9 @@ static unsigned int __ro_after_init default_colors[NR_LLC_COLORS];
static unsigned int __initdata dom0_colors[NR_LLC_COLORS];
static unsigned int __initdata dom0_num_colors;
+#define mfn_color_mask (max_nr_colors - 1)
+#define mfn_to_color(mfn) (mfn_x(mfn) & mfn_color_mask)
+
/*
* Parse the coloring configuration given in the buf string, following the
* syntax below.
@@ -311,6 +314,16 @@ int __init domain_set_llc_colors_from_str(struct domain *d, const char *str)
return 0;
}
+unsigned int page_to_llc_color(const struct page_info *pg)
+{
+ return mfn_to_color(page_to_mfn(pg));
+}
+
+unsigned int get_max_nr_llc_colors(void)
+{
+ return max_nr_colors;
+}
+
/*
* Local variables:
* mode: C
diff --git a/xen/common/page_alloc.c b/xen/common/page_alloc.c
index 34cd473150..27431a9bd2 100644
--- a/xen/common/page_alloc.c
+++ b/xen/common/page_alloc.c
@@ -159,8 +159,12 @@
#define PGC_static 0
#endif
-#define PGC_no_buddy_merge PGC_static
-#define PGC_preserved (PGC_extra | PGC_static | PGC_broken)
+#ifndef PGC_colored
+#define PGC_colored 0
+#endif
+
+#define PGC_no_buddy_merge (PGC_static | PGC_colored)
+#define PGC_preserved (PGC_extra | PGC_static | PGC_broken | PGC_colored)
#ifndef PGT_TYPE_INFO_INITIALIZER
#define PGT_TYPE_INFO_INITIALIZER 0
@@ -1457,6 +1461,8 @@ static bool mark_page_free(struct page_info *pg, mfn_t mfn)
return pg_offlined;
}
+static void free_color_heap_page(struct page_info *pg, bool need_scrub);
+
/* Free 2^@order set of pages. */
static void free_heap_pages(
struct page_info *pg, unsigned int order, bool need_scrub)
@@ -1481,6 +1487,15 @@ static void free_heap_pages(
pg[i].count_info |= PGC_need_scrub;
poison_one_page(&pg[i]);
}
+
+ if ( pg->count_info & PGC_colored )
+ {
+ ASSERT(order == 0);
+
+ free_color_heap_page(pg, need_scrub);
+ spin_unlock(&heap_lock);
+ return;
+ }
}
avail[node][zone] += 1 << order;
@@ -1945,6 +1960,155 @@ static unsigned long avail_heap_pages(
return free_pages;
}
+/*************************
+ * COLORED SIDE-ALLOCATOR
+ *
+ * Pages are grouped by LLC color in lists which are globally referred to as the
+ * color heap. Lists are populated in end_boot_allocator().
+ * After initialization there will be N lists where N is the number of
+ * available colors on the platform.
+ */
+static struct page_list_head *__ro_after_init _color_heap;
+#define color_heap(color) (&_color_heap[color])
+
+static unsigned long *__ro_after_init free_colored_pages;
+
+/* Memory required for buddy allocator to work with colored one */
+static unsigned long __initdata buddy_alloc_size =
+ MB(CONFIG_BUDDY_ALLOCATOR_SIZE);
+size_param("buddy-alloc-size", buddy_alloc_size);
+
+#ifdef CONFIG_LLC_COLORING
+#define domain_num_llc_colors(d) ((d)->num_llc_colors)
+#define domain_llc_color(d, i) ((d)->llc_colors[i])
+#else
+#define domain_num_llc_colors(d) 0
+#define domain_llc_color(d, i) 0
+#endif
+
+static void free_color_heap_page(struct page_info *pg, bool need_scrub)
+{
+ unsigned int color;
+
+ color = page_to_llc_color(pg);
+ free_colored_pages[color]++;
+ /*
+ * Head insertion allows re-using cache-hot pages in configurations without
+ * sharing of colors.
+ */
+ page_list_add(pg, color_heap(color));
+}
+
+static struct page_info *alloc_color_heap_page(unsigned int memflags,
+ const struct domain *d)
+{
+ struct page_info *pg = NULL;
+ unsigned int i, color = 0;
+ unsigned long max = 0;
+ bool need_tlbflush = false;
+ uint32_t tlbflush_timestamp = 0;
+ bool need_scrub;
+
+ if ( memflags & ~(MEMF_no_refcount | MEMF_no_owner | MEMF_no_tlbflush |
+ MEMF_no_icache_flush | MEMF_no_scrub) )
+ return NULL;
+
+ spin_lock(&heap_lock);
+
+ for ( i = 0; i < domain_num_llc_colors(d); i++ )
+ {
+ unsigned long free = free_colored_pages[domain_llc_color(d, i)];
+
+ if ( free > max )
+ {
+ color = domain_llc_color(d, i);
+ pg = page_list_first(color_heap(color));
+ max = free;
+ }
+ }
+
+ if ( !pg )
+ {
+ spin_unlock(&heap_lock);
+ return NULL;
+ }
+
+ need_scrub = pg->count_info & PGC_need_scrub;
+ pg->count_info = PGC_state_inuse | (pg->count_info & PGC_preserved);
+ free_colored_pages[color]--;
+ page_list_del(pg, color_heap(color));
+
+ if ( !(memflags & MEMF_no_tlbflush) )
+ accumulate_tlbflush(&need_tlbflush, pg, &tlbflush_timestamp);
+
+ init_free_page_fields(pg);
+
+ spin_unlock(&heap_lock);
+
+ if ( !(memflags & MEMF_no_scrub) )
+ {
+ if ( need_scrub )
+ scrub_one_page(pg);
+ else
+ check_one_page(pg);
+ }
+
+ if ( need_tlbflush )
+ filtered_flush_tlb_mask(tlbflush_timestamp);
+
+ flush_page_to_ram(mfn_x(page_to_mfn(pg)),
+ !(memflags & MEMF_no_icache_flush));
+
+ return pg;
+}
+
+static void __init init_color_heap_pages(struct page_info *pg,
+ unsigned long nr_pages)
+{
+ unsigned int i;
+ bool need_scrub = opt_bootscrub == BOOTSCRUB_IDLE;
+
+ if ( buddy_alloc_size >= PAGE_SIZE )
+ {
+ unsigned long buddy_pages = min(PFN_DOWN(buddy_alloc_size), nr_pages);
+
+ init_heap_pages(pg, buddy_pages);
+ nr_pages -= buddy_pages;
+ buddy_alloc_size -= buddy_pages << PAGE_SHIFT;
+ pg += buddy_pages;
+ }
+
+ if ( !_color_heap )
+ {
+ unsigned int max_nr_colors = get_max_nr_llc_colors();
+
+ _color_heap = xmalloc_array(struct page_list_head, max_nr_colors);
+ free_colored_pages = xzalloc_array(unsigned long, max_nr_colors);
+ if ( !_color_heap || !free_colored_pages )
+ panic("Can't allocate colored heap. Buddy reserved size is too low");
+
+ for ( i = 0; i < max_nr_colors; i++ )
+ INIT_PAGE_LIST_HEAD(color_heap(i));
+ }
+
+ for ( i = 0; i < nr_pages; i++ )
+ {
+ pg[i].count_info = PGC_colored;
+ free_color_heap_page(&pg[i], need_scrub);
+ }
+}
+
+static void dump_color_heap(void)
+{
+ unsigned int color;
+
+ printk("Dumping color heap info\n");
+ for ( color = 0; color < get_max_nr_llc_colors(); color++ )
+ if ( free_colored_pages[color] > 0 )
+ printk("Color heap[%u]: %lu pages\n",
+ color, free_colored_pages[color]);
+}
+
void __init end_boot_allocator(void)
{
unsigned int i;
@@ -1964,7 +2128,13 @@ void __init end_boot_allocator(void)
for ( i = nr_bootmem_regions; i-- > 0; )
{
struct bootmem_region *r = &bootmem_region_list[i];
- if ( r->s < r->e )
+
+ if ( r->s >= r->e )
+ continue;
+
+ if ( llc_coloring_enabled )
+ init_color_heap_pages(mfn_to_page(_mfn(r->s)), r->e - r->s);
+ else
init_heap_pages(mfn_to_page(_mfn(r->s)), r->e - r->s);
}
nr_bootmem_regions = 0;
@@ -2460,7 +2630,14 @@ struct page_info *alloc_domheap_pages(
if ( memflags & MEMF_no_owner )
memflags |= MEMF_no_refcount;
- if ( !dma_bitsize )
+ /* Only domains are supported for coloring */
+ if ( d && llc_coloring_enabled )
+ {
+ /* Colored allocation must be done on 0 order */
+ if ( order || (pg = alloc_color_heap_page(memflags, d)) == NULL )
+ return NULL;
+ }
+ else if ( !dma_bitsize )
memflags &= ~MEMF_no_dma;
else if ( (dma_zone = bits_to_zone(dma_bitsize)) < zone_hi )
pg = alloc_heap_pages(dma_zone + 1, zone_hi, order, memflags, d);
@@ -2686,6 +2863,9 @@ static void cf_check dump_heap(unsigned char key)
continue;
printk("Node %d has %lu unscrubbed pages\n", i, node_need_scrub[i]);
}
+
+ if ( llc_coloring_enabled )
+ dump_color_heap();
}
static __init int cf_check register_heap_trigger(void)
diff --git a/xen/include/xen/llc-coloring.h b/xen/include/xen/llc-coloring.h
index 76d3cbae0e..90daf214cd 100644
--- a/xen/include/xen/llc-coloring.h
+++ b/xen/include/xen/llc-coloring.h
@@ -34,6 +34,10 @@ int domain_set_llc_colors(struct domain *d,
const struct xen_domctl_set_llc_colors *config);
int domain_set_llc_colors_from_str(struct domain *d, const char *str);
+struct page_info;
+unsigned int page_to_llc_color(const struct page_info *pg);
+unsigned int get_max_nr_llc_colors(void);
+
#endif /* __COLORING_H__ */
/*
--
2.43.0
On 19.11.2024 15:13, Carlo Nonato wrote: > --- a/docs/misc/xen-command-line.pandoc > +++ b/docs/misc/xen-command-line.pandoc > @@ -270,6 +270,20 @@ and not running softirqs. Reduce this if softirqs are not being run frequently > enough. Setting this to a high value may cause boot failure, particularly if > the NMI watchdog is also enabled. > > +### buddy-alloc-size (arm64) > +> `= <size>` > + > +> Default: `64M` > + > +Amount of memory reserved for the buddy allocator when colored allocator is > +active. This option is available only when `CONFIG_LLC_COLORING` is enabled. > +The colored allocator is meant as an alternative to the buddy allocator, > +because its allocation policy is by definition incompatible with the generic > +one. Since the Xen heap systems is not colored yet, we need to support the > +coexistence of the two allocators for now. This parameter, which is optional > +and for expert only, it's used to set the amount of memory reserved to the > +buddy allocator. Every time I see this, and I further see the title of patch 12, I'm confused, expecting that what's being said here needs adjusting (or even undoing) there. The issue is that patch 12's title says just "Xen" when, if I'm not mistaken, it really only means the Xen image. > --- a/xen/common/Kconfig > +++ b/xen/common/Kconfig > @@ -537,4 +537,12 @@ config LLC_COLORS_ORDER > The default value corresponds to an 8 MiB 16-ways LLC, which should be > more than what's needed in the general case. > > +config BUDDY_ALLOCATOR_SIZE > + int "Buddy allocator reserved memory size (MiB)" if LLC_COLORING > + default "0" if !LLC_COLORING > + default "64" > + help > + Amount of memory reserved for the buddy allocator to serve Xen heap, > + working alongside the colored one. As the description has nothing in this regard: Why again is it that this can't simply have "depends on LLC_COLORING"? Even if it ends up with a value of 0, in an LLC_COLORING=n (or LLC_COLORING entirely absent) .config I'd find it at least irritating to see this setting to be there. > @@ -1945,6 +1960,155 @@ static unsigned long avail_heap_pages( > return free_pages; > } > > +/************************* > + * COLORED SIDE-ALLOCATOR > + * > + * Pages are grouped by LLC color in lists which are globally referred to as the > + * color heap. Lists are populated in end_boot_allocator(). > + * After initialization there will be N lists where N is the number of > + * available colors on the platform. > + */ > +static struct page_list_head *__ro_after_init _color_heap; > +#define color_heap(color) (&_color_heap[color]) > + > +static unsigned long *__ro_after_init free_colored_pages; > + > +/* Memory required for buddy allocator to work with colored one */ > +static unsigned long __initdata buddy_alloc_size = > + MB(CONFIG_BUDDY_ALLOCATOR_SIZE); > +size_param("buddy-alloc-size", buddy_alloc_size); > + > +#ifdef CONFIG_LLC_COLORING According to the (updated) command line doc, this #ifdef needs to move up so the command line option indeed is unrecognized when !LLC_COLORING. Question then is whether the variable is actually needed. With the variable also moved into the #ifdef, the need for BUDDY_ALLOCATOR_SIZE also goes away when !LLC_COLORING (see the comment on the common/Kconfig change). Of course you'll then need "#ifndef buddy_alloc_size" in init_color_heap_pages(), around the respective if() there. > +#define domain_num_llc_colors(d) ((d)->num_llc_colors) > +#define domain_llc_color(d, i) ((d)->llc_colors[i]) > +#else > +#define domain_num_llc_colors(d) 0 > +#define domain_llc_color(d, i) 0 > +#endif > + > +static void free_color_heap_page(struct page_info *pg, bool need_scrub) > +{ > + unsigned int color; > + > + color = page_to_llc_color(pg); > + free_colored_pages[color]++; > + /* > + * Head insertion allows re-using cache-hot pages in configurations without > + * sharing of colors. > + */ > + page_list_add(pg, color_heap(color)); > +} Iirc it was me who had asked to keep this and further functions outside of #ifdef-s, for the compiler's DCE to take care of. However, with all that Misra work that has been going on since then, I now wonder what Misra thinks of this: When PGC_colored is 0, functions like the above are unreachable, and any code or data the compiler doesn't manage to eliminate would be dead. Imo if the code is to remain as is, correctness Misra-wise may want mentioning in the description (this isn't the only place we have such, so an overall position towards this is going to be needed). > +static void __init init_color_heap_pages(struct page_info *pg, > + unsigned long nr_pages) > +{ > + unsigned int i; > + bool need_scrub = opt_bootscrub == BOOTSCRUB_IDLE; > + > + if ( buddy_alloc_size >= PAGE_SIZE ) > + { > + unsigned long buddy_pages = min(PFN_DOWN(buddy_alloc_size), nr_pages); > + > + init_heap_pages(pg, buddy_pages); > + nr_pages -= buddy_pages; > + buddy_alloc_size -= buddy_pages << PAGE_SHIFT; > + pg += buddy_pages; > + } > + > + if ( !_color_heap ) > + { > + unsigned int max_nr_colors = get_max_nr_llc_colors(); > + > + _color_heap = xmalloc_array(struct page_list_head, max_nr_colors); > + free_colored_pages = xzalloc_array(unsigned long, max_nr_colors); At this point, xvmalloc() and friends want using by all new code, unless explicitly justified otherwise. > + if ( !_color_heap || !free_colored_pages ) > + panic("Can't allocate colored heap. Buddy reserved size is too low"); > + > + for ( i = 0; i < max_nr_colors; i++ ) > + INIT_PAGE_LIST_HEAD(color_heap(i)); While for this loop i being unsigned int is okay, I fear that ... > + } > + > + for ( i = 0; i < nr_pages; i++ ) > + { > + pg[i].count_info = PGC_colored; > + free_color_heap_page(&pg[i], need_scrub); > + } ... for this loop it isn't. Jan
Hi Jan, On Thu, Nov 28, 2024 at 12:43 PM Jan Beulich <jbeulich@suse.com> wrote: > > On 19.11.2024 15:13, Carlo Nonato wrote: > > --- a/docs/misc/xen-command-line.pandoc > > +++ b/docs/misc/xen-command-line.pandoc > > @@ -270,6 +270,20 @@ and not running softirqs. Reduce this if softirqs are not being run frequently > > enough. Setting this to a high value may cause boot failure, particularly if > > the NMI watchdog is also enabled. > > > > +### buddy-alloc-size (arm64) > > +> `= <size>` > > + > > +> Default: `64M` > > + > > +Amount of memory reserved for the buddy allocator when colored allocator is > > +active. This option is available only when `CONFIG_LLC_COLORING` is enabled. > > +The colored allocator is meant as an alternative to the buddy allocator, > > +because its allocation policy is by definition incompatible with the generic > > +one. Since the Xen heap systems is not colored yet, we need to support the > > +coexistence of the two allocators for now. This parameter, which is optional > > +and for expert only, it's used to set the amount of memory reserved to the > > +buddy allocator. > > Every time I see this, and I further see the title of patch 12, I'm confused, > expecting that what's being said here needs adjusting (or even undoing) there. > The issue is that patch 12's title says just "Xen" when, if I'm not mistaken, > it really only means the Xen image. I'll change the patch 12 title to reflect what you said. > > --- a/xen/common/Kconfig > > +++ b/xen/common/Kconfig > > @@ -537,4 +537,12 @@ config LLC_COLORS_ORDER > > The default value corresponds to an 8 MiB 16-ways LLC, which should be > > more than what's needed in the general case. > > > > +config BUDDY_ALLOCATOR_SIZE > > + int "Buddy allocator reserved memory size (MiB)" if LLC_COLORING > > + default "0" if !LLC_COLORING > > + default "64" > > + help > > + Amount of memory reserved for the buddy allocator to serve Xen heap, > > + working alongside the colored one. > > As the description has nothing in this regard: Why again is it that this > can't simply have "depends on LLC_COLORING"? Even if it ends up with a > value of 0, in an LLC_COLORING=n (or LLC_COLORING entirely absent) .config > I'd find it at least irritating to see this setting to be there. Quoting you from v8 (6 months ago, a lot of time ago I know, link here https://patchew.org/Xen/20240502165533.319988-1-carlo.nonato@minervasys.tech/20240502165533.319988-10-carlo.nonato@minervasys.tech/#5c16f53f-3bb0-49d6-b174-b0e8c6ceff4c@suse.com): > > +/* Memory required for buddy allocator to work with colored one */ > > +#ifdef CONFIG_LLC_COLORING > > +static unsigned long __initdata buddy_alloc_size = > > + MB(CONFIG_BUDDY_ALLOCATOR_SIZE); > > I think it would be quite nice if this and ... > > > +size_param("buddy-alloc-size", buddy_alloc_size); > > + > > +#define domain_num_llc_colors(d) (d)->num_llc_colors > > +#define domain_llc_color(d, i) (d)->llc_colors[i] > > +#else > > +static unsigned long __initdata buddy_alloc_size; > > ... this were folded. Which I think would be possible if the Kconfig > addition went like this: > > config BUDDY_ALLOCATOR_SIZE > int "Buddy allocator reserved memory size (MiB)" if LLC_COLORING > default "0" if !LLC_COLORING > default "64" But I know that... > > @@ -1945,6 +1960,155 @@ static unsigned long avail_heap_pages( > > return free_pages; > > } > > > > +/************************* > > + * COLORED SIDE-ALLOCATOR > > + * > > + * Pages are grouped by LLC color in lists which are globally referred to as the > > + * color heap. Lists are populated in end_boot_allocator(). > > + * After initialization there will be N lists where N is the number of > > + * available colors on the platform. > > + */ > > +static struct page_list_head *__ro_after_init _color_heap; > > +#define color_heap(color) (&_color_heap[color]) > > + > > +static unsigned long *__ro_after_init free_colored_pages; > > + > > +/* Memory required for buddy allocator to work with colored one */ > > +static unsigned long __initdata buddy_alloc_size = > > + MB(CONFIG_BUDDY_ALLOCATOR_SIZE); > > +size_param("buddy-alloc-size", buddy_alloc_size); > > + > > +#ifdef CONFIG_LLC_COLORING > > According to the (updated) command line doc, this #ifdef needs to move up > so the command line option indeed is unrecognized when !LLC_COLORING. > Question then is whether the variable is actually needed. With the variable > also moved into the #ifdef, the need for BUDDY_ALLOCATOR_SIZE also goes > away when !LLC_COLORING (see the comment on the common/Kconfig change). Of > course you'll then need "#ifndef buddy_alloc_size" in > init_color_heap_pages(), around the respective if() there. buddy-alloc-size must not be parsed when !LLC_COLORING. I'll change the code accordingly to what you said. > > +#define domain_num_llc_colors(d) ((d)->num_llc_colors) > > +#define domain_llc_color(d, i) ((d)->llc_colors[i]) > > +#else > > +#define domain_num_llc_colors(d) 0 > > +#define domain_llc_color(d, i) 0 > > +#endif > > + > > +static void free_color_heap_page(struct page_info *pg, bool need_scrub) > > +{ > > + unsigned int color; > > + > > + color = page_to_llc_color(pg); > > + free_colored_pages[color]++; > > + /* > > + * Head insertion allows re-using cache-hot pages in configurations without > > + * sharing of colors. > > + */ > > + page_list_add(pg, color_heap(color)); > > +} > > Iirc it was me who had asked to keep this and further functions outside of > #ifdef-s, for the compiler's DCE to take care of. However, with all that > Misra work that has been going on since then, I now wonder what Misra > thinks of this: When PGC_colored is 0, functions like the above are > unreachable, and any code or data the compiler doesn't manage to eliminate > would be dead. Imo if the code is to remain as is, correctness Misra-wise > may want mentioning in the description (this isn't the only place we have > such, so an overall position towards this is going to be needed). For the time being, I'll add a note that can be updated when the overall position is found. > > +static void __init init_color_heap_pages(struct page_info *pg, > > + unsigned long nr_pages) > > +{ > > + unsigned int i; > > + bool need_scrub = opt_bootscrub == BOOTSCRUB_IDLE; > > + > > + if ( buddy_alloc_size >= PAGE_SIZE ) > > + { > > + unsigned long buddy_pages = min(PFN_DOWN(buddy_alloc_size), nr_pages); > > + > > + init_heap_pages(pg, buddy_pages); > > + nr_pages -= buddy_pages; > > + buddy_alloc_size -= buddy_pages << PAGE_SHIFT; > > + pg += buddy_pages; > > + } > > + > > + if ( !_color_heap ) > > + { > > + unsigned int max_nr_colors = get_max_nr_llc_colors(); > > + > > + _color_heap = xmalloc_array(struct page_list_head, max_nr_colors); > > + free_colored_pages = xzalloc_array(unsigned long, max_nr_colors); > > At this point, xvmalloc() and friends want using by all new code, unless > explicitly justified otherwise. Yes. > > + if ( !_color_heap || !free_colored_pages ) > > + panic("Can't allocate colored heap. Buddy reserved size is too low"); > > + > > + for ( i = 0; i < max_nr_colors; i++ ) > > + INIT_PAGE_LIST_HEAD(color_heap(i)); > > While for this loop i being unsigned int is okay, I fear that ... > > > + } > > + > > + for ( i = 0; i < nr_pages; i++ ) > > + { > > + pg[i].count_info = PGC_colored; > > + free_color_heap_page(&pg[i], need_scrub); > > + } > > ... for this loop it isn't. Ok. > Jan Thanks. - Carlo
On 29.11.2024 10:37, Carlo Nonato wrote: > On Thu, Nov 28, 2024 at 12:43 PM Jan Beulich <jbeulich@suse.com> wrote: >> On 19.11.2024 15:13, Carlo Nonato wrote: >>> --- a/xen/common/Kconfig >>> +++ b/xen/common/Kconfig >>> @@ -537,4 +537,12 @@ config LLC_COLORS_ORDER >>> The default value corresponds to an 8 MiB 16-ways LLC, which should be >>> more than what's needed in the general case. >>> >>> +config BUDDY_ALLOCATOR_SIZE >>> + int "Buddy allocator reserved memory size (MiB)" if LLC_COLORING >>> + default "0" if !LLC_COLORING >>> + default "64" >>> + help >>> + Amount of memory reserved for the buddy allocator to serve Xen heap, >>> + working alongside the colored one. >> >> As the description has nothing in this regard: Why again is it that this >> can't simply have "depends on LLC_COLORING"? Even if it ends up with a >> value of 0, in an LLC_COLORING=n (or LLC_COLORING entirely absent) .config >> I'd find it at least irritating to see this setting to be there. > > Quoting you from v8 (6 months ago, a lot of time ago I know, link here > https://patchew.org/Xen/20240502165533.319988-1-carlo.nonato@minervasys.tech/20240502165533.319988-10-carlo.nonato@minervasys.tech/#5c16f53f-3bb0-49d6-b174-b0e8c6ceff4c@suse.com): >>> +/* Memory required for buddy allocator to work with colored one */ >>> +#ifdef CONFIG_LLC_COLORING >>> +static unsigned long __initdata buddy_alloc_size = >>> + MB(CONFIG_BUDDY_ALLOCATOR_SIZE); >> >> I think it would be quite nice if this and ... >> >>> +size_param("buddy-alloc-size", buddy_alloc_size); >>> + >>> +#define domain_num_llc_colors(d) (d)->num_llc_colors >>> +#define domain_llc_color(d, i) (d)->llc_colors[i] >>> +#else >>> +static unsigned long __initdata buddy_alloc_size; >> >> ... this were folded. Which I think would be possible if the Kconfig >> addition went like this: >> >> config BUDDY_ALLOCATOR_SIZE >> int "Buddy allocator reserved memory size (MiB)" if LLC_COLORING >> default "0" if !LLC_COLORING >> default "64" Hmm, I see. I didn't consider the negative effect back at the time, it seems. The main goal was anyway to get rid of the multiple instances of buddy_alloc_size. I'm sure that can be easily achieved also when going back to the original form of the Kconfig option (at least partly by following what I said in the initial reply to this v10 patch, I think). Jan
© 2016 - 2024 Red Hat, Inc.