From nobody Fri Oct 3 15:33:21 2025 Received: from szxga05-in.huawei.com (szxga05-in.huawei.com [45.249.212.191]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id E706D30BF59 for ; Fri, 29 Aug 2025 10:14:36 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=45.249.212.191 ARC-Seal: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1756462479; cv=none; b=HyZ52C723JBaiDeoFYvVB30HJ/+k7lSJuv+q/jHs70SiSvaAOq/2ky222DrIej1tZEXRWyEFmOvaw0pOfPzGp0DMkn/8IBCS0kFyRAvKxRJ9DF3Ed73LsHN7Nkfi2H3b4zPXiYNTDy4T4H+oi6J0gB7IyIssAcQB7OYL1lSiDoU= ARC-Message-Signature: i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1756462479; c=relaxed/simple; bh=SMGgIJw+/Qy2fL1oakEZWD7YQjuvN0t2/KUcGxTpuFg=; h=From:To:CC:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version:Content-Type; b=RMqylqmdrkDwXcq4PehtcbOcuiLeUNhW5FwoiIiXXAQ84DctJhGG3lqAS0RdbQ3YPLObHNr3XHr7LINMAEtYPa+Ufr8ue67tEm1Djnt90AHcrqWzhgiZ3fQdbnmeXizQf58/Lkhll4reOt4DM7y6qkqtwAiOWHhfntPXKW2c3w8= ARC-Authentication-Results: i=1; smtp.subspace.kernel.org; dmarc=pass (p=quarantine dis=none) header.from=huawei.com; spf=pass smtp.mailfrom=huawei.com; arc=none smtp.client-ip=45.249.212.191 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=quarantine dis=none) header.from=huawei.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=huawei.com Received: from mail.maildlp.com (unknown [172.19.163.44]) by szxga05-in.huawei.com (SkyGuard) with ESMTP id 4cCvF03DvTz24j0G; Fri, 29 Aug 2025 18:11:32 +0800 (CST) Received: from dggemv712-chm.china.huawei.com (unknown [10.1.198.32]) by mail.maildlp.com (Postfix) with ESMTPS id 9817A1402CA; Fri, 29 Aug 2025 18:14:32 +0800 (CST) Received: from kwepemn100008.china.huawei.com (7.202.194.111) by dggemv712-chm.china.huawei.com (10.1.198.32) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1544.11; Fri, 29 Aug 2025 18:14:32 +0800 Received: from localhost.huawei.com (10.90.31.46) by kwepemn100008.china.huawei.com (7.202.194.111) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1544.11; Fri, 29 Aug 2025 18:14:31 +0800 From: Yushan Wang To: , , , CC: , , , , , , , , , Subject: [PATCH v3 7/9] drivers/perf: hisi: Add support for L3C PMU v3 Date: Fri, 29 Aug 2025 18:14:25 +0800 Message-ID: <20250829101427.2557899-8-wangyushan12@huawei.com> X-Mailer: git-send-email 2.33.0 In-Reply-To: <20250829101427.2557899-1-wangyushan12@huawei.com> References: <20250829101427.2557899-1-wangyushan12@huawei.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable X-ClientProxiedBy: kwepems100001.china.huawei.com (7.221.188.238) To kwepemn100008.china.huawei.com (7.202.194.111) Content-Type: text/plain; charset="utf-8" From: Yicong Yang This patch adds support for L3C PMU v3. The v3 L3C PMU supports an extended events space which can be controlled in up to 2 extra address spaces with separate overflow interrupts. The layout of the control/event registers are kept the same. The extended events with original ones together cover the monitoring job of all transactions on L3C. The extended events is specified with `ext=3D[1|2]` option for the driver to distinguish, like below: perf stat -e hisi_sccl0_l3c0_0/event=3D,ext=3D1/ Currently only event option using config bit [7, 0]. There's still plenty unused space. Make ext using config [16, 17] and reserve bit [15, 8] for event option for future extension. With the capability of extra counters, number of counters for HiSilicon uncore PMU could reach up to 24, the usedmap is extended accordingly. The hw_perf_event::event_base is initialized to the base MMIO address of the event and will be used for later control, overflow handling and counts readout. We still make use of the Uncore PMU framework for handling the events and interrupt migration on CPU hotplug. The framework's cpuhp callback will handle the event migration and interrupt migration of orginial event, if PMU supports extended events then the interrupt of extended events is migrated to the same CPU choosed by the framework. A new HID of HISI0215 is used for this version of L3C PMU. Acked-by: Jonathan Cameron Signed-off-by: Yicong Yang Co-developed-by: Yushan Wang Signed-off-by: Yushan Wang --- drivers/perf/hisilicon/hisi_uncore_l3c_pmu.c | 352 +++++++++++++++++-- drivers/perf/hisilicon/hisi_uncore_pmu.h | 2 +- 2 files changed, 324 insertions(+), 30 deletions(-) diff --git a/drivers/perf/hisilicon/hisi_uncore_l3c_pmu.c b/drivers/perf/hi= silicon/hisi_uncore_l3c_pmu.c index 7928b9bb3e7e..bbd81a43047d 100644 --- a/drivers/perf/hisilicon/hisi_uncore_l3c_pmu.c +++ b/drivers/perf/hisilicon/hisi_uncore_l3c_pmu.c @@ -39,6 +39,7 @@ =20 /* L3C has 8-counters */ #define L3C_NR_COUNTERS 0x8 +#define L3C_MAX_EXT 2 =20 #define L3C_PERF_CTRL_EN 0x10000 #define L3C_TRACETAG_EN BIT(31) @@ -55,24 +56,81 @@ #define L3C_V1_NR_EVENTS 0x59 #define L3C_V2_NR_EVENTS 0xFF =20 +HISI_PMU_EVENT_ATTR_EXTRACTOR(ext, config, 17, 16); HISI_PMU_EVENT_ATTR_EXTRACTOR(tt_req, config1, 10, 8); HISI_PMU_EVENT_ATTR_EXTRACTOR(datasrc_cfg, config1, 15, 11); HISI_PMU_EVENT_ATTR_EXTRACTOR(datasrc_skt, config1, 16, 16); HISI_PMU_EVENT_ATTR_EXTRACTOR(tt_core, config2, 15, 0); =20 +struct hisi_l3c_pmu { + struct hisi_pmu l3c_pmu; + + /* MMIO and IRQ resources for extension events */ + void __iomem *ext_base[L3C_MAX_EXT]; + int ext_irq[L3C_MAX_EXT]; + int ext_num; +}; + +#define to_hisi_l3c_pmu(_l3c_pmu) \ + container_of(_l3c_pmu, struct hisi_l3c_pmu, l3c_pmu) + +/* + * The hardware counter idx used in counter enable/disable, + * interrupt enable/disable and status check, etc. + */ +#define L3C_HW_IDX(_cntr_idx) ((_cntr_idx) % L3C_NR_COUNTERS) + +/* Range of ext counters in used mask. */ +#define L3C_CNTR_EXT_L(_ext) (((_ext) + 1) * L3C_NR_COUNTERS) +#define L3C_CNTR_EXT_H(_ext) (((_ext) + 2) * L3C_NR_COUNTERS) + +struct hisi_l3c_pmu_ext { + bool support_ext; +}; + +static bool support_ext(struct hisi_l3c_pmu *pmu) +{ + struct hisi_l3c_pmu_ext *l3c_pmu_ext =3D pmu->l3c_pmu.dev_info->private; + + return l3c_pmu_ext->support_ext; +} + static int hisi_l3c_pmu_get_event_idx(struct perf_event *event) { struct hisi_pmu *l3c_pmu =3D to_hisi_pmu(event->pmu); + struct hisi_l3c_pmu *hisi_l3c_pmu =3D to_hisi_l3c_pmu(l3c_pmu); unsigned long *used_mask =3D l3c_pmu->pmu_events.used_mask; - u32 num_counters =3D l3c_pmu->num_counters; + int ext =3D hisi_get_ext(event); int idx; =20 - idx =3D find_first_zero_bit(used_mask, num_counters); - if (idx =3D=3D num_counters) + /* + * For an L3C PMU that supports extension events, we can monitor + * maximum 2 * num_counters to 3 * num_counters events, depending on + * the number of ext regions supported by hardware. Thus use bit + * [0, num_counters - 1] for normal events and bit + * [ext * num_counters, (ext + 1) * num_counters - 1] for extension + * events. The idx allocation will keep unchanged for normal events and + * we can also use the idx to distinguish whether it's an extension + * event or not. + * + * Since normal events and extension events locates on the different + * address space, save the base address to the event->hw.event_base. + */ + if (ext && !support_ext(hisi_l3c_pmu)) + return -EOPNOTSUPP; + + if (ext) + event->hw.event_base =3D (unsigned long)hisi_l3c_pmu->ext_base[ext - 1]; + else + event->hw.event_base =3D (unsigned long)l3c_pmu->base; + + ext -=3D 1; + idx =3D find_next_zero_bit(used_mask, L3C_CNTR_EXT_H(ext), L3C_CNTR_EXT_L= (ext)); + + if (idx >=3D L3C_CNTR_EXT_H(ext)) return -EAGAIN; =20 set_bit(idx, used_mask); - event->hw.event_base =3D (unsigned long)l3c_pmu->base; =20 return idx; } @@ -143,7 +201,7 @@ static void hisi_l3c_pmu_write_ds(struct perf_event *ev= ent, u32 ds_cfg) { struct hw_perf_event *hwc =3D &event->hw; u32 reg, reg_idx, shift, val; - int idx =3D hwc->idx; + int idx =3D L3C_HW_IDX(hwc->idx); =20 /* * Select the appropriate datasource register(L3C_DATSRC_TYPE0/1). @@ -264,12 +322,24 @@ static void hisi_l3c_pmu_disable_filter(struct perf_e= vent *event) } } =20 +static int hisi_l3c_pmu_check_filter(struct perf_event *event) +{ + struct hisi_pmu *l3c_pmu =3D to_hisi_pmu(event->pmu); + struct hisi_l3c_pmu *hisi_l3c_pmu =3D to_hisi_l3c_pmu(l3c_pmu); + int ext =3D hisi_get_ext(event); + + if (ext < 0 || ext > hisi_l3c_pmu->ext_num) + return -EINVAL; + + return 0; +} + /* * Select the counter register offset using the counter index */ static u32 hisi_l3c_pmu_get_counter_offset(int cntr_idx) { - return (L3C_CNTR0_LOWER + (cntr_idx * 8)); + return L3C_CNTR0_LOWER + L3C_HW_IDX(cntr_idx) * 8; } =20 static u64 hisi_l3c_pmu_read_counter(struct hisi_pmu *l3c_pmu, @@ -290,6 +360,8 @@ static void hisi_l3c_pmu_write_evtype(struct hisi_pmu *= l3c_pmu, int idx, struct hw_perf_event *hwc =3D &l3c_pmu->pmu_events.hw_events[idx]->hw; u32 reg, reg_idx, shift, val; =20 + idx =3D L3C_HW_IDX(idx); + /* * Select the appropriate event select register(L3C_EVENT_TYPE0/1). * There are 2 event select registers for the 8 hardware counters. @@ -304,34 +376,70 @@ static void hisi_l3c_pmu_write_evtype(struct hisi_pmu= *l3c_pmu, int idx, /* Write event code to L3C_EVENT_TYPEx Register */ val =3D hisi_l3c_pmu_event_readl(hwc, reg); val &=3D ~(L3C_EVTYPE_NONE << shift); - val |=3D (type << shift); + val |=3D type << shift; hisi_l3c_pmu_event_writel(hwc, reg, val); } =20 static void hisi_l3c_pmu_start_counters(struct hisi_pmu *l3c_pmu) { + struct hisi_l3c_pmu *hisi_l3c_pmu =3D to_hisi_l3c_pmu(l3c_pmu); + unsigned long *used_mask =3D l3c_pmu->pmu_events.used_mask; + unsigned long used_cntr =3D find_first_bit(used_mask, l3c_pmu->num_counte= rs); u32 val; + int i; =20 /* - * Set perf_enable bit in L3C_PERF_CTRL register to start counting - * for all enabled counters. + * Check if any counter belongs to the normal range (instead of ext + * range). If so, enable it. */ - val =3D readl(l3c_pmu->base + L3C_PERF_CTRL); - val |=3D L3C_PERF_CTRL_EN; - writel(val, l3c_pmu->base + L3C_PERF_CTRL); + if (used_cntr < L3C_NR_COUNTERS) { + val =3D readl(l3c_pmu->base + L3C_PERF_CTRL); + val |=3D L3C_PERF_CTRL_EN; + writel(val, l3c_pmu->base + L3C_PERF_CTRL); + } + + /* If not, do enable it on ext ranges. */ + for (i =3D 0; i < hisi_l3c_pmu->ext_num; i++) { + /* Find used counter in this ext range, skip the range if not. */ + used_cntr =3D find_next_bit(used_mask, L3C_CNTR_EXT_H(i), L3C_CNTR_EXT_L= (i)); + if (used_cntr >=3D L3C_CNTR_EXT_H(i)) + continue; + + val =3D readl(hisi_l3c_pmu->ext_base[i] + L3C_PERF_CTRL); + val |=3D L3C_PERF_CTRL_EN; + writel(val, hisi_l3c_pmu->ext_base[i] + L3C_PERF_CTRL); + } } =20 static void hisi_l3c_pmu_stop_counters(struct hisi_pmu *l3c_pmu) { + struct hisi_l3c_pmu *hisi_l3c_pmu =3D to_hisi_l3c_pmu(l3c_pmu); + unsigned long *used_mask =3D l3c_pmu->pmu_events.used_mask; + unsigned long used_cntr =3D find_first_bit(used_mask, l3c_pmu->num_counte= rs); u32 val; + int i; =20 /* - * Clear perf_enable bit in L3C_PERF_CTRL register to stop counting - * for all enabled counters. + * Check if any counter belongs to the normal range (instead of ext + * range). If so, stop it. */ - val =3D readl(l3c_pmu->base + L3C_PERF_CTRL); - val &=3D ~(L3C_PERF_CTRL_EN); - writel(val, l3c_pmu->base + L3C_PERF_CTRL); + if (used_cntr < L3C_NR_COUNTERS) { + val =3D readl(l3c_pmu->base + L3C_PERF_CTRL); + val &=3D ~L3C_PERF_CTRL_EN; + writel(val, l3c_pmu->base + L3C_PERF_CTRL); + } + + /* If not, do stop it on ext ranges. */ + for (i =3D 0; i < hisi_l3c_pmu->ext_num; i++) { + /* Find used counter in this ext range, skip the range if not. */ + used_cntr =3D find_next_bit(used_mask, L3C_CNTR_EXT_H(i), L3C_CNTR_EXT_L= (i)); + if (used_cntr >=3D L3C_CNTR_EXT_H(i)) + continue; + + val =3D readl(hisi_l3c_pmu->ext_base[i] + L3C_PERF_CTRL); + val &=3D ~L3C_PERF_CTRL_EN; + writel(val, hisi_l3c_pmu->ext_base[i] + L3C_PERF_CTRL); + } } =20 static void hisi_l3c_pmu_enable_counter(struct hisi_pmu *l3c_pmu, @@ -341,7 +449,7 @@ static void hisi_l3c_pmu_enable_counter(struct hisi_pmu= *l3c_pmu, =20 /* Enable counter index in L3C_EVENT_CTRL register */ val =3D hisi_l3c_pmu_event_readl(hwc, L3C_EVENT_CTRL); - val |=3D (1 << hwc->idx); + val |=3D 1 << L3C_HW_IDX(hwc->idx); hisi_l3c_pmu_event_writel(hwc, L3C_EVENT_CTRL, val); } =20 @@ -352,7 +460,7 @@ static void hisi_l3c_pmu_disable_counter(struct hisi_pm= u *l3c_pmu, =20 /* Clear counter index in L3C_EVENT_CTRL register */ val =3D hisi_l3c_pmu_event_readl(hwc, L3C_EVENT_CTRL); - val &=3D ~(1 << hwc->idx); + val &=3D ~(1 << L3C_HW_IDX(hwc->idx)); hisi_l3c_pmu_event_writel(hwc, L3C_EVENT_CTRL, val); } =20 @@ -363,7 +471,7 @@ static void hisi_l3c_pmu_enable_counter_int(struct hisi= _pmu *l3c_pmu, =20 val =3D hisi_l3c_pmu_event_readl(hwc, L3C_INT_MASK); /* Write 0 to enable interrupt */ - val &=3D ~(1 << hwc->idx); + val &=3D ~(1 << L3C_HW_IDX(hwc->idx)); hisi_l3c_pmu_event_writel(hwc, L3C_INT_MASK, val); } =20 @@ -374,20 +482,34 @@ static void hisi_l3c_pmu_disable_counter_int(struct h= isi_pmu *l3c_pmu, =20 val =3D hisi_l3c_pmu_event_readl(hwc, L3C_INT_MASK); /* Write 1 to mask interrupt */ - val |=3D (1 << hwc->idx); + val |=3D 1 << L3C_HW_IDX(hwc->idx); hisi_l3c_pmu_event_writel(hwc, L3C_INT_MASK, val); } =20 static u32 hisi_l3c_pmu_get_int_status(struct hisi_pmu *l3c_pmu) { - return readl(l3c_pmu->base + L3C_INT_STATUS); + struct hisi_l3c_pmu *hisi_l3c_pmu =3D to_hisi_l3c_pmu(l3c_pmu); + u32 ext_int, status, status_ext =3D 0; + int i; + + status =3D readl(l3c_pmu->base + L3C_INT_STATUS); + + if (!support_ext(hisi_l3c_pmu)) + return status; + + for (i =3D 0; i < hisi_l3c_pmu->ext_num; i++) { + ext_int =3D readl(hisi_l3c_pmu->ext_base[i] + L3C_INT_STATUS); + status_ext |=3D ext_int << (L3C_NR_COUNTERS * i); + } + + return status | (status_ext << L3C_NR_COUNTERS); } =20 static void hisi_l3c_pmu_clear_int_status(struct hisi_pmu *l3c_pmu, int id= x) { struct hw_perf_event *hwc =3D &l3c_pmu->pmu_events.hw_events[idx]->hw; =20 - hisi_l3c_pmu_event_writel(hwc, L3C_INT_CLEAR, 1 << idx); + hisi_l3c_pmu_event_writel(hwc, L3C_INT_CLEAR, 1 << L3C_HW_IDX(idx)); } =20 static int hisi_l3c_pmu_init_data(struct platform_device *pdev, @@ -424,6 +546,50 @@ static int hisi_l3c_pmu_init_data(struct platform_devi= ce *pdev, return 0; } =20 +static int hisi_l3c_pmu_init_ext(struct hisi_pmu *l3c_pmu, struct platform= _device *pdev) +{ + struct hisi_l3c_pmu *hisi_l3c_pmu =3D to_hisi_l3c_pmu(l3c_pmu); + int ret, irq, ext_num, i; + char *irqname; + + /* HiSilicon L3C PMU supporting ext should have more than 1 irq resources= . */ + ext_num =3D platform_irq_count(pdev); + if (ext_num < L3C_MAX_EXT) + return -ENODEV; + + /* + * The number of ext supported equals the number of irq - 1, since one + * of the irqs belongs to the normal part of PMU. + */ + hisi_l3c_pmu->ext_num =3D ext_num - 1; + + for (i =3D 0; i < hisi_l3c_pmu->ext_num; i++) { + hisi_l3c_pmu->ext_base[i] =3D devm_platform_ioremap_resource(pdev, i + 1= ); + if (IS_ERR(hisi_l3c_pmu->ext_base[i])) + return PTR_ERR(hisi_l3c_pmu->ext_base[i]); + + irq =3D platform_get_irq(pdev, i + 1); + if (irq < 0) + return irq; + + irqname =3D devm_kasprintf(&pdev->dev, GFP_KERNEL, "%s ext%d", + dev_name(&pdev->dev), i + 1); + if (!irqname) + return -ENOMEM; + + ret =3D devm_request_irq(&pdev->dev, irq, hisi_uncore_pmu_isr, + IRQF_NOBALANCING | IRQF_NO_THREAD, + irqname, l3c_pmu); + if (ret < 0) + return dev_err_probe(&pdev->dev, ret, + "Fail to request EXT IRQ: %d.\n", irq); + + hisi_l3c_pmu->ext_irq[i] =3D irq; + } + + return 0; +} + static struct attribute *hisi_l3c_pmu_v1_format_attr[] =3D { HISI_PMU_FORMAT_ATTR(event, "config:0-7"), NULL, @@ -448,6 +614,19 @@ static const struct attribute_group hisi_l3c_pmu_v2_fo= rmat_group =3D { .attrs =3D hisi_l3c_pmu_v2_format_attr, }; =20 +static struct attribute *hisi_l3c_pmu_v3_format_attr[] =3D { + HISI_PMU_FORMAT_ATTR(event, "config:0-7"), + HISI_PMU_FORMAT_ATTR(ext, "config:16-17"), + HISI_PMU_FORMAT_ATTR(tt_req, "config1:8-10"), + HISI_PMU_FORMAT_ATTR(tt_core, "config2:0-15"), + NULL +}; + +static const struct attribute_group hisi_l3c_pmu_v3_format_group =3D { + .name =3D "format", + .attrs =3D hisi_l3c_pmu_v3_format_attr, +}; + static struct attribute *hisi_l3c_pmu_v1_events_attr[] =3D { HISI_PMU_EVENT_ATTR(rd_cpipe, 0x00), HISI_PMU_EVENT_ATTR(wr_cpipe, 0x01), @@ -483,6 +662,26 @@ static const struct attribute_group hisi_l3c_pmu_v2_ev= ents_group =3D { .attrs =3D hisi_l3c_pmu_v2_events_attr, }; =20 +static struct attribute *hisi_l3c_pmu_v3_events_attr[] =3D { + HISI_PMU_EVENT_ATTR(rd_spipe, 0x18), + HISI_PMU_EVENT_ATTR(rd_hit_spipe, 0x19), + HISI_PMU_EVENT_ATTR(wr_spipe, 0x1a), + HISI_PMU_EVENT_ATTR(wr_hit_spipe, 0x1b), + HISI_PMU_EVENT_ATTR(io_rd_spipe, 0x1c), + HISI_PMU_EVENT_ATTR(io_rd_hit_spipe, 0x1d), + HISI_PMU_EVENT_ATTR(io_wr_spipe, 0x1e), + HISI_PMU_EVENT_ATTR(io_wr_hit_spipe, 0x1f), + HISI_PMU_EVENT_ATTR(cycles, 0x7f), + HISI_PMU_EVENT_ATTR(l3c_ref, 0xbc), + HISI_PMU_EVENT_ATTR(l3c2ring, 0xbd), + NULL +}; + +static const struct attribute_group hisi_l3c_pmu_v3_events_group =3D { + .name =3D "events", + .attrs =3D hisi_l3c_pmu_v3_events_attr, +}; + static const struct attribute_group *hisi_l3c_pmu_v1_attr_groups[] =3D { &hisi_l3c_pmu_v1_format_group, &hisi_l3c_pmu_v1_events_group, @@ -499,16 +698,41 @@ static const struct attribute_group *hisi_l3c_pmu_v2_= attr_groups[] =3D { NULL }; =20 +static const struct attribute_group *hisi_l3c_pmu_v3_attr_groups[] =3D { + &hisi_l3c_pmu_v3_format_group, + &hisi_l3c_pmu_v3_events_group, + &hisi_pmu_cpumask_attr_group, + &hisi_pmu_identifier_group, + NULL +}; + +static struct hisi_l3c_pmu_ext hisi_l3c_pmu_support_ext =3D { + .support_ext =3D true, +}; + +static struct hisi_l3c_pmu_ext hisi_l3c_pmu_not_support_ext =3D { + .support_ext =3D false, +}; + static const struct hisi_pmu_dev_info hisi_l3c_pmu_v1 =3D { .attr_groups =3D hisi_l3c_pmu_v1_attr_groups, .counter_bits =3D 48, .check_event =3D L3C_V1_NR_EVENTS, + .private =3D &hisi_l3c_pmu_not_support_ext, }; =20 static const struct hisi_pmu_dev_info hisi_l3c_pmu_v2 =3D { .attr_groups =3D hisi_l3c_pmu_v2_attr_groups, .counter_bits =3D 64, .check_event =3D L3C_V2_NR_EVENTS, + .private =3D &hisi_l3c_pmu_not_support_ext, +}; + +static const struct hisi_pmu_dev_info hisi_l3c_pmu_v3 =3D { + .attr_groups =3D hisi_l3c_pmu_v3_attr_groups, + .counter_bits =3D 64, + .check_event =3D L3C_V2_NR_EVENTS, + .private =3D &hisi_l3c_pmu_support_ext, }; =20 static const struct hisi_uncore_ops hisi_uncore_l3c_ops =3D { @@ -526,11 +750,14 @@ static const struct hisi_uncore_ops hisi_uncore_l3c_o= ps =3D { .clear_int_status =3D hisi_l3c_pmu_clear_int_status, .enable_filter =3D hisi_l3c_pmu_enable_filter, .disable_filter =3D hisi_l3c_pmu_disable_filter, + .check_filter =3D hisi_l3c_pmu_check_filter, }; =20 static int hisi_l3c_pmu_dev_probe(struct platform_device *pdev, struct hisi_pmu *l3c_pmu) { + struct hisi_l3c_pmu *hisi_l3c_pmu =3D to_hisi_l3c_pmu(l3c_pmu); + struct hisi_l3c_pmu_ext *l3c_pmu_dev_ext; int ret; =20 ret =3D hisi_l3c_pmu_init_data(pdev, l3c_pmu); @@ -549,27 +776,47 @@ static int hisi_l3c_pmu_dev_probe(struct platform_dev= ice *pdev, l3c_pmu->dev =3D &pdev->dev; l3c_pmu->on_cpu =3D -1; =20 + l3c_pmu_dev_ext =3D l3c_pmu->dev_info->private; + if (l3c_pmu_dev_ext->support_ext) { + ret =3D hisi_l3c_pmu_init_ext(l3c_pmu, pdev); + if (ret) + return ret; + /* + * The extension events have their own counters with the + * same number of the normal events counters. So we can + * have at maximum num_counters * ext events monitored. + */ + l3c_pmu->num_counters +=3D hisi_l3c_pmu->ext_num * L3C_NR_COUNTERS; + } + return 0; } =20 static int hisi_l3c_pmu_probe(struct platform_device *pdev) { + struct hisi_l3c_pmu *hisi_l3c_pmu; struct hisi_pmu *l3c_pmu; char *name; int ret; =20 - l3c_pmu =3D devm_kzalloc(&pdev->dev, sizeof(*l3c_pmu), GFP_KERNEL); - if (!l3c_pmu) + hisi_l3c_pmu =3D devm_kzalloc(&pdev->dev, sizeof(*hisi_l3c_pmu), GFP_KERN= EL); + if (!hisi_l3c_pmu) return -ENOMEM; =20 + l3c_pmu =3D &hisi_l3c_pmu->l3c_pmu; platform_set_drvdata(pdev, l3c_pmu); =20 ret =3D hisi_l3c_pmu_dev_probe(pdev, l3c_pmu); if (ret) return ret; =20 - name =3D devm_kasprintf(&pdev->dev, GFP_KERNEL, "hisi_sccl%d_l3c%d", - l3c_pmu->topo.sccl_id, l3c_pmu->topo.ccl_id); + if (l3c_pmu->topo.sub_id >=3D 0) + name =3D devm_kasprintf(&pdev->dev, GFP_KERNEL, "hisi_sccl%d_l3c%d_%d", + l3c_pmu->topo.sccl_id, l3c_pmu->topo.ccl_id, + l3c_pmu->topo.sub_id); + else + name =3D devm_kasprintf(&pdev->dev, GFP_KERNEL, "hisi_sccl%d_l3c%d", + l3c_pmu->topo.sccl_id, l3c_pmu->topo.ccl_id); if (!name) return -ENOMEM; =20 @@ -604,6 +851,7 @@ static void hisi_l3c_pmu_remove(struct platform_device = *pdev) static const struct acpi_device_id hisi_l3c_pmu_acpi_match[] =3D { { "HISI0213", (kernel_ulong_t)&hisi_l3c_pmu_v1 }, { "HISI0214", (kernel_ulong_t)&hisi_l3c_pmu_v2 }, + { "HISI0215", (kernel_ulong_t)&hisi_l3c_pmu_v3 }, {} }; MODULE_DEVICE_TABLE(acpi, hisi_l3c_pmu_acpi_match); @@ -618,14 +866,60 @@ static struct platform_driver hisi_l3c_pmu_driver =3D= { .remove =3D hisi_l3c_pmu_remove, }; =20 +static int hisi_l3c_pmu_online_cpu(unsigned int cpu, struct hlist_node *no= de) +{ + struct hisi_pmu *l3c_pmu =3D hlist_entry_safe(node, struct hisi_pmu, node= ); + struct hisi_l3c_pmu *hisi_l3c_pmu =3D to_hisi_l3c_pmu(l3c_pmu); + int ret, i; + + ret =3D hisi_uncore_pmu_online_cpu(cpu, node); + if (ret) + return ret; + + /* Avoid L3C pmu not supporting ext from ext irq migrating. */ + if (!support_ext(hisi_l3c_pmu)) + return 0; + + for (i =3D 0; i < hisi_l3c_pmu->ext_num; i++) + WARN_ON(irq_set_affinity(hisi_l3c_pmu->ext_irq[i], + cpumask_of(l3c_pmu->on_cpu))); + + return 0; +} + +static int hisi_l3c_pmu_offline_cpu(unsigned int cpu, struct hlist_node *n= ode) +{ + struct hisi_pmu *l3c_pmu =3D hlist_entry_safe(node, struct hisi_pmu, node= ); + struct hisi_l3c_pmu *hisi_l3c_pmu =3D to_hisi_l3c_pmu(l3c_pmu); + int ret, i; + + ret =3D hisi_uncore_pmu_offline_cpu(cpu, node); + if (ret) + return ret; + + /* If failed to find any available CPU, skip irq migration. */ + if (l3c_pmu->on_cpu < 0) + return 0; + + /* Avoid L3C pmu not supporting ext from ext irq migrating. */ + if (!support_ext(hisi_l3c_pmu)) + return 0; + + for (i =3D 0; i < hisi_l3c_pmu->ext_num; i++) + WARN_ON(irq_set_affinity(hisi_l3c_pmu->ext_irq[i], + cpumask_of(l3c_pmu->on_cpu))); + + return 0; +} + static int __init hisi_l3c_pmu_module_init(void) { int ret; =20 ret =3D cpuhp_setup_state_multi(CPUHP_AP_PERF_ARM_HISI_L3_ONLINE, "AP_PERF_ARM_HISI_L3_ONLINE", - hisi_uncore_pmu_online_cpu, - hisi_uncore_pmu_offline_cpu); + hisi_l3c_pmu_online_cpu, + hisi_l3c_pmu_offline_cpu); if (ret) { pr_err("L3C PMU: Error setup hotplug, ret =3D %d\n", ret); return ret; diff --git a/drivers/perf/hisilicon/hisi_uncore_pmu.h b/drivers/perf/hisili= con/hisi_uncore_pmu.h index 8649be6f716a..3ffe6acda653 100644 --- a/drivers/perf/hisilicon/hisi_uncore_pmu.h +++ b/drivers/perf/hisilicon/hisi_uncore_pmu.h @@ -24,7 +24,7 @@ #define pr_fmt(fmt) "hisi_pmu: " fmt =20 #define HISI_PMU_V2 0x30 -#define HISI_MAX_COUNTERS 0x10 +#define HISI_MAX_COUNTERS 0x18 #define to_hisi_pmu(p) (container_of(p, struct hisi_pmu, pmu)) =20 #define HISI_PMU_ATTR(_name, _func, _config) \ --=20 2.33.0