From nobody Tue Nov 26 04:36:34 2024 Delivered-To: importer@patchew.org Authentication-Results: mx.zohomail.com; dkim=pass; spf=pass (zohomail.com: domain of gnu.org designates 209.51.188.17 as permitted sender) smtp.mailfrom=qemu-devel-bounces+importer=patchew.org@nongnu.org; dmarc=pass(p=none dis=none) header.from=linux.alibaba.com ARC-Seal: i=1; a=rsa-sha256; t=1709730232; cv=none; d=zohomail.com; s=zohoarc; b=PfYqvG2JnkCCO/+9sFcBTQc2mhBPb2wIdb8uHK1B3NPZcV+SZl0HoD9KT4GmwSEktDReXlV2IyTqGk4SHwfxbYIf467HMK1FlOiyvLlz+Hv7w/22c0+uuuLeYp/U5ySjS7pq0LS5ofcwBswFLBeTbHrVT8Nbs+I+GnRwbqDNj0o= ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=zohomail.com; s=zohoarc; t=1709730232; h=Content-Transfer-Encoding:Cc:Cc:Date:Date:From:From:List-Subscribe:List-Post:List-Id:List-Archive:List-Help:List-Unsubscribe:MIME-Version:Message-ID:Sender:Subject:Subject:To:To:Message-Id:Reply-To; bh=XRTjDwnWYw3xObuy6D1KWY71Kr1BmmZ+OPqApDe0go0=; b=BAe+K3cUD2F4YdMmz3cnqcDeHsRRmKG+2ewUQ6KODg8bSqG5CWZATYM6N1R5dcZFMmNRYuzKf1g3z8iZIKnUsx2+TohkiWBb/MKi217kpOwlyThi2juGj2SZGO8xC+05RQin++lnIkqpV0ELCl8pB9cGWjNQIInFQ8TQwTYyNM4= ARC-Authentication-Results: i=1; mx.zohomail.com; dkim=pass; spf=pass (zohomail.com: domain of gnu.org designates 209.51.188.17 as permitted sender) smtp.mailfrom=qemu-devel-bounces+importer=patchew.org@nongnu.org; dmarc=pass header.from= (p=none dis=none) Return-Path: Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) by mx.zohomail.com with SMTPS id 1709730232831463.80491746391203; Wed, 6 Mar 2024 05:03:52 -0800 (PST) Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1rhqvb-000183-Vo; Wed, 06 Mar 2024 08:03:24 -0500 Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1rhnef-000631-CO; Wed, 06 Mar 2024 04:33:41 -0500 Received: from out30-124.freemail.mail.aliyun.com ([115.124.30.124]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1rhnea-0008HS-Qi; Wed, 06 Mar 2024 04:33:41 -0500 Received: from localhost.localdomain(mailfrom:eric.huang@linux.alibaba.com fp:SMTPD_---0W1xFe7I_1709717605) by smtp.aliyun-inc.com; Wed, 06 Mar 2024 17:33:26 +0800 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linux.alibaba.com; s=default; t=1709717607; h=From:To:Subject:Date:Message-ID:MIME-Version; bh=XRTjDwnWYw3xObuy6D1KWY71Kr1BmmZ+OPqApDe0go0=; b=pVYBPEcxJfgmChVABEXtc8/iUEckGvQbPoVcc/sHWAUkq9pmbWpDECYMojqIOC1HVpm5Q2XSAwirmpt1EyFJgC8VNrmXpjSlrPfveWbEh6s2SaxisWUZT3AAa5hv7UQ6WAKsAW+kt5vSxi14XI2qO1XhtvKY+rv23s6A1Z6UND4= X-Alimail-AntiSpam: AC=PASS; BC=-1|-1; BR=01201311R131e4; CH=green; DM=||false|; DS=||; FP=0|-1|-1|-1|0|-1|-1|-1; HT=ay29a033018046050; MF=eric.huang@linux.alibaba.com; NM=1; PH=DS; RN=10; SR=0; TI=SMTPD_---0W1xFe7I_1709717605; From: Huang Tao To: qemu-devel@nongnu.org Cc: qemu-riscv@nongnu.org, zhiwei_liu@linux.alibaba.com, dbarboza@ventanamicro.com, liwei1518@gmail.com, bin.meng@windriver.com, alistair.francis@wdc.com, palmer@dabbelt.com, Huang Tao , Christoph Muellner Subject: [PATCH] target/riscv: Implement dynamic establishment of custom decoder Date: Wed, 6 Mar 2024 17:33:08 +0800 Message-ID: <20240306093308.35188-1-eric.huang@linux.alibaba.com> X-Mailer: git-send-email 2.41.0 MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable Received-SPF: pass (zohomail.com: domain of gnu.org designates 209.51.188.17 as permitted sender) client-ip=209.51.188.17; envelope-from=qemu-devel-bounces+importer=patchew.org@nongnu.org; helo=lists.gnu.org; Received-SPF: pass client-ip=115.124.30.124; envelope-from=eric.huang@linux.alibaba.com; helo=out30-124.freemail.mail.aliyun.com X-Spam_score_int: -174 X-Spam_score: -17.5 X-Spam_bar: ----------------- X-Spam_report: (-17.5 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, ENV_AND_HDR_SPF_MATCH=-0.5, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01, UNPARSEABLE_RELAY=0.001, USER_IN_DEF_DKIM_WL=-7.5, USER_IN_DEF_SPF_WL=-7.5 autolearn=ham autolearn_force=no X-Spam_action: no action X-Mailman-Approved-At: Wed, 06 Mar 2024 08:03:17 -0500 X-BeenThere: qemu-devel@nongnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: qemu-devel-bounces+importer=patchew.org@nongnu.org Sender: qemu-devel-bounces+importer=patchew.org@nongnu.org X-ZohoMail-DKIM: pass (identity @linux.alibaba.com) X-ZM-MESSAGEID: 1709730234193100004 Content-Type: text/plain; charset="utf-8" In this patch, we modify the decoder to be a freely composable data structure instead of a hardcoded one. It can be dynamically builded up according to the extensions. This approach has several benefits: 1. Provides support for heterogeneous cpu architectures. As we add decoder = in CPUArchState, each cpu can have their own decoder, and the decoders can = be different due to cpu's features. 2. Improve the decoding efficiency. We run the guard_func to see if the dec= oder can be added to the dynamic_decoder when building up the decoder. Theref= ore, there is no need to run the guard_func when decoding each instruction. I= t can improve the decoding efficiency 3. For vendor or dynamic cpus, it allows them to customize their own decoder functions to improve decoding efficiency, especially when vendor-defined instruction sets increase. Because of dynamic building up, it can skip t= he other decoder guard functions when decoding. Signed-off-by: Huang Tao Suggested-by: Christoph Muellner Co-authored-by: LIU Zhiwei --- target/riscv/cpu.c | 20 ++++++++++++++++++++ target/riscv/cpu.h | 2 ++ target/riscv/cpu_decoder.h | 34 ++++++++++++++++++++++++++++++++++ target/riscv/translate.c | 28 ++++++++++++---------------- 4 files changed, 68 insertions(+), 16 deletions(-) create mode 100644 target/riscv/cpu_decoder.h diff --git a/target/riscv/cpu.c b/target/riscv/cpu.c index 5ff0192c52..cadcd51b4f 100644 --- a/target/riscv/cpu.c +++ b/target/riscv/cpu.c @@ -38,6 +38,7 @@ #include "kvm/kvm_riscv.h" #include "tcg/tcg-cpu.h" #include "tcg/tcg.h" +#include "cpu_decoder.h" =20 /* RISC-V CPU definitions */ static const char riscv_single_letter_exts[] =3D "IEMAFDQCBPVH"; @@ -1102,6 +1103,23 @@ static void riscv_cpu_satp_mode_finalize(RISCVCPU *c= pu, Error **errp) } #endif =20 +static void riscv_cpu_finalize_dynamic_decoder(RISCVCPU *cpu) +{ + CPURISCVState *env =3D &cpu->env; + decode_fn *dynamic_decoder; + dynamic_decoder =3D g_new0(decode_fn, decoder_table_size); + int j =3D 0; + for (size_t i =3D 0; i < decoder_table_size; ++i) { + if (decoder_table[i].guard_func && + decoder_table[i].guard_func(&cpu->cfg)) { + dynamic_decoder[j] =3D decoder_table[i].decode_fn; + j++; + } + } + + env->decoder =3D dynamic_decoder; +} + void riscv_cpu_finalize_features(RISCVCPU *cpu, Error **errp) { Error *local_err =3D NULL; @@ -1127,6 +1145,8 @@ void riscv_cpu_finalize_features(RISCVCPU *cpu, Error= **errp) return; } } + + riscv_cpu_finalize_dynamic_decoder(cpu); } =20 static void riscv_cpu_realize(DeviceState *dev, Error **errp) diff --git a/target/riscv/cpu.h b/target/riscv/cpu.h index 5d291a7092..42bfed065c 100644 --- a/target/riscv/cpu.h +++ b/target/riscv/cpu.h @@ -30,6 +30,7 @@ #include "qemu/int128.h" #include "cpu_bits.h" #include "cpu_cfg.h" +#include "cpu_decoder.h" #include "qapi/qapi-types-common.h" #include "cpu-qom.h" =20 @@ -433,6 +434,7 @@ struct CPUArchState { uint64_t kvm_timer_state; uint64_t kvm_timer_frequency; #endif /* CONFIG_KVM */ + const decode_fn *decoder; }; =20 /* diff --git a/target/riscv/cpu_decoder.h b/target/riscv/cpu_decoder.h new file mode 100644 index 0000000000..549414ce4c --- /dev/null +++ b/target/riscv/cpu_decoder.h @@ -0,0 +1,34 @@ +/* + * QEMU RISC-V CPU Decoder + * + * Copyright (c) 2023-2024 Alibaba Group + * + * This program is free software; you can redistribute it and/or modify it + * under the terms and conditions of the GNU General Public License, + * version 2 or later, as published by the Free Software Foundation. + * + * This program is distributed in the hope it will be useful, but WITHOUT + * ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or + * FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License f= or + * more details. + * + * You should have received a copy of the GNU General Public License along= with + * this program. If not, see . + */ + +#ifndef RISCV_CPU_DECODER_H +#define RISCV_CPU_DECODER_H + +struct DisasContext; +struct RISCVCPUConfig; +typedef struct RISCVDecoder { + bool (*guard_func)(const struct RISCVCPUConfig *); + bool (*decode_fn)(struct DisasContext *, uint32_t); +} RISCVDecoder; + +typedef bool (*decode_fn)(struct DisasContext *, uint32_t); + +extern const size_t decoder_table_size; + +extern const RISCVDecoder decoder_table[]; +#endif diff --git a/target/riscv/translate.c b/target/riscv/translate.c index 177418b2b9..23c5321bce 100644 --- a/target/riscv/translate.c +++ b/target/riscv/translate.c @@ -115,6 +115,7 @@ typedef struct DisasContext { bool frm_valid; /* TCG of the current insn_start */ TCGOp *insn_start; + const decode_fn *decoder; } DisasContext; =20 static inline bool has_ext(DisasContext *ctx, uint32_t ext) @@ -1118,21 +1119,16 @@ static inline int insn_len(uint16_t first_word) return (first_word & 3) =3D=3D 3 ? 4 : 2; } =20 +const RISCVDecoder decoder_table[] =3D { + { always_true_p, decode_insn32 }, + { has_xthead_p, decode_xthead}, + { has_XVentanaCondOps_p, decode_XVentanaCodeOps}, +}; + +const size_t decoder_table_size =3D ARRAY_SIZE(decoder_table); + static void decode_opc(CPURISCVState *env, DisasContext *ctx, uint16_t opc= ode) { - /* - * A table with predicate (i.e., guard) functions and decoder functions - * that are tested in-order until a decoder matches onto the opcode. - */ - static const struct { - bool (*guard_func)(const RISCVCPUConfig *); - bool (*decode_func)(DisasContext *, uint32_t); - } decoders[] =3D { - { always_true_p, decode_insn32 }, - { has_xthead_p, decode_xthead }, - { has_XVentanaCondOps_p, decode_XVentanaCodeOps }, - }; - ctx->virt_inst_excp =3D false; ctx->cur_insn_len =3D insn_len(opcode); /* Check for compressed insn */ @@ -1153,9 +1149,8 @@ static void decode_opc(CPURISCVState *env, DisasConte= xt *ctx, uint16_t opcode) ctx->base.pc_next + 2)); ctx->opcode =3D opcode32; =20 - for (size_t i =3D 0; i < ARRAY_SIZE(decoders); ++i) { - if (decoders[i].guard_func(ctx->cfg_ptr) && - decoders[i].decode_func(ctx, opcode32)) { + for (size_t i =3D 0; i < decoder_table_size; ++i) { + if (ctx->decoder[i](ctx, opcode32)) { return; } } @@ -1199,6 +1194,7 @@ static void riscv_tr_init_disas_context(DisasContextB= ase *dcbase, CPUState *cs) ctx->itrigger =3D FIELD_EX32(tb_flags, TB_FLAGS, ITRIGGER); ctx->zero =3D tcg_constant_tl(0); ctx->virt_inst_excp =3D false; + ctx->decoder =3D env->decoder; } =20 static void riscv_tr_tb_start(DisasContextBase *db, CPUState *cpu) --=20 2.41.0