From nobody Sun Nov 9 17:53:26 2025 Delivered-To: importer@patchew.org Received-SPF: temperror (zoho.com: Error in retrieving data from DNS) client-ip=209.51.188.17; envelope-from=qemu-devel-bounces+importer=patchew.org@nongnu.org; helo=lists.gnu.org; Authentication-Results: mx.zohomail.com; spf=temperror (zoho.com: Error in retrieving data from DNS) smtp.mailfrom=qemu-devel-bounces+importer=patchew.org@nongnu.org Return-Path: Received: from lists.gnu.org (209.51.188.17 [209.51.188.17]) by mx.zohomail.com with SMTPS id 1551445994571279.53630585357234; Fri, 1 Mar 2019 05:13:14 -0800 (PST) Received: from localhost ([127.0.0.1]:37733 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1gzhyJ-0000qx-9f for importer@patchew.org; Fri, 01 Mar 2019 08:13:03 -0500 Received: from eggs.gnu.org ([209.51.188.92]:54309) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1gzhvD-0006w8-6j for qemu-devel@nongnu.org; Fri, 01 Mar 2019 08:09:52 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1gzhvC-0004GJ-AI for qemu-devel@nongnu.org; Fri, 01 Mar 2019 08:09:51 -0500 Received: from mx2.rt-rk.com ([89.216.37.149]:58429 helo=mail.rt-rk.com) by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32) (Exim 4.71) (envelope-from ) id 1gzhvB-0003mn-Uw for qemu-devel@nongnu.org; Fri, 01 Mar 2019 08:09:50 -0500 Received: from localhost (localhost [127.0.0.1]) by mail.rt-rk.com (Postfix) with ESMTP id 371A81A222E; Fri, 1 Mar 2019 14:08:47 +0100 (CET) Received: from rtrkw310-lin.domain.local (rtrkw310-lin.domain.local [10.10.13.57]) by mail.rt-rk.com (Postfix) with ESMTPSA id F3F3A1A2026; Fri, 1 Mar 2019 14:08:46 +0100 (CET) X-Virus-Scanned: amavisd-new at rt-rk.com From: Mateja Marjanovic To: qemu-devel@nongnu.org Date: Fri, 1 Mar 2019 14:08:38 +0100 Message-Id: <1551445720-19622-3-git-send-email-mateja.marjanovic@rt-rk.com> X-Mailer: git-send-email 2.7.4 In-Reply-To: <1551445720-19622-1-git-send-email-mateja.marjanovic@rt-rk.com> References: <1551445720-19622-1-git-send-email-mateja.marjanovic@rt-rk.com> MIME-Version: 1.0 Content-Transfer-Encoding: quoted-printable X-detected-operating-system: by eggs.gnu.org: GNU/Linux 3.x X-Received-From: 89.216.37.149 Subject: [Qemu-devel] [PATCH v2 2/4] target/mips: Optimize support for MSA instructions ILVOD. X-BeenThere: qemu-devel@nongnu.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: arikalo@wavecomp.com, amarkovic@wavecomp.com, aurelien@aurel32.net Errors-To: qemu-devel-bounces+importer=patchew.org@nongnu.org Sender: "Qemu-devel" Content-Type: text/plain; charset="utf-8" From: Mateja Marjanovic Optimize support for MSA instructions ILVOD.B, ILVOD.H, ILVOD.W, and ILVOD.D. Optimization is done by eliminating loops, and explicitly assigning desired values to individual data elements. Performance measurement is done by executing the instructions large number of times on a computer with Intel Core i7-3770 CPU @ 3.40GHz=C3=978. Measured time before optimization: ILVOD.B: 118.42 ms ILVOD.H: 93.45 ms ILVOD.W: 119.89 ms ILVOD.D: 46.00 ms Measured time after optimization: ILVOD.B: 60.08 ms ILVOD.H: 41.52 ms ILVOD.W: 36.44 ms ILVOD.D: 36.40 ms Signed-off-by: Mateja Marjanovic --- target/mips/msa_helper.c | 58 ++++++++++++++++++++++++++++++++++++++++++--= ---- 1 file changed, 51 insertions(+), 7 deletions(-) diff --git a/target/mips/msa_helper.c b/target/mips/msa_helper.c index 49332e6..2426338 100644 --- a/target/mips/msa_helper.c +++ b/target/mips/msa_helper.c @@ -1198,13 +1198,6 @@ MSA_FN_DF(ilvl_df) MSA_FN_DF(ilvr_df) #undef MSA_DO =20 -#define MSA_DO(DF) \ - do { \ - pwx->DF[2*i] =3D pwt->DF[2*i+1]; \ - pwx->DF[2*i+1] =3D pws->DF[2*i+1]; \ - } while (0) -MSA_FN_DF(ilvod_df) -#undef MSA_DO #undef MSA_LOOP_COND =20 #define MSA_LOOP_COND(DF) \ @@ -1274,6 +1267,57 @@ void helper_msa_ilvev_df(CPUMIPSState *env, uint32_t= df, uint32_t wd, } } =20 +void helper_msa_ilvod_df(CPUMIPSState *env, uint32_t df, uint32_t wd, + uint32_t ws, uint32_t wt) +{ + wr_t *pwd =3D &(env->active_fpu.fpr[wd].wr); + wr_t *pws =3D &(env->active_fpu.fpr[ws].wr); + wr_t *pwt =3D &(env->active_fpu.fpr[wt].wr); + + switch (df) { + case DF_BYTE: + pwd->b[0] =3D pwt->b[1]; + pwd->b[1] =3D pws->b[1]; + pwd->b[2] =3D pwt->b[3]; + pwd->b[3] =3D pws->b[3]; + pwd->b[4] =3D pwt->b[5]; + pwd->b[5] =3D pws->b[5]; + pwd->b[6] =3D pwt->b[7]; + pwd->b[7] =3D pws->b[7]; + pwd->b[8] =3D pwt->b[9]; + pwd->b[9] =3D pws->b[9]; + pwd->b[10] =3D pwt->b[11]; + pwd->b[11] =3D pws->b[11]; + pwd->b[12] =3D pwt->b[13]; + pwd->b[13] =3D pws->b[13]; + pwd->b[14] =3D pwt->b[15]; + pwd->b[15] =3D pws->b[15]; + break; + case DF_HALF: + pwd->h[0] =3D pwt->h[1]; + pwd->h[1] =3D pws->h[1]; + pwd->h[2] =3D pwt->h[3]; + pwd->h[3] =3D pws->h[3]; + pwd->h[4] =3D pwt->h[5]; + pwd->h[5] =3D pws->h[5]; + pwd->h[6] =3D pwt->h[7]; + pwd->h[7] =3D pws->h[7]; + break; + case DF_WORD: + pwd->w[0] =3D pwt->w[1]; + pwd->w[1] =3D pws->w[1]; + pwd->w[2] =3D pwt->w[3]; + pwd->w[3] =3D pws->w[3]; + break; + case DF_DOUBLE: + pwd->d[0] =3D pwt->d[1]; + pwd->d[1] =3D pws->d[1]; + break; + default: + assert(0); + } +} + void helper_msa_sldi_df(CPUMIPSState *env, uint32_t df, uint32_t wd, uint32_t ws, uint32_t n) { --=20 2.7.4