1
Just my fp16 work, plus some small stuff for the sbsa-ref board;
1
The following changes since commit 7e7eb9f852a46b51a71ae9d82590b2e4d28827ee:
2
but my rule of thumb is to send a pullreq once I get over about
3
30 patches...
4
2
5
-- PMM
3
Merge remote-tracking branch 'remotes/armbru/tags/pull-qapi-2021-01-28' into staging (2021-01-28 22:43:18 +0000)
6
7
The following changes since commit 2f4c51c0f384d7888a04b4815861e6d5fd244d75:
8
9
Merge remote-tracking branch 'remotes/kraxel/tags/usb-20200831-pull-request' into staging (2020-08-31 19:39:13 +0100)
10
4
11
are available in the Git repository at:
5
are available in the Git repository at:
12
6
13
https://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20200901
7
https://git.linaro.org/people/pmaydell/qemu-arm.git tags/pull-target-arm-20210129
14
8
15
for you to fetch changes up to 3f462bf0f6ea6382dd1502d4eb1fcd33c8e774f5:
9
for you to fetch changes up to 11749122e1a86866591306d43603d2795a3dea1a:
16
10
17
hw/arm/sbsa-ref : Add embedded controller in secure memory (2020-09-01 14:01:34 +0100)
11
hw/arm/stellaris: Remove board-creation reset of STELLARIS_SYS (2021-01-29 10:47:29 +0000)
18
12
19
----------------------------------------------------------------
13
----------------------------------------------------------------
20
target-arm queue:
14
target-arm queue:
21
* Implement fp16 support for AArch32 VFP and Neon
15
* Implement ID_PFR2
22
* hw/arm/sbsa-ref: add "reg" property to DT cpu nodes
16
* Conditionalize DBGDIDR
23
* hw/arm/sbsa-ref : Add embedded controller in secure memory
17
* rename xlnx-zcu102.canbusN properties
18
* provide powerdown/reset mechanism for secure firmware on 'virt' board
19
* hw/misc: Fix arith overflow in NPCM7XX PWM module
20
* target/arm: Replace magic value by MMU_DATA_LOAD definition
21
* configure: fix preadv errors on Catalina macOS with new XCode
22
* Various configure and other cleanups in preparation for iOS support
23
* hvf: Add hypervisor entitlement to output binaries (needed for Big Sur)
24
* Implement pvpanic-pci device
25
* Convert the CMSDK timer devices to the Clock framework
24
26
25
----------------------------------------------------------------
27
----------------------------------------------------------------
26
Graeme Gregory (2):
28
Alexander Graf (1):
27
hw/misc/sbsa_ec : Add an embedded controller for sbsa-ref
29
hvf: Add hypervisor entitlement to output binaries
28
hw/arm/sbsa-ref : Add embedded controller in secure memory
29
30
30
Leif Lindholm (1):
31
Hao Wu (1):
31
hw/arm/sbsa-ref: add "reg" property to DT cpu nodes
32
hw/misc: Fix arith overflow in NPCM7XX PWM module
32
33
33
Peter Maydell (44):
34
Joelle van Dyne (7):
34
target/arm: Remove local definitions of float constants
35
configure: cross-compiling with empty cross_prefix
35
target/arm: Use correct ID register check for aa32_fp16_arith
36
osdep: build with non-working system() function
36
target/arm: Implement VFP fp16 for VFP_BINOP operations
37
darwin: remove redundant dependency declaration
37
target/arm: Implement VFP fp16 VMLA, VMLS, VNMLS, VNMLA, VNMUL
38
darwin: fix cross-compiling for Darwin
38
target/arm: Macroify trans functions for VFMA, VFMS, VFNMA, VFNMS
39
configure: cross compile should use x86_64 cpu_family
39
target/arm: Implement VFP fp16 for fused-multiply-add
40
darwin: detect CoreAudio for build
40
target/arm: Macroify uses of do_vfp_2op_sp() and do_vfp_2op_dp()
41
darwin: remove 64-bit build detection on 32-bit OS
41
target/arm: Implement VFP fp16 for VABS, VNEG, VSQRT
42
target/arm: Implement VFP fp16 for VMOV immediate
43
target/arm: Implement VFP fp16 VCMP
44
target/arm: Implement VFP fp16 VLDR and VSTR
45
target/arm: Implement VFP fp16 VCVT between float and integer
46
target/arm: Make VFP_CONV_FIX macros take separate float type and float size
47
target/arm: Use macros instead of open-coding fp16 conversion helpers
48
target/arm: Implement VFP fp16 VCVT between float and fixed-point
49
target/arm: Implement VFP vp16 VCVT-with-specified-rounding-mode
50
target/arm: Implement VFP fp16 VSEL
51
target/arm: Implement VFP fp16 VRINT*
52
target/arm: Implement new VFP fp16 insn VINS
53
target/arm: Implement new VFP fp16 insn VMOVX
54
target/arm: Implement VFP fp16 VMOV between gp and halfprec registers
55
target/arm: Implement FP16 for Neon VADD, VSUB, VABD, VMUL
56
target/arm: Implement fp16 for Neon VRECPE, VRSQRTE using gvec
57
target/arm: Implement fp16 for Neon VABS, VNEG of floats
58
target/arm: Implement fp16 for VCEQ, VCGE, VCGT comparisons
59
target/arm: Implement fp16 for VACGE, VACGT
60
target/arm: Implement fp16 for Neon VMAX, VMIN
61
target/arm: Implement fp16 for Neon VMAXNM, VMINNM
62
target/arm: Implement fp16 for Neon VMLA, VMLS operations
63
target/arm: Implement fp16 for Neon VFMA, VMFS
64
target/arm: Implement fp16 for Neon fp compare-vs-0
65
target/arm: Implement fp16 for Neon VRECPS
66
target/arm: Implement fp16 for Neon VRSQRTS
67
target/arm: Implement fp16 for Neon pairwise fp ops
68
target/arm: Implement fp16 for Neon float-integer VCVT
69
target/arm: Convert Neon VCVT fixed-point to gvec
70
target/arm: Implement fp16 for Neon VCVT fixed-point
71
target/arm: Implement fp16 for Neon VCVT with rounding modes
72
target/arm: Implement fp16 for Neon VRINT-with-specified-rounding-mode
73
target/arm: Implement fp16 for Neon VRINTX
74
target/arm/vec_helper: Handle oprsz less than 16 bytes in indexed operations
75
target/arm/vec_helper: Add gvec fp indexed multiply-and-add operations
76
target/arm: Implement fp16 for Neon VMUL, VMLA, VMLS
77
target/arm: Enable FP16 in '-cpu max'
78
42
79
target/arm/cpu.h | 7 +-
43
Maxim Uvarov (3):
80
target/arm/helper.h | 133 ++++++-
44
hw: gpio: implement gpio-pwr driver for qemu reset/poweroff
81
target/arm/neon-dp.decode | 8 +-
45
arm-virt: refactor gpios creation
82
target/arm/vfp-uncond.decode | 27 +-
46
arm-virt: add secure pl061 for reset/power down
83
target/arm/vfp.decode | 34 +-
84
hw/arm/sbsa-ref.c | 43 ++-
85
hw/misc/sbsa_ec.c | 98 +++++
86
target/arm/cpu.c | 3 +-
87
target/arm/cpu64.c | 10 +-
88
target/arm/helper-a64.c | 11 -
89
target/arm/translate-sve.c | 4 -
90
target/arm/vec_helper.c | 431 ++++++++++++++++++++-
91
target/arm/vfp_helper.c | 244 +++++-------
92
hw/misc/meson.build | 2 +
93
target/arm/translate-neon.c.inc | 755 +++++++++++++------------------------
94
target/arm/translate-vfp.c.inc | 810 ++++++++++++++++++++++++++++++++++++----
95
16 files changed, 1819 insertions(+), 801 deletions(-)
96
create mode 100644 hw/misc/sbsa_ec.c
97
47
48
Mihai Carabas (4):
49
hw/misc/pvpanic: split-out generic and bus dependent code
50
hw/misc/pvpanic: add PCI interface support
51
pvpanic : update pvpanic spec document
52
tests/qtest: add a test case for pvpanic-pci
53
54
Paolo Bonzini (1):
55
arm: rename xlnx-zcu102.canbusN properties
56
57
Peter Maydell (26):
58
configure: Move preadv check to meson.build
59
ptimer: Add new ptimer_set_period_from_clock() function
60
clock: Add new clock_has_source() function
61
tests: Add a simple test of the CMSDK APB timer
62
tests: Add a simple test of the CMSDK APB watchdog
63
tests: Add a simple test of the CMSDK APB dual timer
64
hw/timer/cmsdk-apb-timer: Rename CMSDKAPBTIMER struct to CMSDKAPBTimer
65
hw/timer/cmsdk-apb-timer: Add Clock input
66
hw/timer/cmsdk-apb-dualtimer: Add Clock input
67
hw/watchdog/cmsdk-apb-watchdog: Add Clock input
68
hw/arm/armsse: Rename "MAINCLK" property to "MAINCLK_FRQ"
69
hw/arm/armsse: Wire up clocks
70
hw/arm/mps2: Inline CMSDK_APB_TIMER creation
71
hw/arm/mps2: Create and connect SYSCLK Clock
72
hw/arm/mps2-tz: Create and connect ARMSSE Clocks
73
hw/arm/musca: Create and connect ARMSSE Clocks
74
hw/arm/stellaris: Convert SSYS to QOM device
75
hw/arm/stellaris: Create Clock input for watchdog
76
hw/timer/cmsdk-apb-timer: Convert to use Clock input
77
hw/timer/cmsdk-apb-dualtimer: Convert to use Clock input
78
hw/watchdog/cmsdk-apb-watchdog: Convert to use Clock input
79
tests/qtest/cmsdk-apb-watchdog-test: Test clock changes
80
hw/arm/armsse: Use Clock to set system_clock_scale
81
arm: Don't set freq properties on CMSDK timer, dualtimer, watchdog, ARMSSE
82
arm: Remove frq properties on CMSDK timer, dualtimer, watchdog, ARMSSE
83
hw/arm/stellaris: Remove board-creation reset of STELLARIS_SYS
84
85
Philippe Mathieu-Daudé (1):
86
target/arm: Replace magic value by MMU_DATA_LOAD definition
87
88
Richard Henderson (2):
89
target/arm: Implement ID_PFR2
90
target/arm: Conditionalize DBGDIDR
91
92
docs/devel/clocks.rst | 16 +++
93
docs/specs/pci-ids.txt | 1 +
94
docs/specs/pvpanic.txt | 13 ++-
95
docs/system/arm/virt.rst | 2 +
96
configure | 78 ++++++++------
97
meson.build | 34 ++++++-
98
include/hw/arm/armsse.h | 14 ++-
99
include/hw/arm/virt.h | 2 +
100
include/hw/clock.h | 15 +++
101
include/hw/misc/pvpanic.h | 24 ++++-
102
include/hw/pci/pci.h | 1 +
103
include/hw/ptimer.h | 22 ++++
104
include/hw/timer/cmsdk-apb-dualtimer.h | 5 +-
105
include/hw/timer/cmsdk-apb-timer.h | 34 ++-----
106
include/hw/watchdog/cmsdk-apb-watchdog.h | 5 +-
107
include/qemu/osdep.h | 12 +++
108
include/qemu/typedefs.h | 1 +
109
target/arm/cpu.h | 1 +
110
hw/arm/armsse.c | 48 ++++++---
111
hw/arm/mps2-tz.c | 14 ++-
112
hw/arm/mps2.c | 28 ++++-
113
hw/arm/musca.c | 13 ++-
114
hw/arm/stellaris.c | 170 +++++++++++++++++++++++--------
115
hw/arm/virt.c | 111 ++++++++++++++++----
116
hw/arm/xlnx-zcu102.c | 4 +-
117
hw/core/ptimer.c | 34 +++++++
118
hw/gpio/gpio_pwr.c | 70 +++++++++++++
119
hw/misc/npcm7xx_pwm.c | 23 ++++-
120
hw/misc/pvpanic-isa.c | 94 +++++++++++++++++
121
hw/misc/pvpanic-pci.c | 94 +++++++++++++++++
122
hw/misc/pvpanic.c | 85 ++--------------
123
hw/timer/cmsdk-apb-dualtimer.c | 53 +++++++---
124
hw/timer/cmsdk-apb-timer.c | 55 +++++-----
125
hw/watchdog/cmsdk-apb-watchdog.c | 29 ++++--
126
target/arm/helper.c | 27 +++--
127
target/arm/kvm64.c | 2 +
128
tests/qtest/cmsdk-apb-dualtimer-test.c | 130 +++++++++++++++++++++++
129
tests/qtest/cmsdk-apb-timer-test.c | 75 ++++++++++++++
130
tests/qtest/cmsdk-apb-watchdog-test.c | 131 ++++++++++++++++++++++++
131
tests/qtest/npcm7xx_pwm-test.c | 4 +-
132
tests/qtest/pvpanic-pci-test.c | 94 +++++++++++++++++
133
tests/qtest/xlnx-can-test.c | 30 +++---
134
MAINTAINERS | 3 +
135
accel/hvf/entitlements.plist | 8 ++
136
hw/arm/Kconfig | 1 +
137
hw/gpio/Kconfig | 3 +
138
hw/gpio/meson.build | 1 +
139
hw/i386/Kconfig | 2 +-
140
hw/misc/Kconfig | 12 ++-
141
hw/misc/meson.build | 4 +-
142
scripts/entitlement.sh | 13 +++
143
tests/qtest/meson.build | 6 +-
144
52 files changed, 1432 insertions(+), 319 deletions(-)
145
create mode 100644 hw/gpio/gpio_pwr.c
146
create mode 100644 hw/misc/pvpanic-isa.c
147
create mode 100644 hw/misc/pvpanic-pci.c
148
create mode 100644 tests/qtest/cmsdk-apb-dualtimer-test.c
149
create mode 100644 tests/qtest/cmsdk-apb-timer-test.c
150
create mode 100644 tests/qtest/cmsdk-apb-watchdog-test.c
151
create mode 100644 tests/qtest/pvpanic-pci-test.c
152
create mode 100644 accel/hvf/entitlements.plist
153
create mode 100755 scripts/entitlement.sh
154
diff view generated by jsdifflib
1
The aa32_fp16_arith feature check function currently looks at the
1
From: Richard Henderson <richard.henderson@linaro.org>
2
AArch64 ID_AA64PFR0 register. This is (as the comment notes) not
3
correct. The bogus check was put in mostly to allow testing of the
4
fp16 variants of the VCMLA instructions and it was something of
5
a mistake that we allowed them to exist in master.
6
2
7
Switch the feature check function to testing VMFR1.FPHP, which is
3
This was defined at some point before ARMv8.4, and will
8
what it ought to be.
4
shortly be used by new processor descriptions.
9
5
10
This will remove emulation of the VCMLA and VCADD insns from
6
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
11
AArch32 code running on an AArch64 '-cpu max' using system emulation.
7
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
12
(They were never enabled for aarch32 linux-user and system-emulation.)
8
Message-id: 20210120204400.1056582-1-richard.henderson@linaro.org
13
Since we weren't advertising their existence via the AArch32 ID
14
register, well-behaved guests wouldn't have been using them anyway.
15
16
Once we have implemented all the AArch32 support for the FP16 extension
17
we will advertise it in the MVFR1 ID register field, which will reenable
18
these insns along with all the others.
19
20
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
9
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
21
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
22
Message-id: 20200828183354.27913-3-peter.maydell@linaro.org
23
---
10
---
24
target/arm/cpu.h | 7 +------
11
target/arm/cpu.h | 1 +
25
1 file changed, 1 insertion(+), 6 deletions(-)
12
target/arm/helper.c | 4 ++--
13
target/arm/kvm64.c | 2 ++
14
3 files changed, 5 insertions(+), 2 deletions(-)
26
15
27
diff --git a/target/arm/cpu.h b/target/arm/cpu.h
16
diff --git a/target/arm/cpu.h b/target/arm/cpu.h
28
index XXXXXXX..XXXXXXX 100644
17
index XXXXXXX..XXXXXXX 100644
29
--- a/target/arm/cpu.h
18
--- a/target/arm/cpu.h
30
+++ b/target/arm/cpu.h
19
+++ b/target/arm/cpu.h
31
@@ -XXX,XX +XXX,XX @@ static inline bool isar_feature_aa32_predinv(const ARMISARegisters *id)
20
@@ -XXX,XX +XXX,XX @@ struct ARMCPU {
32
21
uint32_t id_mmfr4;
33
static inline bool isar_feature_aa32_fp16_arith(const ARMISARegisters *id)
22
uint32_t id_pfr0;
34
{
23
uint32_t id_pfr1;
35
- /*
24
+ uint32_t id_pfr2;
36
- * This is a placeholder for use by VCMA until the rest of
25
uint32_t mvfr0;
37
- * the ARMv8.2-FP16 extension is implemented for aa32 mode.
26
uint32_t mvfr1;
38
- * At which point we can properly set and check MVFR1.FPHP.
27
uint32_t mvfr2;
39
- */
28
diff --git a/target/arm/helper.c b/target/arm/helper.c
40
- return FIELD_EX64(id->id_aa64pfr0, ID_AA64PFR0, FP) == 1;
29
index XXXXXXX..XXXXXXX 100644
41
+ return FIELD_EX32(id->mvfr1, MVFR1, FPHP) >= 3;
30
--- a/target/arm/helper.c
42
}
31
+++ b/target/arm/helper.c
43
32
@@ -XXX,XX +XXX,XX @@ void register_cp_regs_for_features(ARMCPU *cpu)
44
static inline bool isar_feature_aa32_vfp_simd(const ARMISARegisters *id)
33
.access = PL1_R, .type = ARM_CP_CONST,
34
.accessfn = access_aa64_tid3,
35
.resetvalue = 0 },
36
- { .name = "MVFR4_EL1_RESERVED", .state = ARM_CP_STATE_AA64,
37
+ { .name = "ID_PFR2", .state = ARM_CP_STATE_BOTH,
38
.opc0 = 3, .opc1 = 0, .crn = 0, .crm = 3, .opc2 = 4,
39
.access = PL1_R, .type = ARM_CP_CONST,
40
.accessfn = access_aa64_tid3,
41
- .resetvalue = 0 },
42
+ .resetvalue = cpu->isar.id_pfr2 },
43
{ .name = "MVFR5_EL1_RESERVED", .state = ARM_CP_STATE_AA64,
44
.opc0 = 3, .opc1 = 0, .crn = 0, .crm = 3, .opc2 = 5,
45
.access = PL1_R, .type = ARM_CP_CONST,
46
diff --git a/target/arm/kvm64.c b/target/arm/kvm64.c
47
index XXXXXXX..XXXXXXX 100644
48
--- a/target/arm/kvm64.c
49
+++ b/target/arm/kvm64.c
50
@@ -XXX,XX +XXX,XX @@ bool kvm_arm_get_host_cpu_features(ARMHostCPUFeatures *ahcf)
51
ARM64_SYS_REG(3, 0, 0, 1, 0));
52
err |= read_sys_reg32(fdarray[2], &ahcf->isar.id_pfr1,
53
ARM64_SYS_REG(3, 0, 0, 1, 1));
54
+ err |= read_sys_reg32(fdarray[2], &ahcf->isar.id_pfr2,
55
+ ARM64_SYS_REG(3, 0, 0, 3, 4));
56
err |= read_sys_reg32(fdarray[2], &ahcf->isar.id_dfr0,
57
ARM64_SYS_REG(3, 0, 0, 1, 2));
58
err |= read_sys_reg32(fdarray[2], &ahcf->isar.id_mmfr0,
45
--
59
--
46
2.20.1
60
2.20.1
47
61
48
62
diff view generated by jsdifflib
1
Macroify the uses of do_vfp_2op_sp() and do_vfp_2op_dp(); this will
1
From: Richard Henderson <richard.henderson@linaro.org>
2
make it easier to add the halfprec support.
3
2
3
Only define the register if it exists for the cpu.
4
5
Signed-off-by: Richard Henderson <richard.henderson@linaro.org>
6
Message-id: 20210120031656.737646-1-richard.henderson@linaro.org
7
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
8
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
5
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
6
Message-id: 20200828183354.27913-8-peter.maydell@linaro.org
7
---
9
---
8
target/arm/translate-vfp.c.inc | 49 ++++++++++------------------------
10
target/arm/helper.c | 21 +++++++++++++++------
9
1 file changed, 14 insertions(+), 35 deletions(-)
11
1 file changed, 15 insertions(+), 6 deletions(-)
10
12
11
diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
13
diff --git a/target/arm/helper.c b/target/arm/helper.c
12
index XXXXXXX..XXXXXXX 100644
14
index XXXXXXX..XXXXXXX 100644
13
--- a/target/arm/translate-vfp.c.inc
15
--- a/target/arm/helper.c
14
+++ b/target/arm/translate-vfp.c.inc
16
+++ b/target/arm/helper.c
15
@@ -XXX,XX +XXX,XX @@ static bool trans_VMOV_imm_dp(DisasContext *s, arg_VMOV_imm_dp *a)
17
@@ -XXX,XX +XXX,XX @@ static void define_debug_regs(ARMCPU *cpu)
16
return true;
18
*/
17
}
19
int i;
18
20
int wrps, brps, ctx_cmps;
19
-static bool trans_VMOV_reg_sp(DisasContext *s, arg_VMOV_reg_sp *a)
21
- ARMCPRegInfo dbgdidr = {
20
-{
22
- .name = "DBGDIDR", .cp = 14, .crn = 0, .crm = 0, .opc1 = 0, .opc2 = 0,
21
- return do_vfp_2op_sp(s, tcg_gen_mov_i32, a->vd, a->vm);
23
- .access = PL0_R, .accessfn = access_tda,
22
-}
24
- .type = ARM_CP_CONST, .resetvalue = cpu->isar.dbgdidr,
23
+#define DO_VFP_2OP(INSN, PREC, FN) \
25
- };
24
+ static bool trans_##INSN##_##PREC(DisasContext *s, \
26
+
25
+ arg_##INSN##_##PREC *a) \
27
+ /*
26
+ { \
28
+ * The Arm ARM says DBGDIDR is optional and deprecated if EL1 cannot
27
+ return do_vfp_2op_##PREC(s, FN, a->vd, a->vm); \
29
+ * use AArch32. Given that bit 15 is RES1, if the value is 0 then
30
+ * the register must not exist for this cpu.
31
+ */
32
+ if (cpu->isar.dbgdidr != 0) {
33
+ ARMCPRegInfo dbgdidr = {
34
+ .name = "DBGDIDR", .cp = 14, .crn = 0, .crm = 0,
35
+ .opc1 = 0, .opc2 = 0,
36
+ .access = PL0_R, .accessfn = access_tda,
37
+ .type = ARM_CP_CONST, .resetvalue = cpu->isar.dbgdidr,
38
+ };
39
+ define_one_arm_cp_reg(cpu, &dbgdidr);
28
+ }
40
+ }
29
41
30
-static bool trans_VMOV_reg_dp(DisasContext *s, arg_VMOV_reg_dp *a)
42
/* Note that all these register fields hold "number of Xs minus 1". */
31
-{
43
brps = arm_num_brps(cpu);
32
- return do_vfp_2op_dp(s, tcg_gen_mov_i64, a->vd, a->vm);
44
@@ -XXX,XX +XXX,XX @@ static void define_debug_regs(ARMCPU *cpu)
33
-}
45
34
+DO_VFP_2OP(VMOV_reg, sp, tcg_gen_mov_i32)
46
assert(ctx_cmps <= brps);
35
+DO_VFP_2OP(VMOV_reg, dp, tcg_gen_mov_i64)
47
36
48
- define_one_arm_cp_reg(cpu, &dbgdidr);
37
-static bool trans_VABS_sp(DisasContext *s, arg_VABS_sp *a)
49
define_arm_cp_regs(cpu, debug_cp_reginfo);
38
-{
50
39
- return do_vfp_2op_sp(s, gen_helper_vfp_abss, a->vd, a->vm);
51
if (arm_feature(&cpu->env, ARM_FEATURE_LPAE)) {
40
-}
41
+DO_VFP_2OP(VABS, sp, gen_helper_vfp_abss)
42
+DO_VFP_2OP(VABS, dp, gen_helper_vfp_absd)
43
44
-static bool trans_VABS_dp(DisasContext *s, arg_VABS_dp *a)
45
-{
46
- return do_vfp_2op_dp(s, gen_helper_vfp_absd, a->vd, a->vm);
47
-}
48
-
49
-static bool trans_VNEG_sp(DisasContext *s, arg_VNEG_sp *a)
50
-{
51
- return do_vfp_2op_sp(s, gen_helper_vfp_negs, a->vd, a->vm);
52
-}
53
-
54
-static bool trans_VNEG_dp(DisasContext *s, arg_VNEG_dp *a)
55
-{
56
- return do_vfp_2op_dp(s, gen_helper_vfp_negd, a->vd, a->vm);
57
-}
58
+DO_VFP_2OP(VNEG, sp, gen_helper_vfp_negs)
59
+DO_VFP_2OP(VNEG, dp, gen_helper_vfp_negd)
60
61
static void gen_VSQRT_sp(TCGv_i32 vd, TCGv_i32 vm)
62
{
63
gen_helper_vfp_sqrts(vd, vm, cpu_env);
64
}
65
66
-static bool trans_VSQRT_sp(DisasContext *s, arg_VSQRT_sp *a)
67
-{
68
- return do_vfp_2op_sp(s, gen_VSQRT_sp, a->vd, a->vm);
69
-}
70
-
71
static void gen_VSQRT_dp(TCGv_i64 vd, TCGv_i64 vm)
72
{
73
gen_helper_vfp_sqrtd(vd, vm, cpu_env);
74
}
75
76
-static bool trans_VSQRT_dp(DisasContext *s, arg_VSQRT_dp *a)
77
-{
78
- return do_vfp_2op_dp(s, gen_VSQRT_dp, a->vd, a->vm);
79
-}
80
+DO_VFP_2OP(VSQRT, sp, gen_VSQRT_sp)
81
+DO_VFP_2OP(VSQRT, dp, gen_VSQRT_dp)
82
83
static bool trans_VCMP_sp(DisasContext *s, arg_VCMP_sp *a)
84
{
85
--
52
--
86
2.20.1
53
2.20.1
87
54
88
55
diff view generated by jsdifflib
1
Set the MVFR1 ID register FPHP and SIMDHP fields to indicate
1
From: Paolo Bonzini <pbonzini@redhat.com>
2
that our "-cpu max" has v8.2-FP16.
3
2
3
The properties to attach a CANBUS object to the xlnx-zcu102 machine have
4
a period in them. We want to use periods in properties for compound QAPI types,
5
and besides the "xlnx-zcu102." prefix is both unnecessary and different
6
from any other machine property name. Remove it.
7
8
Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>
9
Message-id: 20210118162537.779542-1-pbonzini@redhat.com
10
Reviewed-by: Vikram Garhwal <fnu.vikram@xilinx.com>
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
11
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
5
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
6
Message-id: 20200828183354.27913-46-peter.maydell@linaro.org
7
---
12
---
8
target/arm/cpu.c | 3 ++-
13
hw/arm/xlnx-zcu102.c | 4 ++--
9
target/arm/cpu64.c | 10 ++++------
14
tests/qtest/xlnx-can-test.c | 30 +++++++++++++++---------------
10
2 files changed, 6 insertions(+), 7 deletions(-)
15
2 files changed, 17 insertions(+), 17 deletions(-)
11
16
12
diff --git a/target/arm/cpu.c b/target/arm/cpu.c
17
diff --git a/hw/arm/xlnx-zcu102.c b/hw/arm/xlnx-zcu102.c
13
index XXXXXXX..XXXXXXX 100644
18
index XXXXXXX..XXXXXXX 100644
14
--- a/target/arm/cpu.c
19
--- a/hw/arm/xlnx-zcu102.c
15
+++ b/target/arm/cpu.c
20
+++ b/hw/arm/xlnx-zcu102.c
16
@@ -XXX,XX +XXX,XX @@ static void arm_max_initfn(Object *obj)
21
@@ -XXX,XX +XXX,XX @@ static void xlnx_zcu102_machine_instance_init(Object *obj)
17
cpu->isar.id_isar6 = t;
22
s->secure = false;
18
23
/* Default to virt (EL2) being disabled */
19
t = cpu->isar.mvfr1;
24
s->virt = false;
20
- t = FIELD_DP32(t, MVFR1, FPHP, 2); /* v8.0 FP support */
25
- object_property_add_link(obj, "xlnx-zcu102.canbus0", TYPE_CAN_BUS,
21
+ t = FIELD_DP32(t, MVFR1, FPHP, 3); /* v8.2-FP16 */
26
+ object_property_add_link(obj, "canbus0", TYPE_CAN_BUS,
22
+ t = FIELD_DP32(t, MVFR1, SIMDHP, 2); /* v8.2-FP16 */
27
(Object **)&s->canbus[0],
23
cpu->isar.mvfr1 = t;
28
object_property_allow_set_link,
24
29
0);
25
t = cpu->isar.mvfr2;
30
26
diff --git a/target/arm/cpu64.c b/target/arm/cpu64.c
31
- object_property_add_link(obj, "xlnx-zcu102.canbus1", TYPE_CAN_BUS,
32
+ object_property_add_link(obj, "canbus1", TYPE_CAN_BUS,
33
(Object **)&s->canbus[1],
34
object_property_allow_set_link,
35
0);
36
diff --git a/tests/qtest/xlnx-can-test.c b/tests/qtest/xlnx-can-test.c
27
index XXXXXXX..XXXXXXX 100644
37
index XXXXXXX..XXXXXXX 100644
28
--- a/target/arm/cpu64.c
38
--- a/tests/qtest/xlnx-can-test.c
29
+++ b/target/arm/cpu64.c
39
+++ b/tests/qtest/xlnx-can-test.c
30
@@ -XXX,XX +XXX,XX @@ static void aarch64_max_initfn(Object *obj)
40
@@ -XXX,XX +XXX,XX @@ static void test_can_bus(void)
31
u = FIELD_DP32(u, ID_DFR0, PERFMON, 5); /* v8.4-PMU */
41
uint8_t can_timestamp = 1;
32
cpu->isar.id_dfr0 = u;
42
33
43
QTestState *qts = qtest_init("-machine xlnx-zcu102"
34
- /*
44
- " -object can-bus,id=canbus0"
35
- * FIXME: We do not yet support ARMv8.2-fp16 for AArch32 yet,
45
- " -machine xlnx-zcu102.canbus0=canbus0"
36
- * so do not set MVFR1.FPHP. Strictly speaking this is not legal,
46
- " -machine xlnx-zcu102.canbus1=canbus0"
37
- * but it is also not legal to enable SVE without support for FP16,
47
+ " -object can-bus,id=canbus"
38
- * and enabling SVE in system mode is more useful in the short term.
48
+ " -machine canbus0=canbus"
39
- */
49
+ " -machine canbus1=canbus"
40
+ u = cpu->isar.mvfr1;
50
);
41
+ u = FIELD_DP32(u, MVFR1, FPHP, 3); /* v8.2-FP16 */
51
42
+ u = FIELD_DP32(u, MVFR1, SIMDHP, 2); /* v8.2-FP16 */
52
/* Configure the CAN0 and CAN1. */
43
+ cpu->isar.mvfr1 = u;
53
@@ -XXX,XX +XXX,XX @@ static void test_can_loopback(void)
44
54
uint32_t status = 0;
45
#ifdef CONFIG_USER_ONLY
55
46
/* For usermode -cpu max we can use a larger and more efficient DCZ
56
QTestState *qts = qtest_init("-machine xlnx-zcu102"
57
- " -object can-bus,id=canbus0"
58
- " -machine xlnx-zcu102.canbus0=canbus0"
59
- " -machine xlnx-zcu102.canbus1=canbus0"
60
+ " -object can-bus,id=canbus"
61
+ " -machine canbus0=canbus"
62
+ " -machine canbus1=canbus"
63
);
64
65
/* Configure the CAN0 in loopback mode. */
66
@@ -XXX,XX +XXX,XX @@ static void test_can_filter(void)
67
uint8_t can_timestamp = 1;
68
69
QTestState *qts = qtest_init("-machine xlnx-zcu102"
70
- " -object can-bus,id=canbus0"
71
- " -machine xlnx-zcu102.canbus0=canbus0"
72
- " -machine xlnx-zcu102.canbus1=canbus0"
73
+ " -object can-bus,id=canbus"
74
+ " -machine canbus0=canbus"
75
+ " -machine canbus1=canbus"
76
);
77
78
/* Configure the CAN0 and CAN1. */
79
@@ -XXX,XX +XXX,XX @@ static void test_can_sleepmode(void)
80
uint8_t can_timestamp = 1;
81
82
QTestState *qts = qtest_init("-machine xlnx-zcu102"
83
- " -object can-bus,id=canbus0"
84
- " -machine xlnx-zcu102.canbus0=canbus0"
85
- " -machine xlnx-zcu102.canbus1=canbus0"
86
+ " -object can-bus,id=canbus"
87
+ " -machine canbus0=canbus"
88
+ " -machine canbus1=canbus"
89
);
90
91
/* Configure the CAN0. */
92
@@ -XXX,XX +XXX,XX @@ static void test_can_snoopmode(void)
93
uint8_t can_timestamp = 1;
94
95
QTestState *qts = qtest_init("-machine xlnx-zcu102"
96
- " -object can-bus,id=canbus0"
97
- " -machine xlnx-zcu102.canbus0=canbus0"
98
- " -machine xlnx-zcu102.canbus1=canbus0"
99
+ " -object can-bus,id=canbus"
100
+ " -machine canbus0=canbus"
101
+ " -machine canbus1=canbus"
102
);
103
104
/* Configure the CAN0. */
47
--
105
--
48
2.20.1
106
2.20.1
49
107
50
108
diff view generated by jsdifflib
1
Convert the neon floating-point vector operations VFMA and VFMS
1
From: Maxim Uvarov <maxim.uvarov@linaro.org>
2
to use a gvec helper, and use this to implement the fp16 case.
3
2
4
This is the last use of do_3same_fp() so we can now delete
3
Implement gpio-pwr driver to allow reboot and poweroff machine.
5
that function.
4
This is simple driver with just 2 gpios lines. Current use case
5
is to reboot and poweroff virt machine in secure mode. Secure
6
pl066 gpio chip is needed for that.
6
7
8
Signed-off-by: Maxim Uvarov <maxim.uvarov@linaro.org>
9
Reviewed-by: Hao Wu <wuhaotsh@google.com>
10
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
7
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
11
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
8
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
9
Message-id: 20200828183354.27913-32-peter.maydell@linaro.org
10
---
12
---
11
target/arm/helper.h | 6 +++
13
hw/gpio/gpio_pwr.c | 70 +++++++++++++++++++++++++++++++++++++++++++++
12
target/arm/vec_helper.c | 33 +++++++++++-
14
hw/gpio/Kconfig | 3 ++
13
target/arm/translate-neon.c.inc | 92 +--------------------------------
15
hw/gpio/meson.build | 1 +
14
3 files changed, 40 insertions(+), 91 deletions(-)
16
3 files changed, 74 insertions(+)
17
create mode 100644 hw/gpio/gpio_pwr.c
15
18
16
diff --git a/target/arm/helper.h b/target/arm/helper.h
19
diff --git a/hw/gpio/gpio_pwr.c b/hw/gpio/gpio_pwr.c
17
index XXXXXXX..XXXXXXX 100644
20
new file mode 100644
18
--- a/target/arm/helper.h
21
index XXXXXXX..XXXXXXX
19
+++ b/target/arm/helper.h
22
--- /dev/null
20
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_5(gvec_fmla_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
23
+++ b/hw/gpio/gpio_pwr.c
21
DEF_HELPER_FLAGS_5(gvec_fmls_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
24
@@ -XXX,XX +XXX,XX @@
22
DEF_HELPER_FLAGS_5(gvec_fmls_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
25
+/*
23
26
+ * GPIO qemu power controller
24
+DEF_HELPER_FLAGS_5(gvec_vfma_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
27
+ *
25
+DEF_HELPER_FLAGS_5(gvec_vfma_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
28
+ * Copyright (c) 2020 Linaro Limited
29
+ *
30
+ * Author: Maxim Uvarov <maxim.uvarov@linaro.org>
31
+ *
32
+ * Virtual gpio driver which can be used on top of pl061
33
+ * to reboot and shutdown qemu virtual machine. One of use
34
+ * case is gpio driver for secure world application (ARM
35
+ * Trusted Firmware.).
36
+ *
37
+ * This work is licensed under the terms of the GNU GPL, version 2 or later.
38
+ * See the COPYING file in the top-level directory.
39
+ * SPDX-License-Identifier: GPL-2.0-or-later
40
+ */
26
+
41
+
27
+DEF_HELPER_FLAGS_5(gvec_vfms_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
42
+/*
28
+DEF_HELPER_FLAGS_5(gvec_vfms_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
43
+ * QEMU interface:
44
+ * two named input GPIO lines:
45
+ * 'reset' : when asserted, trigger system reset
46
+ * 'shutdown' : when asserted, trigger system shutdown
47
+ */
29
+
48
+
30
DEF_HELPER_FLAGS_5(gvec_ftsmul_h, TCG_CALL_NO_RWG,
49
+#include "qemu/osdep.h"
31
void, ptr, ptr, ptr, ptr, i32)
50
+#include "hw/sysbus.h"
32
DEF_HELPER_FLAGS_5(gvec_ftsmul_s, TCG_CALL_NO_RWG,
51
+#include "sysemu/runstate.h"
33
diff --git a/target/arm/vec_helper.c b/target/arm/vec_helper.c
52
+
34
index XXXXXXX..XXXXXXX 100644
53
+#define TYPE_GPIOPWR "gpio-pwr"
35
--- a/target/arm/vec_helper.c
54
+OBJECT_DECLARE_SIMPLE_TYPE(GPIO_PWR_State, GPIOPWR)
36
+++ b/target/arm/vec_helper.c
55
+
37
@@ -XXX,XX +XXX,XX @@ static float32 float32_mulsub_nf(float32 dest, float32 op1, float32 op2,
56
+struct GPIO_PWR_State {
38
return float32_sub(dest, float32_mul(op1, op2, stat), stat);
57
+ SysBusDevice parent_obj;
39
}
58
+};
40
59
+
41
-#define DO_MULADD(NAME, FUNC, TYPE) \
60
+static void gpio_pwr_reset(void *opaque, int n, int level)
42
+/* Fused versions; these have the semantics Neon VFMA/VFMS want */
43
+static float16 float16_muladd_f(float16 dest, float16 op1, float16 op2,
44
+ float_status *stat)
45
+{
61
+{
46
+ return float16_muladd(op1, op2, dest, 0, stat);
62
+ if (level) {
63
+ qemu_system_reset_request(SHUTDOWN_CAUSE_GUEST_RESET);
64
+ }
47
+}
65
+}
48
+
66
+
49
+static float32 float32_muladd_f(float32 dest, float32 op1, float32 op2,
67
+static void gpio_pwr_shutdown(void *opaque, int n, int level)
50
+ float_status *stat)
51
+{
68
+{
52
+ return float32_muladd(op1, op2, dest, 0, stat);
69
+ if (level) {
70
+ qemu_system_reset_request(SHUTDOWN_CAUSE_GUEST_SHUTDOWN);
71
+ }
53
+}
72
+}
54
+
73
+
55
+static float16 float16_mulsub_f(float16 dest, float16 op1, float16 op2,
74
+static void gpio_pwr_init(Object *obj)
56
+ float_status *stat)
57
+{
75
+{
58
+ return float16_muladd(float16_chs(op1), op2, dest, 0, stat);
76
+ DeviceState *dev = DEVICE(obj);
77
+
78
+ qdev_init_gpio_in_named(dev, gpio_pwr_reset, "reset", 1);
79
+ qdev_init_gpio_in_named(dev, gpio_pwr_shutdown, "shutdown", 1);
59
+}
80
+}
60
+
81
+
61
+static float32 float32_mulsub_f(float32 dest, float32 op1, float32 op2,
82
+static const TypeInfo gpio_pwr_info = {
62
+ float_status *stat)
83
+ .name = TYPE_GPIOPWR,
84
+ .parent = TYPE_SYS_BUS_DEVICE,
85
+ .instance_size = sizeof(GPIO_PWR_State),
86
+ .instance_init = gpio_pwr_init,
87
+};
88
+
89
+static void gpio_pwr_register_types(void)
63
+{
90
+{
64
+ return float32_muladd(float32_chs(op1), op2, dest, 0, stat);
91
+ type_register_static(&gpio_pwr_info);
65
+}
92
+}
66
+
93
+
67
+#define DO_MULADD(NAME, FUNC, TYPE) \
94
+type_init(gpio_pwr_register_types)
68
void HELPER(NAME)(void *vd, void *vn, void *vm, void *stat, uint32_t desc) \
95
diff --git a/hw/gpio/Kconfig b/hw/gpio/Kconfig
69
{ \
96
index XXXXXXX..XXXXXXX 100644
70
intptr_t i, oprsz = simd_oprsz(desc); \
97
--- a/hw/gpio/Kconfig
71
@@ -XXX,XX +XXX,XX @@ DO_MULADD(gvec_fmla_s, float32_muladd_nf, float32)
98
+++ b/hw/gpio/Kconfig
72
DO_MULADD(gvec_fmls_h, float16_mulsub_nf, float16)
99
@@ -XXX,XX +XXX,XX @@ config PL061
73
DO_MULADD(gvec_fmls_s, float32_mulsub_nf, float32)
100
config GPIO_KEY
74
101
bool
75
+DO_MULADD(gvec_vfma_h, float16_muladd_f, float16)
102
76
+DO_MULADD(gvec_vfma_s, float32_muladd_f, float32)
103
+config GPIO_PWR
104
+ bool
77
+
105
+
78
+DO_MULADD(gvec_vfms_h, float16_mulsub_f, float16)
106
config SIFIVE_GPIO
79
+DO_MULADD(gvec_vfms_s, float32_mulsub_f, float32)
107
bool
80
+
108
diff --git a/hw/gpio/meson.build b/hw/gpio/meson.build
81
/* For the indexed ops, SVE applies the index per 128-bit vector segment.
82
* For AdvSIMD, there is of course only one such vector segment.
83
*/
84
diff --git a/target/arm/translate-neon.c.inc b/target/arm/translate-neon.c.inc
85
index XXXXXXX..XXXXXXX 100644
109
index XXXXXXX..XXXXXXX 100644
86
--- a/target/arm/translate-neon.c.inc
110
--- a/hw/gpio/meson.build
87
+++ b/target/arm/translate-neon.c.inc
111
+++ b/hw/gpio/meson.build
88
@@ -XXX,XX +XXX,XX @@ DO_3SAME_PAIR(VPADD, padd_u)
112
@@ -XXX,XX +XXX,XX @@
89
DO_3SAME_VQDMULH(VQDMULH, qdmulh)
113
softmmu_ss.add(when: 'CONFIG_E500', if_true: files('mpc8xxx.c'))
90
DO_3SAME_VQDMULH(VQRDMULH, qrdmulh)
114
softmmu_ss.add(when: 'CONFIG_GPIO_KEY', if_true: files('gpio_key.c'))
91
115
+softmmu_ss.add(when: 'CONFIG_GPIO_PWR', if_true: files('gpio_pwr.c'))
92
-static bool do_3same_fp(DisasContext *s, arg_3same *a, VFPGen3OpSPFn *fn,
116
softmmu_ss.add(when: 'CONFIG_MAX7310', if_true: files('max7310.c'))
93
- bool reads_vd)
117
softmmu_ss.add(when: 'CONFIG_PL061', if_true: files('pl061.c'))
94
-{
118
softmmu_ss.add(when: 'CONFIG_PUV3', if_true: files('puv3_gpio.c'))
95
- /*
96
- * FP operations handled elementwise 32 bits at a time.
97
- * If reads_vd is true then the old value of Vd will be
98
- * loaded before calling the callback function. This is
99
- * used for multiply-accumulate type operations.
100
- */
101
- TCGv_i32 tmp, tmp2;
102
- int pass;
103
-
104
- if (!arm_dc_feature(s, ARM_FEATURE_NEON)) {
105
- return false;
106
- }
107
-
108
- /* UNDEF accesses to D16-D31 if they don't exist. */
109
- if (!dc_isar_feature(aa32_simd_r32, s) &&
110
- ((a->vd | a->vn | a->vm) & 0x10)) {
111
- return false;
112
- }
113
-
114
- if ((a->vn | a->vm | a->vd) & a->q) {
115
- return false;
116
- }
117
-
118
- if (!vfp_access_check(s)) {
119
- return true;
120
- }
121
-
122
- TCGv_ptr fpstatus = fpstatus_ptr(FPST_STD);
123
- for (pass = 0; pass < (a->q ? 4 : 2); pass++) {
124
- tmp = neon_load_reg(a->vn, pass);
125
- tmp2 = neon_load_reg(a->vm, pass);
126
- if (reads_vd) {
127
- TCGv_i32 tmp_rd = neon_load_reg(a->vd, pass);
128
- fn(tmp_rd, tmp, tmp2, fpstatus);
129
- neon_store_reg(a->vd, pass, tmp_rd);
130
- tcg_temp_free_i32(tmp);
131
- } else {
132
- fn(tmp, tmp, tmp2, fpstatus);
133
- neon_store_reg(a->vd, pass, tmp);
134
- }
135
- tcg_temp_free_i32(tmp2);
136
- }
137
- tcg_temp_free_ptr(fpstatus);
138
- return true;
139
-}
140
-
141
#define WRAP_FP_GVEC(WRAPNAME, FPST, FUNC) \
142
static void WRAPNAME(unsigned vece, uint32_t rd_ofs, \
143
uint32_t rn_ofs, uint32_t rm_ofs, \
144
@@ -XXX,XX +XXX,XX @@ DO_3S_FP_GVEC(VMAX, gen_helper_gvec_fmax_s, gen_helper_gvec_fmax_h)
145
DO_3S_FP_GVEC(VMIN, gen_helper_gvec_fmin_s, gen_helper_gvec_fmin_h)
146
DO_3S_FP_GVEC(VMLA, gen_helper_gvec_fmla_s, gen_helper_gvec_fmla_h)
147
DO_3S_FP_GVEC(VMLS, gen_helper_gvec_fmls_s, gen_helper_gvec_fmls_h)
148
+DO_3S_FP_GVEC(VFMA, gen_helper_gvec_vfma_s, gen_helper_gvec_vfma_h)
149
+DO_3S_FP_GVEC(VFMS, gen_helper_gvec_vfms_s, gen_helper_gvec_vfms_h)
150
151
WRAP_FP_GVEC(gen_VMAXNM_fp32_3s, FPST_STD, gen_helper_gvec_fmaxnum_s)
152
WRAP_FP_GVEC(gen_VMAXNM_fp16_3s, FPST_STD_F16, gen_helper_gvec_fmaxnum_h)
153
@@ -XXX,XX +XXX,XX @@ static bool trans_VRSQRTS_fp_3s(DisasContext *s, arg_3same *a)
154
return do_3same(s, a, gen_VRSQRTS_fp_3s);
155
}
156
157
-static void gen_VFMA_fp_3s(TCGv_i32 vd, TCGv_i32 vn, TCGv_i32 vm,
158
- TCGv_ptr fpstatus)
159
-{
160
- gen_helper_vfp_muladds(vd, vn, vm, vd, fpstatus);
161
-}
162
-
163
-static bool trans_VFMA_fp_3s(DisasContext *s, arg_3same *a)
164
-{
165
- if (!dc_isar_feature(aa32_simdfmac, s)) {
166
- return false;
167
- }
168
-
169
- if (a->size != 0) {
170
- /* TODO fp16 support */
171
- return false;
172
- }
173
-
174
- return do_3same_fp(s, a, gen_VFMA_fp_3s, true);
175
-}
176
-
177
-static void gen_VFMS_fp_3s(TCGv_i32 vd, TCGv_i32 vn, TCGv_i32 vm,
178
- TCGv_ptr fpstatus)
179
-{
180
- gen_helper_vfp_negs(vn, vn);
181
- gen_helper_vfp_muladds(vd, vn, vm, vd, fpstatus);
182
-}
183
-
184
-static bool trans_VFMS_fp_3s(DisasContext *s, arg_3same *a)
185
-{
186
- if (!dc_isar_feature(aa32_simdfmac, s)) {
187
- return false;
188
- }
189
-
190
- if (a->size != 0) {
191
- /* TODO fp16 support */
192
- return false;
193
- }
194
-
195
- return do_3same_fp(s, a, gen_VFMS_fp_3s, true);
196
-}
197
-
198
static bool do_3same_fp_pair(DisasContext *s, arg_3same *a, VFPGen3OpSPFn *fn)
199
{
200
/* FP operations handled pairwise 32 bits at a time */
201
--
119
--
202
2.20.1
120
2.20.1
203
121
204
122
diff view generated by jsdifflib
1
Implement FP16 support for the Neon insns which use the DO_3S_FP_GVEC
1
From: Maxim Uvarov <maxim.uvarov@linaro.org>
2
macro: VADD, VSUB, VABD, VMUL.
3
2
4
For VABD this requires us to implement a new gvec_fabd_h helper
3
No functional change. Just refactor code to better
5
using the machinery we have already for the other helpers.
4
support secure and normal world gpios.
6
5
6
Signed-off-by: Maxim Uvarov <maxim.uvarov@linaro.org>
7
Reviewed-by: Andrew Jones <drjones@redhat.com>
7
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
8
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
8
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
9
Message-id: 20200828183354.27913-24-peter.maydell@linaro.org
10
---
9
---
11
target/arm/helper.h | 1 +
10
hw/arm/virt.c | 57 ++++++++++++++++++++++++++++++++-------------------
12
target/arm/vec_helper.c | 6 ++++++
11
1 file changed, 36 insertions(+), 21 deletions(-)
13
target/arm/translate-neon.c.inc | 36 +++++++++++++++++----------------
14
3 files changed, 26 insertions(+), 17 deletions(-)
15
12
16
diff --git a/target/arm/helper.h b/target/arm/helper.h
13
diff --git a/hw/arm/virt.c b/hw/arm/virt.c
17
index XXXXXXX..XXXXXXX 100644
14
index XXXXXXX..XXXXXXX 100644
18
--- a/target/arm/helper.h
15
--- a/hw/arm/virt.c
19
+++ b/target/arm/helper.h
16
+++ b/hw/arm/virt.c
20
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_5(gvec_fmul_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
17
@@ -XXX,XX +XXX,XX @@ static void virt_powerdown_req(Notifier *n, void *opaque)
21
DEF_HELPER_FLAGS_5(gvec_fmul_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
18
}
22
DEF_HELPER_FLAGS_5(gvec_fmul_d, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
23
24
+DEF_HELPER_FLAGS_5(gvec_fabd_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
25
DEF_HELPER_FLAGS_5(gvec_fabd_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
26
27
DEF_HELPER_FLAGS_5(gvec_ftsmul_h, TCG_CALL_NO_RWG,
28
diff --git a/target/arm/vec_helper.c b/target/arm/vec_helper.c
29
index XXXXXXX..XXXXXXX 100644
30
--- a/target/arm/vec_helper.c
31
+++ b/target/arm/vec_helper.c
32
@@ -XXX,XX +XXX,XX @@ static float64 float64_ftsmul(float64 op1, uint64_t op2, float_status *stat)
33
return result;
34
}
19
}
35
20
36
+static float16 float16_abd(float16 op1, float16 op2, float_status *stat)
21
-static void create_gpio(const VirtMachineState *vms)
22
+static void create_gpio_keys(const VirtMachineState *vms,
23
+ DeviceState *pl061_dev,
24
+ uint32_t phandle)
37
+{
25
+{
38
+ return float16_abs(float16_sub(op1, op2, stat));
26
+ gpio_key_dev = sysbus_create_simple("gpio-key", -1,
27
+ qdev_get_gpio_in(pl061_dev, 3));
28
+
29
+ qemu_fdt_add_subnode(vms->fdt, "/gpio-keys");
30
+ qemu_fdt_setprop_string(vms->fdt, "/gpio-keys", "compatible", "gpio-keys");
31
+ qemu_fdt_setprop_cell(vms->fdt, "/gpio-keys", "#size-cells", 0);
32
+ qemu_fdt_setprop_cell(vms->fdt, "/gpio-keys", "#address-cells", 1);
33
+
34
+ qemu_fdt_add_subnode(vms->fdt, "/gpio-keys/poweroff");
35
+ qemu_fdt_setprop_string(vms->fdt, "/gpio-keys/poweroff",
36
+ "label", "GPIO Key Poweroff");
37
+ qemu_fdt_setprop_cell(vms->fdt, "/gpio-keys/poweroff", "linux,code",
38
+ KEY_POWER);
39
+ qemu_fdt_setprop_cells(vms->fdt, "/gpio-keys/poweroff",
40
+ "gpios", phandle, 3, 0);
39
+}
41
+}
40
+
42
+
41
static float32 float32_abd(float32 op1, float32 op2, float_status *stat)
43
+static void create_gpio_devices(const VirtMachineState *vms, int gpio,
44
+ MemoryRegion *mem)
42
{
45
{
43
return float32_abs(float32_sub(op1, op2, stat));
46
char *nodename;
44
@@ -XXX,XX +XXX,XX @@ DO_3OP(gvec_ftsmul_h, float16_ftsmul, float16)
47
DeviceState *pl061_dev;
45
DO_3OP(gvec_ftsmul_s, float32_ftsmul, float32)
48
- hwaddr base = vms->memmap[VIRT_GPIO].base;
46
DO_3OP(gvec_ftsmul_d, float64_ftsmul, float64)
49
- hwaddr size = vms->memmap[VIRT_GPIO].size;
47
50
- int irq = vms->irqmap[VIRT_GPIO];
48
+DO_3OP(gvec_fabd_h, float16_abd, float16)
51
+ hwaddr base = vms->memmap[gpio].base;
49
DO_3OP(gvec_fabd_s, float32_abd, float32)
52
+ hwaddr size = vms->memmap[gpio].size;
50
53
+ int irq = vms->irqmap[gpio];
51
#ifdef TARGET_AARCH64
54
const char compat[] = "arm,pl061\0arm,primecell";
52
diff --git a/target/arm/translate-neon.c.inc b/target/arm/translate-neon.c.inc
55
+ SysBusDevice *s;
53
index XXXXXXX..XXXXXXX 100644
56
54
--- a/target/arm/translate-neon.c.inc
57
- pl061_dev = sysbus_create_simple("pl061", base,
55
+++ b/target/arm/translate-neon.c.inc
58
- qdev_get_gpio_in(vms->gic, irq));
56
@@ -XXX,XX +XXX,XX @@ static bool do_3same_fp(DisasContext *s, arg_3same *a, VFPGen3OpSPFn *fn,
59
+ pl061_dev = qdev_new("pl061");
57
return true;
60
+ s = SYS_BUS_DEVICE(pl061_dev);
61
+ sysbus_realize_and_unref(s, &error_fatal);
62
+ memory_region_add_subregion(mem, base, sysbus_mmio_get_region(s, 0));
63
+ sysbus_connect_irq(s, 0, qdev_get_gpio_in(vms->gic, irq));
64
65
uint32_t phandle = qemu_fdt_alloc_phandle(vms->fdt);
66
nodename = g_strdup_printf("/pl061@%" PRIx64, base);
67
@@ -XXX,XX +XXX,XX @@ static void create_gpio(const VirtMachineState *vms)
68
qemu_fdt_setprop_string(vms->fdt, nodename, "clock-names", "apb_pclk");
69
qemu_fdt_setprop_cell(vms->fdt, nodename, "phandle", phandle);
70
71
- gpio_key_dev = sysbus_create_simple("gpio-key", -1,
72
- qdev_get_gpio_in(pl061_dev, 3));
73
- qemu_fdt_add_subnode(vms->fdt, "/gpio-keys");
74
- qemu_fdt_setprop_string(vms->fdt, "/gpio-keys", "compatible", "gpio-keys");
75
- qemu_fdt_setprop_cell(vms->fdt, "/gpio-keys", "#size-cells", 0);
76
- qemu_fdt_setprop_cell(vms->fdt, "/gpio-keys", "#address-cells", 1);
77
-
78
- qemu_fdt_add_subnode(vms->fdt, "/gpio-keys/poweroff");
79
- qemu_fdt_setprop_string(vms->fdt, "/gpio-keys/poweroff",
80
- "label", "GPIO Key Poweroff");
81
- qemu_fdt_setprop_cell(vms->fdt, "/gpio-keys/poweroff", "linux,code",
82
- KEY_POWER);
83
- qemu_fdt_setprop_cells(vms->fdt, "/gpio-keys/poweroff",
84
- "gpios", phandle, 3, 0);
85
g_free(nodename);
86
+
87
+ /* Child gpio devices */
88
+ create_gpio_keys(vms, pl061_dev, phandle);
58
}
89
}
59
90
60
-/*
91
static void create_virtio_devices(const VirtMachineState *vms)
61
- * For all the functions using this macro, size == 1 means fp16,
92
@@ -XXX,XX +XXX,XX @@ static void machvirt_init(MachineState *machine)
62
- * which is an architecture extension we don't implement yet.
93
if (has_ged && aarch64 && firmware_loaded && virt_is_acpi_enabled(vms)) {
63
- */
94
vms->acpi_dev = create_acpi_ged(vms);
64
-#define DO_3S_FP_GVEC(INSN,FUNC) \
95
} else {
65
- static void gen_##INSN##_3s(unsigned vece, uint32_t rd_ofs, \
96
- create_gpio(vms);
66
- uint32_t rn_ofs, uint32_t rm_ofs, \
97
+ create_gpio_devices(vms, VIRT_GPIO, sysmem);
67
- uint32_t oprsz, uint32_t maxsz) \
68
+#define WRAP_FP_GVEC(WRAPNAME, FPST, FUNC) \
69
+ static void WRAPNAME(unsigned vece, uint32_t rd_ofs, \
70
+ uint32_t rn_ofs, uint32_t rm_ofs, \
71
+ uint32_t oprsz, uint32_t maxsz) \
72
{ \
73
- TCGv_ptr fpst = fpstatus_ptr(FPST_STD); \
74
+ TCGv_ptr fpst = fpstatus_ptr(FPST); \
75
tcg_gen_gvec_3_ptr(rd_ofs, rn_ofs, rm_ofs, fpst, \
76
oprsz, maxsz, 0, FUNC); \
77
tcg_temp_free_ptr(fpst); \
78
- } \
79
+ }
80
+
81
+#define DO_3S_FP_GVEC(INSN,SFUNC,HFUNC) \
82
+ WRAP_FP_GVEC(gen_##INSN##_fp32_3s, FPST_STD, SFUNC) \
83
+ WRAP_FP_GVEC(gen_##INSN##_fp16_3s, FPST_STD_F16, HFUNC) \
84
static bool trans_##INSN##_fp_3s(DisasContext *s, arg_3same *a) \
85
{ \
86
if (a->size != 0) { \
87
- /* TODO fp16 support */ \
88
- return false; \
89
+ if (!dc_isar_feature(aa32_fp16_arith, s)) { \
90
+ return false; \
91
+ } \
92
+ return do_3same(s, a, gen_##INSN##_fp16_3s); \
93
} \
94
- return do_3same(s, a, gen_##INSN##_3s); \
95
+ return do_3same(s, a, gen_##INSN##_fp32_3s); \
96
}
98
}
97
99
98
100
/* connect powerdown request */
99
-DO_3S_FP_GVEC(VADD, gen_helper_gvec_fadd_s)
100
-DO_3S_FP_GVEC(VSUB, gen_helper_gvec_fsub_s)
101
-DO_3S_FP_GVEC(VABD, gen_helper_gvec_fabd_s)
102
-DO_3S_FP_GVEC(VMUL, gen_helper_gvec_fmul_s)
103
+DO_3S_FP_GVEC(VADD, gen_helper_gvec_fadd_s, gen_helper_gvec_fadd_h)
104
+DO_3S_FP_GVEC(VSUB, gen_helper_gvec_fsub_s, gen_helper_gvec_fsub_h)
105
+DO_3S_FP_GVEC(VABD, gen_helper_gvec_fabd_s, gen_helper_gvec_fabd_h)
106
+DO_3S_FP_GVEC(VMUL, gen_helper_gvec_fmul_s, gen_helper_gvec_fmul_h)
107
108
/*
109
* For all the functions using this macro, size == 1 means fp16,
110
--
101
--
111
2.20.1
102
2.20.1
112
103
113
104
diff view generated by jsdifflib
1
Convert the Neon floating-point VMUL, VMLA and VMLS to use gvec,
1
From: Maxim Uvarov <maxim.uvarov@linaro.org>
2
and use this to implement fp16 support.
3
2
3
Add secure pl061 for reset/power down machine from
4
the secure world (Arm Trusted Firmware). Connect it
5
with gpio-pwr driver.
6
7
Signed-off-by: Maxim Uvarov <maxim.uvarov@linaro.org>
8
Reviewed-by: Andrew Jones <drjones@redhat.com>
9
[PMM: Added mention of the new device to the documentation]
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
10
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
5
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
6
Message-id: 20200828183354.27913-45-peter.maydell@linaro.org
7
---
11
---
8
target/arm/translate-neon.c.inc | 114 ++++++++++++++++----------------
12
docs/system/arm/virt.rst | 2 ++
9
1 file changed, 57 insertions(+), 57 deletions(-)
13
include/hw/arm/virt.h | 2 ++
14
hw/arm/virt.c | 56 +++++++++++++++++++++++++++++++++++++++-
15
hw/arm/Kconfig | 1 +
16
4 files changed, 60 insertions(+), 1 deletion(-)
10
17
11
diff --git a/target/arm/translate-neon.c.inc b/target/arm/translate-neon.c.inc
18
diff --git a/docs/system/arm/virt.rst b/docs/system/arm/virt.rst
12
index XXXXXXX..XXXXXXX 100644
19
index XXXXXXX..XXXXXXX 100644
13
--- a/target/arm/translate-neon.c.inc
20
--- a/docs/system/arm/virt.rst
14
+++ b/target/arm/translate-neon.c.inc
21
+++ b/docs/system/arm/virt.rst
15
@@ -XXX,XX +XXX,XX @@ static bool trans_VMLS_2sc(DisasContext *s, arg_2scalar *a)
22
@@ -XXX,XX +XXX,XX @@ The virt board supports:
16
return do_2scalar(s, a, opfn[a->size], accfn[a->size]);
23
- Secure-World-only devices if the CPU has TrustZone:
24
25
- A second PL011 UART
26
+ - A second PL061 GPIO controller, with GPIO lines for triggering
27
+ a system reset or system poweroff
28
- A secure flash memory
29
- 16MB of secure RAM
30
31
diff --git a/include/hw/arm/virt.h b/include/hw/arm/virt.h
32
index XXXXXXX..XXXXXXX 100644
33
--- a/include/hw/arm/virt.h
34
+++ b/include/hw/arm/virt.h
35
@@ -XXX,XX +XXX,XX @@ enum {
36
VIRT_GPIO,
37
VIRT_SECURE_UART,
38
VIRT_SECURE_MEM,
39
+ VIRT_SECURE_GPIO,
40
VIRT_PCDIMM_ACPI,
41
VIRT_ACPI_GED,
42
VIRT_NVDIMM_ACPI,
43
@@ -XXX,XX +XXX,XX @@ struct VirtMachineClass {
44
bool kvm_no_adjvtime;
45
bool no_kvm_steal_time;
46
bool acpi_expose_flash;
47
+ bool no_secure_gpio;
48
};
49
50
struct VirtMachineState {
51
diff --git a/hw/arm/virt.c b/hw/arm/virt.c
52
index XXXXXXX..XXXXXXX 100644
53
--- a/hw/arm/virt.c
54
+++ b/hw/arm/virt.c
55
@@ -XXX,XX +XXX,XX @@ static const MemMapEntry base_memmap[] = {
56
[VIRT_ACPI_GED] = { 0x09080000, ACPI_GED_EVT_SEL_LEN },
57
[VIRT_NVDIMM_ACPI] = { 0x09090000, NVDIMM_ACPI_IO_LEN},
58
[VIRT_PVTIME] = { 0x090a0000, 0x00010000 },
59
+ [VIRT_SECURE_GPIO] = { 0x090b0000, 0x00001000 },
60
[VIRT_MMIO] = { 0x0a000000, 0x00000200 },
61
/* ...repeating for a total of NUM_VIRTIO_TRANSPORTS, each of that size */
62
[VIRT_PLATFORM_BUS] = { 0x0c000000, 0x02000000 },
63
@@ -XXX,XX +XXX,XX @@ static void create_gpio_keys(const VirtMachineState *vms,
64
"gpios", phandle, 3, 0);
17
}
65
}
18
66
19
-/*
67
+#define SECURE_GPIO_POWEROFF 0
20
- * Rather than have a float-specific version of do_2scalar just for
68
+#define SECURE_GPIO_RESET 1
21
- * three insns, we wrap a NeonGenTwoSingleOpFn to turn it into
69
+
22
- * a NeonGenTwoOpFn.
70
+static void create_secure_gpio_pwr(const VirtMachineState *vms,
23
- */
71
+ DeviceState *pl061_dev,
24
-#define WRAP_FP_FN(WRAPNAME, FUNC) \
72
+ uint32_t phandle)
25
- static void WRAPNAME(TCGv_i32 rd, TCGv_i32 rn, TCGv_i32 rm) \
26
- { \
27
- TCGv_ptr fpstatus = fpstatus_ptr(FPST_STD); \
28
- FUNC(rd, rn, rm, fpstatus); \
29
- tcg_temp_free_ptr(fpstatus); \
30
+static bool do_2scalar_fp_vec(DisasContext *s, arg_2scalar *a,
31
+ gen_helper_gvec_3_ptr *fn)
32
+{
73
+{
33
+ /* Two registers and a scalar, using gvec */
74
+ DeviceState *gpio_pwr_dev;
34
+ int vec_size = a->q ? 16 : 8;
35
+ int rd_ofs = neon_reg_offset(a->vd, 0);
36
+ int rn_ofs = neon_reg_offset(a->vn, 0);
37
+ int rm_ofs;
38
+ int idx;
39
+ TCGv_ptr fpstatus;
40
+
75
+
41
+ if (!arm_dc_feature(s, ARM_FEATURE_NEON)) {
76
+ /* gpio-pwr */
42
+ return false;
77
+ gpio_pwr_dev = sysbus_create_simple("gpio-pwr", -1, NULL);
78
+
79
+ /* connect secure pl061 to gpio-pwr */
80
+ qdev_connect_gpio_out(pl061_dev, SECURE_GPIO_RESET,
81
+ qdev_get_gpio_in_named(gpio_pwr_dev, "reset", 0));
82
+ qdev_connect_gpio_out(pl061_dev, SECURE_GPIO_POWEROFF,
83
+ qdev_get_gpio_in_named(gpio_pwr_dev, "shutdown", 0));
84
+
85
+ qemu_fdt_add_subnode(vms->fdt, "/gpio-poweroff");
86
+ qemu_fdt_setprop_string(vms->fdt, "/gpio-poweroff", "compatible",
87
+ "gpio-poweroff");
88
+ qemu_fdt_setprop_cells(vms->fdt, "/gpio-poweroff",
89
+ "gpios", phandle, SECURE_GPIO_POWEROFF, 0);
90
+ qemu_fdt_setprop_string(vms->fdt, "/gpio-poweroff", "status", "disabled");
91
+ qemu_fdt_setprop_string(vms->fdt, "/gpio-poweroff", "secure-status",
92
+ "okay");
93
+
94
+ qemu_fdt_add_subnode(vms->fdt, "/gpio-restart");
95
+ qemu_fdt_setprop_string(vms->fdt, "/gpio-restart", "compatible",
96
+ "gpio-restart");
97
+ qemu_fdt_setprop_cells(vms->fdt, "/gpio-restart",
98
+ "gpios", phandle, SECURE_GPIO_RESET, 0);
99
+ qemu_fdt_setprop_string(vms->fdt, "/gpio-restart", "status", "disabled");
100
+ qemu_fdt_setprop_string(vms->fdt, "/gpio-restart", "secure-status",
101
+ "okay");
102
+}
103
+
104
static void create_gpio_devices(const VirtMachineState *vms, int gpio,
105
MemoryRegion *mem)
106
{
107
@@ -XXX,XX +XXX,XX @@ static void create_gpio_devices(const VirtMachineState *vms, int gpio,
108
qemu_fdt_setprop_string(vms->fdt, nodename, "clock-names", "apb_pclk");
109
qemu_fdt_setprop_cell(vms->fdt, nodename, "phandle", phandle);
110
111
+ if (gpio != VIRT_GPIO) {
112
+ /* Mark as not usable by the normal world */
113
+ qemu_fdt_setprop_string(vms->fdt, nodename, "status", "disabled");
114
+ qemu_fdt_setprop_string(vms->fdt, nodename, "secure-status", "okay");
115
+ }
116
g_free(nodename);
117
118
/* Child gpio devices */
119
- create_gpio_keys(vms, pl061_dev, phandle);
120
+ if (gpio == VIRT_GPIO) {
121
+ create_gpio_keys(vms, pl061_dev, phandle);
122
+ } else {
123
+ create_secure_gpio_pwr(vms, pl061_dev, phandle);
124
+ }
125
}
126
127
static void create_virtio_devices(const VirtMachineState *vms)
128
@@ -XXX,XX +XXX,XX @@ static void machvirt_init(MachineState *machine)
129
create_gpio_devices(vms, VIRT_GPIO, sysmem);
43
}
130
}
44
131
45
-WRAP_FP_FN(gen_VMUL_F_mul, gen_helper_vfp_muls)
132
+ if (vms->secure && !vmc->no_secure_gpio) {
46
-WRAP_FP_FN(gen_VMUL_F_add, gen_helper_vfp_adds)
133
+ create_gpio_devices(vms, VIRT_SECURE_GPIO, secure_sysmem);
47
-WRAP_FP_FN(gen_VMUL_F_sub, gen_helper_vfp_subs)
48
+ /* UNDEF accesses to D16-D31 if they don't exist. */
49
+ if (!dc_isar_feature(aa32_simd_r32, s) &&
50
+ ((a->vd | a->vn | a->vm) & 0x10)) {
51
+ return false;
52
+ }
53
54
-static bool trans_VMUL_F_2sc(DisasContext *s, arg_2scalar *a)
55
-{
56
- static NeonGenTwoOpFn * const opfn[] = {
57
- NULL,
58
- NULL, /* TODO: fp16 support */
59
- gen_VMUL_F_mul,
60
- NULL,
61
- };
62
+ if (!fn) {
63
+ /* Bad size (including size == 3, which is a different insn group) */
64
+ return false;
65
+ }
66
67
- return do_2scalar(s, a, opfn[a->size], NULL);
68
+ if (a->q && ((a->vd | a->vn) & 1)) {
69
+ return false;
70
+ }
134
+ }
71
+
135
+
72
+ if (!vfp_access_check(s)) {
136
/* connect powerdown request */
73
+ return true;
137
vms->powerdown_notifier.notify = virt_powerdown_req;
74
+ }
138
qemu_register_powerdown_notifier(&vms->powerdown_notifier);
139
@@ -XXX,XX +XXX,XX @@ DEFINE_VIRT_MACHINE_AS_LATEST(6, 0)
140
141
static void virt_machine_5_2_options(MachineClass *mc)
142
{
143
+ VirtMachineClass *vmc = VIRT_MACHINE_CLASS(OBJECT_CLASS(mc));
75
+
144
+
76
+ /* a->vm is M:Vm, which encodes both register and index */
145
virt_machine_6_0_options(mc);
77
+ idx = extract32(a->vm, a->size + 2, 2);
146
compat_props_add(mc->compat_props, hw_compat_5_2, hw_compat_5_2_len);
78
+ a->vm = extract32(a->vm, 0, a->size + 2);
147
+ vmc->no_secure_gpio = true;
79
+ rm_ofs = neon_reg_offset(a->vm, 0);
80
+
81
+ fpstatus = fpstatus_ptr(a->size == 1 ? FPST_STD_F16 : FPST_STD);
82
+ tcg_gen_gvec_3_ptr(rd_ofs, rn_ofs, rm_ofs, fpstatus,
83
+ vec_size, vec_size, idx, fn);
84
+ tcg_temp_free_ptr(fpstatus);
85
+ return true;
86
}
148
}
87
149
DEFINE_VIRT_MACHINE(5, 2)
88
-static bool trans_VMLA_F_2sc(DisasContext *s, arg_2scalar *a)
150
89
-{
151
diff --git a/hw/arm/Kconfig b/hw/arm/Kconfig
90
- static NeonGenTwoOpFn * const opfn[] = {
152
index XXXXXXX..XXXXXXX 100644
91
- NULL,
153
--- a/hw/arm/Kconfig
92
- NULL, /* TODO: fp16 support */
154
+++ b/hw/arm/Kconfig
93
- gen_VMUL_F_mul,
155
@@ -XXX,XX +XXX,XX @@ config ARM_VIRT
94
- NULL,
156
select PL011 # UART
95
- };
157
select PL031 # RTC
96
- static NeonGenTwoOpFn * const accfn[] = {
158
select PL061 # GPIO
97
- NULL,
159
+ select GPIO_PWR
98
- NULL, /* TODO: fp16 support */
160
select PLATFORM_BUS
99
- gen_VMUL_F_add,
161
select SMBIOS
100
- NULL,
162
select VIRTIO_MMIO
101
- };
102
+#define DO_VMUL_F_2sc(NAME, FUNC) \
103
+ static bool trans_##NAME##_F_2sc(DisasContext *s, arg_2scalar *a) \
104
+ { \
105
+ static gen_helper_gvec_3_ptr * const opfn[] = { \
106
+ NULL, \
107
+ gen_helper_##FUNC##_h, \
108
+ gen_helper_##FUNC##_s, \
109
+ NULL, \
110
+ }; \
111
+ if (a->size == MO_16 && !dc_isar_feature(aa32_fp16_arith, s)) { \
112
+ return false; \
113
+ } \
114
+ return do_2scalar_fp_vec(s, a, opfn[a->size]); \
115
+ }
116
117
- return do_2scalar(s, a, opfn[a->size], accfn[a->size]);
118
-}
119
-
120
-static bool trans_VMLS_F_2sc(DisasContext *s, arg_2scalar *a)
121
-{
122
- static NeonGenTwoOpFn * const opfn[] = {
123
- NULL,
124
- NULL, /* TODO: fp16 support */
125
- gen_VMUL_F_mul,
126
- NULL,
127
- };
128
- static NeonGenTwoOpFn * const accfn[] = {
129
- NULL,
130
- NULL, /* TODO: fp16 support */
131
- gen_VMUL_F_sub,
132
- NULL,
133
- };
134
-
135
- return do_2scalar(s, a, opfn[a->size], accfn[a->size]);
136
-}
137
+DO_VMUL_F_2sc(VMUL, gvec_fmul_idx)
138
+DO_VMUL_F_2sc(VMLA, gvec_fmla_nf_idx)
139
+DO_VMUL_F_2sc(VMLS, gvec_fmls_nf_idx)
140
141
WRAP_ENV_FN(gen_VQDMULH_16, gen_helper_neon_qdmulh_s16)
142
WRAP_ENV_FN(gen_VQDMULH_32, gen_helper_neon_qdmulh_s32)
143
--
163
--
144
2.20.1
164
2.20.1
145
165
146
166
diff view generated by jsdifflib
1
Convert the Neon VCVT with-specified-rounding-mode instructions
1
From: Hao Wu <wuhaotsh@google.com>
2
to gvec, and use this to implement fp16 support for them.
3
2
3
Fix potential overflow problem when calculating pwm_duty.
4
1. Ensure p->cmr and p->cnr to be from [0,65535], according to the
5
hardware specification.
6
2. Changed duty to uint32_t. However, since MAX_DUTY * (p->cmr+1)
7
can excceed UINT32_MAX, we convert them to uint64_t in computation
8
and converted them back to uint32_t.
9
(duty is guaranteed to be <= MAX_DUTY so it won't overflow.)
10
11
Fixes: CID 1442342
12
Suggested-by: Peter Maydell <peter.maydell@linaro.org>
13
Reviewed-by: Doug Evans <dje@google.com>
14
Signed-off-by: Hao Wu <wuhaotsh@google.com>
15
Message-id: 20210127011142.2122790-1-wuhaotsh@google.com
16
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
17
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
5
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
6
Message-id: 20200828183354.27913-40-peter.maydell@linaro.org
7
---
18
---
8
target/arm/helper.h | 5 ++
19
hw/misc/npcm7xx_pwm.c | 23 +++++++++++++++++++----
9
target/arm/vec_helper.c | 23 +++++++
20
tests/qtest/npcm7xx_pwm-test.c | 4 ++--
10
target/arm/translate-neon.c.inc | 105 ++++++++++++--------------------
21
2 files changed, 21 insertions(+), 6 deletions(-)
11
3 files changed, 66 insertions(+), 67 deletions(-)
12
22
13
diff --git a/target/arm/helper.h b/target/arm/helper.h
23
diff --git a/hw/misc/npcm7xx_pwm.c b/hw/misc/npcm7xx_pwm.c
14
index XXXXXXX..XXXXXXX 100644
24
index XXXXXXX..XXXXXXX 100644
15
--- a/target/arm/helper.h
25
--- a/hw/misc/npcm7xx_pwm.c
16
+++ b/target/arm/helper.h
26
+++ b/hw/misc/npcm7xx_pwm.c
17
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_4(gvec_vcvt_uh, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
27
@@ -XXX,XX +XXX,XX @@ REG32(NPCM7XX_PWM_PWDR3, 0x50);
18
DEF_HELPER_FLAGS_4(gvec_vcvt_hs, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
28
#define NPCM7XX_CH_INV BIT(2)
19
DEF_HELPER_FLAGS_4(gvec_vcvt_hu, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
29
#define NPCM7XX_CH_MOD BIT(3)
20
30
21
+DEF_HELPER_FLAGS_4(gvec_vcvt_rm_ss, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
31
+#define NPCM7XX_MAX_CMR 65535
22
+DEF_HELPER_FLAGS_4(gvec_vcvt_rm_us, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
32
+#define NPCM7XX_MAX_CNR 65535
23
+DEF_HELPER_FLAGS_4(gvec_vcvt_rm_sh, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
24
+DEF_HELPER_FLAGS_4(gvec_vcvt_rm_uh, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
25
+
33
+
26
DEF_HELPER_FLAGS_4(gvec_frecpe_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
34
/* Offset of each PWM channel's prescaler in the PPR register. */
27
DEF_HELPER_FLAGS_4(gvec_frecpe_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
35
static const int npcm7xx_ppr_base[] = { 0, 0, 8, 8 };
28
DEF_HELPER_FLAGS_4(gvec_frecpe_d, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
36
/* Offset of each PWM channel's clock selector in the CSR register. */
29
diff --git a/target/arm/vec_helper.c b/target/arm/vec_helper.c
37
@@ -XXX,XX +XXX,XX @@ static uint32_t npcm7xx_pwm_calculate_freq(NPCM7xxPWM *p)
38
39
static uint32_t npcm7xx_pwm_calculate_duty(NPCM7xxPWM *p)
40
{
41
- uint64_t duty;
42
+ uint32_t duty;
43
44
if (p->running) {
45
if (p->cnr == 0) {
46
@@ -XXX,XX +XXX,XX @@ static uint32_t npcm7xx_pwm_calculate_duty(NPCM7xxPWM *p)
47
} else if (p->cmr >= p->cnr) {
48
duty = NPCM7XX_PWM_MAX_DUTY;
49
} else {
50
- duty = NPCM7XX_PWM_MAX_DUTY * (p->cmr + 1) / (p->cnr + 1);
51
+ duty = (uint64_t)NPCM7XX_PWM_MAX_DUTY * (p->cmr + 1) / (p->cnr + 1);
52
}
53
} else {
54
duty = 0;
55
@@ -XXX,XX +XXX,XX @@ static void npcm7xx_pwm_write(void *opaque, hwaddr offset,
56
case A_NPCM7XX_PWM_CNR2:
57
case A_NPCM7XX_PWM_CNR3:
58
p = &s->pwm[npcm7xx_cnr_index(offset)];
59
- p->cnr = value;
60
+ if (value > NPCM7XX_MAX_CNR) {
61
+ qemu_log_mask(LOG_GUEST_ERROR,
62
+ "%s: invalid cnr value: %u", __func__, value);
63
+ p->cnr = NPCM7XX_MAX_CNR;
64
+ } else {
65
+ p->cnr = value;
66
+ }
67
npcm7xx_pwm_update_output(p);
68
break;
69
70
@@ -XXX,XX +XXX,XX @@ static void npcm7xx_pwm_write(void *opaque, hwaddr offset,
71
case A_NPCM7XX_PWM_CMR2:
72
case A_NPCM7XX_PWM_CMR3:
73
p = &s->pwm[npcm7xx_cmr_index(offset)];
74
- p->cmr = value;
75
+ if (value > NPCM7XX_MAX_CMR) {
76
+ qemu_log_mask(LOG_GUEST_ERROR,
77
+ "%s: invalid cmr value: %u", __func__, value);
78
+ p->cmr = NPCM7XX_MAX_CMR;
79
+ } else {
80
+ p->cmr = value;
81
+ }
82
npcm7xx_pwm_update_output(p);
83
break;
84
85
diff --git a/tests/qtest/npcm7xx_pwm-test.c b/tests/qtest/npcm7xx_pwm-test.c
30
index XXXXXXX..XXXXXXX 100644
86
index XXXXXXX..XXXXXXX 100644
31
--- a/target/arm/vec_helper.c
87
--- a/tests/qtest/npcm7xx_pwm-test.c
32
+++ b/target/arm/vec_helper.c
88
+++ b/tests/qtest/npcm7xx_pwm-test.c
33
@@ -XXX,XX +XXX,XX @@ DO_VCVT_FIXED(gvec_vcvt_hs, helper_vfp_toshh_round_to_zero, uint16_t)
89
@@ -XXX,XX +XXX,XX @@ static uint64_t pwm_compute_freq(QTestState *qts, uint32_t ppr, uint32_t csr,
34
DO_VCVT_FIXED(gvec_vcvt_hu, helper_vfp_touhh_round_to_zero, uint16_t)
90
35
91
static uint64_t pwm_compute_duty(uint32_t cnr, uint32_t cmr, bool inverted)
36
#undef DO_VCVT_FIXED
92
{
37
+
93
- uint64_t duty;
38
+#define DO_VCVT_RMODE(NAME, FUNC, TYPE) \
94
+ uint32_t duty;
39
+ void HELPER(NAME)(void *vd, void *vn, void *stat, uint32_t desc) \
95
40
+ { \
96
if (cnr == 0) {
41
+ float_status *fpst = stat; \
97
/* PWM is stopped. */
42
+ intptr_t i, oprsz = simd_oprsz(desc); \
98
@@ -XXX,XX +XXX,XX @@ static uint64_t pwm_compute_duty(uint32_t cnr, uint32_t cmr, bool inverted)
43
+ uint32_t rmode = simd_data(desc); \
99
} else if (cmr >= cnr) {
44
+ uint32_t prev_rmode = get_float_rounding_mode(fpst); \
100
duty = MAX_DUTY;
45
+ TYPE *d = vd, *n = vn; \
101
} else {
46
+ set_float_rounding_mode(rmode, fpst); \
102
- duty = MAX_DUTY * (cmr + 1) / (cnr + 1);
47
+ for (i = 0; i < oprsz / sizeof(TYPE); i++) { \
103
+ duty = (uint64_t)MAX_DUTY * (cmr + 1) / (cnr + 1);
48
+ d[i] = FUNC(n[i], 0, fpst); \
49
+ } \
50
+ set_float_rounding_mode(prev_rmode, fpst); \
51
+ clear_tail(d, oprsz, simd_maxsz(desc)); \
52
+ }
53
+
54
+DO_VCVT_RMODE(gvec_vcvt_rm_ss, helper_vfp_tosls, uint32_t)
55
+DO_VCVT_RMODE(gvec_vcvt_rm_us, helper_vfp_touls, uint32_t)
56
+DO_VCVT_RMODE(gvec_vcvt_rm_sh, helper_vfp_toshh, uint16_t)
57
+DO_VCVT_RMODE(gvec_vcvt_rm_uh, helper_vfp_touhh, uint16_t)
58
+
59
+#undef DO_VCVT_RMODE
60
diff --git a/target/arm/translate-neon.c.inc b/target/arm/translate-neon.c.inc
61
index XXXXXXX..XXXXXXX 100644
62
--- a/target/arm/translate-neon.c.inc
63
+++ b/target/arm/translate-neon.c.inc
64
@@ -XXX,XX +XXX,XX @@ DO_VRINT(VRINTZ, FPROUNDING_ZERO)
65
DO_VRINT(VRINTM, FPROUNDING_NEGINF)
66
DO_VRINT(VRINTP, FPROUNDING_POSINF)
67
68
-static bool do_vcvt(DisasContext *s, arg_2misc *a, int rmode, bool is_signed)
69
-{
70
- /*
71
- * Handle a VCVT* operation by iterating 32 bits at a time,
72
- * with a specified rounding mode in operation.
73
- */
74
- int pass;
75
- TCGv_ptr fpst;
76
- TCGv_i32 tcg_rmode, tcg_shift;
77
-
78
- if (!arm_dc_feature(s, ARM_FEATURE_NEON) ||
79
- !arm_dc_feature(s, ARM_FEATURE_V8)) {
80
- return false;
81
+#define DO_VEC_RMODE(INSN, RMODE, OP) \
82
+ static void gen_##INSN(unsigned vece, uint32_t rd_ofs, \
83
+ uint32_t rm_ofs, \
84
+ uint32_t oprsz, uint32_t maxsz) \
85
+ { \
86
+ static gen_helper_gvec_2_ptr * const fns[4] = { \
87
+ NULL, \
88
+ gen_helper_gvec_##OP##h, \
89
+ gen_helper_gvec_##OP##s, \
90
+ NULL, \
91
+ }; \
92
+ TCGv_ptr fpst; \
93
+ fpst = fpstatus_ptr(vece == 1 ? FPST_STD_F16 : FPST_STD); \
94
+ tcg_gen_gvec_2_ptr(rd_ofs, rm_ofs, fpst, oprsz, maxsz, \
95
+ arm_rmode_to_sf(RMODE), fns[vece]); \
96
+ tcg_temp_free_ptr(fpst); \
97
+ } \
98
+ static bool trans_##INSN(DisasContext *s, arg_2misc *a) \
99
+ { \
100
+ if (!arm_dc_feature(s, ARM_FEATURE_V8)) { \
101
+ return false; \
102
+ } \
103
+ if (a->size == MO_16) { \
104
+ if (!dc_isar_feature(aa32_fp16_arith, s)) { \
105
+ return false; \
106
+ } \
107
+ } else if (a->size != MO_32) { \
108
+ return false; \
109
+ } \
110
+ return do_2misc_vec(s, a, gen_##INSN); \
111
}
104
}
112
105
113
- /* UNDEF accesses to D16-D31 if they don't exist. */
106
if (inverted) {
114
- if (!dc_isar_feature(aa32_simd_r32, s) &&
115
- ((a->vd | a->vm) & 0x10)) {
116
- return false;
117
- }
118
-
119
- if (a->size != 2) {
120
- /* TODO: FP16 will be the size == 1 case */
121
- return false;
122
- }
123
-
124
- if ((a->vd | a->vm) & a->q) {
125
- return false;
126
- }
127
-
128
- if (!vfp_access_check(s)) {
129
- return true;
130
- }
131
-
132
- fpst = fpstatus_ptr(FPST_STD);
133
- tcg_shift = tcg_const_i32(0);
134
- tcg_rmode = tcg_const_i32(arm_rmode_to_sf(rmode));
135
- gen_helper_set_neon_rmode(tcg_rmode, tcg_rmode, cpu_env);
136
- for (pass = 0; pass < (a->q ? 4 : 2); pass++) {
137
- TCGv_i32 tmp = neon_load_reg(a->vm, pass);
138
- if (is_signed) {
139
- gen_helper_vfp_tosls(tmp, tmp, tcg_shift, fpst);
140
- } else {
141
- gen_helper_vfp_touls(tmp, tmp, tcg_shift, fpst);
142
- }
143
- neon_store_reg(a->vd, pass, tmp);
144
- }
145
- gen_helper_set_neon_rmode(tcg_rmode, tcg_rmode, cpu_env);
146
- tcg_temp_free_i32(tcg_rmode);
147
- tcg_temp_free_i32(tcg_shift);
148
- tcg_temp_free_ptr(fpst);
149
-
150
- return true;
151
-}
152
-
153
-#define DO_VCVT(INSN, RMODE, SIGNED) \
154
- static bool trans_##INSN(DisasContext *s, arg_2misc *a) \
155
- { \
156
- return do_vcvt(s, a, RMODE, SIGNED); \
157
- }
158
-
159
-DO_VCVT(VCVTAU, FPROUNDING_TIEAWAY, false)
160
-DO_VCVT(VCVTAS, FPROUNDING_TIEAWAY, true)
161
-DO_VCVT(VCVTNU, FPROUNDING_TIEEVEN, false)
162
-DO_VCVT(VCVTNS, FPROUNDING_TIEEVEN, true)
163
-DO_VCVT(VCVTPU, FPROUNDING_POSINF, false)
164
-DO_VCVT(VCVTPS, FPROUNDING_POSINF, true)
165
-DO_VCVT(VCVTMU, FPROUNDING_NEGINF, false)
166
-DO_VCVT(VCVTMS, FPROUNDING_NEGINF, true)
167
+DO_VEC_RMODE(VCVTAU, FPROUNDING_TIEAWAY, vcvt_rm_u)
168
+DO_VEC_RMODE(VCVTAS, FPROUNDING_TIEAWAY, vcvt_rm_s)
169
+DO_VEC_RMODE(VCVTNU, FPROUNDING_TIEEVEN, vcvt_rm_u)
170
+DO_VEC_RMODE(VCVTNS, FPROUNDING_TIEEVEN, vcvt_rm_s)
171
+DO_VEC_RMODE(VCVTPU, FPROUNDING_POSINF, vcvt_rm_u)
172
+DO_VEC_RMODE(VCVTPS, FPROUNDING_POSINF, vcvt_rm_s)
173
+DO_VEC_RMODE(VCVTMU, FPROUNDING_NEGINF, vcvt_rm_u)
174
+DO_VEC_RMODE(VCVTMS, FPROUNDING_NEGINF, vcvt_rm_s)
175
176
static bool trans_VSWP(DisasContext *s, arg_2misc *a)
177
{
178
--
107
--
179
2.20.1
108
2.20.1
180
109
181
110
diff view generated by jsdifflib
1
Add gvec helpers for doing Neon-style indexed non-fused fp
1
From: Philippe Mathieu-Daudé <f4bug@amsat.org>
2
multiply-and-accumulate operations.
3
2
3
cpu_get_phys_page_debug() uses 'DATA LOAD' MMU access type.
4
5
Signed-off-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
6
Message-id: 20210127232822.3530782-1-f4bug@amsat.org
7
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
8
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
5
Message-id: 20200828183354.27913-44-peter.maydell@linaro.org
6
---
9
---
7
target/arm/helper.h | 10 ++++++++++
10
target/arm/helper.c | 2 +-
8
target/arm/vec_helper.c | 27 ++++++++++++++++++++++-----
11
1 file changed, 1 insertion(+), 1 deletion(-)
9
2 files changed, 32 insertions(+), 5 deletions(-)
10
12
11
diff --git a/target/arm/helper.h b/target/arm/helper.h
13
diff --git a/target/arm/helper.c b/target/arm/helper.c
12
index XXXXXXX..XXXXXXX 100644
14
index XXXXXXX..XXXXXXX 100644
13
--- a/target/arm/helper.h
15
--- a/target/arm/helper.c
14
+++ b/target/arm/helper.h
16
+++ b/target/arm/helper.c
15
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_5(gvec_fmul_idx_s, TCG_CALL_NO_RWG,
17
@@ -XXX,XX +XXX,XX @@ hwaddr arm_cpu_get_phys_page_attrs_debug(CPUState *cs, vaddr addr,
16
DEF_HELPER_FLAGS_5(gvec_fmul_idx_d, TCG_CALL_NO_RWG,
18
17
void, ptr, ptr, ptr, ptr, i32)
19
*attrs = (MemTxAttrs) {};
18
20
19
+DEF_HELPER_FLAGS_5(gvec_fmla_nf_idx_h, TCG_CALL_NO_RWG,
21
- ret = get_phys_addr(env, addr, 0, mmu_idx, &phys_addr,
20
+ void, ptr, ptr, ptr, ptr, i32)
22
+ ret = get_phys_addr(env, addr, MMU_DATA_LOAD, mmu_idx, &phys_addr,
21
+DEF_HELPER_FLAGS_5(gvec_fmla_nf_idx_s, TCG_CALL_NO_RWG,
23
attrs, &prot, &page_size, &fi, &cacheattrs);
22
+ void, ptr, ptr, ptr, ptr, i32)
24
23
+
25
if (ret) {
24
+DEF_HELPER_FLAGS_5(gvec_fmls_nf_idx_h, TCG_CALL_NO_RWG,
25
+ void, ptr, ptr, ptr, ptr, i32)
26
+DEF_HELPER_FLAGS_5(gvec_fmls_nf_idx_s, TCG_CALL_NO_RWG,
27
+ void, ptr, ptr, ptr, ptr, i32)
28
+
29
DEF_HELPER_FLAGS_6(gvec_fmla_idx_h, TCG_CALL_NO_RWG,
30
void, ptr, ptr, ptr, ptr, ptr, i32)
31
DEF_HELPER_FLAGS_6(gvec_fmla_idx_s, TCG_CALL_NO_RWG,
32
diff --git a/target/arm/vec_helper.c b/target/arm/vec_helper.c
33
index XXXXXXX..XXXXXXX 100644
34
--- a/target/arm/vec_helper.c
35
+++ b/target/arm/vec_helper.c
36
@@ -XXX,XX +XXX,XX @@ DO_MLA_IDX(gvec_mls_idx_d, uint64_t, -, )
37
38
#undef DO_MLA_IDX
39
40
-#define DO_FMUL_IDX(NAME, TYPE, H) \
41
+#define DO_FMUL_IDX(NAME, ADD, TYPE, H) \
42
void HELPER(NAME)(void *vd, void *vn, void *vm, void *stat, uint32_t desc) \
43
{ \
44
intptr_t i, j, oprsz = simd_oprsz(desc); \
45
@@ -XXX,XX +XXX,XX @@ void HELPER(NAME)(void *vd, void *vn, void *vm, void *stat, uint32_t desc) \
46
for (i = 0; i < oprsz / sizeof(TYPE); i += segment) { \
47
TYPE mm = m[H(i + idx)]; \
48
for (j = 0; j < segment; j++) { \
49
- d[i + j] = TYPE##_mul(n[i + j], mm, stat); \
50
+ d[i + j] = TYPE##_##ADD(d[i + j], \
51
+ TYPE##_mul(n[i + j], mm, stat), stat); \
52
} \
53
} \
54
clear_tail(d, oprsz, simd_maxsz(desc)); \
55
}
56
57
-DO_FMUL_IDX(gvec_fmul_idx_h, float16, H2)
58
-DO_FMUL_IDX(gvec_fmul_idx_s, float32, H4)
59
-DO_FMUL_IDX(gvec_fmul_idx_d, float64, )
60
+#define float16_nop(N, M, S) (M)
61
+#define float32_nop(N, M, S) (M)
62
+#define float64_nop(N, M, S) (M)
63
64
+DO_FMUL_IDX(gvec_fmul_idx_h, nop, float16, H2)
65
+DO_FMUL_IDX(gvec_fmul_idx_s, nop, float32, H4)
66
+DO_FMUL_IDX(gvec_fmul_idx_d, nop, float64, )
67
+
68
+/*
69
+ * Non-fused multiply-accumulate operations, for Neon. NB that unlike
70
+ * the fused ops below they assume accumulate both from and into Vd.
71
+ */
72
+DO_FMUL_IDX(gvec_fmla_nf_idx_h, add, float16, H2)
73
+DO_FMUL_IDX(gvec_fmla_nf_idx_s, add, float32, H4)
74
+DO_FMUL_IDX(gvec_fmls_nf_idx_h, sub, float16, H2)
75
+DO_FMUL_IDX(gvec_fmls_nf_idx_s, sub, float32, H4)
76
+
77
+#undef float16_nop
78
+#undef float32_nop
79
+#undef float64_nop
80
#undef DO_FMUL_IDX
81
82
#define DO_FMLA_IDX(NAME, TYPE, H) \
83
--
26
--
84
2.20.1
27
2.20.1
85
28
86
29
diff view generated by jsdifflib
1
In the gvec helper functions for indexed operations, for AArch32
1
Move the preadv availability check to meson.build. This is what we
2
Neon the oprsz (total size of the vector) can be less than 16 bytes
2
want to be doing for host-OS-feature-checks anyway, but it also fixes
3
if the operation is on a D reg. Since the inner loop in these
3
a problem with building for macOS with the most recent XCode SDK on a
4
helpers always goes from 0 to segment, we must clamp it based
4
Catalina host.
5
on oprsz to avoid processing a full 16 byte segment when asked to
5
6
handle an 8 byte wide vector.
6
On that configuration, 'preadv()' is provided as a weak symbol, so
7
that programs can be built with optional support for it and make a
8
runtime availability check to see whether the preadv() they have is a
9
working one or one which they must not call because it will
10
runtime-assert. QEMU's configure test passes (unless you're building
11
with --enable-werror) because the test program using preadv()
12
compiles, but then QEMU crashes at runtime when preadv() is called,
13
with errors like:
14
15
dyld: lazy symbol binding failed: Symbol not found: _preadv
16
Referenced from: /Users/pm215/src/qemu/./build/x86/tests/test-replication
17
Expected in: /usr/lib/libSystem.B.dylib
18
19
dyld: Symbol not found: _preadv
20
Referenced from: /Users/pm215/src/qemu/./build/x86/tests/test-replication
21
Expected in: /usr/lib/libSystem.B.dylib
22
23
Meson's own function availability check has a special case for macOS
24
which adds '-Wl,-no_weak_imports' to the compiler flags, which forces
25
the test to require the real function, not the macOS-version-too-old
26
stub.
27
28
So this commit fixes the bug where macOS builds on Catalina currently
29
require --disable-werror.
7
30
8
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
31
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
9
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
32
Acked-by: Paolo Bonzini <pbonzini@redhat.com>
10
Message-id: 20200828183354.27913-43-peter.maydell@linaro.org
33
Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
34
Message-id: 20210126155846.17109-1-peter.maydell@linaro.org
11
---
35
---
12
target/arm/vec_helper.c | 12 ++++++++----
36
configure | 16 ----------------
13
1 file changed, 8 insertions(+), 4 deletions(-)
37
meson.build | 4 +++-
38
2 files changed, 3 insertions(+), 17 deletions(-)
14
39
15
diff --git a/target/arm/vec_helper.c b/target/arm/vec_helper.c
40
diff --git a/configure b/configure
41
index XXXXXXX..XXXXXXX 100755
42
--- a/configure
43
+++ b/configure
44
@@ -XXX,XX +XXX,XX @@ if compile_prog "" "" ; then
45
iovec=yes
46
fi
47
48
-##########################################
49
-# preadv probe
50
-cat > $TMPC <<EOF
51
-#include <sys/types.h>
52
-#include <sys/uio.h>
53
-#include <unistd.h>
54
-int main(void) { return preadv(0, 0, 0, 0); }
55
-EOF
56
-preadv=no
57
-if compile_prog "" "" ; then
58
- preadv=yes
59
-fi
60
-
61
##########################################
62
# fdt probe
63
64
@@ -XXX,XX +XXX,XX @@ fi
65
if test "$iovec" = "yes" ; then
66
echo "CONFIG_IOVEC=y" >> $config_host_mak
67
fi
68
-if test "$preadv" = "yes" ; then
69
- echo "CONFIG_PREADV=y" >> $config_host_mak
70
-fi
71
if test "$membarrier" = "yes" ; then
72
echo "CONFIG_MEMBARRIER=y" >> $config_host_mak
73
fi
74
diff --git a/meson.build b/meson.build
16
index XXXXXXX..XXXXXXX 100644
75
index XXXXXXX..XXXXXXX 100644
17
--- a/target/arm/vec_helper.c
76
--- a/meson.build
18
+++ b/target/arm/vec_helper.c
77
+++ b/meson.build
19
@@ -XXX,XX +XXX,XX @@ DO_MULADD(gvec_vfms_s, float32_mulsub_f, float32)
78
@@ -XXX,XX +XXX,XX @@ config_host_data.set('HAVE_PTY_H', cc.has_header('pty.h'))
20
#define DO_MUL_IDX(NAME, TYPE, H) \
79
config_host_data.set('HAVE_SYS_IOCCOM_H', cc.has_header('sys/ioccom.h'))
21
void HELPER(NAME)(void *vd, void *vn, void *vm, uint32_t desc) \
80
config_host_data.set('HAVE_SYS_KCOV_H', cc.has_header('sys/kcov.h'))
22
{ \
81
23
- intptr_t i, j, oprsz = simd_oprsz(desc), segment = 16 / sizeof(TYPE); \
82
+config_host_data.set('CONFIG_PREADV', cc.has_function('preadv', prefix: '#include <sys/uio.h>'))
24
+ intptr_t i, j, oprsz = simd_oprsz(desc); \
83
+
25
+ intptr_t segment = MIN(16, oprsz) / sizeof(TYPE); \
84
ignored = ['CONFIG_QEMU_INTERP_PREFIX'] # actually per-target
26
intptr_t idx = simd_data(desc); \
85
arrays = ['CONFIG_AUDIO_DRIVERS', 'CONFIG_BDRV_RW_WHITELIST', 'CONFIG_BDRV_RO_WHITELIST']
27
TYPE *d = vd, *n = vn, *m = vm; \
86
strings = ['HOST_DSOSUF', 'CONFIG_IASL']
28
for (i = 0; i < oprsz / sizeof(TYPE); i += segment) { \
87
@@ -XXX,XX +XXX,XX @@ summary_info += {'PIE': get_option('b_pie')}
29
@@ -XXX,XX +XXX,XX @@ DO_MUL_IDX(gvec_mul_idx_d, uint64_t, )
88
summary_info += {'static build': config_host.has_key('CONFIG_STATIC')}
30
#define DO_MLA_IDX(NAME, TYPE, OP, H) \
89
summary_info += {'malloc trim support': has_malloc_trim}
31
void HELPER(NAME)(void *vd, void *vn, void *vm, void *va, uint32_t desc) \
90
summary_info += {'membarrier': config_host.has_key('CONFIG_MEMBARRIER')}
32
{ \
91
-summary_info += {'preadv support': config_host.has_key('CONFIG_PREADV')}
33
- intptr_t i, j, oprsz = simd_oprsz(desc), segment = 16 / sizeof(TYPE); \
92
+summary_info += {'preadv support': config_host_data.get('CONFIG_PREADV')}
34
+ intptr_t i, j, oprsz = simd_oprsz(desc); \
93
summary_info += {'fdatasync': config_host.has_key('CONFIG_FDATASYNC')}
35
+ intptr_t segment = MIN(16, oprsz) / sizeof(TYPE); \
94
summary_info += {'madvise': config_host.has_key('CONFIG_MADVISE')}
36
intptr_t idx = simd_data(desc); \
95
summary_info += {'posix_madvise': config_host.has_key('CONFIG_POSIX_MADVISE')}
37
TYPE *d = vd, *n = vn, *m = vm, *a = va; \
38
for (i = 0; i < oprsz / sizeof(TYPE); i += segment) { \
39
@@ -XXX,XX +XXX,XX @@ DO_MLA_IDX(gvec_mls_idx_d, uint64_t, -, )
40
#define DO_FMUL_IDX(NAME, TYPE, H) \
41
void HELPER(NAME)(void *vd, void *vn, void *vm, void *stat, uint32_t desc) \
42
{ \
43
- intptr_t i, j, oprsz = simd_oprsz(desc), segment = 16 / sizeof(TYPE); \
44
+ intptr_t i, j, oprsz = simd_oprsz(desc); \
45
+ intptr_t segment = MIN(16, oprsz) / sizeof(TYPE); \
46
intptr_t idx = simd_data(desc); \
47
TYPE *d = vd, *n = vn, *m = vm; \
48
for (i = 0; i < oprsz / sizeof(TYPE); i += segment) { \
49
@@ -XXX,XX +XXX,XX @@ DO_FMUL_IDX(gvec_fmul_idx_d, float64, )
50
void HELPER(NAME)(void *vd, void *vn, void *vm, void *va, \
51
void *stat, uint32_t desc) \
52
{ \
53
- intptr_t i, j, oprsz = simd_oprsz(desc), segment = 16 / sizeof(TYPE); \
54
+ intptr_t i, j, oprsz = simd_oprsz(desc); \
55
+ intptr_t segment = MIN(16, oprsz) / sizeof(TYPE); \
56
TYPE op1_neg = extract32(desc, SIMD_DATA_SHIFT, 1); \
57
intptr_t idx = desc >> (SIMD_DATA_SHIFT + 1); \
58
TYPE *d = vd, *n = vn, *m = vm, *a = va; \
59
--
96
--
60
2.20.1
97
2.20.1
61
98
62
99
diff view generated by jsdifflib
1
Convert the Neon VRINTX insn to use gvec, and use this to implement
1
From: Joelle van Dyne <j@getutm.app>
2
fp16 support for it.
3
2
3
The iOS toolchain does not use the host prefix naming convention. So we
4
need to enable cross-compile options while allowing the PREFIX to be
5
blank.
6
7
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
8
Signed-off-by: Joelle van Dyne <j@getutm.app>
9
Message-id: 20210126012457.39046-3-j@getutm.app
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
10
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
5
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
6
Message-id: 20200828183354.27913-42-peter.maydell@linaro.org
7
---
11
---
8
target/arm/helper.h | 3 +++
12
configure | 6 ++++--
9
target/arm/vec_helper.c | 3 +++
13
1 file changed, 4 insertions(+), 2 deletions(-)
10
target/arm/translate-neon.c.inc | 45 +++------------------------------
11
3 files changed, 9 insertions(+), 42 deletions(-)
12
14
13
diff --git a/target/arm/helper.h b/target/arm/helper.h
15
diff --git a/configure b/configure
14
index XXXXXXX..XXXXXXX 100644
16
index XXXXXXX..XXXXXXX 100755
15
--- a/target/arm/helper.h
17
--- a/configure
16
+++ b/target/arm/helper.h
18
+++ b/configure
17
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_4(gvec_vcvt_rm_uh, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
19
@@ -XXX,XX +XXX,XX @@ cpu=""
18
DEF_HELPER_FLAGS_4(gvec_vrint_rm_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
20
iasl="iasl"
19
DEF_HELPER_FLAGS_4(gvec_vrint_rm_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
21
interp_prefix="/usr/gnemul/qemu-%M"
20
22
static="no"
21
+DEF_HELPER_FLAGS_4(gvec_vrintx_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
23
+cross_compile="no"
22
+DEF_HELPER_FLAGS_4(gvec_vrintx_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
24
cross_prefix=""
23
+
25
audio_drv_list=""
24
DEF_HELPER_FLAGS_4(gvec_frecpe_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
26
block_drv_rw_whitelist=""
25
DEF_HELPER_FLAGS_4(gvec_frecpe_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
27
@@ -XXX,XX +XXX,XX @@ for opt do
26
DEF_HELPER_FLAGS_4(gvec_frecpe_d, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
28
optarg=$(expr "x$opt" : 'x[^=]*=\(.*\)')
27
diff --git a/target/arm/vec_helper.c b/target/arm/vec_helper.c
29
case "$opt" in
28
index XXXXXXX..XXXXXXX 100644
30
--cross-prefix=*) cross_prefix="$optarg"
29
--- a/target/arm/vec_helper.c
31
+ cross_compile="yes"
30
+++ b/target/arm/vec_helper.c
32
;;
31
@@ -XXX,XX +XXX,XX @@ DO_2OP(gvec_frsqrte_h, helper_rsqrte_f16, float16)
33
--cc=*) CC="$optarg"
32
DO_2OP(gvec_frsqrte_s, helper_rsqrte_f32, float32)
34
;;
33
DO_2OP(gvec_frsqrte_d, helper_rsqrte_f64, float64)
35
@@ -XXX,XX +XXX,XX @@ $(echo Deprecated targets: $deprecated_targets_list | \
34
36
--target-list-exclude=LIST exclude a set of targets from the default target-list
35
+DO_2OP(gvec_vrintx_h, float16_round_to_int, float16)
37
36
+DO_2OP(gvec_vrintx_s, float32_round_to_int, float32)
38
Advanced options (experts only):
37
+
39
- --cross-prefix=PREFIX use PREFIX for compile tools [$cross_prefix]
38
DO_2OP(gvec_sitos, helper_vfp_sitos, int32_t)
40
+ --cross-prefix=PREFIX use PREFIX for compile tools, PREFIX can be blank [$cross_prefix]
39
DO_2OP(gvec_uitos, helper_vfp_uitos, uint32_t)
41
--cc=CC use C compiler CC [$cc]
40
DO_2OP(gvec_tosizs, helper_vfp_tosizs, float32)
42
--iasl=IASL use ACPI compiler IASL [$iasl]
41
diff --git a/target/arm/translate-neon.c.inc b/target/arm/translate-neon.c.inc
43
--host-cc=CC use C compiler CC [$host_cc] for code run at
42
index XXXXXXX..XXXXXXX 100644
44
@@ -XXX,XX +XXX,XX @@ if has $sdl2_config; then
43
--- a/target/arm/translate-neon.c.inc
45
fi
44
+++ b/target/arm/translate-neon.c.inc
46
echo "strip = [$(meson_quote $strip)]" >> $cross
45
@@ -XXX,XX +XXX,XX @@ static bool trans_VQNEG(DisasContext *s, arg_2misc *a)
47
echo "windres = [$(meson_quote $windres)]" >> $cross
46
return do_2misc(s, a, fn[a->size]);
48
-if test -n "$cross_prefix"; then
47
}
49
+if test "$cross_compile" = "yes"; then
48
50
cross_arg="--cross-file config-meson.cross"
49
-static bool do_2misc_fp(DisasContext *s, arg_2misc *a,
51
echo "[host_machine]" >> $cross
50
- NeonGenOneSingleOpFn *fn)
52
if test "$mingw32" = "yes" ; then
51
-{
52
- int pass;
53
- TCGv_ptr fpst;
54
-
55
- /* Handle a 2-reg-misc operation by iterating 32 bits at a time */
56
- if (!arm_dc_feature(s, ARM_FEATURE_NEON)) {
57
- return false;
58
- }
59
-
60
- /* UNDEF accesses to D16-D31 if they don't exist. */
61
- if (!dc_isar_feature(aa32_simd_r32, s) &&
62
- ((a->vd | a->vm) & 0x10)) {
63
- return false;
64
- }
65
-
66
- if (a->size != 2) {
67
- /* TODO: FP16 will be the size == 1 case */
68
- return false;
69
- }
70
-
71
- if ((a->vd | a->vm) & a->q) {
72
- return false;
73
- }
74
-
75
- if (!vfp_access_check(s)) {
76
- return true;
77
- }
78
-
79
- fpst = fpstatus_ptr(FPST_STD);
80
- for (pass = 0; pass < (a->q ? 4 : 2); pass++) {
81
- TCGv_i32 tmp = neon_load_reg(a->vm, pass);
82
- fn(tmp, tmp, fpst);
83
- neon_store_reg(a->vd, pass, tmp);
84
- }
85
- tcg_temp_free_ptr(fpst);
86
-
87
- return true;
88
-}
89
-
90
#define DO_2MISC_FP_VEC(INSN, HFUNC, SFUNC) \
91
static void gen_##INSN(unsigned vece, uint32_t rd_ofs, \
92
uint32_t rm_ofs, \
93
@@ -XXX,XX +XXX,XX @@ DO_2MISC_FP_VEC(VCVT_FU, gen_helper_gvec_ustoh, gen_helper_gvec_uitos)
94
DO_2MISC_FP_VEC(VCVT_SF, gen_helper_gvec_tosszh, gen_helper_gvec_tosizs)
95
DO_2MISC_FP_VEC(VCVT_UF, gen_helper_gvec_touszh, gen_helper_gvec_touizs)
96
97
+DO_2MISC_FP_VEC(VRINTX_impl, gen_helper_gvec_vrintx_h, gen_helper_gvec_vrintx_s)
98
+
99
static bool trans_VRINTX(DisasContext *s, arg_2misc *a)
100
{
101
if (!arm_dc_feature(s, ARM_FEATURE_V8)) {
102
return false;
103
}
104
- return do_2misc_fp(s, a, gen_helper_rints_exact);
105
+ return trans_VRINTX_impl(s, a);
106
}
107
108
#define DO_VEC_RMODE(INSN, RMODE, OP) \
109
--
53
--
110
2.20.1
54
2.20.1
111
55
112
56
diff view generated by jsdifflib
1
From: Graeme Gregory <graeme@nuviainc.com>
1
From: Joelle van Dyne <j@getutm.app>
2
2
3
Add the previously created sbsa-ec device to the sbsa-ref machine in
3
Build without error on hosts without a working system(). If system()
4
secure memory so the PSCI implementation in ARM-TF can access it, but
4
is called, return -1 with ENOSYS.
5
not expose it to non secure firmware or OS except by via ARM-TF.
6
5
7
Signed-off-by: Graeme Gregory <graeme@nuviainc.com>
6
Signed-off-by: Joelle van Dyne <j@getutm.app>
8
Reviewed-by: Leif Lindholm <leif@nuviainc.com>
7
Message-id: 20210126012457.39046-6-j@getutm.app
9
Tested-by: Leif Lindholm <leif@nuviainc.com>
10
Message-id: 20200826141952.136164-3-graeme@nuviainc.com
11
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
8
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
12
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
9
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
13
---
10
---
14
hw/arm/sbsa-ref.c | 14 ++++++++++++++
11
meson.build | 1 +
15
1 file changed, 14 insertions(+)
12
include/qemu/osdep.h | 12 ++++++++++++
13
2 files changed, 13 insertions(+)
16
14
17
diff --git a/hw/arm/sbsa-ref.c b/hw/arm/sbsa-ref.c
15
diff --git a/meson.build b/meson.build
18
index XXXXXXX..XXXXXXX 100644
16
index XXXXXXX..XXXXXXX 100644
19
--- a/hw/arm/sbsa-ref.c
17
--- a/meson.build
20
+++ b/hw/arm/sbsa-ref.c
18
+++ b/meson.build
21
@@ -XXX,XX +XXX,XX @@ enum {
19
@@ -XXX,XX +XXX,XX @@ config_host_data.set('HAVE_DRM_H', cc.has_header('libdrm/drm.h'))
22
SBSA_CPUPERIPHS,
20
config_host_data.set('HAVE_PTY_H', cc.has_header('pty.h'))
23
SBSA_GIC_DIST,
21
config_host_data.set('HAVE_SYS_IOCCOM_H', cc.has_header('sys/ioccom.h'))
24
SBSA_GIC_REDIST,
22
config_host_data.set('HAVE_SYS_KCOV_H', cc.has_header('sys/kcov.h'))
25
+ SBSA_SECURE_EC,
23
+config_host_data.set('HAVE_SYSTEM_FUNCTION', cc.has_function('system', prefix: '#include <stdlib.h>'))
26
SBSA_SMMU,
24
27
SBSA_UART,
25
config_host_data.set('CONFIG_PREADV', cc.has_function('preadv', prefix: '#include <sys/uio.h>'))
28
SBSA_RTC,
26
29
@@ -XXX,XX +XXX,XX @@ static const MemMapEntry sbsa_ref_memmap[] = {
27
diff --git a/include/qemu/osdep.h b/include/qemu/osdep.h
30
[SBSA_CPUPERIPHS] = { 0x40000000, 0x00040000 },
28
index XXXXXXX..XXXXXXX 100644
31
[SBSA_GIC_DIST] = { 0x40060000, 0x00010000 },
29
--- a/include/qemu/osdep.h
32
[SBSA_GIC_REDIST] = { 0x40080000, 0x04000000 },
30
+++ b/include/qemu/osdep.h
33
+ [SBSA_SECURE_EC] = { 0x50000000, 0x00001000 },
31
@@ -XXX,XX +XXX,XX @@ static inline void qemu_thread_jit_write(void) {}
34
[SBSA_UART] = { 0x60000000, 0x00001000 },
32
static inline void qemu_thread_jit_execute(void) {}
35
[SBSA_RTC] = { 0x60010000, 0x00001000 },
33
#endif
36
[SBSA_GPIO] = { 0x60020000, 0x00001000 },
34
37
@@ -XXX,XX +XXX,XX @@ static void *sbsa_ref_dtb(const struct arm_boot_info *binfo, int *fdt_size)
35
+/**
38
return board->fdt;
36
+ * Platforms which do not support system() return ENOSYS
39
}
37
+ */
40
38
+#ifndef HAVE_SYSTEM_FUNCTION
41
+static void create_secure_ec(MemoryRegion *mem)
39
+#define system platform_does_not_support_system
40
+static inline int platform_does_not_support_system(const char *command)
42
+{
41
+{
43
+ hwaddr base = sbsa_ref_memmap[SBSA_SECURE_EC].base;
42
+ errno = ENOSYS;
44
+ DeviceState *dev = qdev_new("sbsa-ec");
43
+ return -1;
45
+ SysBusDevice *s = SYS_BUS_DEVICE(dev);
44
+}
45
+#endif /* !HAVE_SYSTEM_FUNCTION */
46
+
46
+
47
+ memory_region_add_subregion(mem, base,
47
#endif
48
+ sysbus_mmio_get_region(s, 0));
49
+}
50
+
51
static void sbsa_ref_init(MachineState *machine)
52
{
53
unsigned int smp_cpus = machine->smp.cpus;
54
@@ -XXX,XX +XXX,XX @@ static void sbsa_ref_init(MachineState *machine)
55
56
create_pcie(sms);
57
58
+ create_secure_ec(secure_sysmem);
59
+
60
sms->bootinfo.ram_size = machine->ram_size;
61
sms->bootinfo.nb_cpus = smp_cpus;
62
sms->bootinfo.board_id = -1;
63
--
48
--
64
2.20.1
49
2.20.1
65
50
66
51
diff view generated by jsdifflib
1
Convert the Neon VRINT-with-specified-rounding-mode insns to gvec,
1
From: Joelle van Dyne <j@getutm.app>
2
and use this to implement the fp16 versions.
3
2
3
Meson will find CoreFoundation, IOKit, and Cocoa as needed.
4
5
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
6
Signed-off-by: Joelle van Dyne <j@getutm.app>
7
Message-id: 20210126012457.39046-7-j@getutm.app
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
8
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
5
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
6
Message-id: 20200828183354.27913-41-peter.maydell@linaro.org
7
---
9
---
8
target/arm/helper.h | 4 +-
10
configure | 1 -
9
target/arm/vec_helper.c | 21 +++++++++++
11
1 file changed, 1 deletion(-)
10
target/arm/vfp_helper.c | 17 ---------
11
target/arm/translate-neon.c.inc | 67 +++------------------------------
12
4 files changed, 30 insertions(+), 79 deletions(-)
13
12
14
diff --git a/target/arm/helper.h b/target/arm/helper.h
13
diff --git a/configure b/configure
15
index XXXXXXX..XXXXXXX 100644
14
index XXXXXXX..XXXXXXX 100755
16
--- a/target/arm/helper.h
15
--- a/configure
17
+++ b/target/arm/helper.h
16
+++ b/configure
18
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_3(vfp_sqtoh, f16, i64, i32, ptr)
17
@@ -XXX,XX +XXX,XX @@ Darwin)
19
DEF_HELPER_3(vfp_uqtoh, f16, i64, i32, ptr)
18
fi
20
19
audio_drv_list="coreaudio try-sdl"
21
DEF_HELPER_FLAGS_2(set_rmode, TCG_CALL_NO_RWG, i32, i32, ptr)
20
audio_possible_drivers="coreaudio sdl"
22
-DEF_HELPER_FLAGS_2(set_neon_rmode, TCG_CALL_NO_RWG, i32, i32, env)
21
- QEMU_LDFLAGS="-framework CoreFoundation -framework IOKit $QEMU_LDFLAGS"
23
22
# Disable attempts to use ObjectiveC features in os/object.h since they
24
DEF_HELPER_FLAGS_3(vfp_fcvt_f16_to_f32, TCG_CALL_NO_RWG, f32, f16, ptr, i32)
23
# won't work when we're compiling with gcc as a C compiler.
25
DEF_HELPER_FLAGS_3(vfp_fcvt_f32_to_f16, TCG_CALL_NO_RWG, f16, f32, ptr, i32)
24
QEMU_CFLAGS="-DOS_OBJECT_USE_OBJC=0 $QEMU_CFLAGS"
26
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_4(gvec_vcvt_rm_us, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
27
DEF_HELPER_FLAGS_4(gvec_vcvt_rm_sh, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
28
DEF_HELPER_FLAGS_4(gvec_vcvt_rm_uh, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
29
30
+DEF_HELPER_FLAGS_4(gvec_vrint_rm_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
31
+DEF_HELPER_FLAGS_4(gvec_vrint_rm_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
32
+
33
DEF_HELPER_FLAGS_4(gvec_frecpe_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
34
DEF_HELPER_FLAGS_4(gvec_frecpe_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
35
DEF_HELPER_FLAGS_4(gvec_frecpe_d, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
36
diff --git a/target/arm/vec_helper.c b/target/arm/vec_helper.c
37
index XXXXXXX..XXXXXXX 100644
38
--- a/target/arm/vec_helper.c
39
+++ b/target/arm/vec_helper.c
40
@@ -XXX,XX +XXX,XX @@ DO_VCVT_RMODE(gvec_vcvt_rm_sh, helper_vfp_toshh, uint16_t)
41
DO_VCVT_RMODE(gvec_vcvt_rm_uh, helper_vfp_touhh, uint16_t)
42
43
#undef DO_VCVT_RMODE
44
+
45
+#define DO_VRINT_RMODE(NAME, FUNC, TYPE) \
46
+ void HELPER(NAME)(void *vd, void *vn, void *stat, uint32_t desc) \
47
+ { \
48
+ float_status *fpst = stat; \
49
+ intptr_t i, oprsz = simd_oprsz(desc); \
50
+ uint32_t rmode = simd_data(desc); \
51
+ uint32_t prev_rmode = get_float_rounding_mode(fpst); \
52
+ TYPE *d = vd, *n = vn; \
53
+ set_float_rounding_mode(rmode, fpst); \
54
+ for (i = 0; i < oprsz / sizeof(TYPE); i++) { \
55
+ d[i] = FUNC(n[i], fpst); \
56
+ } \
57
+ set_float_rounding_mode(prev_rmode, fpst); \
58
+ clear_tail(d, oprsz, simd_maxsz(desc)); \
59
+ }
60
+
61
+DO_VRINT_RMODE(gvec_vrint_rm_h, helper_rinth, uint16_t)
62
+DO_VRINT_RMODE(gvec_vrint_rm_s, helper_rints, uint32_t)
63
+
64
+#undef DO_VRINT_RMODE
65
diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
66
index XXXXXXX..XXXXXXX 100644
67
--- a/target/arm/vfp_helper.c
68
+++ b/target/arm/vfp_helper.c
69
@@ -XXX,XX +XXX,XX @@ uint32_t HELPER(set_rmode)(uint32_t rmode, void *fpstp)
70
return prev_rmode;
71
}
72
73
-/* Set the current fp rounding mode in the standard fp status and return
74
- * the old one. This is for NEON instructions that need to change the
75
- * rounding mode but wish to use the standard FPSCR values for everything
76
- * else. Always set the rounding mode back to the correct value after
77
- * modifying it.
78
- * The argument is a softfloat float_round_ value.
79
- */
80
-uint32_t HELPER(set_neon_rmode)(uint32_t rmode, CPUARMState *env)
81
-{
82
- float_status *fp_status = &env->vfp.standard_fp_status;
83
-
84
- uint32_t prev_rmode = get_float_rounding_mode(fp_status);
85
- set_float_rounding_mode(rmode, fp_status);
86
-
87
- return prev_rmode;
88
-}
89
-
90
/* Half precision conversions. */
91
float32 HELPER(vfp_fcvt_f16_to_f32)(uint32_t a, void *fpstp, uint32_t ahp_mode)
92
{
93
diff --git a/target/arm/translate-neon.c.inc b/target/arm/translate-neon.c.inc
94
index XXXXXXX..XXXXXXX 100644
95
--- a/target/arm/translate-neon.c.inc
96
+++ b/target/arm/translate-neon.c.inc
97
@@ -XXX,XX +XXX,XX @@ static bool trans_VRINTX(DisasContext *s, arg_2misc *a)
98
return do_2misc_fp(s, a, gen_helper_rints_exact);
99
}
100
101
-static bool do_vrint(DisasContext *s, arg_2misc *a, int rmode)
102
-{
103
- /*
104
- * Handle a VRINT* operation by iterating 32 bits at a time,
105
- * with a specified rounding mode in operation.
106
- */
107
- int pass;
108
- TCGv_ptr fpst;
109
- TCGv_i32 tcg_rmode;
110
-
111
- if (!arm_dc_feature(s, ARM_FEATURE_NEON) ||
112
- !arm_dc_feature(s, ARM_FEATURE_V8)) {
113
- return false;
114
- }
115
-
116
- /* UNDEF accesses to D16-D31 if they don't exist. */
117
- if (!dc_isar_feature(aa32_simd_r32, s) &&
118
- ((a->vd | a->vm) & 0x10)) {
119
- return false;
120
- }
121
-
122
- if (a->size != 2) {
123
- /* TODO: FP16 will be the size == 1 case */
124
- return false;
125
- }
126
-
127
- if ((a->vd | a->vm) & a->q) {
128
- return false;
129
- }
130
-
131
- if (!vfp_access_check(s)) {
132
- return true;
133
- }
134
-
135
- fpst = fpstatus_ptr(FPST_STD);
136
- tcg_rmode = tcg_const_i32(arm_rmode_to_sf(rmode));
137
- gen_helper_set_neon_rmode(tcg_rmode, tcg_rmode, cpu_env);
138
- for (pass = 0; pass < (a->q ? 4 : 2); pass++) {
139
- TCGv_i32 tmp = neon_load_reg(a->vm, pass);
140
- gen_helper_rints(tmp, tmp, fpst);
141
- neon_store_reg(a->vd, pass, tmp);
142
- }
143
- gen_helper_set_neon_rmode(tcg_rmode, tcg_rmode, cpu_env);
144
- tcg_temp_free_i32(tcg_rmode);
145
- tcg_temp_free_ptr(fpst);
146
-
147
- return true;
148
-}
149
-
150
-#define DO_VRINT(INSN, RMODE) \
151
- static bool trans_##INSN(DisasContext *s, arg_2misc *a) \
152
- { \
153
- return do_vrint(s, a, RMODE); \
154
- }
155
-
156
-DO_VRINT(VRINTN, FPROUNDING_TIEEVEN)
157
-DO_VRINT(VRINTA, FPROUNDING_TIEAWAY)
158
-DO_VRINT(VRINTZ, FPROUNDING_ZERO)
159
-DO_VRINT(VRINTM, FPROUNDING_NEGINF)
160
-DO_VRINT(VRINTP, FPROUNDING_POSINF)
161
-
162
#define DO_VEC_RMODE(INSN, RMODE, OP) \
163
static void gen_##INSN(unsigned vece, uint32_t rd_ofs, \
164
uint32_t rm_ofs, \
165
@@ -XXX,XX +XXX,XX @@ DO_VEC_RMODE(VCVTPS, FPROUNDING_POSINF, vcvt_rm_s)
166
DO_VEC_RMODE(VCVTMU, FPROUNDING_NEGINF, vcvt_rm_u)
167
DO_VEC_RMODE(VCVTMS, FPROUNDING_NEGINF, vcvt_rm_s)
168
169
+DO_VEC_RMODE(VRINTN, FPROUNDING_TIEEVEN, vrint_rm_)
170
+DO_VEC_RMODE(VRINTA, FPROUNDING_TIEAWAY, vrint_rm_)
171
+DO_VEC_RMODE(VRINTZ, FPROUNDING_ZERO, vrint_rm_)
172
+DO_VEC_RMODE(VRINTM, FPROUNDING_NEGINF, vrint_rm_)
173
+DO_VEC_RMODE(VRINTP, FPROUNDING_POSINF, vrint_rm_)
174
+
175
static bool trans_VSWP(DisasContext *s, arg_2misc *a)
176
{
177
TCGv_i64 rm, rd;
178
--
25
--
179
2.20.1
26
2.20.1
180
27
181
28
diff view generated by jsdifflib
1
Implement fp16 for the Neon VCVT insns which convert between
1
From: Joelle van Dyne <j@getutm.app>
2
float and fixed-point.
3
2
3
Add objc to the Meson cross file as well as detection of Darwin.
4
5
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
6
Signed-off-by: Joelle van Dyne <j@getutm.app>
7
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
8
Message-id: 20210126012457.39046-8-j@getutm.app
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
9
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
5
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
6
Message-id: 20200828183354.27913-39-peter.maydell@linaro.org
7
---
10
---
8
target/arm/helper.h | 5 +++++
11
configure | 4 ++++
9
target/arm/neon-dp.decode | 8 +++++++-
12
1 file changed, 4 insertions(+)
10
target/arm/vec_helper.c | 4 ++++
11
target/arm/translate-neon.c.inc | 5 +++++
12
4 files changed, 21 insertions(+), 1 deletion(-)
13
13
14
diff --git a/target/arm/helper.h b/target/arm/helper.h
14
diff --git a/configure b/configure
15
index XXXXXXX..XXXXXXX 100644
15
index XXXXXXX..XXXXXXX 100755
16
--- a/target/arm/helper.h
16
--- a/configure
17
+++ b/target/arm/helper.h
17
+++ b/configure
18
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_4(gvec_vcvt_uf, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
18
@@ -XXX,XX +XXX,XX @@ echo "cpp_link_args = [${LDFLAGS:+$(meson_quote $LDFLAGS)}]" >> $cross
19
DEF_HELPER_FLAGS_4(gvec_vcvt_fs, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
19
echo "[binaries]" >> $cross
20
DEF_HELPER_FLAGS_4(gvec_vcvt_fu, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
20
echo "c = [$(meson_quote $cc)]" >> $cross
21
21
test -n "$cxx" && echo "cpp = [$(meson_quote $cxx)]" >> $cross
22
+DEF_HELPER_FLAGS_4(gvec_vcvt_sh, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
22
+test -n "$objcc" && echo "objc = [$(meson_quote $objcc)]" >> $cross
23
+DEF_HELPER_FLAGS_4(gvec_vcvt_uh, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
23
echo "ar = [$(meson_quote $ar)]" >> $cross
24
+DEF_HELPER_FLAGS_4(gvec_vcvt_hs, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
24
echo "nm = [$(meson_quote $nm)]" >> $cross
25
+DEF_HELPER_FLAGS_4(gvec_vcvt_hu, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
25
echo "pkgconfig = [$(meson_quote $pkg_config_exe)]" >> $cross
26
+
26
@@ -XXX,XX +XXX,XX @@ if test "$cross_compile" = "yes"; then
27
DEF_HELPER_FLAGS_4(gvec_frecpe_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
27
if test "$linux" = "yes" ; then
28
DEF_HELPER_FLAGS_4(gvec_frecpe_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
28
echo "system = 'linux'" >> $cross
29
DEF_HELPER_FLAGS_4(gvec_frecpe_d, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
29
fi
30
diff --git a/target/arm/neon-dp.decode b/target/arm/neon-dp.decode
30
+ if test "$darwin" = "yes" ; then
31
index XXXXXXX..XXXXXXX 100644
31
+ echo "system = 'darwin'" >> $cross
32
--- a/target/arm/neon-dp.decode
32
+ fi
33
+++ b/target/arm/neon-dp.decode
33
case "$ARCH" in
34
@@ -XXX,XX +XXX,XX @@ VMINNM_fp_3s 1111 001 1 0 . 1 . .... .... 1111 ... 1 .... @3same_fp
34
i386|x86_64)
35
# We use size=0 for fp32 and size=1 for fp16 to match the 3-same encodings.
35
echo "cpu_family = 'x86'" >> $cross
36
@2reg_vcvt .... ... . . . 1 ..... .... .... . q:1 . . .... \
37
&2reg_shift vm=%vm_dp vd=%vd_dp size=0 shift=%neon_rshift_i5
38
+@2reg_vcvt_f16 .... ... . . . 11 .... .... .... . q:1 . . .... \
39
+ &2reg_shift vm=%vm_dp vd=%vd_dp size=1 shift=%neon_rshift_i4
40
41
VSHR_S_2sh 1111 001 0 1 . ...... .... 0000 . . . 1 .... @2reg_shr_d
42
VSHR_S_2sh 1111 001 0 1 . ...... .... 0000 . . . 1 .... @2reg_shr_s
43
@@ -XXX,XX +XXX,XX @@ VSHLL_U_2sh 1111 001 1 1 . ...... .... 1010 . 0 . 1 .... @2reg_shll_h
44
VSHLL_U_2sh 1111 001 1 1 . ...... .... 1010 . 0 . 1 .... @2reg_shll_b
45
46
# VCVT fixed<->float conversions
47
-# TODO: FP16 fixed<->float conversions are opc==0b1100 and 0b1101
48
+VCVT_SH_2sh 1111 001 0 1 . ...... .... 1100 0 . . 1 .... @2reg_vcvt_f16
49
+VCVT_UH_2sh 1111 001 1 1 . ...... .... 1100 0 . . 1 .... @2reg_vcvt_f16
50
+VCVT_HS_2sh 1111 001 0 1 . ...... .... 1101 0 . . 1 .... @2reg_vcvt_f16
51
+VCVT_HU_2sh 1111 001 1 1 . ...... .... 1101 0 . . 1 .... @2reg_vcvt_f16
52
+
53
VCVT_SF_2sh 1111 001 0 1 . ...... .... 1110 0 . . 1 .... @2reg_vcvt
54
VCVT_UF_2sh 1111 001 1 1 . ...... .... 1110 0 . . 1 .... @2reg_vcvt
55
VCVT_FS_2sh 1111 001 0 1 . ...... .... 1111 0 . . 1 .... @2reg_vcvt
56
diff --git a/target/arm/vec_helper.c b/target/arm/vec_helper.c
57
index XXXXXXX..XXXXXXX 100644
58
--- a/target/arm/vec_helper.c
59
+++ b/target/arm/vec_helper.c
60
@@ -XXX,XX +XXX,XX @@ DO_VCVT_FIXED(gvec_vcvt_sf, helper_vfp_sltos, uint32_t)
61
DO_VCVT_FIXED(gvec_vcvt_uf, helper_vfp_ultos, uint32_t)
62
DO_VCVT_FIXED(gvec_vcvt_fs, helper_vfp_tosls_round_to_zero, uint32_t)
63
DO_VCVT_FIXED(gvec_vcvt_fu, helper_vfp_touls_round_to_zero, uint32_t)
64
+DO_VCVT_FIXED(gvec_vcvt_sh, helper_vfp_shtoh, uint16_t)
65
+DO_VCVT_FIXED(gvec_vcvt_uh, helper_vfp_uhtoh, uint16_t)
66
+DO_VCVT_FIXED(gvec_vcvt_hs, helper_vfp_toshh_round_to_zero, uint16_t)
67
+DO_VCVT_FIXED(gvec_vcvt_hu, helper_vfp_touhh_round_to_zero, uint16_t)
68
69
#undef DO_VCVT_FIXED
70
diff --git a/target/arm/translate-neon.c.inc b/target/arm/translate-neon.c.inc
71
index XXXXXXX..XXXXXXX 100644
72
--- a/target/arm/translate-neon.c.inc
73
+++ b/target/arm/translate-neon.c.inc
74
@@ -XXX,XX +XXX,XX @@ DO_FP_2SH(VCVT_UF, gen_helper_gvec_vcvt_uf)
75
DO_FP_2SH(VCVT_FS, gen_helper_gvec_vcvt_fs)
76
DO_FP_2SH(VCVT_FU, gen_helper_gvec_vcvt_fu)
77
78
+DO_FP_2SH(VCVT_SH, gen_helper_gvec_vcvt_sh)
79
+DO_FP_2SH(VCVT_UH, gen_helper_gvec_vcvt_uh)
80
+DO_FP_2SH(VCVT_HS, gen_helper_gvec_vcvt_hs)
81
+DO_FP_2SH(VCVT_HU, gen_helper_gvec_vcvt_hu)
82
+
83
static uint64_t asimd_imm_const(uint32_t imm, int cmode, int op)
84
{
85
/*
86
--
36
--
87
2.20.1
37
2.20.1
88
38
89
39
diff view generated by jsdifflib
1
From: Leif Lindholm <leif@nuviainc.com>
1
From: Joelle van Dyne <j@getutm.app>
2
2
3
The sbsa-ref platform uses a minimal device tree to pass amount of memory
3
Reviewed-by: Philippe Mathieu-Daudé <philmd@redhat.com>
4
as well as number of cpus to the firmware. However, when dumping that
4
Signed-off-by: Joelle van Dyne <j@getutm.app>
5
minimal dtb (with -M sbsa-virt,dumpdtb=<file>), the resulting blob
5
Message-id: 20210126012457.39046-9-j@getutm.app
6
generates a warning when decompiled by dtc due to lack of reg property.
7
8
Add a simple reg property per cpu, representing a 64-bit MPIDR_EL1.
9
10
This also ends up being cleaner than having the firmware calculating its
11
own IDs for generating APCI.
12
13
Signed-off-by: Leif Lindholm <leif@nuviainc.com>
14
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
15
Message-id: 20200827124335.30586-1-leif@nuviainc.com
16
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
6
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
17
---
7
---
18
hw/arm/sbsa-ref.c | 29 +++++++++++++++++++++++------
8
configure | 5 ++++-
19
1 file changed, 23 insertions(+), 6 deletions(-)
9
1 file changed, 4 insertions(+), 1 deletion(-)
20
10
21
diff --git a/hw/arm/sbsa-ref.c b/hw/arm/sbsa-ref.c
11
diff --git a/configure b/configure
22
index XXXXXXX..XXXXXXX 100644
12
index XXXXXXX..XXXXXXX 100755
23
--- a/hw/arm/sbsa-ref.c
13
--- a/configure
24
+++ b/hw/arm/sbsa-ref.c
14
+++ b/configure
25
@@ -XXX,XX +XXX,XX @@ static const int sbsa_ref_irqmap[] = {
15
@@ -XXX,XX +XXX,XX @@ if test "$cross_compile" = "yes"; then
26
[SBSA_EHCI] = 11,
16
echo "system = 'darwin'" >> $cross
27
};
17
fi
28
18
case "$ARCH" in
29
+static uint64_t sbsa_ref_cpu_mp_affinity(SBSAMachineState *sms, int idx)
19
- i386|x86_64)
30
+{
20
+ i386)
31
+ uint8_t clustersz = ARM_DEFAULT_CPUS_PER_CLUSTER;
21
echo "cpu_family = 'x86'" >> $cross
32
+ return arm_cpu_mp_affinity(idx, clustersz);
22
;;
33
+}
23
+ x86_64)
34
+
24
+ echo "cpu_family = 'x86_64'" >> $cross
35
/*
25
+ ;;
36
* Firmware on this machine only uses ACPI table to load OS, these limited
26
ppc64le)
37
* device tree nodes are just to let firmware know the info which varies from
27
echo "cpu_family = 'ppc64'" >> $cross
38
@@ -XXX,XX +XXX,XX @@ static void create_fdt(SBSAMachineState *sms)
28
;;
39
g_free(matrix);
40
}
41
42
+ /*
43
+ * From Documentation/devicetree/bindings/arm/cpus.yaml
44
+ * On ARM v8 64-bit systems this property is required
45
+ * and matches the MPIDR_EL1 register affinity bits.
46
+ *
47
+ * * If cpus node's #address-cells property is set to 2
48
+ *
49
+ * The first reg cell bits [7:0] must be set to
50
+ * bits [39:32] of MPIDR_EL1.
51
+ *
52
+ * The second reg cell bits [23:0] must be set to
53
+ * bits [23:0] of MPIDR_EL1.
54
+ */
55
qemu_fdt_add_subnode(sms->fdt, "/cpus");
56
+ qemu_fdt_setprop_cell(sms->fdt, "/cpus", "#address-cells", 2);
57
+ qemu_fdt_setprop_cell(sms->fdt, "/cpus", "#size-cells", 0x0);
58
59
for (cpu = sms->smp_cpus - 1; cpu >= 0; cpu--) {
60
char *nodename = g_strdup_printf("/cpus/cpu@%d", cpu);
61
ARMCPU *armcpu = ARM_CPU(qemu_get_cpu(cpu));
62
CPUState *cs = CPU(armcpu);
63
+ uint64_t mpidr = sbsa_ref_cpu_mp_affinity(sms, cpu);
64
65
qemu_fdt_add_subnode(sms->fdt, nodename);
66
+ qemu_fdt_setprop_u64(sms->fdt, nodename, "reg", mpidr);
67
68
if (ms->possible_cpus->cpus[cs->cpu_index].props.has_node_id) {
69
qemu_fdt_setprop_cell(sms->fdt, nodename, "numa-node-id",
70
@@ -XXX,XX +XXX,XX @@ static void sbsa_ref_init(MachineState *machine)
71
arm_load_kernel(ARM_CPU(first_cpu), machine, &sms->bootinfo);
72
}
73
74
-static uint64_t sbsa_ref_cpu_mp_affinity(SBSAMachineState *sms, int idx)
75
-{
76
- uint8_t clustersz = ARM_DEFAULT_CPUS_PER_CLUSTER;
77
- return arm_cpu_mp_affinity(idx, clustersz);
78
-}
79
-
80
static const CPUArchIdList *sbsa_ref_possible_cpu_arch_ids(MachineState *ms)
81
{
82
unsigned int max_cpus = ms->smp.max_cpus;
83
--
29
--
84
2.20.1
30
2.20.1
85
31
86
32
diff view generated by jsdifflib
1
Convert the Neon VCVT float<->fixed-point insns to a
1
From: Joelle van Dyne <j@getutm.app>
2
gvec style, in preparation for adding fp16 support.
3
2
3
On iOS there is no CoreAudio, so we should not assume Darwin always
4
has it.
5
6
Signed-off-by: Joelle van Dyne <j@getutm.app>
7
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
8
Message-id: 20210126012457.39046-11-j@getutm.app
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
9
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
5
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
6
Message-id: 20200828183354.27913-38-peter.maydell@linaro.org
7
---
10
---
8
target/arm/helper.h | 5 +++++
11
configure | 35 +++++++++++++++++++++++++++++++++--
9
target/arm/vec_helper.c | 20 +++++++++++++++++++
12
1 file changed, 33 insertions(+), 2 deletions(-)
10
target/arm/translate-neon.c.inc | 35 +++++++++++++++++----------------
11
3 files changed, 43 insertions(+), 17 deletions(-)
12
13
13
diff --git a/target/arm/helper.h b/target/arm/helper.h
14
diff --git a/configure b/configure
14
index XXXXXXX..XXXXXXX 100644
15
index XXXXXXX..XXXXXXX 100755
15
--- a/target/arm/helper.h
16
--- a/configure
16
+++ b/target/arm/helper.h
17
+++ b/configure
17
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_4(gvec_tosizs, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
18
@@ -XXX,XX +XXX,XX @@ fdt="auto"
18
DEF_HELPER_FLAGS_4(gvec_touszh, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
19
netmap="no"
19
DEF_HELPER_FLAGS_4(gvec_touizs, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
20
sdl="auto"
20
21
sdl_image="auto"
21
+DEF_HELPER_FLAGS_4(gvec_vcvt_sf, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
22
+coreaudio="auto"
22
+DEF_HELPER_FLAGS_4(gvec_vcvt_uf, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
23
virtiofsd="auto"
23
+DEF_HELPER_FLAGS_4(gvec_vcvt_fs, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
24
virtfs="auto"
24
+DEF_HELPER_FLAGS_4(gvec_vcvt_fu, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
25
libudev="auto"
26
@@ -XXX,XX +XXX,XX @@ Darwin)
27
QEMU_CFLAGS="-arch x86_64 $QEMU_CFLAGS"
28
QEMU_LDFLAGS="-arch x86_64 $QEMU_LDFLAGS"
29
fi
30
- audio_drv_list="coreaudio try-sdl"
31
+ audio_drv_list="try-coreaudio try-sdl"
32
audio_possible_drivers="coreaudio sdl"
33
# Disable attempts to use ObjectiveC features in os/object.h since they
34
# won't work when we're compiling with gcc as a C compiler.
35
@@ -XXX,XX +XXX,XX @@ EOF
36
fi
37
fi
38
39
+##########################################
40
+# detect CoreAudio
41
+if test "$coreaudio" != "no" ; then
42
+ coreaudio_libs="-framework CoreAudio"
43
+ cat > $TMPC << EOF
44
+#include <CoreAudio/CoreAudio.h>
45
+int main(void)
46
+{
47
+ return (int)AudioGetCurrentHostTime();
48
+}
49
+EOF
50
+ if compile_prog "" "$coreaudio_libs" ; then
51
+ coreaudio=yes
52
+ else
53
+ coreaudio=no
54
+ fi
55
+fi
25
+
56
+
26
DEF_HELPER_FLAGS_4(gvec_frecpe_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
57
##########################################
27
DEF_HELPER_FLAGS_4(gvec_frecpe_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
58
# Sound support libraries probe
28
DEF_HELPER_FLAGS_4(gvec_frecpe_d, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
59
29
diff --git a/target/arm/vec_helper.c b/target/arm/vec_helper.c
60
@@ -XXX,XX +XXX,XX @@ for drv in $audio_drv_list; do
30
index XXXXXXX..XXXXXXX 100644
61
fi
31
--- a/target/arm/vec_helper.c
62
;;
32
+++ b/target/arm/vec_helper.c
63
33
@@ -XXX,XX +XXX,XX @@ DO_NEON_PAIRWISE(neon_pmax, max)
64
- coreaudio)
34
DO_NEON_PAIRWISE(neon_pmin, min)
65
+ coreaudio | try-coreaudio)
35
66
+ if test "$coreaudio" = "no"; then
36
#undef DO_NEON_PAIRWISE
67
+ if test "$drv" = "try-coreaudio"; then
37
+
68
+ audio_drv_list=$(echo "$audio_drv_list" | sed -e 's/try-coreaudio//')
38
+#define DO_VCVT_FIXED(NAME, FUNC, TYPE) \
69
+ else
39
+ void HELPER(NAME)(void *vd, void *vn, void *stat, uint32_t desc) \
70
+ error_exit "$drv check failed" \
40
+ { \
71
+ "Make sure to have the $drv is available."
41
+ intptr_t i, oprsz = simd_oprsz(desc); \
72
+ fi
42
+ int shift = simd_data(desc); \
73
+ else
43
+ TYPE *d = vd, *n = vn; \
74
coreaudio_libs="-framework CoreAudio"
44
+ float_status *fpst = stat; \
75
+ if test "$drv" = "try-coreaudio"; then
45
+ for (i = 0; i < oprsz / sizeof(TYPE); i++) { \
76
+ audio_drv_list=$(echo "$audio_drv_list" | sed -e 's/try-coreaudio/coreaudio/')
46
+ d[i] = FUNC(n[i], shift, fpst); \
77
+ fi
47
+ } \
78
+ fi
48
+ clear_tail(d, oprsz, simd_maxsz(desc)); \
79
;;
49
+ }
80
50
+
81
dsound)
51
+DO_VCVT_FIXED(gvec_vcvt_sf, helper_vfp_sltos, uint32_t)
52
+DO_VCVT_FIXED(gvec_vcvt_uf, helper_vfp_ultos, uint32_t)
53
+DO_VCVT_FIXED(gvec_vcvt_fs, helper_vfp_tosls_round_to_zero, uint32_t)
54
+DO_VCVT_FIXED(gvec_vcvt_fu, helper_vfp_touls_round_to_zero, uint32_t)
55
+
56
+#undef DO_VCVT_FIXED
57
diff --git a/target/arm/translate-neon.c.inc b/target/arm/translate-neon.c.inc
58
index XXXXXXX..XXXXXXX 100644
59
--- a/target/arm/translate-neon.c.inc
60
+++ b/target/arm/translate-neon.c.inc
61
@@ -XXX,XX +XXX,XX @@ static bool trans_VSHLL_U_2sh(DisasContext *s, arg_2reg_shift *a)
62
}
63
64
static bool do_fp_2sh(DisasContext *s, arg_2reg_shift *a,
65
- NeonGenTwoSingleOpFn *fn)
66
+ gen_helper_gvec_2_ptr *fn)
67
{
68
/* FP operations in 2-reg-and-shift group */
69
- TCGv_i32 tmp, shiftv;
70
- TCGv_ptr fpstatus;
71
- int pass;
72
+ int vec_size = a->q ? 16 : 8;
73
+ int rd_ofs = neon_reg_offset(a->vd, 0);
74
+ int rm_ofs = neon_reg_offset(a->vm, 0);
75
+ TCGv_ptr fpst;
76
77
if (!arm_dc_feature(s, ARM_FEATURE_NEON)) {
78
return false;
79
}
80
81
+ if (a->size != 0) {
82
+ if (!dc_isar_feature(aa32_fp16_arith, s)) {
83
+ return false;
84
+ }
85
+ }
86
+
87
/* UNDEF accesses to D16-D31 if they don't exist. */
88
if (!dc_isar_feature(aa32_simd_r32, s) &&
89
((a->vd | a->vm) & 0x10)) {
90
@@ -XXX,XX +XXX,XX @@ static bool do_fp_2sh(DisasContext *s, arg_2reg_shift *a,
91
return true;
92
}
93
94
- fpstatus = fpstatus_ptr(FPST_STD);
95
- shiftv = tcg_const_i32(a->shift);
96
- for (pass = 0; pass < (a->q ? 4 : 2); pass++) {
97
- tmp = neon_load_reg(a->vm, pass);
98
- fn(tmp, tmp, shiftv, fpstatus);
99
- neon_store_reg(a->vd, pass, tmp);
100
- }
101
- tcg_temp_free_ptr(fpstatus);
102
- tcg_temp_free_i32(shiftv);
103
+ fpst = fpstatus_ptr(a->size ? FPST_STD_F16 : FPST_STD);
104
+ tcg_gen_gvec_2_ptr(rd_ofs, rm_ofs, fpst, vec_size, vec_size, a->shift, fn);
105
+ tcg_temp_free_ptr(fpst);
106
return true;
107
}
108
109
@@ -XXX,XX +XXX,XX @@ static bool do_fp_2sh(DisasContext *s, arg_2reg_shift *a,
110
return do_fp_2sh(s, a, FUNC); \
111
}
112
113
-DO_FP_2SH(VCVT_SF, gen_helper_vfp_sltos)
114
-DO_FP_2SH(VCVT_UF, gen_helper_vfp_ultos)
115
-DO_FP_2SH(VCVT_FS, gen_helper_vfp_tosls_round_to_zero)
116
-DO_FP_2SH(VCVT_FU, gen_helper_vfp_touls_round_to_zero)
117
+DO_FP_2SH(VCVT_SF, gen_helper_gvec_vcvt_sf)
118
+DO_FP_2SH(VCVT_UF, gen_helper_gvec_vcvt_uf)
119
+DO_FP_2SH(VCVT_FS, gen_helper_gvec_vcvt_fs)
120
+DO_FP_2SH(VCVT_FU, gen_helper_gvec_vcvt_fu)
121
122
static uint64_t asimd_imm_const(uint32_t imm, int cmode, int op)
123
{
124
--
82
--
125
2.20.1
83
2.20.1
126
84
127
85
diff view generated by jsdifflib
1
Convert the Neon pairwise fp ops to use a single gvic-style
1
From: Joelle van Dyne <j@getutm.app>
2
helper to do the full operation instead of one helper call
3
for each 32-bit part. This allows us to use the same
4
framework to implement the fp16.
5
2
3
A workaround added in early days of 64-bit OSX forced x86_64 if the
4
host machine had 64-bit support. This creates issues when cross-
5
compiling for ARM64. Additionally, the user can always use --cpu=* to
6
manually set the host CPU and therefore this workaround should be
7
removed.
8
9
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
10
Signed-off-by: Joelle van Dyne <j@getutm.app>
11
Message-id: 20210126012457.39046-12-j@getutm.app
6
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
12
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
7
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
8
Message-id: 20200828183354.27913-36-peter.maydell@linaro.org
9
---
13
---
10
target/arm/helper.h | 7 +++++
14
configure | 11 -----------
11
target/arm/vec_helper.c | 45 +++++++++++++++++++++++++++++++++
15
1 file changed, 11 deletions(-)
12
target/arm/translate-neon.c.inc | 42 ++++++++++++------------------
13
3 files changed, 68 insertions(+), 26 deletions(-)
14
16
15
diff --git a/target/arm/helper.h b/target/arm/helper.h
17
diff --git a/configure b/configure
16
index XXXXXXX..XXXXXXX 100644
18
index XXXXXXX..XXXXXXX 100755
17
--- a/target/arm/helper.h
19
--- a/configure
18
+++ b/target/arm/helper.h
20
+++ b/configure
19
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_5(gvec_fcmlas_idx, TCG_CALL_NO_RWG,
21
@@ -XXX,XX +XXX,XX @@ fi
20
DEF_HELPER_FLAGS_5(gvec_fcmlad, TCG_CALL_NO_RWG,
22
# the correct CPU with the --cpu option.
21
void, ptr, ptr, ptr, ptr, i32)
23
case $targetos in
22
24
Darwin)
23
+DEF_HELPER_FLAGS_5(neon_paddh, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
25
- # on Leopard most of the system is 32-bit, so we have to ask the kernel if we can
24
+DEF_HELPER_FLAGS_5(neon_pmaxh, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
26
- # run 64-bit userspace code.
25
+DEF_HELPER_FLAGS_5(neon_pminh, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
27
- # If the user didn't specify a CPU explicitly and the kernel says this is
26
+DEF_HELPER_FLAGS_5(neon_padds, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
28
- # 64 bit hw, then assume x86_64. Otherwise fall through to the usual detection code.
27
+DEF_HELPER_FLAGS_5(neon_pmaxs, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
29
- if test -z "$cpu" && test "$(sysctl -n hw.optional.x86_64)" = "1"; then
28
+DEF_HELPER_FLAGS_5(neon_pmins, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
30
- cpu="x86_64"
29
+
31
- fi
30
DEF_HELPER_FLAGS_4(gvec_frecpe_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
32
HOST_DSOSUF=".dylib"
31
DEF_HELPER_FLAGS_4(gvec_frecpe_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
33
;;
32
DEF_HELPER_FLAGS_4(gvec_frecpe_d, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
34
SunOS)
33
diff --git a/target/arm/vec_helper.c b/target/arm/vec_helper.c
35
@@ -XXX,XX +XXX,XX @@ OpenBSD)
34
index XXXXXXX..XXXXXXX 100644
36
Darwin)
35
--- a/target/arm/vec_helper.c
37
bsd="yes"
36
+++ b/target/arm/vec_helper.c
38
darwin="yes"
37
@@ -XXX,XX +XXX,XX @@ DO_ABA(gvec_uaba_s, uint32_t)
39
- if [ "$cpu" = "x86_64" ] ; then
38
DO_ABA(gvec_uaba_d, uint64_t)
40
- QEMU_CFLAGS="-arch x86_64 $QEMU_CFLAGS"
39
41
- QEMU_LDFLAGS="-arch x86_64 $QEMU_LDFLAGS"
40
#undef DO_ABA
42
- fi
41
+
43
audio_drv_list="try-coreaudio try-sdl"
42
+#define DO_NEON_PAIRWISE(NAME, OP) \
44
audio_possible_drivers="coreaudio sdl"
43
+ void HELPER(NAME##s)(void *vd, void *vn, void *vm, \
45
# Disable attempts to use ObjectiveC features in os/object.h since they
44
+ void *stat, uint32_t oprsz) \
45
+ { \
46
+ float_status *fpst = stat; \
47
+ float32 *d = vd; \
48
+ float32 *n = vn; \
49
+ float32 *m = vm; \
50
+ float32 r0, r1; \
51
+ \
52
+ /* Read all inputs before writing outputs in case vm == vd */ \
53
+ r0 = float32_##OP(n[H4(0)], n[H4(1)], fpst); \
54
+ r1 = float32_##OP(m[H4(0)], m[H4(1)], fpst); \
55
+ \
56
+ d[H4(0)] = r0; \
57
+ d[H4(1)] = r1; \
58
+ } \
59
+ \
60
+ void HELPER(NAME##h)(void *vd, void *vn, void *vm, \
61
+ void *stat, uint32_t oprsz) \
62
+ { \
63
+ float_status *fpst = stat; \
64
+ float16 *d = vd; \
65
+ float16 *n = vn; \
66
+ float16 *m = vm; \
67
+ float16 r0, r1, r2, r3; \
68
+ \
69
+ /* Read all inputs before writing outputs in case vm == vd */ \
70
+ r0 = float16_##OP(n[H2(0)], n[H2(1)], fpst); \
71
+ r1 = float16_##OP(n[H2(2)], n[H2(3)], fpst); \
72
+ r2 = float16_##OP(m[H2(0)], m[H2(1)], fpst); \
73
+ r3 = float16_##OP(m[H2(2)], m[H2(3)], fpst); \
74
+ \
75
+ d[H4(0)] = r0; \
76
+ d[H4(1)] = r1; \
77
+ d[H4(2)] = r2; \
78
+ d[H4(3)] = r3; \
79
+ }
80
+
81
+DO_NEON_PAIRWISE(neon_padd, add)
82
+DO_NEON_PAIRWISE(neon_pmax, max)
83
+DO_NEON_PAIRWISE(neon_pmin, min)
84
+
85
+#undef DO_NEON_PAIRWISE
86
diff --git a/target/arm/translate-neon.c.inc b/target/arm/translate-neon.c.inc
87
index XXXXXXX..XXXXXXX 100644
88
--- a/target/arm/translate-neon.c.inc
89
+++ b/target/arm/translate-neon.c.inc
90
@@ -XXX,XX +XXX,XX @@ static bool trans_VMINNM_fp_3s(DisasContext *s, arg_3same *a)
91
return do_3same(s, a, gen_VMINNM_fp32_3s);
92
}
93
94
-static bool do_3same_fp_pair(DisasContext *s, arg_3same *a, VFPGen3OpSPFn *fn)
95
+static bool do_3same_fp_pair(DisasContext *s, arg_3same *a,
96
+ gen_helper_gvec_3_ptr *fn)
97
{
98
- /* FP operations handled pairwise 32 bits at a time */
99
- TCGv_i32 tmp, tmp2, tmp3;
100
+ /* FP pairwise operations */
101
TCGv_ptr fpstatus;
102
103
if (!arm_dc_feature(s, ARM_FEATURE_NEON)) {
104
@@ -XXX,XX +XXX,XX @@ static bool do_3same_fp_pair(DisasContext *s, arg_3same *a, VFPGen3OpSPFn *fn)
105
106
assert(a->q == 0); /* enforced by decode patterns */
107
108
- /*
109
- * Note that we have to be careful not to clobber the source operands
110
- * in the "vm == vd" case by storing the result of the first pass too
111
- * early. Since Q is 0 there are always just two passes, so instead
112
- * of a complicated loop over each pass we just unroll.
113
- */
114
- fpstatus = fpstatus_ptr(FPST_STD);
115
- tmp = neon_load_reg(a->vn, 0);
116
- tmp2 = neon_load_reg(a->vn, 1);
117
- fn(tmp, tmp, tmp2, fpstatus);
118
- tcg_temp_free_i32(tmp2);
119
120
- tmp3 = neon_load_reg(a->vm, 0);
121
- tmp2 = neon_load_reg(a->vm, 1);
122
- fn(tmp3, tmp3, tmp2, fpstatus);
123
- tcg_temp_free_i32(tmp2);
124
+ fpstatus = fpstatus_ptr(a->size != 0 ? FPST_STD_F16 : FPST_STD);
125
+ tcg_gen_gvec_3_ptr(vfp_reg_offset(1, a->vd),
126
+ vfp_reg_offset(1, a->vn),
127
+ vfp_reg_offset(1, a->vm),
128
+ fpstatus, 8, 8, 0, fn);
129
tcg_temp_free_ptr(fpstatus);
130
131
- neon_store_reg(a->vd, 0, tmp);
132
- neon_store_reg(a->vd, 1, tmp3);
133
return true;
134
}
135
136
@@ -XXX,XX +XXX,XX @@ static bool do_3same_fp_pair(DisasContext *s, arg_3same *a, VFPGen3OpSPFn *fn)
137
static bool trans_##INSN##_fp_3s(DisasContext *s, arg_3same *a) \
138
{ \
139
if (a->size != 0) { \
140
- /* TODO fp16 support */ \
141
- return false; \
142
+ if (!dc_isar_feature(aa32_fp16_arith, s)) { \
143
+ return false; \
144
+ } \
145
+ return do_3same_fp_pair(s, a, FUNC##h); \
146
} \
147
- return do_3same_fp_pair(s, a, FUNC); \
148
+ return do_3same_fp_pair(s, a, FUNC##s); \
149
}
150
151
-DO_3S_FP_PAIR(VPADD, gen_helper_vfp_adds)
152
-DO_3S_FP_PAIR(VPMAX, gen_helper_vfp_maxs)
153
-DO_3S_FP_PAIR(VPMIN, gen_helper_vfp_mins)
154
+DO_3S_FP_PAIR(VPADD, gen_helper_neon_padd)
155
+DO_3S_FP_PAIR(VPMAX, gen_helper_neon_pmax)
156
+DO_3S_FP_PAIR(VPMIN, gen_helper_neon_pmin)
157
158
static bool do_vector_2sh(DisasContext *s, arg_2reg_shift *a, GVecGen2iFn *fn)
159
{
160
--
46
--
161
2.20.1
47
2.20.1
162
48
163
49
diff view generated by jsdifflib
1
Convert the Neon floating point VMAXNM and VMINNM insns to
1
From: Alexander Graf <agraf@csgraf.de>
2
using a gvec helper and use this to implement the fp16 case.
3
2
3
In macOS 11, QEMU only gets access to Hypervisor.framework if it has the
4
respective entitlement. Add an entitlement template and automatically self
5
sign and apply the entitlement in the build.
6
7
Signed-off-by: Alexander Graf <agraf@csgraf.de>
8
Reviewed-by: Roman Bolshakov <r.bolshakov@yadro.com>
9
Tested-by: Roman Bolshakov <r.bolshakov@yadro.com>
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
10
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
5
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
6
Message-id: 20200828183354.27913-30-peter.maydell@linaro.org
7
---
11
---
8
target/arm/helper.h | 6 ++++++
12
meson.build | 29 +++++++++++++++++++++++++----
9
target/arm/vec_helper.c | 6 ++++++
13
accel/hvf/entitlements.plist | 8 ++++++++
10
target/arm/translate-neon.c.inc | 23 +++++++++++++++--------
14
scripts/entitlement.sh | 13 +++++++++++++
11
3 files changed, 27 insertions(+), 8 deletions(-)
15
3 files changed, 46 insertions(+), 4 deletions(-)
16
create mode 100644 accel/hvf/entitlements.plist
17
create mode 100755 scripts/entitlement.sh
12
18
13
diff --git a/target/arm/helper.h b/target/arm/helper.h
19
diff --git a/meson.build b/meson.build
14
index XXXXXXX..XXXXXXX 100644
20
index XXXXXXX..XXXXXXX 100644
15
--- a/target/arm/helper.h
21
--- a/meson.build
16
+++ b/target/arm/helper.h
22
+++ b/meson.build
17
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_5(gvec_fmax_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
23
@@ -XXX,XX +XXX,XX @@ foreach target : target_dirs
18
DEF_HELPER_FLAGS_5(gvec_fmin_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
24
}]
19
DEF_HELPER_FLAGS_5(gvec_fmin_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
25
endif
20
26
foreach exe: execs
21
+DEF_HELPER_FLAGS_5(gvec_fmaxnum_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
27
- emulators += {exe['name']:
22
+DEF_HELPER_FLAGS_5(gvec_fmaxnum_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
28
- executable(exe['name'], exe['sources'],
29
- install: true,
30
+ exe_name = exe['name']
31
+ exe_sign = 'CONFIG_HVF' in config_target
32
+ if exe_sign
33
+ exe_name += '-unsigned'
34
+ endif
23
+
35
+
24
+DEF_HELPER_FLAGS_5(gvec_fminnum_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
36
+ emulator = executable(exe_name, exe['sources'],
25
+DEF_HELPER_FLAGS_5(gvec_fminnum_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
37
+ install: not exe_sign,
38
c_args: c_args,
39
dependencies: arch_deps + deps + exe['dependencies'],
40
objects: lib.extract_all_objects(recursive: true),
41
@@ -XXX,XX +XXX,XX @@ foreach target : target_dirs
42
link_depends: [block_syms, qemu_syms] + exe.get('link_depends', []),
43
link_args: link_args,
44
gui_app: exe['gui'])
45
- }
26
+
46
+
27
DEF_HELPER_FLAGS_5(gvec_ftsmul_h, TCG_CALL_NO_RWG,
47
+ if exe_sign
28
void, ptr, ptr, ptr, ptr, i32)
48
+ emulators += {exe['name'] : custom_target(exe['name'],
29
DEF_HELPER_FLAGS_5(gvec_ftsmul_s, TCG_CALL_NO_RWG,
49
+ install: true,
30
diff --git a/target/arm/vec_helper.c b/target/arm/vec_helper.c
50
+ install_dir: get_option('bindir'),
31
index XXXXXXX..XXXXXXX 100644
51
+ depends: emulator,
32
--- a/target/arm/vec_helper.c
52
+ output: exe['name'],
33
+++ b/target/arm/vec_helper.c
53
+ command: [
34
@@ -XXX,XX +XXX,XX @@ DO_3OP(gvec_fmax_s, float32_max, float32)
54
+ meson.current_source_dir() / 'scripts/entitlement.sh',
35
DO_3OP(gvec_fmin_h, float16_min, float16)
55
+ meson.current_build_dir() / exe_name,
36
DO_3OP(gvec_fmin_s, float32_min, float32)
56
+ meson.current_build_dir() / exe['name'],
37
57
+ meson.current_source_dir() / 'accel/hvf/entitlements.plist'
38
+DO_3OP(gvec_fmaxnum_h, float16_maxnum, float16)
58
+ ])
39
+DO_3OP(gvec_fmaxnum_s, float32_maxnum, float32)
59
+ }
60
+ else
61
+ emulators += {exe['name']: emulator}
62
+ endif
63
64
if 'CONFIG_TRACE_SYSTEMTAP' in config_host
65
foreach stp: [
66
diff --git a/accel/hvf/entitlements.plist b/accel/hvf/entitlements.plist
67
new file mode 100644
68
index XXXXXXX..XXXXXXX
69
--- /dev/null
70
+++ b/accel/hvf/entitlements.plist
71
@@ -XXX,XX +XXX,XX @@
72
+<?xml version="1.0" encoding="UTF-8"?>
73
+<!DOCTYPE plist PUBLIC "-//Apple//DTD PLIST 1.0//EN" "http://www.apple.com/DTDs/PropertyList-1.0.dtd">
74
+<plist version="1.0">
75
+<dict>
76
+ <key>com.apple.security.hypervisor</key>
77
+ <true/>
78
+</dict>
79
+</plist>
80
diff --git a/scripts/entitlement.sh b/scripts/entitlement.sh
81
new file mode 100755
82
index XXXXXXX..XXXXXXX
83
--- /dev/null
84
+++ b/scripts/entitlement.sh
85
@@ -XXX,XX +XXX,XX @@
86
+#!/bin/sh -e
87
+#
88
+# Helper script for the build process to apply entitlements
40
+
89
+
41
+DO_3OP(gvec_fminnum_h, float16_minnum, float16)
90
+SRC="$1"
42
+DO_3OP(gvec_fminnum_s, float32_minnum, float32)
91
+DST="$2"
92
+ENTITLEMENT="$3"
43
+
93
+
44
#ifdef TARGET_AARCH64
94
+trap 'rm "$DST.tmp"' exit
45
95
+cp -af "$SRC" "$DST.tmp"
46
DO_3OP(gvec_recps_h, helper_recpsf_f16, float16)
96
+codesign --entitlements "$ENTITLEMENT" --force -s - "$DST.tmp"
47
diff --git a/target/arm/translate-neon.c.inc b/target/arm/translate-neon.c.inc
97
+mv "$DST.tmp" "$DST"
48
index XXXXXXX..XXXXXXX 100644
98
+trap '' exit
49
--- a/target/arm/translate-neon.c.inc
50
+++ b/target/arm/translate-neon.c.inc
51
@@ -XXX,XX +XXX,XX @@ static void gen_VMLS_fp_3s(TCGv_i32 vd, TCGv_i32 vn, TCGv_i32 vm,
52
DO_3S_FP(VMLA, gen_VMLA_fp_3s, true)
53
DO_3S_FP(VMLS, gen_VMLS_fp_3s, true)
54
55
+WRAP_FP_GVEC(gen_VMAXNM_fp32_3s, FPST_STD, gen_helper_gvec_fmaxnum_s)
56
+WRAP_FP_GVEC(gen_VMAXNM_fp16_3s, FPST_STD_F16, gen_helper_gvec_fmaxnum_h)
57
+WRAP_FP_GVEC(gen_VMINNM_fp32_3s, FPST_STD, gen_helper_gvec_fminnum_s)
58
+WRAP_FP_GVEC(gen_VMINNM_fp16_3s, FPST_STD_F16, gen_helper_gvec_fminnum_h)
59
+
60
static bool trans_VMAXNM_fp_3s(DisasContext *s, arg_3same *a)
61
{
62
if (!arm_dc_feature(s, ARM_FEATURE_V8)) {
63
@@ -XXX,XX +XXX,XX @@ static bool trans_VMAXNM_fp_3s(DisasContext *s, arg_3same *a)
64
}
65
66
if (a->size != 0) {
67
- /* TODO fp16 support */
68
- return false;
69
+ if (!dc_isar_feature(aa32_fp16_arith, s)) {
70
+ return false;
71
+ }
72
+ return do_3same(s, a, gen_VMAXNM_fp16_3s);
73
}
74
-
75
- return do_3same_fp(s, a, gen_helper_vfp_maxnums, false);
76
+ return do_3same(s, a, gen_VMAXNM_fp32_3s);
77
}
78
79
static bool trans_VMINNM_fp_3s(DisasContext *s, arg_3same *a)
80
@@ -XXX,XX +XXX,XX @@ static bool trans_VMINNM_fp_3s(DisasContext *s, arg_3same *a)
81
}
82
83
if (a->size != 0) {
84
- /* TODO fp16 support */
85
- return false;
86
+ if (!dc_isar_feature(aa32_fp16_arith, s)) {
87
+ return false;
88
+ }
89
+ return do_3same(s, a, gen_VMINNM_fp16_3s);
90
}
91
-
92
- return do_3same_fp(s, a, gen_helper_vfp_minnums, false);
93
+ return do_3same(s, a, gen_VMINNM_fp32_3s);
94
}
95
96
WRAP_ENV_FN(gen_VRECPS_tramp, gen_helper_recps_f32)
97
--
99
--
98
2.20.1
100
2.20.1
99
101
100
102
diff view generated by jsdifflib
1
From: Graeme Gregory <graeme@nuviainc.com>
1
From: Mihai Carabas <mihai.carabas@oracle.com>
2
2
3
A difference between sbsa platform and the virt platform is PSCI is
3
To ease the PCI device addition in next patches, split the code as follows:
4
handled by ARM-TF in the sbsa platform. This means that the PSCI code
4
- generic code (read/write/setup) is being kept in pvpanic.c
5
there needs to communicate some of the platform power changes down
5
- ISA dependent code moved to pvpanic-isa.c
6
to the qemu code for things like shutdown/reset control.
6
7
7
Also, rename:
8
Space has been left to extend the EC if we find other use cases in
8
- ISA_PVPANIC_DEVICE -> PVPANIC_ISA_DEVICE.
9
future where ARM-TF and qemu need to communicate.
9
- TYPE_PVPANIC -> TYPE_PVPANIC_ISA.
10
10
- MemoryRegion io -> mr.
11
Signed-off-by: Graeme Gregory <graeme@nuviainc.com>
11
- pvpanic_ioport_* in pvpanic_*.
12
Reviewed-by: Leif Lindholm <leif@nuviainc.com>
12
13
Tested-by: Leif Lindholm <leif@nuviainc.com>
13
Update the build system with the new files and config structure.
14
Message-id: 20200826141952.136164-2-graeme@nuviainc.com
14
15
Signed-off-by: Mihai Carabas <mihai.carabas@oracle.com>
15
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
16
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
16
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
17
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
17
---
18
---
18
hw/misc/sbsa_ec.c | 98 +++++++++++++++++++++++++++++++++++++++++++++
19
include/hw/misc/pvpanic.h | 23 +++++++++-
19
hw/misc/meson.build | 2 +
20
hw/misc/pvpanic-isa.c | 94 +++++++++++++++++++++++++++++++++++++++
20
2 files changed, 100 insertions(+)
21
hw/misc/pvpanic.c | 85 +++--------------------------------
21
create mode 100644 hw/misc/sbsa_ec.c
22
hw/i386/Kconfig | 2 +-
22
23
hw/misc/Kconfig | 6 ++-
23
diff --git a/hw/misc/sbsa_ec.c b/hw/misc/sbsa_ec.c
24
hw/misc/meson.build | 3 +-
25
tests/qtest/meson.build | 2 +-
26
7 files changed, 130 insertions(+), 85 deletions(-)
27
create mode 100644 hw/misc/pvpanic-isa.c
28
29
diff --git a/include/hw/misc/pvpanic.h b/include/hw/misc/pvpanic.h
30
index XXXXXXX..XXXXXXX 100644
31
--- a/include/hw/misc/pvpanic.h
32
+++ b/include/hw/misc/pvpanic.h
33
@@ -XXX,XX +XXX,XX @@
34
35
#include "qom/object.h"
36
37
-#define TYPE_PVPANIC "pvpanic"
38
+#define TYPE_PVPANIC_ISA_DEVICE "pvpanic"
39
40
#define PVPANIC_IOPORT_PROP "ioport"
41
42
+/* The bit of supported pv event, TODO: include uapi header and remove this */
43
+#define PVPANIC_F_PANICKED 0
44
+#define PVPANIC_F_CRASHLOADED 1
45
+
46
+/* The pv event value */
47
+#define PVPANIC_PANICKED (1 << PVPANIC_F_PANICKED)
48
+#define PVPANIC_CRASHLOADED (1 << PVPANIC_F_CRASHLOADED)
49
+
50
+/*
51
+ * PVPanicState for any device type
52
+ */
53
+typedef struct PVPanicState PVPanicState;
54
+struct PVPanicState {
55
+ MemoryRegion mr;
56
+ uint8_t events;
57
+};
58
+
59
+void pvpanic_setup_io(PVPanicState *s, DeviceState *dev, unsigned size);
60
+
61
static inline uint16_t pvpanic_port(void)
62
{
63
- Object *o = object_resolve_path_type("", TYPE_PVPANIC, NULL);
64
+ Object *o = object_resolve_path_type("", TYPE_PVPANIC_ISA_DEVICE, NULL);
65
if (!o) {
66
return 0;
67
}
68
diff --git a/hw/misc/pvpanic-isa.c b/hw/misc/pvpanic-isa.c
24
new file mode 100644
69
new file mode 100644
25
index XXXXXXX..XXXXXXX
70
index XXXXXXX..XXXXXXX
26
--- /dev/null
71
--- /dev/null
27
+++ b/hw/misc/sbsa_ec.c
72
+++ b/hw/misc/pvpanic-isa.c
28
@@ -XXX,XX +XXX,XX @@
73
@@ -XXX,XX +XXX,XX @@
29
+/*
74
+/*
30
+ * ARM SBSA Reference Platform Embedded Controller
75
+ * QEMU simulated pvpanic device.
31
+ *
76
+ *
32
+ * A device to allow PSCI running in the secure side of sbsa-ref machine
77
+ * Copyright Fujitsu, Corp. 2013
33
+ * to communicate platform power states to qemu.
34
+ *
78
+ *
35
+ * Copyright (c) 2020 Nuvia Inc
79
+ * Authors:
36
+ * Written by Graeme Gregory <graeme@nuviainc.com>
80
+ * Wen Congyang <wency@cn.fujitsu.com>
81
+ * Hu Tao <hutao@cn.fujitsu.com>
37
+ *
82
+ *
38
+ * SPDX-License-Identifer: GPL-2.0-or-later
83
+ * This work is licensed under the terms of the GNU GPL, version 2 or later.
84
+ * See the COPYING file in the top-level directory.
85
+ *
39
+ */
86
+ */
40
+
87
+
41
+#include "qemu/osdep.h"
88
+#include "qemu/osdep.h"
42
+#include "qemu-common.h"
43
+#include "qemu/log.h"
89
+#include "qemu/log.h"
44
+#include "hw/sysbus.h"
90
+#include "qemu/module.h"
45
+#include "sysemu/runstate.h"
91
+#include "sysemu/runstate.h"
46
+
92
+
47
+typedef struct {
93
+#include "hw/nvram/fw_cfg.h"
48
+ SysBusDevice parent_obj;
94
+#include "hw/qdev-properties.h"
49
+ MemoryRegion iomem;
95
+#include "hw/misc/pvpanic.h"
50
+} SECUREECState;
96
+#include "qom/object.h"
51
+
97
+#include "hw/isa/isa.h"
52
+#define TYPE_SBSA_EC "sbsa-ec"
98
+
53
+#define SECURE_EC(obj) OBJECT_CHECK(SECUREECState, (obj), TYPE_SBSA_EC)
99
+OBJECT_DECLARE_SIMPLE_TYPE(PVPanicISAState, PVPANIC_ISA_DEVICE)
54
+
100
+
55
+enum sbsa_ec_powerstates {
101
+/*
56
+ SBSA_EC_CMD_POWEROFF = 0x01,
102
+ * PVPanicISAState for ISA device and
57
+ SBSA_EC_CMD_REBOOT = 0x02,
103
+ * use ioport.
104
+ */
105
+struct PVPanicISAState {
106
+ ISADevice parent_obj;
107
+
108
+ uint16_t ioport;
109
+ PVPanicState pvpanic;
58
+};
110
+};
59
+
111
+
60
+static uint64_t sbsa_ec_read(void *opaque, hwaddr offset, unsigned size)
112
+static void pvpanic_isa_initfn(Object *obj)
61
+{
113
+{
62
+ /* No use for this currently */
114
+ PVPanicISAState *s = PVPANIC_ISA_DEVICE(obj);
63
+ qemu_log_mask(LOG_GUEST_ERROR, "sbsa-ec: no readable registers");
115
+
64
+ return 0;
116
+ pvpanic_setup_io(&s->pvpanic, DEVICE(s), 1);
65
+}
117
+}
66
+
118
+
67
+static void sbsa_ec_write(void *opaque, hwaddr offset,
119
+static void pvpanic_isa_realizefn(DeviceState *dev, Error **errp)
68
+ uint64_t value, unsigned size)
69
+{
120
+{
70
+ if (offset == 0) { /* PSCI machine power command register */
121
+ ISADevice *d = ISA_DEVICE(dev);
71
+ switch (value) {
122
+ PVPanicISAState *s = PVPANIC_ISA_DEVICE(dev);
72
+ case SBSA_EC_CMD_POWEROFF:
123
+ PVPanicState *ps = &s->pvpanic;
73
+ qemu_system_shutdown_request(SHUTDOWN_CAUSE_GUEST_SHUTDOWN);
124
+ FWCfgState *fw_cfg = fw_cfg_find();
74
+ break;
125
+ uint16_t *pvpanic_port;
75
+ case SBSA_EC_CMD_REBOOT:
126
+
76
+ qemu_system_reset_request(SHUTDOWN_CAUSE_GUEST_RESET);
127
+ if (!fw_cfg) {
77
+ break;
128
+ return;
78
+ default:
79
+ qemu_log_mask(LOG_GUEST_ERROR,
80
+ "sbsa-ec: unknown power command");
81
+ }
82
+ } else {
83
+ qemu_log_mask(LOG_GUEST_ERROR, "sbsa-ec: unknown EC register");
84
+ }
129
+ }
130
+
131
+ pvpanic_port = g_malloc(sizeof(*pvpanic_port));
132
+ *pvpanic_port = cpu_to_le16(s->ioport);
133
+ fw_cfg_add_file(fw_cfg, "etc/pvpanic-port", pvpanic_port,
134
+ sizeof(*pvpanic_port));
135
+
136
+ isa_register_ioport(d, &ps->mr, s->ioport);
85
+}
137
+}
86
+
138
+
87
+static const MemoryRegionOps sbsa_ec_ops = {
139
+static Property pvpanic_isa_properties[] = {
88
+ .read = sbsa_ec_read,
140
+ DEFINE_PROP_UINT16(PVPANIC_IOPORT_PROP, PVPanicISAState, ioport, 0x505),
89
+ .write = sbsa_ec_write,
141
+ DEFINE_PROP_UINT8("events", PVPanicISAState, pvpanic.events, PVPANIC_PANICKED | PVPANIC_CRASHLOADED),
90
+ .endianness = DEVICE_NATIVE_ENDIAN,
142
+ DEFINE_PROP_END_OF_LIST(),
91
+ .valid.min_access_size = 4,
92
+ .valid.max_access_size = 4,
93
+};
143
+};
94
+
144
+
95
+static void sbsa_ec_init(Object *obj)
145
+static void pvpanic_isa_class_init(ObjectClass *klass, void *data)
96
+{
97
+ SECUREECState *s = SECURE_EC(obj);
98
+ SysBusDevice *dev = SYS_BUS_DEVICE(obj);
99
+
100
+ memory_region_init_io(&s->iomem, obj, &sbsa_ec_ops, s, "sbsa-ec",
101
+ 0x1000);
102
+ sysbus_init_mmio(dev, &s->iomem);
103
+}
104
+
105
+static void sbsa_ec_class_init(ObjectClass *klass, void *data)
106
+{
146
+{
107
+ DeviceClass *dc = DEVICE_CLASS(klass);
147
+ DeviceClass *dc = DEVICE_CLASS(klass);
108
+
148
+
109
+ /* No vmstate or reset required: device has no internal state */
149
+ dc->realize = pvpanic_isa_realizefn;
110
+ dc->user_creatable = false;
150
+ device_class_set_props(dc, pvpanic_isa_properties);
151
+ set_bit(DEVICE_CATEGORY_MISC, dc->categories);
111
+}
152
+}
112
+
153
+
113
+static const TypeInfo sbsa_ec_info = {
154
+static TypeInfo pvpanic_isa_info = {
114
+ .name = TYPE_SBSA_EC,
155
+ .name = TYPE_PVPANIC_ISA_DEVICE,
115
+ .parent = TYPE_SYS_BUS_DEVICE,
156
+ .parent = TYPE_ISA_DEVICE,
116
+ .instance_size = sizeof(SECUREECState),
157
+ .instance_size = sizeof(PVPanicISAState),
117
+ .instance_init = sbsa_ec_init,
158
+ .instance_init = pvpanic_isa_initfn,
118
+ .class_init = sbsa_ec_class_init,
159
+ .class_init = pvpanic_isa_class_init,
119
+};
160
+};
120
+
161
+
121
+static void sbsa_ec_register_type(void)
162
+static void pvpanic_register_types(void)
122
+{
163
+{
123
+ type_register_static(&sbsa_ec_info);
164
+ type_register_static(&pvpanic_isa_info);
124
+}
165
+}
125
+
166
+
126
+type_init(sbsa_ec_register_type);
167
+type_init(pvpanic_register_types)
168
diff --git a/hw/misc/pvpanic.c b/hw/misc/pvpanic.c
169
index XXXXXXX..XXXXXXX 100644
170
--- a/hw/misc/pvpanic.c
171
+++ b/hw/misc/pvpanic.c
172
@@ -XXX,XX +XXX,XX @@
173
#include "hw/misc/pvpanic.h"
174
#include "qom/object.h"
175
176
-/* The bit of supported pv event, TODO: include uapi header and remove this */
177
-#define PVPANIC_F_PANICKED 0
178
-#define PVPANIC_F_CRASHLOADED 1
179
-
180
-/* The pv event value */
181
-#define PVPANIC_PANICKED (1 << PVPANIC_F_PANICKED)
182
-#define PVPANIC_CRASHLOADED (1 << PVPANIC_F_CRASHLOADED)
183
-
184
-typedef struct PVPanicState PVPanicState;
185
-DECLARE_INSTANCE_CHECKER(PVPanicState, ISA_PVPANIC_DEVICE,
186
- TYPE_PVPANIC)
187
-
188
static void handle_event(int event)
189
{
190
static bool logged;
191
@@ -XXX,XX +XXX,XX @@ static void handle_event(int event)
192
}
193
}
194
195
-#include "hw/isa/isa.h"
196
-
197
-struct PVPanicState {
198
- ISADevice parent_obj;
199
-
200
- MemoryRegion io;
201
- uint16_t ioport;
202
- uint8_t events;
203
-};
204
-
205
/* return supported events on read */
206
-static uint64_t pvpanic_ioport_read(void *opaque, hwaddr addr, unsigned size)
207
+static uint64_t pvpanic_read(void *opaque, hwaddr addr, unsigned size)
208
{
209
PVPanicState *pvp = opaque;
210
return pvp->events;
211
}
212
213
-static void pvpanic_ioport_write(void *opaque, hwaddr addr, uint64_t val,
214
+static void pvpanic_write(void *opaque, hwaddr addr, uint64_t val,
215
unsigned size)
216
{
217
handle_event(val);
218
}
219
220
static const MemoryRegionOps pvpanic_ops = {
221
- .read = pvpanic_ioport_read,
222
- .write = pvpanic_ioport_write,
223
+ .read = pvpanic_read,
224
+ .write = pvpanic_write,
225
.impl = {
226
.min_access_size = 1,
227
.max_access_size = 1,
228
},
229
};
230
231
-static void pvpanic_isa_initfn(Object *obj)
232
+void pvpanic_setup_io(PVPanicState *s, DeviceState *dev, unsigned size)
233
{
234
- PVPanicState *s = ISA_PVPANIC_DEVICE(obj);
235
-
236
- memory_region_init_io(&s->io, OBJECT(s), &pvpanic_ops, s, "pvpanic", 1);
237
+ memory_region_init_io(&s->mr, OBJECT(dev), &pvpanic_ops, s, "pvpanic", size);
238
}
239
-
240
-static void pvpanic_isa_realizefn(DeviceState *dev, Error **errp)
241
-{
242
- ISADevice *d = ISA_DEVICE(dev);
243
- PVPanicState *s = ISA_PVPANIC_DEVICE(dev);
244
- FWCfgState *fw_cfg = fw_cfg_find();
245
- uint16_t *pvpanic_port;
246
-
247
- if (!fw_cfg) {
248
- return;
249
- }
250
-
251
- pvpanic_port = g_malloc(sizeof(*pvpanic_port));
252
- *pvpanic_port = cpu_to_le16(s->ioport);
253
- fw_cfg_add_file(fw_cfg, "etc/pvpanic-port", pvpanic_port,
254
- sizeof(*pvpanic_port));
255
-
256
- isa_register_ioport(d, &s->io, s->ioport);
257
-}
258
-
259
-static Property pvpanic_isa_properties[] = {
260
- DEFINE_PROP_UINT16(PVPANIC_IOPORT_PROP, PVPanicState, ioport, 0x505),
261
- DEFINE_PROP_UINT8("events", PVPanicState, events, PVPANIC_PANICKED | PVPANIC_CRASHLOADED),
262
- DEFINE_PROP_END_OF_LIST(),
263
-};
264
-
265
-static void pvpanic_isa_class_init(ObjectClass *klass, void *data)
266
-{
267
- DeviceClass *dc = DEVICE_CLASS(klass);
268
-
269
- dc->realize = pvpanic_isa_realizefn;
270
- device_class_set_props(dc, pvpanic_isa_properties);
271
- set_bit(DEVICE_CATEGORY_MISC, dc->categories);
272
-}
273
-
274
-static TypeInfo pvpanic_isa_info = {
275
- .name = TYPE_PVPANIC,
276
- .parent = TYPE_ISA_DEVICE,
277
- .instance_size = sizeof(PVPanicState),
278
- .instance_init = pvpanic_isa_initfn,
279
- .class_init = pvpanic_isa_class_init,
280
-};
281
-
282
-static void pvpanic_register_types(void)
283
-{
284
- type_register_static(&pvpanic_isa_info);
285
-}
286
-
287
-type_init(pvpanic_register_types)
288
diff --git a/hw/i386/Kconfig b/hw/i386/Kconfig
289
index XXXXXXX..XXXXXXX 100644
290
--- a/hw/i386/Kconfig
291
+++ b/hw/i386/Kconfig
292
@@ -XXX,XX +XXX,XX @@ config PC
293
imply ISA_DEBUG
294
imply PARALLEL
295
imply PCI_DEVICES
296
- imply PVPANIC
297
+ imply PVPANIC_ISA
298
imply QXL
299
imply SEV
300
imply SGA
301
diff --git a/hw/misc/Kconfig b/hw/misc/Kconfig
302
index XXXXXXX..XXXXXXX 100644
303
--- a/hw/misc/Kconfig
304
+++ b/hw/misc/Kconfig
305
@@ -XXX,XX +XXX,XX @@ config IOTKIT_SYSCTL
306
config IOTKIT_SYSINFO
307
bool
308
309
-config PVPANIC
310
+config PVPANIC_COMMON
311
+ bool
312
+
313
+config PVPANIC_ISA
314
bool
315
depends on ISA_BUS
316
+ select PVPANIC_COMMON
317
318
config AUX
319
bool
127
diff --git a/hw/misc/meson.build b/hw/misc/meson.build
320
diff --git a/hw/misc/meson.build b/hw/misc/meson.build
128
index XXXXXXX..XXXXXXX 100644
321
index XXXXXXX..XXXXXXX 100644
129
--- a/hw/misc/meson.build
322
--- a/hw/misc/meson.build
130
+++ b/hw/misc/meson.build
323
+++ b/hw/misc/meson.build
131
@@ -XXX,XX +XXX,XX @@ specific_ss.add(when: 'CONFIG_MAC_VIA', if_true: files('mac_via.c'))
324
@@ -XXX,XX +XXX,XX @@ softmmu_ss.add(when: 'CONFIG_EMC141X', if_true: files('emc141x.c'))
132
325
softmmu_ss.add(when: 'CONFIG_UNIMP', if_true: files('unimp.c'))
133
specific_ss.add(when: 'CONFIG_MIPS_CPS', if_true: files('mips_cmgcr.c', 'mips_cpc.c'))
326
softmmu_ss.add(when: 'CONFIG_EMPTY_SLOT', if_true: files('empty_slot.c'))
134
specific_ss.add(when: 'CONFIG_MIPS_ITU', if_true: files('mips_itu.c'))
327
softmmu_ss.add(when: 'CONFIG_LED', if_true: files('led.c'))
135
+
328
+softmmu_ss.add(when: 'CONFIG_PVPANIC_COMMON', if_true: files('pvpanic.c'))
136
+specific_ss.add(when: 'CONFIG_SBSA_REF', if_true: files('sbsa_ec.c'))
329
330
# ARM devices
331
softmmu_ss.add(when: 'CONFIG_PL310', if_true: files('arm_l2x0.c'))
332
@@ -XXX,XX +XXX,XX @@ softmmu_ss.add(when: 'CONFIG_IOTKIT_SYSINFO', if_true: files('iotkit-sysinfo.c')
333
softmmu_ss.add(when: 'CONFIG_ARMSSE_CPUID', if_true: files('armsse-cpuid.c'))
334
softmmu_ss.add(when: 'CONFIG_ARMSSE_MHU', if_true: files('armsse-mhu.c'))
335
336
-softmmu_ss.add(when: 'CONFIG_PVPANIC', if_true: files('pvpanic.c'))
337
+softmmu_ss.add(when: 'CONFIG_PVPANIC_ISA', if_true: files('pvpanic-isa.c'))
338
softmmu_ss.add(when: 'CONFIG_AUX', if_true: files('auxbus.c'))
339
softmmu_ss.add(when: 'CONFIG_ASPEED_SOC', if_true: files('aspeed_scu.c', 'aspeed_sdmc.c', 'aspeed_xdma.c'))
340
softmmu_ss.add(when: 'CONFIG_MSF2', if_true: files('msf2-sysreg.c'))
341
diff --git a/tests/qtest/meson.build b/tests/qtest/meson.build
342
index XXXXXXX..XXXXXXX 100644
343
--- a/tests/qtest/meson.build
344
+++ b/tests/qtest/meson.build
345
@@ -XXX,XX +XXX,XX @@ qtests_i386 = \
346
(config_host.has_key('CONFIG_LINUX') and \
347
config_all_devices.has_key('CONFIG_ISA_IPMI_BT') ? ['ipmi-bt-test'] : []) + \
348
(config_all_devices.has_key('CONFIG_WDT_IB700') ? ['wdt_ib700-test'] : []) + \
349
- (config_all_devices.has_key('CONFIG_PVPANIC') ? ['pvpanic-test'] : []) + \
350
+ (config_all_devices.has_key('CONFIG_PVPANIC_ISA') ? ['pvpanic-test'] : []) + \
351
(config_all_devices.has_key('CONFIG_HDA') ? ['intel-hda-test'] : []) + \
352
(config_all_devices.has_key('CONFIG_I82801B11') ? ['i82801b11-test'] : []) + \
353
(config_all_devices.has_key('CONFIG_IOH3420') ? ['ioh3420-test'] : []) + \
137
--
354
--
138
2.20.1
355
2.20.1
139
356
140
357
diff view generated by jsdifflib
1
Implement fp16 versions of the VFP VMLA, VMLS, VNMLS, VNMLA, VNMUL
1
From: Mihai Carabas <mihai.carabas@oracle.com>
2
instructions. (These are all the remaining ones which we implement
3
via do_vfp_3op_[hsd]p().)
4
2
3
Add PCI interface support for PVPANIC device. Create a new file pvpanic-pci.c
4
where the PCI specific routines reside and update the build system with the new
5
files and config structure.
6
7
Signed-off-by: Mihai Carabas <mihai.carabas@oracle.com>
8
Reviewed-by: Gerd Hoffmann <kraxel@redhat.com>
9
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
10
Signed-off-by: Mihai Carabas <mihai.carabas@oracle.com>
5
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
11
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
6
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
7
Message-id: 20200828183354.27913-5-peter.maydell@linaro.org
8
---
12
---
9
target/arm/helper.h | 1 +
13
docs/specs/pci-ids.txt | 1 +
10
target/arm/vfp.decode | 5 ++
14
include/hw/misc/pvpanic.h | 1 +
11
target/arm/vfp_helper.c | 5 ++
15
include/hw/pci/pci.h | 1 +
12
target/arm/translate-vfp.c.inc | 84 ++++++++++++++++++++++++++++++++++
16
hw/misc/pvpanic-pci.c | 94 +++++++++++++++++++++++++++++++++++++++
13
4 files changed, 95 insertions(+)
17
hw/misc/Kconfig | 6 +++
18
hw/misc/meson.build | 1 +
19
6 files changed, 104 insertions(+)
20
create mode 100644 hw/misc/pvpanic-pci.c
14
21
15
diff --git a/target/arm/helper.h b/target/arm/helper.h
22
diff --git a/docs/specs/pci-ids.txt b/docs/specs/pci-ids.txt
16
index XXXXXXX..XXXXXXX 100644
23
index XXXXXXX..XXXXXXX 100644
17
--- a/target/arm/helper.h
24
--- a/docs/specs/pci-ids.txt
18
+++ b/target/arm/helper.h
25
+++ b/docs/specs/pci-ids.txt
19
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_3(vfp_maxnumd, f64, f64, f64, ptr)
26
@@ -XXX,XX +XXX,XX @@ PCI devices (other than virtio):
20
DEF_HELPER_3(vfp_minnumh, f16, f16, f16, ptr)
27
1b36:000d PCI xhci usb host adapter
21
DEF_HELPER_3(vfp_minnums, f32, f32, f32, ptr)
28
1b36:000f mdpy (mdev sample device), linux/samples/vfio-mdev/mdpy.c
22
DEF_HELPER_3(vfp_minnumd, f64, f64, f64, ptr)
29
1b36:0010 PCIe NVMe device (-device nvme)
23
+DEF_HELPER_1(vfp_negh, f16, f16)
30
+1b36:0011 PCI PVPanic device (-device pvpanic-pci)
24
DEF_HELPER_1(vfp_negs, f32, f32)
31
25
DEF_HELPER_1(vfp_negd, f64, f64)
32
All these devices are documented in docs/specs.
26
DEF_HELPER_1(vfp_abss, f32, f32)
33
27
diff --git a/target/arm/vfp.decode b/target/arm/vfp.decode
34
diff --git a/include/hw/misc/pvpanic.h b/include/hw/misc/pvpanic.h
28
index XXXXXXX..XXXXXXX 100644
35
index XXXXXXX..XXXXXXX 100644
29
--- a/target/arm/vfp.decode
36
--- a/include/hw/misc/pvpanic.h
30
+++ b/target/arm/vfp.decode
37
+++ b/include/hw/misc/pvpanic.h
31
@@ -XXX,XX +XXX,XX @@ VLDM_VSTM_dp ---- 1101 0.1 l:1 rn:4 .... 1011 imm:8 \
38
@@ -XXX,XX +XXX,XX @@
32
vd=%vd_dp p=1 u=0 w=1
39
#include "qom/object.h"
33
40
34
# 3-register VFP data-processing; bits [23,21:20,6] identify the operation.
41
#define TYPE_PVPANIC_ISA_DEVICE "pvpanic"
35
+VMLA_hp ---- 1110 0.00 .... .... 1001 .0.0 .... @vfp_dnm_s
42
+#define TYPE_PVPANIC_PCI_DEVICE "pvpanic-pci"
36
VMLA_sp ---- 1110 0.00 .... .... 1010 .0.0 .... @vfp_dnm_s
43
37
VMLA_dp ---- 1110 0.00 .... .... 1011 .0.0 .... @vfp_dnm_d
44
#define PVPANIC_IOPORT_PROP "ioport"
38
45
39
+VMLS_hp ---- 1110 0.00 .... .... 1001 .1.0 .... @vfp_dnm_s
46
diff --git a/include/hw/pci/pci.h b/include/hw/pci/pci.h
40
VMLS_sp ---- 1110 0.00 .... .... 1010 .1.0 .... @vfp_dnm_s
41
VMLS_dp ---- 1110 0.00 .... .... 1011 .1.0 .... @vfp_dnm_d
42
43
+VNMLS_hp ---- 1110 0.01 .... .... 1001 .0.0 .... @vfp_dnm_s
44
VNMLS_sp ---- 1110 0.01 .... .... 1010 .0.0 .... @vfp_dnm_s
45
VNMLS_dp ---- 1110 0.01 .... .... 1011 .0.0 .... @vfp_dnm_d
46
47
+VNMLA_hp ---- 1110 0.01 .... .... 1001 .1.0 .... @vfp_dnm_s
48
VNMLA_sp ---- 1110 0.01 .... .... 1010 .1.0 .... @vfp_dnm_s
49
VNMLA_dp ---- 1110 0.01 .... .... 1011 .1.0 .... @vfp_dnm_d
50
51
@@ -XXX,XX +XXX,XX @@ VMUL_hp ---- 1110 0.10 .... .... 1001 .0.0 .... @vfp_dnm_s
52
VMUL_sp ---- 1110 0.10 .... .... 1010 .0.0 .... @vfp_dnm_s
53
VMUL_dp ---- 1110 0.10 .... .... 1011 .0.0 .... @vfp_dnm_d
54
55
+VNMUL_hp ---- 1110 0.10 .... .... 1001 .1.0 .... @vfp_dnm_s
56
VNMUL_sp ---- 1110 0.10 .... .... 1010 .1.0 .... @vfp_dnm_s
57
VNMUL_dp ---- 1110 0.10 .... .... 1011 .1.0 .... @vfp_dnm_d
58
59
diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
60
index XXXXXXX..XXXXXXX 100644
47
index XXXXXXX..XXXXXXX 100644
61
--- a/target/arm/vfp_helper.c
48
--- a/include/hw/pci/pci.h
62
+++ b/target/arm/vfp_helper.c
49
+++ b/include/hw/pci/pci.h
63
@@ -XXX,XX +XXX,XX @@ VFP_BINOP(minnum)
50
@@ -XXX,XX +XXX,XX @@ extern bool pci_available;
64
VFP_BINOP(maxnum)
51
#define PCI_DEVICE_ID_REDHAT_PCIE_BRIDGE 0x000e
65
#undef VFP_BINOP
52
#define PCI_DEVICE_ID_REDHAT_MDPY 0x000f
66
53
#define PCI_DEVICE_ID_REDHAT_NVME 0x0010
67
+dh_ctype_f16 VFP_HELPER(neg, h)(dh_ctype_f16 a)
54
+#define PCI_DEVICE_ID_REDHAT_PVPANIC 0x0011
55
#define PCI_DEVICE_ID_REDHAT_QXL 0x0100
56
57
#define FMT_PCIBUS PRIx64
58
diff --git a/hw/misc/pvpanic-pci.c b/hw/misc/pvpanic-pci.c
59
new file mode 100644
60
index XXXXXXX..XXXXXXX
61
--- /dev/null
62
+++ b/hw/misc/pvpanic-pci.c
63
@@ -XXX,XX +XXX,XX @@
64
+/*
65
+ * QEMU simulated PCI pvpanic device.
66
+ *
67
+ * Copyright (C) 2020 Oracle
68
+ *
69
+ * Authors:
70
+ * Mihai Carabas <mihai.carabas@oracle.com>
71
+ *
72
+ * This work is licensed under the terms of the GNU GPL, version 2 or later.
73
+ * See the COPYING file in the top-level directory.
74
+ *
75
+ */
76
+
77
+#include "qemu/osdep.h"
78
+#include "qemu/log.h"
79
+#include "qemu/module.h"
80
+#include "sysemu/runstate.h"
81
+
82
+#include "hw/nvram/fw_cfg.h"
83
+#include "hw/qdev-properties.h"
84
+#include "migration/vmstate.h"
85
+#include "hw/misc/pvpanic.h"
86
+#include "qom/object.h"
87
+#include "hw/pci/pci.h"
88
+
89
+OBJECT_DECLARE_SIMPLE_TYPE(PVPanicPCIState, PVPANIC_PCI_DEVICE)
90
+
91
+/*
92
+ * PVPanicPCIState for PCI device
93
+ */
94
+typedef struct PVPanicPCIState {
95
+ PCIDevice dev;
96
+ PVPanicState pvpanic;
97
+} PVPanicPCIState;
98
+
99
+static const VMStateDescription vmstate_pvpanic_pci = {
100
+ .name = "pvpanic-pci",
101
+ .version_id = 1,
102
+ .minimum_version_id = 1,
103
+ .fields = (VMStateField[]) {
104
+ VMSTATE_PCI_DEVICE(dev, PVPanicPCIState),
105
+ VMSTATE_END_OF_LIST()
106
+ }
107
+};
108
+
109
+static void pvpanic_pci_realizefn(PCIDevice *dev, Error **errp)
68
+{
110
+{
69
+ return float16_chs(a);
111
+ PVPanicPCIState *s = PVPANIC_PCI_DEVICE(dev);
112
+ PVPanicState *ps = &s->pvpanic;
113
+
114
+ pvpanic_setup_io(&s->pvpanic, DEVICE(s), 2);
115
+
116
+ pci_register_bar(dev, 0, PCI_BASE_ADDRESS_SPACE_MEMORY, &ps->mr);
70
+}
117
+}
71
+
118
+
72
float32 VFP_HELPER(neg, s)(float32 a)
119
+static Property pvpanic_pci_properties[] = {
73
{
120
+ DEFINE_PROP_UINT8("events", PVPanicPCIState, pvpanic.events, PVPANIC_PANICKED | PVPANIC_CRASHLOADED),
74
return float32_chs(a);
121
+ DEFINE_PROP_END_OF_LIST(),
75
diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
122
+};
76
index XXXXXXX..XXXXXXX 100644
123
+
77
--- a/target/arm/translate-vfp.c.inc
124
+static void pvpanic_pci_class_init(ObjectClass *klass, void *data)
78
+++ b/target/arm/translate-vfp.c.inc
79
@@ -XXX,XX +XXX,XX @@ static bool do_vfp_2op_dp(DisasContext *s, VFPGen2OpDPFn *fn, int vd, int vm)
80
return true;
81
}
82
83
+static void gen_VMLA_hp(TCGv_i32 vd, TCGv_i32 vn, TCGv_i32 vm, TCGv_ptr fpst)
84
+{
125
+{
85
+ /* Note that order of inputs to the add matters for NaNs */
126
+ DeviceClass *dc = DEVICE_CLASS(klass);
86
+ TCGv_i32 tmp = tcg_temp_new_i32();
127
+ PCIDeviceClass *pc = PCI_DEVICE_CLASS(klass);
87
+
128
+
88
+ gen_helper_vfp_mulh(tmp, vn, vm, fpst);
129
+ device_class_set_props(dc, pvpanic_pci_properties);
89
+ gen_helper_vfp_addh(vd, vd, tmp, fpst);
130
+
90
+ tcg_temp_free_i32(tmp);
131
+ pc->realize = pvpanic_pci_realizefn;
132
+ pc->vendor_id = PCI_VENDOR_ID_REDHAT;
133
+ pc->device_id = PCI_DEVICE_ID_REDHAT_PVPANIC;
134
+ pc->revision = 1;
135
+ pc->class_id = PCI_CLASS_SYSTEM_OTHER;
136
+ dc->vmsd = &vmstate_pvpanic_pci;
137
+
138
+ set_bit(DEVICE_CATEGORY_MISC, dc->categories);
91
+}
139
+}
92
+
140
+
93
+static bool trans_VMLA_hp(DisasContext *s, arg_VMLA_sp *a)
141
+static TypeInfo pvpanic_pci_info = {
142
+ .name = TYPE_PVPANIC_PCI_DEVICE,
143
+ .parent = TYPE_PCI_DEVICE,
144
+ .instance_size = sizeof(PVPanicPCIState),
145
+ .class_init = pvpanic_pci_class_init,
146
+ .interfaces = (InterfaceInfo[]) {
147
+ { INTERFACE_CONVENTIONAL_PCI_DEVICE },
148
+ { }
149
+ }
150
+};
151
+
152
+static void pvpanic_register_types(void)
94
+{
153
+{
95
+ return do_vfp_3op_hp(s, gen_VMLA_hp, a->vd, a->vn, a->vm, true);
154
+ type_register_static(&pvpanic_pci_info);
96
+}
155
+}
97
+
156
+
98
static void gen_VMLA_sp(TCGv_i32 vd, TCGv_i32 vn, TCGv_i32 vm, TCGv_ptr fpst)
157
+type_init(pvpanic_register_types);
99
{
158
diff --git a/hw/misc/Kconfig b/hw/misc/Kconfig
100
/* Note that order of inputs to the add matters for NaNs */
159
index XXXXXXX..XXXXXXX 100644
101
@@ -XXX,XX +XXX,XX @@ static bool trans_VMLA_dp(DisasContext *s, arg_VMLA_dp *a)
160
--- a/hw/misc/Kconfig
102
return do_vfp_3op_dp(s, gen_VMLA_dp, a->vd, a->vn, a->vm, true);
161
+++ b/hw/misc/Kconfig
103
}
162
@@ -XXX,XX +XXX,XX @@ config IOTKIT_SYSINFO
104
163
config PVPANIC_COMMON
105
+static void gen_VMLS_hp(TCGv_i32 vd, TCGv_i32 vn, TCGv_i32 vm, TCGv_ptr fpst)
164
bool
106
+{
165
107
+ /*
166
+config PVPANIC_PCI
108
+ * VMLS: vd = vd + -(vn * vm)
167
+ bool
109
+ * Note that order of inputs to the add matters for NaNs.
168
+ default y if PCI_DEVICES
110
+ */
169
+ depends on PCI
111
+ TCGv_i32 tmp = tcg_temp_new_i32();
170
+ select PVPANIC_COMMON
112
+
171
+
113
+ gen_helper_vfp_mulh(tmp, vn, vm, fpst);
172
config PVPANIC_ISA
114
+ gen_helper_vfp_negh(tmp, tmp);
173
bool
115
+ gen_helper_vfp_addh(vd, vd, tmp, fpst);
174
depends on ISA_BUS
116
+ tcg_temp_free_i32(tmp);
175
diff --git a/hw/misc/meson.build b/hw/misc/meson.build
117
+}
176
index XXXXXXX..XXXXXXX 100644
118
+
177
--- a/hw/misc/meson.build
119
+static bool trans_VMLS_hp(DisasContext *s, arg_VMLS_sp *a)
178
+++ b/hw/misc/meson.build
120
+{
179
@@ -XXX,XX +XXX,XX @@ softmmu_ss.add(when: 'CONFIG_ARMSSE_CPUID', if_true: files('armsse-cpuid.c'))
121
+ return do_vfp_3op_hp(s, gen_VMLS_hp, a->vd, a->vn, a->vm, true);
180
softmmu_ss.add(when: 'CONFIG_ARMSSE_MHU', if_true: files('armsse-mhu.c'))
122
+}
181
123
+
182
softmmu_ss.add(when: 'CONFIG_PVPANIC_ISA', if_true: files('pvpanic-isa.c'))
124
static void gen_VMLS_sp(TCGv_i32 vd, TCGv_i32 vn, TCGv_i32 vm, TCGv_ptr fpst)
183
+softmmu_ss.add(when: 'CONFIG_PVPANIC_PCI', if_true: files('pvpanic-pci.c'))
125
{
184
softmmu_ss.add(when: 'CONFIG_AUX', if_true: files('auxbus.c'))
126
/*
185
softmmu_ss.add(when: 'CONFIG_ASPEED_SOC', if_true: files('aspeed_scu.c', 'aspeed_sdmc.c', 'aspeed_xdma.c'))
127
@@ -XXX,XX +XXX,XX @@ static bool trans_VMLS_dp(DisasContext *s, arg_VMLS_dp *a)
186
softmmu_ss.add(when: 'CONFIG_MSF2', if_true: files('msf2-sysreg.c'))
128
return do_vfp_3op_dp(s, gen_VMLS_dp, a->vd, a->vn, a->vm, true);
129
}
130
131
+static void gen_VNMLS_hp(TCGv_i32 vd, TCGv_i32 vn, TCGv_i32 vm, TCGv_ptr fpst)
132
+{
133
+ /*
134
+ * VNMLS: -fd + (fn * fm)
135
+ * Note that it isn't valid to replace (-A + B) with (B - A) or similar
136
+ * plausible looking simplifications because this will give wrong results
137
+ * for NaNs.
138
+ */
139
+ TCGv_i32 tmp = tcg_temp_new_i32();
140
+
141
+ gen_helper_vfp_mulh(tmp, vn, vm, fpst);
142
+ gen_helper_vfp_negh(vd, vd);
143
+ gen_helper_vfp_addh(vd, vd, tmp, fpst);
144
+ tcg_temp_free_i32(tmp);
145
+}
146
+
147
+static bool trans_VNMLS_hp(DisasContext *s, arg_VNMLS_sp *a)
148
+{
149
+ return do_vfp_3op_hp(s, gen_VNMLS_hp, a->vd, a->vn, a->vm, true);
150
+}
151
+
152
static void gen_VNMLS_sp(TCGv_i32 vd, TCGv_i32 vn, TCGv_i32 vm, TCGv_ptr fpst)
153
{
154
/*
155
@@ -XXX,XX +XXX,XX @@ static bool trans_VNMLS_dp(DisasContext *s, arg_VNMLS_dp *a)
156
return do_vfp_3op_dp(s, gen_VNMLS_dp, a->vd, a->vn, a->vm, true);
157
}
158
159
+static void gen_VNMLA_hp(TCGv_i32 vd, TCGv_i32 vn, TCGv_i32 vm, TCGv_ptr fpst)
160
+{
161
+ /* VNMLA: -fd + -(fn * fm) */
162
+ TCGv_i32 tmp = tcg_temp_new_i32();
163
+
164
+ gen_helper_vfp_mulh(tmp, vn, vm, fpst);
165
+ gen_helper_vfp_negh(tmp, tmp);
166
+ gen_helper_vfp_negh(vd, vd);
167
+ gen_helper_vfp_addh(vd, vd, tmp, fpst);
168
+ tcg_temp_free_i32(tmp);
169
+}
170
+
171
+static bool trans_VNMLA_hp(DisasContext *s, arg_VNMLA_sp *a)
172
+{
173
+ return do_vfp_3op_hp(s, gen_VNMLA_hp, a->vd, a->vn, a->vm, true);
174
+}
175
+
176
static void gen_VNMLA_sp(TCGv_i32 vd, TCGv_i32 vn, TCGv_i32 vm, TCGv_ptr fpst)
177
{
178
/* VNMLA: -fd + -(fn * fm) */
179
@@ -XXX,XX +XXX,XX @@ static bool trans_VMUL_dp(DisasContext *s, arg_VMUL_dp *a)
180
return do_vfp_3op_dp(s, gen_helper_vfp_muld, a->vd, a->vn, a->vm, false);
181
}
182
183
+static void gen_VNMUL_hp(TCGv_i32 vd, TCGv_i32 vn, TCGv_i32 vm, TCGv_ptr fpst)
184
+{
185
+ /* VNMUL: -(fn * fm) */
186
+ gen_helper_vfp_mulh(vd, vn, vm, fpst);
187
+ gen_helper_vfp_negh(vd, vd);
188
+}
189
+
190
+static bool trans_VNMUL_hp(DisasContext *s, arg_VNMUL_sp *a)
191
+{
192
+ return do_vfp_3op_hp(s, gen_VNMUL_hp, a->vd, a->vn, a->vm, false);
193
+}
194
+
195
static void gen_VNMUL_sp(TCGv_i32 vd, TCGv_i32 vn, TCGv_i32 vm, TCGv_ptr fpst)
196
{
197
/* VNMUL: -(fn * fm) */
198
--
187
--
199
2.20.1
188
2.20.1
200
189
201
190
diff view generated by jsdifflib
1
Convert the Neon float-point VMAX and VMIN insns over to using
1
From: Mihai Carabas <mihai.carabas@oracle.com>
2
a gvec helper, and use this to implement the fp16 case.
3
2
3
Add pvpanic PCI device support details in docs/specs/pvpanic.txt.
4
5
Signed-off-by: Mihai Carabas <mihai.carabas@oracle.com>
6
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
7
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
5
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
6
Message-id: 20200828183354.27913-29-peter.maydell@linaro.org
7
---
8
---
8
target/arm/helper.h | 6 ++++++
9
docs/specs/pvpanic.txt | 13 ++++++++++++-
9
target/arm/vec_helper.c | 6 ++++++
10
1 file changed, 12 insertions(+), 1 deletion(-)
10
target/arm/translate-neon.c.inc | 5 ++---
11
3 files changed, 14 insertions(+), 3 deletions(-)
12
11
13
diff --git a/target/arm/helper.h b/target/arm/helper.h
12
diff --git a/docs/specs/pvpanic.txt b/docs/specs/pvpanic.txt
14
index XXXXXXX..XXXXXXX 100644
13
index XXXXXXX..XXXXXXX 100644
15
--- a/target/arm/helper.h
14
--- a/docs/specs/pvpanic.txt
16
+++ b/target/arm/helper.h
15
+++ b/docs/specs/pvpanic.txt
17
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_5(gvec_facge_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
16
@@ -XXX,XX +XXX,XX @@
18
DEF_HELPER_FLAGS_5(gvec_facgt_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
17
PVPANIC DEVICE
19
DEF_HELPER_FLAGS_5(gvec_facgt_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
18
==============
20
19
21
+DEF_HELPER_FLAGS_5(gvec_fmax_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
20
-pvpanic device is a simulated ISA device, through which a guest panic
22
+DEF_HELPER_FLAGS_5(gvec_fmax_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
21
+pvpanic device is a simulated device, through which a guest panic
22
event is sent to qemu, and a QMP event is generated. This allows
23
management apps (e.g. libvirt) to be notified and respond to the event.
24
25
@@ -XXX,XX +XXX,XX @@ The management app has the option of waiting for GUEST_PANICKED events,
26
and/or polling for guest-panicked RunState, to learn when the pvpanic
27
device has fired a panic event.
28
29
+The pvpanic device can be implemented as an ISA device (using IOPORT) or as a
30
+PCI device.
23
+
31
+
24
+DEF_HELPER_FLAGS_5(gvec_fmin_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
32
ISA Interface
25
+DEF_HELPER_FLAGS_5(gvec_fmin_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
33
-------------
34
35
@@ -XXX,XX +XXX,XX @@ bit 1: a guest panic has happened and will be handled by the guest;
36
the host should record it or report it, but should not affect
37
the execution of the guest.
38
39
+PCI Interface
40
+-------------
26
+
41
+
27
DEF_HELPER_FLAGS_5(gvec_ftsmul_h, TCG_CALL_NO_RWG,
42
+The PCI interface is similar to the ISA interface except that it uses an MMIO
28
void, ptr, ptr, ptr, ptr, i32)
43
+address space provided by its BAR0, 1 byte long. Any machine with a PCI bus
29
DEF_HELPER_FLAGS_5(gvec_ftsmul_s, TCG_CALL_NO_RWG,
44
+can enable a pvpanic device by adding '-device pvpanic-pci' to the command
30
diff --git a/target/arm/vec_helper.c b/target/arm/vec_helper.c
45
+line.
31
index XXXXXXX..XXXXXXX 100644
32
--- a/target/arm/vec_helper.c
33
+++ b/target/arm/vec_helper.c
34
@@ -XXX,XX +XXX,XX @@ DO_3OP(gvec_facge_s, float32_acge, float32)
35
DO_3OP(gvec_facgt_h, float16_acgt, float16)
36
DO_3OP(gvec_facgt_s, float32_acgt, float32)
37
38
+DO_3OP(gvec_fmax_h, float16_max, float16)
39
+DO_3OP(gvec_fmax_s, float32_max, float32)
40
+
46
+
41
+DO_3OP(gvec_fmin_h, float16_min, float16)
47
ACPI Interface
42
+DO_3OP(gvec_fmin_s, float32_min, float32)
48
--------------
43
+
49
44
#ifdef TARGET_AARCH64
45
46
DO_3OP(gvec_recps_h, helper_recpsf_f16, float16)
47
diff --git a/target/arm/translate-neon.c.inc b/target/arm/translate-neon.c.inc
48
index XXXXXXX..XXXXXXX 100644
49
--- a/target/arm/translate-neon.c.inc
50
+++ b/target/arm/translate-neon.c.inc
51
@@ -XXX,XX +XXX,XX @@ DO_3S_FP_GVEC(VCGE, gen_helper_gvec_fcge_s, gen_helper_gvec_fcge_h)
52
DO_3S_FP_GVEC(VCGT, gen_helper_gvec_fcgt_s, gen_helper_gvec_fcgt_h)
53
DO_3S_FP_GVEC(VACGE, gen_helper_gvec_facge_s, gen_helper_gvec_facge_h)
54
DO_3S_FP_GVEC(VACGT, gen_helper_gvec_facgt_s, gen_helper_gvec_facgt_h)
55
+DO_3S_FP_GVEC(VMAX, gen_helper_gvec_fmax_s, gen_helper_gvec_fmax_h)
56
+DO_3S_FP_GVEC(VMIN, gen_helper_gvec_fmin_s, gen_helper_gvec_fmin_h)
57
58
/*
59
* For all the functions using this macro, size == 1 means fp16,
60
@@ -XXX,XX +XXX,XX @@ DO_3S_FP_GVEC(VACGT, gen_helper_gvec_facgt_s, gen_helper_gvec_facgt_h)
61
return do_3same_fp(s, a, FUNC, READS_VD); \
62
}
63
64
-DO_3S_FP(VMAX, gen_helper_vfp_maxs, false)
65
-DO_3S_FP(VMIN, gen_helper_vfp_mins, false)
66
-
67
static void gen_VMLA_fp_3s(TCGv_i32 vd, TCGv_i32 vn, TCGv_i32 vm,
68
TCGv_ptr fpstatus)
69
{
70
--
50
--
71
2.20.1
51
2.20.1
72
52
73
53
diff view generated by jsdifflib
1
Convert the Neon float-integer VCVT insns to gvec, and use this
1
From: Mihai Carabas <mihai.carabas@oracle.com>
2
to implement fp16 support for them.
3
2
4
Note that unlike the VFP int<->fp16 VCVT insns we converted
3
Add a test case for pvpanic-pci device. The scenario is the same as pvpanic
5
earlier and which convert to/from a 32-bit integer, these
4
ISA device, but is using the PCI bus.
6
Neon insns convert to/from 16-bit integers. So we can use
7
the existing vfp conversion helpers for the f32<->u32/i32
8
case but need to provide our own for f16<->u16/i16.
9
5
6
Signed-off-by: Mihai Carabas <mihai.carabas@oracle.com>
7
Acked-by: Thomas Huth <thuth@redhat.com>
8
Reviewed-by: Peter Maydell <peter.maydell@linaro.org>
9
Signed-off-by: Mihai Carabas <mihai.carabas@oracle.com>
10
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
10
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
11
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
12
Message-id: 20200828183354.27913-37-peter.maydell@linaro.org
13
---
11
---
14
target/arm/helper.h | 9 +++++++++
12
tests/qtest/pvpanic-pci-test.c | 94 ++++++++++++++++++++++++++++++++++
15
target/arm/vec_helper.c | 29 +++++++++++++++++++++++++++++
13
tests/qtest/meson.build | 1 +
16
target/arm/translate-neon.c.inc | 15 ++++-----------
14
2 files changed, 95 insertions(+)
17
3 files changed, 42 insertions(+), 11 deletions(-)
15
create mode 100644 tests/qtest/pvpanic-pci-test.c
18
16
19
diff --git a/target/arm/helper.h b/target/arm/helper.h
17
diff --git a/tests/qtest/pvpanic-pci-test.c b/tests/qtest/pvpanic-pci-test.c
20
index XXXXXXX..XXXXXXX 100644
18
new file mode 100644
21
--- a/target/arm/helper.h
19
index XXXXXXX..XXXXXXX
22
+++ b/target/arm/helper.h
20
--- /dev/null
23
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_5(neon_padds, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
21
+++ b/tests/qtest/pvpanic-pci-test.c
24
DEF_HELPER_FLAGS_5(neon_pmaxs, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
22
@@ -XXX,XX +XXX,XX @@
25
DEF_HELPER_FLAGS_5(neon_pmins, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
23
+/*
26
24
+ * QTest testcase for PV Panic PCI device
27
+DEF_HELPER_FLAGS_4(gvec_sstoh, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
25
+ *
28
+DEF_HELPER_FLAGS_4(gvec_sitos, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
26
+ * Copyright (C) 2020 Oracle
29
+DEF_HELPER_FLAGS_4(gvec_ustoh, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
27
+ *
30
+DEF_HELPER_FLAGS_4(gvec_uitos, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
28
+ * Authors:
31
+DEF_HELPER_FLAGS_4(gvec_tosszh, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
29
+ * Mihai Carabas <mihai.carabas@oracle.com>
32
+DEF_HELPER_FLAGS_4(gvec_tosizs, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
30
+ *
33
+DEF_HELPER_FLAGS_4(gvec_touszh, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
31
+ * This work is licensed under the terms of the GNU GPL, version 2 or later.
34
+DEF_HELPER_FLAGS_4(gvec_touizs, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
32
+ * See the COPYING file in the top-level directory.
33
+ *
34
+ */
35
+
35
+
36
DEF_HELPER_FLAGS_4(gvec_frecpe_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
36
+#include "qemu/osdep.h"
37
DEF_HELPER_FLAGS_4(gvec_frecpe_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
37
+#include "libqos/libqtest.h"
38
DEF_HELPER_FLAGS_4(gvec_frecpe_d, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
38
+#include "qapi/qmp/qdict.h"
39
diff --git a/target/arm/vec_helper.c b/target/arm/vec_helper.c
39
+#include "libqos/pci.h"
40
index XXXXXXX..XXXXXXX 100644
40
+#include "libqos/pci-pc.h"
41
--- a/target/arm/vec_helper.c
41
+#include "hw/pci/pci_regs.h"
42
+++ b/target/arm/vec_helper.c
42
+
43
@@ -XXX,XX +XXX,XX @@ static uint32_t float32_acgt(float32 op1, float32 op2, float_status *stat)
43
+static void test_panic_nopause(void)
44
return -float32_lt(float32_abs(op2), float32_abs(op1), stat);
45
}
46
47
+static int16_t vfp_tosszh(float16 x, void *fpstp)
48
+{
44
+{
49
+ float_status *fpst = fpstp;
45
+ uint8_t val;
50
+ if (float16_is_any_nan(x)) {
46
+ QDict *response, *data;
51
+ float_raise(float_flag_invalid, fpst);
47
+ QTestState *qts;
52
+ return 0;
48
+ QPCIBus *pcibus;
53
+ }
49
+ QPCIDevice *dev;
54
+ return float16_to_int16_round_to_zero(x, fpst);
50
+ QPCIBar bar;
51
+
52
+ qts = qtest_init("-device pvpanic-pci,addr=04.0 -action panic=none");
53
+ pcibus = qpci_new_pc(qts, NULL);
54
+ dev = qpci_device_find(pcibus, QPCI_DEVFN(0x4, 0x0));
55
+ qpci_device_enable(dev);
56
+ bar = qpci_iomap(dev, 0, NULL);
57
+
58
+ qpci_memread(dev, bar, 0, &val, sizeof(val));
59
+ g_assert_cmpuint(val, ==, 3);
60
+
61
+ val = 1;
62
+ qpci_memwrite(dev, bar, 0, &val, sizeof(val));
63
+
64
+ response = qtest_qmp_eventwait_ref(qts, "GUEST_PANICKED");
65
+ g_assert(qdict_haskey(response, "data"));
66
+ data = qdict_get_qdict(response, "data");
67
+ g_assert(qdict_haskey(data, "action"));
68
+ g_assert_cmpstr(qdict_get_str(data, "action"), ==, "run");
69
+ qobject_unref(response);
70
+
71
+ qtest_quit(qts);
55
+}
72
+}
56
+
73
+
57
+static uint16_t vfp_touszh(float16 x, void *fpstp)
74
+static void test_panic(void)
58
+{
75
+{
59
+ float_status *fpst = fpstp;
76
+ uint8_t val;
60
+ if (float16_is_any_nan(x)) {
77
+ QDict *response, *data;
61
+ float_raise(float_flag_invalid, fpst);
78
+ QTestState *qts;
62
+ return 0;
79
+ QPCIBus *pcibus;
63
+ }
80
+ QPCIDevice *dev;
64
+ return float16_to_uint16_round_to_zero(x, fpst);
81
+ QPCIBar bar;
82
+
83
+ qts = qtest_init("-device pvpanic-pci,addr=04.0 -action panic=pause");
84
+ pcibus = qpci_new_pc(qts, NULL);
85
+ dev = qpci_device_find(pcibus, QPCI_DEVFN(0x4, 0x0));
86
+ qpci_device_enable(dev);
87
+ bar = qpci_iomap(dev, 0, NULL);
88
+
89
+ qpci_memread(dev, bar, 0, &val, sizeof(val));
90
+ g_assert_cmpuint(val, ==, 3);
91
+
92
+ val = 1;
93
+ qpci_memwrite(dev, bar, 0, &val, sizeof(val));
94
+
95
+ response = qtest_qmp_eventwait_ref(qts, "GUEST_PANICKED");
96
+ g_assert(qdict_haskey(response, "data"));
97
+ data = qdict_get_qdict(response, "data");
98
+ g_assert(qdict_haskey(data, "action"));
99
+ g_assert_cmpstr(qdict_get_str(data, "action"), ==, "pause");
100
+ qobject_unref(response);
101
+
102
+ qtest_quit(qts);
65
+}
103
+}
66
+
104
+
67
#define DO_2OP(NAME, FUNC, TYPE) \
105
+int main(int argc, char **argv)
68
void HELPER(NAME)(void *vd, void *vn, void *stat, uint32_t desc) \
106
+{
69
{ \
107
+ int ret;
70
@@ -XXX,XX +XXX,XX @@ DO_2OP(gvec_frsqrte_h, helper_rsqrte_f16, float16)
71
DO_2OP(gvec_frsqrte_s, helper_rsqrte_f32, float32)
72
DO_2OP(gvec_frsqrte_d, helper_rsqrte_f64, float64)
73
74
+DO_2OP(gvec_sitos, helper_vfp_sitos, int32_t)
75
+DO_2OP(gvec_uitos, helper_vfp_uitos, uint32_t)
76
+DO_2OP(gvec_tosizs, helper_vfp_tosizs, float32)
77
+DO_2OP(gvec_touizs, helper_vfp_touizs, float32)
78
+DO_2OP(gvec_sstoh, int16_to_float16, int16_t)
79
+DO_2OP(gvec_ustoh, uint16_to_float16, uint16_t)
80
+DO_2OP(gvec_tosszh, vfp_tosszh, float16)
81
+DO_2OP(gvec_touszh, vfp_touszh, float16)
82
+
108
+
83
#define WRAP_CMP0_FWD(FN, CMPOP, TYPE) \
109
+ g_test_init(&argc, &argv, NULL);
84
static TYPE TYPE##_##FN##0(TYPE op, float_status *stat) \
110
+ qtest_add_func("/pvpanic-pci/panic", test_panic);
85
{ \
111
+ qtest_add_func("/pvpanic-pci/panic-nopause", test_panic_nopause);
86
diff --git a/target/arm/translate-neon.c.inc b/target/arm/translate-neon.c.inc
112
+
113
+ ret = g_test_run();
114
+
115
+ return ret;
116
+}
117
diff --git a/tests/qtest/meson.build b/tests/qtest/meson.build
87
index XXXXXXX..XXXXXXX 100644
118
index XXXXXXX..XXXXXXX 100644
88
--- a/target/arm/translate-neon.c.inc
119
--- a/tests/qtest/meson.build
89
+++ b/target/arm/translate-neon.c.inc
120
+++ b/tests/qtest/meson.build
90
@@ -XXX,XX +XXX,XX @@ static bool do_2misc_fp(DisasContext *s, arg_2misc *a,
121
@@ -XXX,XX +XXX,XX @@ qtests_i386 = \
91
return true;
122
config_all_devices.has_key('CONFIG_ISA_IPMI_BT') ? ['ipmi-bt-test'] : []) + \
92
}
123
(config_all_devices.has_key('CONFIG_WDT_IB700') ? ['wdt_ib700-test'] : []) + \
93
124
(config_all_devices.has_key('CONFIG_PVPANIC_ISA') ? ['pvpanic-test'] : []) + \
94
-#define DO_2MISC_FP(INSN, FUNC) \
125
+ (config_all_devices.has_key('CONFIG_PVPANIC_PCI') ? ['pvpanic-pci-test'] : []) + \
95
- static bool trans_##INSN(DisasContext *s, arg_2misc *a) \
126
(config_all_devices.has_key('CONFIG_HDA') ? ['intel-hda-test'] : []) + \
96
- { \
127
(config_all_devices.has_key('CONFIG_I82801B11') ? ['i82801b11-test'] : []) + \
97
- return do_2misc_fp(s, a, FUNC); \
128
(config_all_devices.has_key('CONFIG_IOH3420') ? ['ioh3420-test'] : []) + \
98
- }
99
-
100
-DO_2MISC_FP(VCVT_FS, gen_helper_vfp_sitos)
101
-DO_2MISC_FP(VCVT_FU, gen_helper_vfp_uitos)
102
-DO_2MISC_FP(VCVT_SF, gen_helper_vfp_tosizs)
103
-DO_2MISC_FP(VCVT_UF, gen_helper_vfp_touizs)
104
-
105
#define DO_2MISC_FP_VEC(INSN, HFUNC, SFUNC) \
106
static void gen_##INSN(unsigned vece, uint32_t rd_ofs, \
107
uint32_t rm_ofs, \
108
@@ -XXX,XX +XXX,XX @@ DO_2MISC_FP_VEC(VCGE0_F, gen_helper_gvec_fcge0_h, gen_helper_gvec_fcge0_s)
109
DO_2MISC_FP_VEC(VCEQ0_F, gen_helper_gvec_fceq0_h, gen_helper_gvec_fceq0_s)
110
DO_2MISC_FP_VEC(VCLT0_F, gen_helper_gvec_fclt0_h, gen_helper_gvec_fclt0_s)
111
DO_2MISC_FP_VEC(VCLE0_F, gen_helper_gvec_fcle0_h, gen_helper_gvec_fcle0_s)
112
+DO_2MISC_FP_VEC(VCVT_FS, gen_helper_gvec_sstoh, gen_helper_gvec_sitos)
113
+DO_2MISC_FP_VEC(VCVT_FU, gen_helper_gvec_ustoh, gen_helper_gvec_uitos)
114
+DO_2MISC_FP_VEC(VCVT_SF, gen_helper_gvec_tosszh, gen_helper_gvec_tosizs)
115
+DO_2MISC_FP_VEC(VCVT_UF, gen_helper_gvec_touszh, gen_helper_gvec_touizs)
116
117
static bool trans_VRINTX(DisasContext *s, arg_2misc *a)
118
{
119
--
129
--
120
2.20.1
130
2.20.1
121
131
122
132
diff view generated by jsdifflib
1
Implmeent VFP fp16 support for simple binary-operator VFP insns VADD,
1
The ptimer API currently provides two methods for setting the period:
2
VSUB, VMUL, VDIV, VMINNM and VMAXNM:
2
ptimer_set_period(), which takes a period in nanoseconds, and
3
ptimer_set_freq(), which takes a frequency in Hz. Neither of these
4
lines up nicely with the Clock API, because although both the Clock
5
and the ptimer track the frequency using a representation of whole
6
and fractional nanoseconds, conversion via either period-in-ns or
7
frequency-in-Hz will introduce a rounding error.
3
8
4
* make the VFP_BINOP() macro generate float16 helpers as well as
9
Add a new function ptimer_set_period_from_clock() which takes the
5
float32 and float64
10
Clock object directly to avoid the rounding issues. This includes a
6
* implement a do_vfp_3op_hp() function similar to the existing
11
facility for the user to specify that there is a frequency divider
7
do_vfp_3op_sp()
12
between the Clock proper and the timer, as some timer devices like
8
* add decode for the half-precision insn patterns
13
the CMSDK APB dualtimer need this.
9
14
10
Note that the VFP_BINOP macro use creates a couple of unused helper
15
To avoid having to drag in clock.h from ptimer.h we add the Clock
11
functions vfp_maxh and vfp_minh, but they're small so it's not worth
16
type to typedefs.h.
12
splitting the BINOP operations into "needs halfprec" and "no
13
halfprec" groups.
14
17
15
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
18
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
16
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
19
Reviewed-by: Luc Michel <luc@lmichel.fr>
17
Message-id: 20200828183354.27913-4-peter.maydell@linaro.org
20
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
21
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
22
Message-id: 20210128114145.20536-2-peter.maydell@linaro.org
23
Message-id: 20210121190622.22000-2-peter.maydell@linaro.org
18
---
24
---
19
target/arm/helper.h | 8 ++++
25
include/hw/ptimer.h | 22 ++++++++++++++++++++++
20
target/arm/vfp-uncond.decode | 3 ++
26
include/qemu/typedefs.h | 1 +
21
target/arm/vfp.decode | 4 ++
27
hw/core/ptimer.c | 34 ++++++++++++++++++++++++++++++++++
22
target/arm/vfp_helper.c | 5 ++
28
3 files changed, 57 insertions(+)
23
target/arm/translate-vfp.c.inc | 86 ++++++++++++++++++++++++++++++++++
24
5 files changed, 106 insertions(+)
25
29
26
diff --git a/target/arm/helper.h b/target/arm/helper.h
30
diff --git a/include/hw/ptimer.h b/include/hw/ptimer.h
27
index XXXXXXX..XXXXXXX 100644
31
index XXXXXXX..XXXXXXX 100644
28
--- a/target/arm/helper.h
32
--- a/include/hw/ptimer.h
29
+++ b/target/arm/helper.h
33
+++ b/include/hw/ptimer.h
30
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_5(probe_access, TCG_CALL_NO_WG, void, env, tl, i32, i32, i32)
34
@@ -XXX,XX +XXX,XX @@ void ptimer_transaction_commit(ptimer_state *s);
31
DEF_HELPER_1(vfp_get_fpscr, i32, env)
35
*/
32
DEF_HELPER_2(vfp_set_fpscr, void, env, i32)
36
void ptimer_set_period(ptimer_state *s, int64_t period);
33
37
34
+DEF_HELPER_3(vfp_addh, f16, f16, f16, ptr)
38
+/**
35
DEF_HELPER_3(vfp_adds, f32, f32, f32, ptr)
39
+ * ptimer_set_period_from_clock - Set counter increment from a Clock
36
DEF_HELPER_3(vfp_addd, f64, f64, f64, ptr)
40
+ * @s: ptimer to configure
37
+DEF_HELPER_3(vfp_subh, f16, f16, f16, ptr)
41
+ * @clk: pointer to Clock object to take period from
38
DEF_HELPER_3(vfp_subs, f32, f32, f32, ptr)
42
+ * @divisor: value to scale the clock frequency down by
39
DEF_HELPER_3(vfp_subd, f64, f64, f64, ptr)
43
+ *
40
+DEF_HELPER_3(vfp_mulh, f16, f16, f16, ptr)
44
+ * If the ptimer is being driven from a Clock, this is the preferred
41
DEF_HELPER_3(vfp_muls, f32, f32, f32, ptr)
45
+ * way to tell the ptimer about the period, because it avoids any
42
DEF_HELPER_3(vfp_muld, f64, f64, f64, ptr)
46
+ * possible rounding errors that might happen if the internal
43
+DEF_HELPER_3(vfp_divh, f16, f16, f16, ptr)
47
+ * representation of the Clock period was converted to either a period
44
DEF_HELPER_3(vfp_divs, f32, f32, f32, ptr)
48
+ * in ns or a frequency in Hz.
45
DEF_HELPER_3(vfp_divd, f64, f64, f64, ptr)
49
+ *
46
+DEF_HELPER_3(vfp_maxh, f16, f16, f16, ptr)
50
+ * If the ptimer should run at the same frequency as the clock,
47
DEF_HELPER_3(vfp_maxs, f32, f32, f32, ptr)
51
+ * pass 1 as the @divisor; if the ptimer should run at half the
48
DEF_HELPER_3(vfp_maxd, f64, f64, f64, ptr)
52
+ * frequency, pass 2, and so on.
49
+DEF_HELPER_3(vfp_minh, f16, f16, f16, ptr)
53
+ *
50
DEF_HELPER_3(vfp_mins, f32, f32, f32, ptr)
54
+ * This function will assert if it is called outside a
51
DEF_HELPER_3(vfp_mind, f64, f64, f64, ptr)
55
+ * ptimer_transaction_begin/commit block.
52
+DEF_HELPER_3(vfp_maxnumh, f16, f16, f16, ptr)
56
+ */
53
DEF_HELPER_3(vfp_maxnums, f32, f32, f32, ptr)
57
+void ptimer_set_period_from_clock(ptimer_state *s, const Clock *clock,
54
DEF_HELPER_3(vfp_maxnumd, f64, f64, f64, ptr)
58
+ unsigned int divisor);
55
+DEF_HELPER_3(vfp_minnumh, f16, f16, f16, ptr)
59
+
56
DEF_HELPER_3(vfp_minnums, f32, f32, f32, ptr)
60
/**
57
DEF_HELPER_3(vfp_minnumd, f64, f64, f64, ptr)
61
* ptimer_set_freq - Set counter frequency in Hz
58
DEF_HELPER_1(vfp_negs, f32, f32)
62
* @s: ptimer to configure
59
diff --git a/target/arm/vfp-uncond.decode b/target/arm/vfp-uncond.decode
63
diff --git a/include/qemu/typedefs.h b/include/qemu/typedefs.h
60
index XXXXXXX..XXXXXXX 100644
64
index XXXXXXX..XXXXXXX 100644
61
--- a/target/arm/vfp-uncond.decode
65
--- a/include/qemu/typedefs.h
62
+++ b/target/arm/vfp-uncond.decode
66
+++ b/include/qemu/typedefs.h
63
@@ -XXX,XX +XXX,XX @@ VSEL 1111 1110 0. cc:2 .... .... 1010 .0.0 .... \
67
@@ -XXX,XX +XXX,XX @@ typedef struct BlockDriverState BlockDriverState;
64
VSEL 1111 1110 0. cc:2 .... .... 1011 .0.0 .... \
68
typedef struct BusClass BusClass;
65
vm=%vm_dp vn=%vn_dp vd=%vd_dp dp=1
69
typedef struct BusState BusState;
66
70
typedef struct Chardev Chardev;
67
+VMAXNM_hp 1111 1110 1.00 .... .... 1001 .0.0 .... @vfp_dnm_s
71
+typedef struct Clock Clock;
68
+VMINNM_hp 1111 1110 1.00 .... .... 1001 .1.0 .... @vfp_dnm_s
72
typedef struct CompatProperty CompatProperty;
69
+
73
typedef struct CoMutex CoMutex;
70
VMAXNM_sp 1111 1110 1.00 .... .... 1010 .0.0 .... @vfp_dnm_s
74
typedef struct CPUAddressSpace CPUAddressSpace;
71
VMINNM_sp 1111 1110 1.00 .... .... 1010 .1.0 .... @vfp_dnm_s
75
diff --git a/hw/core/ptimer.c b/hw/core/ptimer.c
72
73
diff --git a/target/arm/vfp.decode b/target/arm/vfp.decode
74
index XXXXXXX..XXXXXXX 100644
76
index XXXXXXX..XXXXXXX 100644
75
--- a/target/arm/vfp.decode
77
--- a/hw/core/ptimer.c
76
+++ b/target/arm/vfp.decode
78
+++ b/hw/core/ptimer.c
77
@@ -XXX,XX +XXX,XX @@ VNMLS_dp ---- 1110 0.01 .... .... 1011 .0.0 .... @vfp_dnm_d
79
@@ -XXX,XX +XXX,XX @@
78
VNMLA_sp ---- 1110 0.01 .... .... 1010 .1.0 .... @vfp_dnm_s
80
#include "sysemu/qtest.h"
79
VNMLA_dp ---- 1110 0.01 .... .... 1011 .1.0 .... @vfp_dnm_d
81
#include "block/aio.h"
80
82
#include "sysemu/cpus.h"
81
+VMUL_hp ---- 1110 0.10 .... .... 1001 .0.0 .... @vfp_dnm_s
83
+#include "hw/clock.h"
82
VMUL_sp ---- 1110 0.10 .... .... 1010 .0.0 .... @vfp_dnm_s
84
83
VMUL_dp ---- 1110 0.10 .... .... 1011 .0.0 .... @vfp_dnm_d
85
#define DELTA_ADJUST 1
84
86
#define DELTA_NO_ADJUST -1
85
VNMUL_sp ---- 1110 0.10 .... .... 1010 .1.0 .... @vfp_dnm_s
87
@@ -XXX,XX +XXX,XX @@ void ptimer_set_period(ptimer_state *s, int64_t period)
86
VNMUL_dp ---- 1110 0.10 .... .... 1011 .1.0 .... @vfp_dnm_d
88
}
87
88
+VADD_hp ---- 1110 0.11 .... .... 1001 .0.0 .... @vfp_dnm_s
89
VADD_sp ---- 1110 0.11 .... .... 1010 .0.0 .... @vfp_dnm_s
90
VADD_dp ---- 1110 0.11 .... .... 1011 .0.0 .... @vfp_dnm_d
91
92
+VSUB_hp ---- 1110 0.11 .... .... 1001 .1.0 .... @vfp_dnm_s
93
VSUB_sp ---- 1110 0.11 .... .... 1010 .1.0 .... @vfp_dnm_s
94
VSUB_dp ---- 1110 0.11 .... .... 1011 .1.0 .... @vfp_dnm_d
95
96
+VDIV_hp ---- 1110 1.00 .... .... 1001 .0.0 .... @vfp_dnm_s
97
VDIV_sp ---- 1110 1.00 .... .... 1010 .0.0 .... @vfp_dnm_s
98
VDIV_dp ---- 1110 1.00 .... .... 1011 .0.0 .... @vfp_dnm_d
99
100
diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
101
index XXXXXXX..XXXXXXX 100644
102
--- a/target/arm/vfp_helper.c
103
+++ b/target/arm/vfp_helper.c
104
@@ -XXX,XX +XXX,XX @@ void vfp_set_fpscr(CPUARMState *env, uint32_t val)
105
#define VFP_HELPER(name, p) HELPER(glue(glue(vfp_,name),p))
106
107
#define VFP_BINOP(name) \
108
+dh_ctype_f16 VFP_HELPER(name, h)(dh_ctype_f16 a, dh_ctype_f16 b, void *fpstp) \
109
+{ \
110
+ float_status *fpst = fpstp; \
111
+ return float16_ ## name(a, b, fpst); \
112
+} \
113
float32 VFP_HELPER(name, s)(float32 a, float32 b, void *fpstp) \
114
{ \
115
float_status *fpst = fpstp; \
116
diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
117
index XXXXXXX..XXXXXXX 100644
118
--- a/target/arm/translate-vfp.c.inc
119
+++ b/target/arm/translate-vfp.c.inc
120
@@ -XXX,XX +XXX,XX @@ static bool do_vfp_3op_sp(DisasContext *s, VFPGen3OpSPFn *fn,
121
return true;
122
}
89
}
123
90
124
+static bool do_vfp_3op_hp(DisasContext *s, VFPGen3OpSPFn *fn,
91
+/* Set counter increment interval from a Clock */
125
+ int vd, int vn, int vm, bool reads_vd)
92
+void ptimer_set_period_from_clock(ptimer_state *s, const Clock *clk,
93
+ unsigned int divisor)
126
+{
94
+{
127
+ /*
95
+ /*
128
+ * Do a half-precision operation. Functionally this is
96
+ * The raw clock period is a 64-bit value in units of 2^-32 ns;
129
+ * the same as do_vfp_3op_sp(), except:
97
+ * put another way it's a 32.32 fixed-point ns value. Our internal
130
+ * - it uses the FPST_FPCR_F16
98
+ * representation of the period is 64.32 fixed point ns, so
131
+ * - it doesn't need the VFP vector handling (fp16 is a
99
+ * the conversion is simple.
132
+ * v8 feature, and in v8 VFP vectors don't exist)
133
+ * - it does the aa32_fp16_arith feature test
134
+ */
100
+ */
135
+ TCGv_i32 f0, f1, fd;
101
+ uint64_t raw_period = clock_get(clk);
136
+ TCGv_ptr fpst;
102
+ uint64_t period_frac;
137
+
103
+
138
+ if (!dc_isar_feature(aa32_fp16_arith, s)) {
104
+ assert(s->in_transaction);
139
+ return false;
105
+ s->delta = ptimer_get_count(s);
106
+ s->period = extract64(raw_period, 32, 32);
107
+ period_frac = extract64(raw_period, 0, 32);
108
+ /*
109
+ * divisor specifies a possible frequency divisor between the
110
+ * clock and the timer, so it is a multiplier on the period.
111
+ * We do the multiply after splitting the raw period out into
112
+ * period and frac to avoid having to do a 32*64->96 multiply.
113
+ */
114
+ s->period *= divisor;
115
+ period_frac *= divisor;
116
+ s->period += extract64(period_frac, 32, 32);
117
+ s->period_frac = (uint32_t)period_frac;
118
+
119
+ if (s->enabled) {
120
+ s->need_reload = true;
140
+ }
121
+ }
141
+
142
+ if (s->vec_len != 0 || s->vec_stride != 0) {
143
+ return false;
144
+ }
145
+
146
+ if (!vfp_access_check(s)) {
147
+ return true;
148
+ }
149
+
150
+ f0 = tcg_temp_new_i32();
151
+ f1 = tcg_temp_new_i32();
152
+ fd = tcg_temp_new_i32();
153
+ fpst = fpstatus_ptr(FPST_FPCR_F16);
154
+
155
+ neon_load_reg32(f0, vn);
156
+ neon_load_reg32(f1, vm);
157
+
158
+ if (reads_vd) {
159
+ neon_load_reg32(fd, vd);
160
+ }
161
+ fn(fd, f0, f1, fpst);
162
+ neon_store_reg32(fd, vd);
163
+
164
+ tcg_temp_free_i32(f0);
165
+ tcg_temp_free_i32(f1);
166
+ tcg_temp_free_i32(fd);
167
+ tcg_temp_free_ptr(fpst);
168
+
169
+ return true;
170
+}
122
+}
171
+
123
+
172
static bool do_vfp_3op_dp(DisasContext *s, VFPGen3OpDPFn *fn,
124
/* Set counter frequency in Hz. */
173
int vd, int vn, int vm, bool reads_vd)
125
void ptimer_set_freq(ptimer_state *s, uint32_t freq)
174
{
126
{
175
@@ -XXX,XX +XXX,XX @@ static bool trans_VNMLA_dp(DisasContext *s, arg_VNMLA_dp *a)
176
return do_vfp_3op_dp(s, gen_VNMLA_dp, a->vd, a->vn, a->vm, true);
177
}
178
179
+static bool trans_VMUL_hp(DisasContext *s, arg_VMUL_sp *a)
180
+{
181
+ return do_vfp_3op_hp(s, gen_helper_vfp_mulh, a->vd, a->vn, a->vm, false);
182
+}
183
+
184
static bool trans_VMUL_sp(DisasContext *s, arg_VMUL_sp *a)
185
{
186
return do_vfp_3op_sp(s, gen_helper_vfp_muls, a->vd, a->vn, a->vm, false);
187
@@ -XXX,XX +XXX,XX @@ static bool trans_VNMUL_dp(DisasContext *s, arg_VNMUL_dp *a)
188
return do_vfp_3op_dp(s, gen_VNMUL_dp, a->vd, a->vn, a->vm, false);
189
}
190
191
+static bool trans_VADD_hp(DisasContext *s, arg_VADD_sp *a)
192
+{
193
+ return do_vfp_3op_hp(s, gen_helper_vfp_addh, a->vd, a->vn, a->vm, false);
194
+}
195
+
196
static bool trans_VADD_sp(DisasContext *s, arg_VADD_sp *a)
197
{
198
return do_vfp_3op_sp(s, gen_helper_vfp_adds, a->vd, a->vn, a->vm, false);
199
@@ -XXX,XX +XXX,XX @@ static bool trans_VADD_dp(DisasContext *s, arg_VADD_dp *a)
200
return do_vfp_3op_dp(s, gen_helper_vfp_addd, a->vd, a->vn, a->vm, false);
201
}
202
203
+static bool trans_VSUB_hp(DisasContext *s, arg_VSUB_sp *a)
204
+{
205
+ return do_vfp_3op_hp(s, gen_helper_vfp_subh, a->vd, a->vn, a->vm, false);
206
+}
207
+
208
static bool trans_VSUB_sp(DisasContext *s, arg_VSUB_sp *a)
209
{
210
return do_vfp_3op_sp(s, gen_helper_vfp_subs, a->vd, a->vn, a->vm, false);
211
@@ -XXX,XX +XXX,XX @@ static bool trans_VSUB_dp(DisasContext *s, arg_VSUB_dp *a)
212
return do_vfp_3op_dp(s, gen_helper_vfp_subd, a->vd, a->vn, a->vm, false);
213
}
214
215
+static bool trans_VDIV_hp(DisasContext *s, arg_VDIV_sp *a)
216
+{
217
+ return do_vfp_3op_hp(s, gen_helper_vfp_divh, a->vd, a->vn, a->vm, false);
218
+}
219
+
220
static bool trans_VDIV_sp(DisasContext *s, arg_VDIV_sp *a)
221
{
222
return do_vfp_3op_sp(s, gen_helper_vfp_divs, a->vd, a->vn, a->vm, false);
223
@@ -XXX,XX +XXX,XX @@ static bool trans_VDIV_dp(DisasContext *s, arg_VDIV_dp *a)
224
return do_vfp_3op_dp(s, gen_helper_vfp_divd, a->vd, a->vn, a->vm, false);
225
}
226
227
+static bool trans_VMINNM_hp(DisasContext *s, arg_VMINNM_sp *a)
228
+{
229
+ if (!dc_isar_feature(aa32_vminmaxnm, s)) {
230
+ return false;
231
+ }
232
+ return do_vfp_3op_hp(s, gen_helper_vfp_minnumh,
233
+ a->vd, a->vn, a->vm, false);
234
+}
235
+
236
+static bool trans_VMAXNM_hp(DisasContext *s, arg_VMAXNM_sp *a)
237
+{
238
+ if (!dc_isar_feature(aa32_vminmaxnm, s)) {
239
+ return false;
240
+ }
241
+ return do_vfp_3op_hp(s, gen_helper_vfp_maxnumh,
242
+ a->vd, a->vn, a->vm, false);
243
+}
244
+
245
static bool trans_VMINNM_sp(DisasContext *s, arg_VMINNM_sp *a)
246
{
247
if (!dc_isar_feature(aa32_vminmaxnm, s)) {
248
--
127
--
249
2.20.1
128
2.20.1
250
129
251
130
diff view generated by jsdifflib
1
Implement the VFP fp16 variant of VMOV that transfers a 16-bit
1
Add a function for checking whether a clock has a source. This is
2
value between a general purpose register and a VFP register.
2
useful for devices which have input clocks that must be wired up by
3
3
the board as it allows them to fail in realize rather than ploughing
4
Note that Rt == 15 is UNPREDICTABLE; since this insn is v8 and later
4
on with a zero-period clock.
5
only we have no need to replicate the old "updates CPSR.NZCV"
6
behaviour that the singleprec version of this insn does.
7
5
8
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
6
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
9
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
7
Reviewed-by: Luc Michel <luc@lmichel.fr>
10
Message-id: 20200828183354.27913-22-peter.maydell@linaro.org
8
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
9
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
10
Message-id: 20210128114145.20536-3-peter.maydell@linaro.org
11
Message-id: 20210121190622.22000-3-peter.maydell@linaro.org
11
---
12
---
12
target/arm/vfp.decode | 1 +
13
docs/devel/clocks.rst | 16 ++++++++++++++++
13
target/arm/translate-vfp.c.inc | 34 ++++++++++++++++++++++++++++++++++
14
include/hw/clock.h | 15 +++++++++++++++
14
2 files changed, 35 insertions(+)
15
2 files changed, 31 insertions(+)
15
16
16
diff --git a/target/arm/vfp.decode b/target/arm/vfp.decode
17
diff --git a/docs/devel/clocks.rst b/docs/devel/clocks.rst
17
index XXXXXXX..XXXXXXX 100644
18
index XXXXXXX..XXXXXXX 100644
18
--- a/target/arm/vfp.decode
19
--- a/docs/devel/clocks.rst
19
+++ b/target/arm/vfp.decode
20
+++ b/docs/devel/clocks.rst
20
@@ -XXX,XX +XXX,XX @@ VDUP ---- 1110 1 b:1 q:1 0 .... rt:4 1011 . 0 e:1 1 0000 \
21
@@ -XXX,XX +XXX,XX @@ object during device instance init. For example:
21
vn=%vn_dp
22
/* set initial value to 10ns / 100MHz */
22
23
clock_set_ns(clk, 10);
23
VMSR_VMRS ---- 1110 111 l:1 reg:4 rt:4 1010 0001 0000
24
24
+VMOV_half ---- 1110 000 l:1 .... rt:4 1001 . 001 0000 vn=%vn_sp
25
+To enforce that the clock is wired up by the board code, you can
25
VMOV_single ---- 1110 000 l:1 .... rt:4 1010 . 001 0000 vn=%vn_sp
26
+call ``clock_has_source()`` in your device's realize method:
26
27
+
27
VMOV_64_sp ---- 1100 010 op:1 rt2:4 rt:4 1010 00.1 .... vm=%vm_sp
28
+.. code-block:: c
28
diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
29
+
30
+ if (!clock_has_source(s->clk)) {
31
+ error_setg(errp, "MyDevice: clk input must be connected");
32
+ return;
33
+ }
34
+
35
+Note that this only checks that the clock has been wired up; it is
36
+still possible that the output clock connected to it is disabled
37
+or has not yet been configured, in which case the period will be
38
+zero. You should use the clock callback to find out when the clock
39
+period changes.
40
+
41
Fetching clock frequency/period
42
-------------------------------
43
44
diff --git a/include/hw/clock.h b/include/hw/clock.h
29
index XXXXXXX..XXXXXXX 100644
45
index XXXXXXX..XXXXXXX 100644
30
--- a/target/arm/translate-vfp.c.inc
46
--- a/include/hw/clock.h
31
+++ b/target/arm/translate-vfp.c.inc
47
+++ b/include/hw/clock.h
32
@@ -XXX,XX +XXX,XX @@ static bool trans_VMSR_VMRS(DisasContext *s, arg_VMSR_VMRS *a)
48
@@ -XXX,XX +XXX,XX @@ void clock_clear_callback(Clock *clk);
33
return true;
49
*/
34
}
50
void clock_set_source(Clock *clk, Clock *src);
35
51
36
+static bool trans_VMOV_half(DisasContext *s, arg_VMOV_single *a)
52
+/**
53
+ * clock_has_source:
54
+ * @clk: the clock
55
+ *
56
+ * Returns true if the clock has a source clock connected to it.
57
+ * This is useful for devices which have input clocks which must
58
+ * be connected by the board/SoC code which creates them. The
59
+ * device code can use this to check in its realize method that
60
+ * the clock has been connected.
61
+ */
62
+static inline bool clock_has_source(const Clock *clk)
37
+{
63
+{
38
+ TCGv_i32 tmp;
64
+ return clk->source != NULL;
39
+
40
+ if (!dc_isar_feature(aa32_fp16_arith, s)) {
41
+ return false;
42
+ }
43
+
44
+ if (a->rt == 15) {
45
+ /* UNPREDICTABLE; we choose to UNDEF */
46
+ return false;
47
+ }
48
+
49
+ if (!vfp_access_check(s)) {
50
+ return true;
51
+ }
52
+
53
+ if (a->l) {
54
+ /* VFP to general purpose register */
55
+ tmp = tcg_temp_new_i32();
56
+ neon_load_reg32(tmp, a->vn);
57
+ tcg_gen_andi_i32(tmp, tmp, 0xffff);
58
+ store_reg(s, a->rt, tmp);
59
+ } else {
60
+ /* general purpose register to VFP */
61
+ tmp = load_reg(s, a->rt);
62
+ tcg_gen_andi_i32(tmp, tmp, 0xffff);
63
+ neon_store_reg32(tmp, a->vn);
64
+ tcg_temp_free_i32(tmp);
65
+ }
66
+
67
+ return true;
68
+}
65
+}
69
+
66
+
70
static bool trans_VMOV_single(DisasContext *s, arg_VMOV_single *a)
67
/**
71
{
68
* clock_set:
72
TCGv_i32 tmp;
69
* @clk: the clock to initialize.
73
--
70
--
74
2.20.1
71
2.20.1
75
72
76
73
diff view generated by jsdifflib
1
Implement the fp16 versions of the VFP VCVT instruction forms which
1
Add a simple test of the CMSDK APB timer, since we're about to do
2
convert between floating point and fixed-point.
2
some refactoring of how it is clocked.
3
3
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
5
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
5
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
6
Message-id: 20200828183354.27913-16-peter.maydell@linaro.org
6
Reviewed-by: Luc Michel <luc@lmichel.fr>
7
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
8
Message-id: 20210128114145.20536-4-peter.maydell@linaro.org
9
Message-id: 20210121190622.22000-4-peter.maydell@linaro.org
7
---
10
---
8
target/arm/vfp.decode | 2 ++
11
tests/qtest/cmsdk-apb-timer-test.c | 75 ++++++++++++++++++++++++++++++
9
target/arm/translate-vfp.c.inc | 59 ++++++++++++++++++++++++++++++++++
12
MAINTAINERS | 1 +
10
2 files changed, 61 insertions(+)
13
tests/qtest/meson.build | 1 +
14
3 files changed, 77 insertions(+)
15
create mode 100644 tests/qtest/cmsdk-apb-timer-test.c
11
16
12
diff --git a/target/arm/vfp.decode b/target/arm/vfp.decode
17
diff --git a/tests/qtest/cmsdk-apb-timer-test.c b/tests/qtest/cmsdk-apb-timer-test.c
13
index XXXXXXX..XXXXXXX 100644
18
new file mode 100644
14
--- a/target/arm/vfp.decode
19
index XXXXXXX..XXXXXXX
15
+++ b/target/arm/vfp.decode
20
--- /dev/null
16
@@ -XXX,XX +XXX,XX @@ VJCVT ---- 1110 1.11 1001 .... 1011 11.0 .... @vfp_dm_sd
21
+++ b/tests/qtest/cmsdk-apb-timer-test.c
17
# We assemble bits 18 (op), 16 (u) and 7 (sx) into a single opc field
22
@@ -XXX,XX +XXX,XX @@
18
# for the convenience of the trans_VCVT_fix functions.
23
+/*
19
%vcvt_fix_op 18:1 16:1 7:1
24
+ * QTest testcase for the CMSDK APB timer device
20
+VCVT_fix_hp ---- 1110 1.11 1.1. .... 1001 .1.0 .... \
25
+ *
21
+ vd=%vd_sp imm=%vm_sp opc=%vcvt_fix_op
26
+ * Copyright (c) 2021 Linaro Limited
22
VCVT_fix_sp ---- 1110 1.11 1.1. .... 1010 .1.0 .... \
27
+ *
23
vd=%vd_sp imm=%vm_sp opc=%vcvt_fix_op
28
+ * This program is free software; you can redistribute it and/or modify it
24
VCVT_fix_dp ---- 1110 1.11 1.1. .... 1011 .1.0 .... \
29
+ * under the terms of the GNU General Public License as published by the
25
diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
30
+ * Free Software Foundation; either version 2 of the License, or
26
index XXXXXXX..XXXXXXX 100644
31
+ * (at your option) any later version.
27
--- a/target/arm/translate-vfp.c.inc
32
+ *
28
+++ b/target/arm/translate-vfp.c.inc
33
+ * This program is distributed in the hope that it will be useful, but WITHOUT
29
@@ -XXX,XX +XXX,XX @@ static bool trans_VJCVT(DisasContext *s, arg_VJCVT *a)
34
+ * ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or
30
return true;
35
+ * FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License
31
}
36
+ * for more details.
32
37
+ */
33
+static bool trans_VCVT_fix_hp(DisasContext *s, arg_VCVT_fix_sp *a)
38
+
39
+#include "qemu/osdep.h"
40
+#include "libqtest-single.h"
41
+
42
+/* IoTKit/ARMSSE-200 timer0; driven at 25MHz in mps2-an385, so 40ns per tick */
43
+#define TIMER_BASE 0x40000000
44
+
45
+#define CTRL 0
46
+#define VALUE 4
47
+#define RELOAD 8
48
+#define INTSTATUS 0xc
49
+
50
+static void test_timer(void)
34
+{
51
+{
35
+ TCGv_i32 vd, shift;
52
+ g_assert_true(readl(TIMER_BASE + INTSTATUS) == 0);
36
+ TCGv_ptr fpst;
37
+ int frac_bits;
38
+
53
+
39
+ if (!dc_isar_feature(aa32_fp16_arith, s)) {
54
+ /* Start timer: will fire after 40 * 1000 == 40000 ns */
40
+ return false;
55
+ writel(TIMER_BASE + RELOAD, 1000);
41
+ }
56
+ writel(TIMER_BASE + CTRL, 9);
42
+
57
+
43
+ if (!vfp_access_check(s)) {
58
+ /* Step to just past the 500th tick and check VALUE */
44
+ return true;
59
+ clock_step(40 * 500 + 1);
45
+ }
60
+ g_assert_cmpuint(readl(TIMER_BASE + INTSTATUS), ==, 0);
61
+ g_assert_cmpuint(readl(TIMER_BASE + VALUE), ==, 500);
46
+
62
+
47
+ frac_bits = (a->opc & 1) ? (32 - a->imm) : (16 - a->imm);
63
+ /* Just past the 1000th tick: timer should have fired */
64
+ clock_step(40 * 500);
65
+ g_assert_cmpuint(readl(TIMER_BASE + INTSTATUS), ==, 1);
66
+ g_assert_cmpuint(readl(TIMER_BASE + VALUE), ==, 0);
48
+
67
+
49
+ vd = tcg_temp_new_i32();
68
+ /* VALUE reloads at the following tick */
50
+ neon_load_reg32(vd, a->vd);
69
+ clock_step(40);
70
+ g_assert_cmpuint(readl(TIMER_BASE + VALUE), ==, 1000);
51
+
71
+
52
+ fpst = fpstatus_ptr(FPST_FPCR_F16);
72
+ /* Check write-1-to-clear behaviour of INTSTATUS */
53
+ shift = tcg_const_i32(frac_bits);
73
+ writel(TIMER_BASE + INTSTATUS, 0);
74
+ g_assert_cmpuint(readl(TIMER_BASE + INTSTATUS), ==, 1);
75
+ writel(TIMER_BASE + INTSTATUS, 1);
76
+ g_assert_cmpuint(readl(TIMER_BASE + INTSTATUS), ==, 0);
54
+
77
+
55
+ /* Switch on op:U:sx bits */
78
+ /* Turn off the timer */
56
+ switch (a->opc) {
79
+ writel(TIMER_BASE + CTRL, 0);
57
+ case 0:
58
+ gen_helper_vfp_shtoh(vd, vd, shift, fpst);
59
+ break;
60
+ case 1:
61
+ gen_helper_vfp_sltoh(vd, vd, shift, fpst);
62
+ break;
63
+ case 2:
64
+ gen_helper_vfp_uhtoh(vd, vd, shift, fpst);
65
+ break;
66
+ case 3:
67
+ gen_helper_vfp_ultoh(vd, vd, shift, fpst);
68
+ break;
69
+ case 4:
70
+ gen_helper_vfp_toshh_round_to_zero(vd, vd, shift, fpst);
71
+ break;
72
+ case 5:
73
+ gen_helper_vfp_toslh_round_to_zero(vd, vd, shift, fpst);
74
+ break;
75
+ case 6:
76
+ gen_helper_vfp_touhh_round_to_zero(vd, vd, shift, fpst);
77
+ break;
78
+ case 7:
79
+ gen_helper_vfp_toulh_round_to_zero(vd, vd, shift, fpst);
80
+ break;
81
+ default:
82
+ g_assert_not_reached();
83
+ }
84
+
85
+ neon_store_reg32(vd, a->vd);
86
+ tcg_temp_free_i32(vd);
87
+ tcg_temp_free_i32(shift);
88
+ tcg_temp_free_ptr(fpst);
89
+ return true;
90
+}
80
+}
91
+
81
+
92
static bool trans_VCVT_fix_sp(DisasContext *s, arg_VCVT_fix_sp *a)
82
+int main(int argc, char **argv)
93
{
83
+{
94
TCGv_i32 vd, shift;
84
+ int r;
85
+
86
+ g_test_init(&argc, &argv, NULL);
87
+
88
+ qtest_start("-machine mps2-an385");
89
+
90
+ qtest_add_func("/cmsdk-apb-timer/timer", test_timer);
91
+
92
+ r = g_test_run();
93
+
94
+ qtest_end();
95
+
96
+ return r;
97
+}
98
diff --git a/MAINTAINERS b/MAINTAINERS
99
index XXXXXXX..XXXXXXX 100644
100
--- a/MAINTAINERS
101
+++ b/MAINTAINERS
102
@@ -XXX,XX +XXX,XX @@ F: include/hw/rtc/pl031.h
103
F: include/hw/arm/primecell.h
104
F: hw/timer/cmsdk-apb-timer.c
105
F: include/hw/timer/cmsdk-apb-timer.h
106
+F: tests/qtest/cmsdk-apb-timer-test.c
107
F: hw/timer/cmsdk-apb-dualtimer.c
108
F: include/hw/timer/cmsdk-apb-dualtimer.h
109
F: hw/char/cmsdk-apb-uart.c
110
diff --git a/tests/qtest/meson.build b/tests/qtest/meson.build
111
index XXXXXXX..XXXXXXX 100644
112
--- a/tests/qtest/meson.build
113
+++ b/tests/qtest/meson.build
114
@@ -XXX,XX +XXX,XX @@ qtests_npcm7xx = \
115
'npcm7xx_timer-test',
116
'npcm7xx_watchdog_timer-test']
117
qtests_arm = \
118
+ (config_all_devices.has_key('CONFIG_CMSDK_APB_TIMER') ? ['cmsdk-apb-timer-test'] : []) + \
119
(config_all_devices.has_key('CONFIG_PFLASH_CFI02') ? ['pflash-cfi02-test'] : []) + \
120
(config_all_devices.has_key('CONFIG_NPCM7XX') ? qtests_npcm7xx : []) + \
121
['arm-cpu-features',
95
--
122
--
96
2.20.1
123
2.20.1
97
124
98
125
diff view generated by jsdifflib
1
Implement the fp16 versions of the VFP VLDR/VSTR (immediate).
1
Add a simple test of the CMSDK watchdog, since we're about to do some
2
refactoring of how it is clocked.
2
3
3
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
4
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
5
Reviewed-by: Luc Michel <luc@lmichel.fr>
5
Message-id: 20200828183354.27913-12-peter.maydell@linaro.org
6
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
7
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
8
Message-id: 20210128114145.20536-5-peter.maydell@linaro.org
9
Message-id: 20210121190622.22000-5-peter.maydell@linaro.org
10
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
6
---
11
---
7
target/arm/vfp.decode | 3 +--
12
tests/qtest/cmsdk-apb-watchdog-test.c | 79 +++++++++++++++++++++++++++
8
target/arm/translate-vfp.c.inc | 35 ++++++++++++++++++++++++++++++++++
13
MAINTAINERS | 1 +
9
2 files changed, 36 insertions(+), 2 deletions(-)
14
tests/qtest/meson.build | 1 +
15
3 files changed, 81 insertions(+)
16
create mode 100644 tests/qtest/cmsdk-apb-watchdog-test.c
10
17
11
diff --git a/target/arm/vfp.decode b/target/arm/vfp.decode
18
diff --git a/tests/qtest/cmsdk-apb-watchdog-test.c b/tests/qtest/cmsdk-apb-watchdog-test.c
12
index XXXXXXX..XXXXXXX 100644
19
new file mode 100644
13
--- a/target/arm/vfp.decode
20
index XXXXXXX..XXXXXXX
14
+++ b/target/arm/vfp.decode
21
--- /dev/null
15
@@ -XXX,XX +XXX,XX @@ VMOV_single ---- 1110 000 l:1 .... rt:4 1010 . 001 0000 vn=%vn_sp
22
+++ b/tests/qtest/cmsdk-apb-watchdog-test.c
16
VMOV_64_sp ---- 1100 010 op:1 rt2:4 rt:4 1010 00.1 .... vm=%vm_sp
23
@@ -XXX,XX +XXX,XX @@
17
VMOV_64_dp ---- 1100 010 op:1 rt2:4 rt:4 1011 00.1 .... vm=%vm_dp
24
+/*
18
25
+ * QTest testcase for the CMSDK APB watchdog device
19
-# Note that the half-precision variants of VLDR and VSTR are
26
+ *
20
-# not part of this decodetree at all because they have bits [9:8] == 0b01
27
+ * Copyright (c) 2021 Linaro Limited
21
+VLDR_VSTR_hp ---- 1101 u:1 .0 l:1 rn:4 .... 1001 imm:8 vd=%vd_sp
28
+ *
22
VLDR_VSTR_sp ---- 1101 u:1 .0 l:1 rn:4 .... 1010 imm:8 vd=%vd_sp
29
+ * This program is free software; you can redistribute it and/or modify it
23
VLDR_VSTR_dp ---- 1101 u:1 .0 l:1 rn:4 .... 1011 imm:8 vd=%vd_dp
30
+ * under the terms of the GNU General Public License as published by the
24
31
+ * Free Software Foundation; either version 2 of the License, or
25
diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
32
+ * (at your option) any later version.
26
index XXXXXXX..XXXXXXX 100644
33
+ *
27
--- a/target/arm/translate-vfp.c.inc
34
+ * This program is distributed in the hope that it will be useful, but WITHOUT
28
+++ b/target/arm/translate-vfp.c.inc
35
+ * ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or
29
@@ -XXX,XX +XXX,XX @@ static bool trans_VMOV_64_dp(DisasContext *s, arg_VMOV_64_dp *a)
36
+ * FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License
30
return true;
37
+ * for more details.
31
}
38
+ */
32
39
+
33
+static bool trans_VLDR_VSTR_hp(DisasContext *s, arg_VLDR_VSTR_sp *a)
40
+#include "qemu/osdep.h"
41
+#include "libqtest-single.h"
42
+
43
+/*
44
+ * lm3s811evb watchdog; at board startup this runs at 200MHz / 16 == 12.5MHz,
45
+ * which is 80ns per tick.
46
+ */
47
+#define WDOG_BASE 0x40000000
48
+
49
+#define WDOGLOAD 0
50
+#define WDOGVALUE 4
51
+#define WDOGCONTROL 8
52
+#define WDOGINTCLR 0xc
53
+#define WDOGRIS 0x10
54
+#define WDOGMIS 0x14
55
+#define WDOGLOCK 0xc00
56
+
57
+static void test_watchdog(void)
34
+{
58
+{
35
+ uint32_t offset;
59
+ g_assert_cmpuint(readl(WDOG_BASE + WDOGRIS), ==, 0);
36
+ TCGv_i32 addr, tmp;
37
+
60
+
38
+ if (!dc_isar_feature(aa32_fp16_arith, s)) {
61
+ writel(WDOG_BASE + WDOGCONTROL, 1);
39
+ return false;
62
+ writel(WDOG_BASE + WDOGLOAD, 1000);
40
+ }
41
+
63
+
42
+ if (!vfp_access_check(s)) {
64
+ /* Step to just past the 500th tick */
43
+ return true;
65
+ clock_step(500 * 80 + 1);
44
+ }
66
+ g_assert_cmpuint(readl(WDOG_BASE + WDOGRIS), ==, 0);
67
+ g_assert_cmpuint(readl(WDOG_BASE + WDOGVALUE), ==, 500);
45
+
68
+
46
+ /* imm8 field is offset/2 for fp16, unlike fp32 and fp64 */
69
+ /* Just past the 1000th tick: timer should have fired */
47
+ offset = a->imm << 1;
70
+ clock_step(500 * 80);
48
+ if (!a->u) {
71
+ g_assert_cmpuint(readl(WDOG_BASE + WDOGRIS), ==, 1);
49
+ offset = -offset;
72
+ g_assert_cmpuint(readl(WDOG_BASE + WDOGVALUE), ==, 0);
50
+ }
51
+
73
+
52
+ /* For thumb, use of PC is UNPREDICTABLE. */
74
+ /* VALUE reloads at following tick */
53
+ addr = add_reg_for_lit(s, a->rn, offset);
75
+ clock_step(80);
54
+ tmp = tcg_temp_new_i32();
76
+ g_assert_cmpuint(readl(WDOG_BASE + WDOGVALUE), ==, 1000);
55
+ if (a->l) {
56
+ gen_aa32_ld16u(s, tmp, addr, get_mem_index(s));
57
+ neon_store_reg32(tmp, a->vd);
58
+ } else {
59
+ neon_load_reg32(tmp, a->vd);
60
+ gen_aa32_st16(s, tmp, addr, get_mem_index(s));
61
+ }
62
+ tcg_temp_free_i32(tmp);
63
+ tcg_temp_free_i32(addr);
64
+
77
+
65
+ return true;
78
+ /* Writing any value to WDOGINTCLR clears the interrupt and reloads */
79
+ clock_step(500 * 80);
80
+ g_assert_cmpuint(readl(WDOG_BASE + WDOGVALUE), ==, 500);
81
+ g_assert_cmpuint(readl(WDOG_BASE + WDOGRIS), ==, 1);
82
+ writel(WDOG_BASE + WDOGINTCLR, 0);
83
+ g_assert_cmpuint(readl(WDOG_BASE + WDOGVALUE), ==, 1000);
84
+ g_assert_cmpuint(readl(WDOG_BASE + WDOGRIS), ==, 0);
66
+}
85
+}
67
+
86
+
68
static bool trans_VLDR_VSTR_sp(DisasContext *s, arg_VLDR_VSTR_sp *a)
87
+int main(int argc, char **argv)
69
{
88
+{
70
uint32_t offset;
89
+ int r;
90
+
91
+ g_test_init(&argc, &argv, NULL);
92
+
93
+ qtest_start("-machine lm3s811evb");
94
+
95
+ qtest_add_func("/cmsdk-apb-watchdog/watchdog", test_watchdog);
96
+
97
+ r = g_test_run();
98
+
99
+ qtest_end();
100
+
101
+ return r;
102
+}
103
diff --git a/MAINTAINERS b/MAINTAINERS
104
index XXXXXXX..XXXXXXX 100644
105
--- a/MAINTAINERS
106
+++ b/MAINTAINERS
107
@@ -XXX,XX +XXX,XX @@ F: hw/char/cmsdk-apb-uart.c
108
F: include/hw/char/cmsdk-apb-uart.h
109
F: hw/watchdog/cmsdk-apb-watchdog.c
110
F: include/hw/watchdog/cmsdk-apb-watchdog.h
111
+F: tests/qtest/cmsdk-apb-watchdog-test.c
112
F: hw/misc/tz-ppc.c
113
F: include/hw/misc/tz-ppc.h
114
F: hw/misc/tz-mpc.c
115
diff --git a/tests/qtest/meson.build b/tests/qtest/meson.build
116
index XXXXXXX..XXXXXXX 100644
117
--- a/tests/qtest/meson.build
118
+++ b/tests/qtest/meson.build
119
@@ -XXX,XX +XXX,XX @@ qtests_npcm7xx = \
120
'npcm7xx_watchdog_timer-test']
121
qtests_arm = \
122
(config_all_devices.has_key('CONFIG_CMSDK_APB_TIMER') ? ['cmsdk-apb-timer-test'] : []) + \
123
+ (config_all_devices.has_key('CONFIG_CMSDK_APB_WATCHDOG') ? ['cmsdk-apb-watchdog-test'] : []) + \
124
(config_all_devices.has_key('CONFIG_PFLASH_CFI02') ? ['pflash-cfi02-test'] : []) + \
125
(config_all_devices.has_key('CONFIG_NPCM7XX') ? qtests_npcm7xx : []) + \
126
['arm-cpu-features',
71
--
127
--
72
2.20.1
128
2.20.1
73
129
74
130
diff view generated by jsdifflib
1
Convert the Neon floating-point VMLA and VMLS insns over to using a
1
Add a simple test of the CMSDK dual timer, since we're about to do
2
gvec helper, and use this to implement the fp16 case.
2
some refactoring of how it is clocked.
3
3
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
5
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
5
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
6
Message-id: 20200828183354.27913-31-peter.maydell@linaro.org
6
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
7
Reviewed-by: Luc Michel <luc@lmichel.fr>
8
Message-id: 20210128114145.20536-6-peter.maydell@linaro.org
9
Message-id: 20210121190622.22000-6-peter.maydell@linaro.org
7
---
10
---
8
target/arm/helper.h | 6 +++++
11
tests/qtest/cmsdk-apb-dualtimer-test.c | 130 +++++++++++++++++++++++++
9
target/arm/vec_helper.c | 42 +++++++++++++++++++++++++++++++++
12
MAINTAINERS | 1 +
10
target/arm/translate-neon.c.inc | 33 ++------------------------
13
tests/qtest/meson.build | 1 +
11
3 files changed, 50 insertions(+), 31 deletions(-)
14
3 files changed, 132 insertions(+)
15
create mode 100644 tests/qtest/cmsdk-apb-dualtimer-test.c
12
16
13
diff --git a/target/arm/helper.h b/target/arm/helper.h
17
diff --git a/tests/qtest/cmsdk-apb-dualtimer-test.c b/tests/qtest/cmsdk-apb-dualtimer-test.c
14
index XXXXXXX..XXXXXXX 100644
18
new file mode 100644
15
--- a/target/arm/helper.h
19
index XXXXXXX..XXXXXXX
16
+++ b/target/arm/helper.h
20
--- /dev/null
17
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_5(gvec_fmaxnum_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i3
21
+++ b/tests/qtest/cmsdk-apb-dualtimer-test.c
18
DEF_HELPER_FLAGS_5(gvec_fminnum_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
22
@@ -XXX,XX +XXX,XX @@
19
DEF_HELPER_FLAGS_5(gvec_fminnum_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
23
+/*
20
24
+ * QTest testcase for the CMSDK APB dualtimer device
21
+DEF_HELPER_FLAGS_5(gvec_fmla_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
25
+ *
22
+DEF_HELPER_FLAGS_5(gvec_fmla_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
26
+ * Copyright (c) 2021 Linaro Limited
27
+ *
28
+ * This program is free software; you can redistribute it and/or modify it
29
+ * under the terms of the GNU General Public License as published by the
30
+ * Free Software Foundation; either version 2 of the License, or
31
+ * (at your option) any later version.
32
+ *
33
+ * This program is distributed in the hope that it will be useful, but WITHOUT
34
+ * ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or
35
+ * FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License
36
+ * for more details.
37
+ */
23
+
38
+
24
+DEF_HELPER_FLAGS_5(gvec_fmls_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
39
+#include "qemu/osdep.h"
25
+DEF_HELPER_FLAGS_5(gvec_fmls_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
40
+#include "libqtest-single.h"
26
+
41
+
27
DEF_HELPER_FLAGS_5(gvec_ftsmul_h, TCG_CALL_NO_RWG,
42
+/* IoTKit/ARMSSE dualtimer; driven at 25MHz in mps2-an385, so 40ns per tick */
28
void, ptr, ptr, ptr, ptr, i32)
43
+#define TIMER_BASE 0x40002000
29
DEF_HELPER_FLAGS_5(gvec_ftsmul_s, TCG_CALL_NO_RWG,
44
+
30
diff --git a/target/arm/vec_helper.c b/target/arm/vec_helper.c
45
+#define TIMER1LOAD 0
31
index XXXXXXX..XXXXXXX 100644
46
+#define TIMER1VALUE 4
32
--- a/target/arm/vec_helper.c
47
+#define TIMER1CONTROL 8
33
+++ b/target/arm/vec_helper.c
48
+#define TIMER1INTCLR 0xc
34
@@ -XXX,XX +XXX,XX @@ DO_3OP(gvec_rsqrts_d, helper_rsqrtsf_f64, float64)
49
+#define TIMER1RIS 0x10
35
#endif
50
+#define TIMER1MIS 0x14
36
#undef DO_3OP
51
+#define TIMER1BGLOAD 0x18
37
52
+
38
+/* Non-fused multiply-add (unlike float16_muladd etc, which are fused) */
53
+#define TIMER2LOAD 0x20
39
+static float16 float16_muladd_nf(float16 dest, float16 op1, float16 op2,
54
+#define TIMER2VALUE 0x24
40
+ float_status *stat)
55
+#define TIMER2CONTROL 0x28
56
+#define TIMER2INTCLR 0x2c
57
+#define TIMER2RIS 0x30
58
+#define TIMER2MIS 0x34
59
+#define TIMER2BGLOAD 0x38
60
+
61
+#define CTRL_ENABLE (1 << 7)
62
+#define CTRL_PERIODIC (1 << 6)
63
+#define CTRL_INTEN (1 << 5)
64
+#define CTRL_PRESCALE_1 (0 << 2)
65
+#define CTRL_PRESCALE_16 (1 << 2)
66
+#define CTRL_PRESCALE_256 (2 << 2)
67
+#define CTRL_32BIT (1 << 1)
68
+#define CTRL_ONESHOT (1 << 0)
69
+
70
+static void test_dualtimer(void)
41
+{
71
+{
42
+ return float16_add(dest, float16_mul(op1, op2, stat), stat);
72
+ g_assert_true(readl(TIMER_BASE + TIMER1RIS) == 0);
73
+
74
+ /* Start timer: will fire after 40000 ns */
75
+ writel(TIMER_BASE + TIMER1LOAD, 1000);
76
+ /* enable in free-running, wrapping, interrupt mode */
77
+ writel(TIMER_BASE + TIMER1CONTROL, CTRL_ENABLE | CTRL_INTEN);
78
+
79
+ /* Step to just past the 500th tick and check VALUE */
80
+ clock_step(500 * 40 + 1);
81
+ g_assert_cmpuint(readl(TIMER_BASE + TIMER1RIS), ==, 0);
82
+ g_assert_cmpuint(readl(TIMER_BASE + TIMER1VALUE), ==, 500);
83
+
84
+ /* Just past the 1000th tick: timer should have fired */
85
+ clock_step(500 * 40);
86
+ g_assert_cmpuint(readl(TIMER_BASE + TIMER1RIS), ==, 1);
87
+ g_assert_cmpuint(readl(TIMER_BASE + TIMER1VALUE), ==, 0);
88
+
89
+ /*
90
+ * We are in free-running wrapping 16-bit mode, so on the following
91
+ * tick VALUE should have wrapped round to 0xffff.
92
+ */
93
+ clock_step(40);
94
+ g_assert_cmpuint(readl(TIMER_BASE + TIMER1VALUE), ==, 0xffff);
95
+
96
+ /* Check that any write to INTCLR clears interrupt */
97
+ writel(TIMER_BASE + TIMER1INTCLR, 1);
98
+ g_assert_cmpuint(readl(TIMER_BASE + TIMER1RIS), ==, 0);
99
+
100
+ /* Turn off the timer */
101
+ writel(TIMER_BASE + TIMER1CONTROL, 0);
43
+}
102
+}
44
+
103
+
45
+static float32 float32_muladd_nf(float32 dest, float32 op1, float32 op2,
104
+static void test_prescale(void)
46
+ float_status *stat)
47
+{
105
+{
48
+ return float32_add(dest, float32_mul(op1, op2, stat), stat);
106
+ g_assert_true(readl(TIMER_BASE + TIMER2RIS) == 0);
107
+
108
+ /* Start timer: will fire after 40 * 256 * 1000 == 1024000 ns */
109
+ writel(TIMER_BASE + TIMER2LOAD, 1000);
110
+ /* enable in periodic, wrapping, interrupt mode, prescale 256 */
111
+ writel(TIMER_BASE + TIMER2CONTROL,
112
+ CTRL_ENABLE | CTRL_INTEN | CTRL_PERIODIC | CTRL_PRESCALE_256);
113
+
114
+ /* Step to just past the 500th tick and check VALUE */
115
+ clock_step(40 * 256 * 501);
116
+ g_assert_cmpuint(readl(TIMER_BASE + TIMER2RIS), ==, 0);
117
+ g_assert_cmpuint(readl(TIMER_BASE + TIMER2VALUE), ==, 500);
118
+
119
+ /* Just past the 1000th tick: timer should have fired */
120
+ clock_step(40 * 256 * 500);
121
+ g_assert_cmpuint(readl(TIMER_BASE + TIMER2RIS), ==, 1);
122
+ g_assert_cmpuint(readl(TIMER_BASE + TIMER2VALUE), ==, 0);
123
+
124
+ /* In periodic mode the tick VALUE now reloads */
125
+ clock_step(40 * 256);
126
+ g_assert_cmpuint(readl(TIMER_BASE + TIMER2VALUE), ==, 1000);
127
+
128
+ /* Check that any write to INTCLR clears interrupt */
129
+ writel(TIMER_BASE + TIMER2INTCLR, 1);
130
+ g_assert_cmpuint(readl(TIMER_BASE + TIMER2RIS), ==, 0);
131
+
132
+ /* Turn off the timer */
133
+ writel(TIMER_BASE + TIMER2CONTROL, 0);
49
+}
134
+}
50
+
135
+
51
+static float16 float16_mulsub_nf(float16 dest, float16 op1, float16 op2,
136
+int main(int argc, char **argv)
52
+ float_status *stat)
53
+{
137
+{
54
+ return float16_sub(dest, float16_mul(op1, op2, stat), stat);
138
+ int r;
139
+
140
+ g_test_init(&argc, &argv, NULL);
141
+
142
+ qtest_start("-machine mps2-an385");
143
+
144
+ qtest_add_func("/cmsdk-apb-dualtimer/dualtimer", test_dualtimer);
145
+ qtest_add_func("/cmsdk-apb-dualtimer/prescale", test_prescale);
146
+
147
+ r = g_test_run();
148
+
149
+ qtest_end();
150
+
151
+ return r;
55
+}
152
+}
56
+
153
diff --git a/MAINTAINERS b/MAINTAINERS
57
+static float32 float32_mulsub_nf(float32 dest, float32 op1, float32 op2,
58
+ float_status *stat)
59
+{
60
+ return float32_sub(dest, float32_mul(op1, op2, stat), stat);
61
+}
62
+
63
+#define DO_MULADD(NAME, FUNC, TYPE) \
64
+void HELPER(NAME)(void *vd, void *vn, void *vm, void *stat, uint32_t desc) \
65
+{ \
66
+ intptr_t i, oprsz = simd_oprsz(desc); \
67
+ TYPE *d = vd, *n = vn, *m = vm; \
68
+ for (i = 0; i < oprsz / sizeof(TYPE); i++) { \
69
+ d[i] = FUNC(d[i], n[i], m[i], stat); \
70
+ } \
71
+ clear_tail(d, oprsz, simd_maxsz(desc)); \
72
+}
73
+
74
+DO_MULADD(gvec_fmla_h, float16_muladd_nf, float16)
75
+DO_MULADD(gvec_fmla_s, float32_muladd_nf, float32)
76
+
77
+DO_MULADD(gvec_fmls_h, float16_mulsub_nf, float16)
78
+DO_MULADD(gvec_fmls_s, float32_mulsub_nf, float32)
79
+
80
/* For the indexed ops, SVE applies the index per 128-bit vector segment.
81
* For AdvSIMD, there is of course only one such vector segment.
82
*/
83
diff --git a/target/arm/translate-neon.c.inc b/target/arm/translate-neon.c.inc
84
index XXXXXXX..XXXXXXX 100644
154
index XXXXXXX..XXXXXXX 100644
85
--- a/target/arm/translate-neon.c.inc
155
--- a/MAINTAINERS
86
+++ b/target/arm/translate-neon.c.inc
156
+++ b/MAINTAINERS
87
@@ -XXX,XX +XXX,XX @@ DO_3S_FP_GVEC(VACGE, gen_helper_gvec_facge_s, gen_helper_gvec_facge_h)
157
@@ -XXX,XX +XXX,XX @@ F: include/hw/timer/cmsdk-apb-timer.h
88
DO_3S_FP_GVEC(VACGT, gen_helper_gvec_facgt_s, gen_helper_gvec_facgt_h)
158
F: tests/qtest/cmsdk-apb-timer-test.c
89
DO_3S_FP_GVEC(VMAX, gen_helper_gvec_fmax_s, gen_helper_gvec_fmax_h)
159
F: hw/timer/cmsdk-apb-dualtimer.c
90
DO_3S_FP_GVEC(VMIN, gen_helper_gvec_fmin_s, gen_helper_gvec_fmin_h)
160
F: include/hw/timer/cmsdk-apb-dualtimer.h
91
-
161
+F: tests/qtest/cmsdk-apb-dualtimer-test.c
92
-/*
162
F: hw/char/cmsdk-apb-uart.c
93
- * For all the functions using this macro, size == 1 means fp16,
163
F: include/hw/char/cmsdk-apb-uart.h
94
- * which is an architecture extension we don't implement yet.
164
F: hw/watchdog/cmsdk-apb-watchdog.c
95
- */
165
diff --git a/tests/qtest/meson.build b/tests/qtest/meson.build
96
-#define DO_3S_FP(INSN,FUNC,READS_VD) \
166
index XXXXXXX..XXXXXXX 100644
97
- static bool trans_##INSN##_fp_3s(DisasContext *s, arg_3same *a) \
167
--- a/tests/qtest/meson.build
98
- { \
168
+++ b/tests/qtest/meson.build
99
- if (a->size != 0) { \
169
@@ -XXX,XX +XXX,XX @@ qtests_npcm7xx = \
100
- /* TODO fp16 support */ \
170
'npcm7xx_timer-test',
101
- return false; \
171
'npcm7xx_watchdog_timer-test']
102
- } \
172
qtests_arm = \
103
- return do_3same_fp(s, a, FUNC, READS_VD); \
173
+ (config_all_devices.has_key('CONFIG_CMSDK_APB_DUALTIMER') ? ['cmsdk-apb-dualtimer-test'] : []) + \
104
- }
174
(config_all_devices.has_key('CONFIG_CMSDK_APB_TIMER') ? ['cmsdk-apb-timer-test'] : []) + \
105
-
175
(config_all_devices.has_key('CONFIG_CMSDK_APB_WATCHDOG') ? ['cmsdk-apb-watchdog-test'] : []) + \
106
-static void gen_VMLA_fp_3s(TCGv_i32 vd, TCGv_i32 vn, TCGv_i32 vm,
176
(config_all_devices.has_key('CONFIG_PFLASH_CFI02') ? ['pflash-cfi02-test'] : []) + \
107
- TCGv_ptr fpstatus)
108
-{
109
- gen_helper_vfp_muls(vn, vn, vm, fpstatus);
110
- gen_helper_vfp_adds(vd, vd, vn, fpstatus);
111
-}
112
-
113
-static void gen_VMLS_fp_3s(TCGv_i32 vd, TCGv_i32 vn, TCGv_i32 vm,
114
- TCGv_ptr fpstatus)
115
-{
116
- gen_helper_vfp_muls(vn, vn, vm, fpstatus);
117
- gen_helper_vfp_subs(vd, vd, vn, fpstatus);
118
-}
119
-
120
-DO_3S_FP(VMLA, gen_VMLA_fp_3s, true)
121
-DO_3S_FP(VMLS, gen_VMLS_fp_3s, true)
122
+DO_3S_FP_GVEC(VMLA, gen_helper_gvec_fmla_s, gen_helper_gvec_fmla_h)
123
+DO_3S_FP_GVEC(VMLS, gen_helper_gvec_fmls_s, gen_helper_gvec_fmls_h)
124
125
WRAP_FP_GVEC(gen_VMAXNM_fp32_3s, FPST_STD, gen_helper_gvec_fmaxnum_s)
126
WRAP_FP_GVEC(gen_VMAXNM_fp16_3s, FPST_STD_F16, gen_helper_gvec_fmaxnum_h)
127
--
177
--
128
2.20.1
178
2.20.1
129
179
130
180
diff view generated by jsdifflib
1
In several places the target/arm code defines local float constants
1
The state struct for the CMSDK APB timer device doesn't follow our
2
for 2, 3 and 1.5, which are also provided by include/fpu/softfloat.h.
2
usual naming convention of camelcase -- "CMSDK" and "APB" are both
3
Remove the unnecessary local duplicate versions.
3
acronyms, but "TIMER" is not so should not be all-uppercase.
4
Globally rename the struct to "CMSDKAPBTimer" (bringing it into line
5
with CMSDKAPBWatchdog and CMSDKAPBDualTimer; CMSDKAPBUART remains
6
as-is because "UART" is an acronym).
7
8
Commit created with:
9
perl -p -i -e 's/CMSDKAPBTIMER/CMSDKAPBTimer/g' hw/timer/cmsdk-apb-timer.c include/hw/arm/armsse.h include/hw/timer/cmsdk-apb-timer.h
4
10
5
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
11
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
6
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
12
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
7
Message-id: 20200828183354.27913-2-peter.maydell@linaro.org
13
Reviewed-by: Luc Michel <luc@lmichel.fr>
14
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
15
Message-id: 20210128114145.20536-7-peter.maydell@linaro.org
16
Message-id: 20210121190622.22000-7-peter.maydell@linaro.org
8
---
17
---
9
target/arm/helper-a64.c | 11 -----------
18
include/hw/arm/armsse.h | 6 +++---
10
target/arm/translate-sve.c | 4 ----
19
include/hw/timer/cmsdk-apb-timer.h | 4 ++--
11
target/arm/vfp_helper.c | 4 ----
20
hw/timer/cmsdk-apb-timer.c | 28 ++++++++++++++--------------
12
3 files changed, 19 deletions(-)
21
3 files changed, 19 insertions(+), 19 deletions(-)
13
22
14
diff --git a/target/arm/helper-a64.c b/target/arm/helper-a64.c
23
diff --git a/include/hw/arm/armsse.h b/include/hw/arm/armsse.h
15
index XXXXXXX..XXXXXXX 100644
24
index XXXXXXX..XXXXXXX 100644
16
--- a/target/arm/helper-a64.c
25
--- a/include/hw/arm/armsse.h
17
+++ b/target/arm/helper-a64.c
26
+++ b/include/hw/arm/armsse.h
18
@@ -XXX,XX +XXX,XX @@ uint64_t HELPER(neon_cgt_f64)(float64 a, float64 b, void *fpstp)
27
@@ -XXX,XX +XXX,XX @@ struct ARMSSE {
19
* versions, these do a fully fused multiply-add or
28
TZPPC apb_ppc0;
20
* multiply-add-and-halve.
29
TZPPC apb_ppc1;
21
*/
30
TZMPC mpc[IOTS_NUM_MPC];
22
-#define float16_two make_float16(0x4000)
31
- CMSDKAPBTIMER timer0;
23
-#define float16_three make_float16(0x4200)
32
- CMSDKAPBTIMER timer1;
24
-#define float16_one_point_five make_float16(0x3e00)
33
- CMSDKAPBTIMER s32ktimer;
25
-
34
+ CMSDKAPBTimer timer0;
26
-#define float32_two make_float32(0x40000000)
35
+ CMSDKAPBTimer timer1;
27
-#define float32_three make_float32(0x40400000)
36
+ CMSDKAPBTimer s32ktimer;
28
-#define float32_one_point_five make_float32(0x3fc00000)
37
qemu_or_irq ppc_irq_orgate;
29
-
38
SplitIRQ sec_resp_splitter;
30
-#define float64_two make_float64(0x4000000000000000ULL)
39
SplitIRQ ppc_irq_splitter[NUM_PPCS];
31
-#define float64_three make_float64(0x4008000000000000ULL)
40
diff --git a/include/hw/timer/cmsdk-apb-timer.h b/include/hw/timer/cmsdk-apb-timer.h
32
-#define float64_one_point_five make_float64(0x3FF8000000000000ULL)
41
index XXXXXXX..XXXXXXX 100644
33
42
--- a/include/hw/timer/cmsdk-apb-timer.h
34
uint32_t HELPER(recpsf_f16)(uint32_t a, uint32_t b, void *fpstp)
43
+++ b/include/hw/timer/cmsdk-apb-timer.h
44
@@ -XXX,XX +XXX,XX @@
45
#include "qom/object.h"
46
47
#define TYPE_CMSDK_APB_TIMER "cmsdk-apb-timer"
48
-OBJECT_DECLARE_SIMPLE_TYPE(CMSDKAPBTIMER, CMSDK_APB_TIMER)
49
+OBJECT_DECLARE_SIMPLE_TYPE(CMSDKAPBTimer, CMSDK_APB_TIMER)
50
51
-struct CMSDKAPBTIMER {
52
+struct CMSDKAPBTimer {
53
/*< private >*/
54
SysBusDevice parent_obj;
55
56
diff --git a/hw/timer/cmsdk-apb-timer.c b/hw/timer/cmsdk-apb-timer.c
57
index XXXXXXX..XXXXXXX 100644
58
--- a/hw/timer/cmsdk-apb-timer.c
59
+++ b/hw/timer/cmsdk-apb-timer.c
60
@@ -XXX,XX +XXX,XX @@ static const int timer_id[] = {
61
0x0d, 0xf0, 0x05, 0xb1, /* CID0..CID3 */
62
};
63
64
-static void cmsdk_apb_timer_update(CMSDKAPBTIMER *s)
65
+static void cmsdk_apb_timer_update(CMSDKAPBTimer *s)
35
{
66
{
36
diff --git a/target/arm/translate-sve.c b/target/arm/translate-sve.c
67
qemu_set_irq(s->timerint, !!(s->intstatus & R_INTSTATUS_IRQ_MASK));
37
index XXXXXXX..XXXXXXX 100644
38
--- a/target/arm/translate-sve.c
39
+++ b/target/arm/translate-sve.c
40
@@ -XXX,XX +XXX,XX @@ static bool trans_##NAME##_zpzi(DisasContext *s, arg_rpri_esz *a) \
41
return true; \
42
}
68
}
43
69
44
-#define float16_two make_float16(0x4000)
70
static uint64_t cmsdk_apb_timer_read(void *opaque, hwaddr offset, unsigned size)
45
-#define float32_two make_float32(0x40000000)
46
-#define float64_two make_float64(0x4000000000000000ULL)
47
-
48
DO_FP_IMM(FADD, fadds, half, one)
49
DO_FP_IMM(FSUB, fsubs, half, one)
50
DO_FP_IMM(FMUL, fmuls, half, two)
51
diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
52
index XXXXXXX..XXXXXXX 100644
53
--- a/target/arm/vfp_helper.c
54
+++ b/target/arm/vfp_helper.c
55
@@ -XXX,XX +XXX,XX @@ uint32_t HELPER(vfp_fcvt_f64_to_f16)(float64 a, void *fpstp, uint32_t ahp_mode)
56
return r;
57
}
58
59
-#define float32_two make_float32(0x40000000)
60
-#define float32_three make_float32(0x40400000)
61
-#define float32_one_point_five make_float32(0x3fc00000)
62
-
63
float32 HELPER(recps_f32)(CPUARMState *env, float32 a, float32 b)
64
{
71
{
65
float_status *s = &env->vfp.standard_fp_status;
72
- CMSDKAPBTIMER *s = CMSDK_APB_TIMER(opaque);
73
+ CMSDKAPBTimer *s = CMSDK_APB_TIMER(opaque);
74
uint64_t r;
75
76
switch (offset) {
77
@@ -XXX,XX +XXX,XX @@ static uint64_t cmsdk_apb_timer_read(void *opaque, hwaddr offset, unsigned size)
78
static void cmsdk_apb_timer_write(void *opaque, hwaddr offset, uint64_t value,
79
unsigned size)
80
{
81
- CMSDKAPBTIMER *s = CMSDK_APB_TIMER(opaque);
82
+ CMSDKAPBTimer *s = CMSDK_APB_TIMER(opaque);
83
84
trace_cmsdk_apb_timer_write(offset, value, size);
85
86
@@ -XXX,XX +XXX,XX @@ static const MemoryRegionOps cmsdk_apb_timer_ops = {
87
88
static void cmsdk_apb_timer_tick(void *opaque)
89
{
90
- CMSDKAPBTIMER *s = CMSDK_APB_TIMER(opaque);
91
+ CMSDKAPBTimer *s = CMSDK_APB_TIMER(opaque);
92
93
if (s->ctrl & R_CTRL_IRQEN_MASK) {
94
s->intstatus |= R_INTSTATUS_IRQ_MASK;
95
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_timer_tick(void *opaque)
96
97
static void cmsdk_apb_timer_reset(DeviceState *dev)
98
{
99
- CMSDKAPBTIMER *s = CMSDK_APB_TIMER(dev);
100
+ CMSDKAPBTimer *s = CMSDK_APB_TIMER(dev);
101
102
trace_cmsdk_apb_timer_reset();
103
s->ctrl = 0;
104
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_timer_reset(DeviceState *dev)
105
static void cmsdk_apb_timer_init(Object *obj)
106
{
107
SysBusDevice *sbd = SYS_BUS_DEVICE(obj);
108
- CMSDKAPBTIMER *s = CMSDK_APB_TIMER(obj);
109
+ CMSDKAPBTimer *s = CMSDK_APB_TIMER(obj);
110
111
memory_region_init_io(&s->iomem, obj, &cmsdk_apb_timer_ops,
112
s, "cmsdk-apb-timer", 0x1000);
113
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_timer_init(Object *obj)
114
115
static void cmsdk_apb_timer_realize(DeviceState *dev, Error **errp)
116
{
117
- CMSDKAPBTIMER *s = CMSDK_APB_TIMER(dev);
118
+ CMSDKAPBTimer *s = CMSDK_APB_TIMER(dev);
119
120
if (s->pclk_frq == 0) {
121
error_setg(errp, "CMSDK APB timer: pclk-frq property must be set");
122
@@ -XXX,XX +XXX,XX @@ static const VMStateDescription cmsdk_apb_timer_vmstate = {
123
.version_id = 1,
124
.minimum_version_id = 1,
125
.fields = (VMStateField[]) {
126
- VMSTATE_PTIMER(timer, CMSDKAPBTIMER),
127
- VMSTATE_UINT32(ctrl, CMSDKAPBTIMER),
128
- VMSTATE_UINT32(value, CMSDKAPBTIMER),
129
- VMSTATE_UINT32(reload, CMSDKAPBTIMER),
130
- VMSTATE_UINT32(intstatus, CMSDKAPBTIMER),
131
+ VMSTATE_PTIMER(timer, CMSDKAPBTimer),
132
+ VMSTATE_UINT32(ctrl, CMSDKAPBTimer),
133
+ VMSTATE_UINT32(value, CMSDKAPBTimer),
134
+ VMSTATE_UINT32(reload, CMSDKAPBTimer),
135
+ VMSTATE_UINT32(intstatus, CMSDKAPBTimer),
136
VMSTATE_END_OF_LIST()
137
}
138
};
139
140
static Property cmsdk_apb_timer_properties[] = {
141
- DEFINE_PROP_UINT32("pclk-frq", CMSDKAPBTIMER, pclk_frq, 0),
142
+ DEFINE_PROP_UINT32("pclk-frq", CMSDKAPBTimer, pclk_frq, 0),
143
DEFINE_PROP_END_OF_LIST(),
144
};
145
146
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_timer_class_init(ObjectClass *klass, void *data)
147
static const TypeInfo cmsdk_apb_timer_info = {
148
.name = TYPE_CMSDK_APB_TIMER,
149
.parent = TYPE_SYS_BUS_DEVICE,
150
- .instance_size = sizeof(CMSDKAPBTIMER),
151
+ .instance_size = sizeof(CMSDKAPBTimer),
152
.instance_init = cmsdk_apb_timer_init,
153
.class_init = cmsdk_apb_timer_class_init,
154
};
66
--
155
--
67
2.20.1
156
2.20.1
68
157
69
158
diff view generated by jsdifflib
1
Currently the VFP_CONV_FIX macros take a single fsz argument for the
1
As the first step in converting the CMSDK_APB_TIMER device to the
2
size of the float type, which is used both to select the name of
2
Clock framework, add a Clock input. For the moment we do nothing
3
the functions to call (eg float32_is_any_nan()) and also for the
3
with this clock; we will change the behaviour from using the pclk-frq
4
type to use for the float inputs and outputs (eg float32).
4
property to using the Clock once all the users of this device have
5
been converted to wire up the Clock.
5
6
6
Separate these into fsz and ftype arguments, so that we can use them
7
Since the device doesn't already have a doc comment for its "QEMU
7
for fp16, which uses 'float16' in the function names but is still
8
interface", we add one including the new Clock.
8
passing inputs and outputs in a 32-bit sized type.
9
10
This is a migration compatibility break for machines mps2-an505,
11
mps2-an521, musca-a, musca-b1.
9
12
10
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
13
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
11
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
14
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
12
Message-id: 20200828183354.27913-14-peter.maydell@linaro.org
15
Reviewed-by: Luc Michel <luc@lmichel.fr>
16
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
17
Message-id: 20210128114145.20536-8-peter.maydell@linaro.org
18
Message-id: 20210121190622.22000-8-peter.maydell@linaro.org
13
---
19
---
14
target/arm/vfp_helper.c | 46 ++++++++++++++++++++---------------------
20
include/hw/timer/cmsdk-apb-timer.h | 9 +++++++++
15
1 file changed, 23 insertions(+), 23 deletions(-)
21
hw/timer/cmsdk-apb-timer.c | 7 +++++--
22
2 files changed, 14 insertions(+), 2 deletions(-)
16
23
17
diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
24
diff --git a/include/hw/timer/cmsdk-apb-timer.h b/include/hw/timer/cmsdk-apb-timer.h
18
index XXXXXXX..XXXXXXX 100644
25
index XXXXXXX..XXXXXXX 100644
19
--- a/target/arm/vfp_helper.c
26
--- a/include/hw/timer/cmsdk-apb-timer.h
20
+++ b/target/arm/vfp_helper.c
27
+++ b/include/hw/timer/cmsdk-apb-timer.h
21
@@ -XXX,XX +XXX,XX @@ float32 VFP_HELPER(fcvts, d)(float64 x, CPUARMState *env)
28
@@ -XXX,XX +XXX,XX @@
29
#include "hw/qdev-properties.h"
30
#include "hw/sysbus.h"
31
#include "hw/ptimer.h"
32
+#include "hw/clock.h"
33
#include "qom/object.h"
34
35
#define TYPE_CMSDK_APB_TIMER "cmsdk-apb-timer"
36
OBJECT_DECLARE_SIMPLE_TYPE(CMSDKAPBTimer, CMSDK_APB_TIMER)
37
38
+/*
39
+ * QEMU interface:
40
+ * + QOM property "pclk-frq": frequency at which the timer is clocked
41
+ * + Clock input "pclk": clock for the timer
42
+ * + sysbus MMIO region 0: the register bank
43
+ * + sysbus IRQ 0: timer interrupt TIMERINT
44
+ */
45
struct CMSDKAPBTimer {
46
/*< private >*/
47
SysBusDevice parent_obj;
48
@@ -XXX,XX +XXX,XX @@ struct CMSDKAPBTimer {
49
qemu_irq timerint;
50
uint32_t pclk_frq;
51
struct ptimer_state *timer;
52
+ Clock *pclk;
53
54
uint32_t ctrl;
55
uint32_t value;
56
diff --git a/hw/timer/cmsdk-apb-timer.c b/hw/timer/cmsdk-apb-timer.c
57
index XXXXXXX..XXXXXXX 100644
58
--- a/hw/timer/cmsdk-apb-timer.c
59
+++ b/hw/timer/cmsdk-apb-timer.c
60
@@ -XXX,XX +XXX,XX @@
61
#include "hw/sysbus.h"
62
#include "hw/irq.h"
63
#include "hw/registerfields.h"
64
+#include "hw/qdev-clock.h"
65
#include "hw/timer/cmsdk-apb-timer.h"
66
#include "migration/vmstate.h"
67
68
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_timer_init(Object *obj)
69
s, "cmsdk-apb-timer", 0x1000);
70
sysbus_init_mmio(sbd, &s->iomem);
71
sysbus_init_irq(sbd, &s->timerint);
72
+ s->pclk = qdev_init_clock_in(DEVICE(s), "pclk", NULL, NULL);
22
}
73
}
23
74
24
/* VFP3 fixed point conversion. */
75
static void cmsdk_apb_timer_realize(DeviceState *dev, Error **errp)
25
-#define VFP_CONV_FIX_FLOAT(name, p, fsz, isz, itype) \
76
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_timer_realize(DeviceState *dev, Error **errp)
26
-float##fsz HELPER(vfp_##name##to##p)(uint##isz##_t x, uint32_t shift, \
77
27
+#define VFP_CONV_FIX_FLOAT(name, p, fsz, ftype, isz, itype) \
78
static const VMStateDescription cmsdk_apb_timer_vmstate = {
28
+ftype HELPER(vfp_##name##to##p)(uint##isz##_t x, uint32_t shift, \
79
.name = "cmsdk-apb-timer",
29
void *fpstp) \
80
- .version_id = 1,
30
{ return itype##_to_##float##fsz##_scalbn(x, -shift, fpstp); }
81
- .minimum_version_id = 1,
31
82
+ .version_id = 2,
32
-#define VFP_CONV_FLOAT_FIX_ROUND(name, p, fsz, isz, itype, ROUND, suff) \
83
+ .minimum_version_id = 2,
33
-uint##isz##_t HELPER(vfp_to##name##p##suff)(float##fsz x, uint32_t shift, \
84
.fields = (VMStateField[]) {
34
+#define VFP_CONV_FLOAT_FIX_ROUND(name, p, fsz, ftype, isz, itype, ROUND, suff) \
85
VMSTATE_PTIMER(timer, CMSDKAPBTimer),
35
+uint##isz##_t HELPER(vfp_to##name##p##suff)(ftype x, uint32_t shift, \
86
+ VMSTATE_CLOCK(pclk, CMSDKAPBTimer),
36
void *fpst) \
87
VMSTATE_UINT32(ctrl, CMSDKAPBTimer),
37
{ \
88
VMSTATE_UINT32(value, CMSDKAPBTimer),
38
if (unlikely(float##fsz##_is_any_nan(x))) { \
89
VMSTATE_UINT32(reload, CMSDKAPBTimer),
39
@@ -XXX,XX +XXX,XX @@ uint##isz##_t HELPER(vfp_to##name##p##suff)(float##fsz x, uint32_t shift, \
40
return float##fsz##_to_##itype##_scalbn(x, ROUND, shift, fpst); \
41
}
42
43
-#define VFP_CONV_FIX(name, p, fsz, isz, itype) \
44
-VFP_CONV_FIX_FLOAT(name, p, fsz, isz, itype) \
45
-VFP_CONV_FLOAT_FIX_ROUND(name, p, fsz, isz, itype, \
46
+#define VFP_CONV_FIX(name, p, fsz, ftype, isz, itype) \
47
+VFP_CONV_FIX_FLOAT(name, p, fsz, ftype, isz, itype) \
48
+VFP_CONV_FLOAT_FIX_ROUND(name, p, fsz, ftype, isz, itype, \
49
float_round_to_zero, _round_to_zero) \
50
-VFP_CONV_FLOAT_FIX_ROUND(name, p, fsz, isz, itype, \
51
+VFP_CONV_FLOAT_FIX_ROUND(name, p, fsz, ftype, isz, itype, \
52
get_float_rounding_mode(fpst), )
53
54
-#define VFP_CONV_FIX_A64(name, p, fsz, isz, itype) \
55
-VFP_CONV_FIX_FLOAT(name, p, fsz, isz, itype) \
56
-VFP_CONV_FLOAT_FIX_ROUND(name, p, fsz, isz, itype, \
57
+#define VFP_CONV_FIX_A64(name, p, fsz, ftype, isz, itype) \
58
+VFP_CONV_FIX_FLOAT(name, p, fsz, ftype, isz, itype) \
59
+VFP_CONV_FLOAT_FIX_ROUND(name, p, fsz, ftype, isz, itype, \
60
get_float_rounding_mode(fpst), )
61
62
-VFP_CONV_FIX(sh, d, 64, 64, int16)
63
-VFP_CONV_FIX(sl, d, 64, 64, int32)
64
-VFP_CONV_FIX_A64(sq, d, 64, 64, int64)
65
-VFP_CONV_FIX(uh, d, 64, 64, uint16)
66
-VFP_CONV_FIX(ul, d, 64, 64, uint32)
67
-VFP_CONV_FIX_A64(uq, d, 64, 64, uint64)
68
-VFP_CONV_FIX(sh, s, 32, 32, int16)
69
-VFP_CONV_FIX(sl, s, 32, 32, int32)
70
-VFP_CONV_FIX_A64(sq, s, 32, 64, int64)
71
-VFP_CONV_FIX(uh, s, 32, 32, uint16)
72
-VFP_CONV_FIX(ul, s, 32, 32, uint32)
73
-VFP_CONV_FIX_A64(uq, s, 32, 64, uint64)
74
+VFP_CONV_FIX(sh, d, 64, float64, 64, int16)
75
+VFP_CONV_FIX(sl, d, 64, float64, 64, int32)
76
+VFP_CONV_FIX_A64(sq, d, 64, float64, 64, int64)
77
+VFP_CONV_FIX(uh, d, 64, float64, 64, uint16)
78
+VFP_CONV_FIX(ul, d, 64, float64, 64, uint32)
79
+VFP_CONV_FIX_A64(uq, d, 64, float64, 64, uint64)
80
+VFP_CONV_FIX(sh, s, 32, float32, 32, int16)
81
+VFP_CONV_FIX(sl, s, 32, float32, 32, int32)
82
+VFP_CONV_FIX_A64(sq, s, 32, float32, 64, int64)
83
+VFP_CONV_FIX(uh, s, 32, float32, 32, uint16)
84
+VFP_CONV_FIX(ul, s, 32, float32, 32, uint32)
85
+VFP_CONV_FIX_A64(uq, s, 32, float32, 64, uint64)
86
87
#undef VFP_CONV_FIX
88
#undef VFP_CONV_FIX_FLOAT
89
--
90
--
90
2.20.1
91
2.20.1
91
92
92
93
diff view generated by jsdifflib
1
Implement fp16 version of VCMP.
1
As the first step in converting the CMSDK_APB_DUALTIMER device to the
2
Clock framework, add a Clock input. For the moment we do nothing
3
with this clock; we will change the behaviour from using the pclk-frq
4
property to using the Clock once all the users of this device have
5
been converted to wire up the Clock.
6
7
We take the opportunity to correct the name of the clock input to
8
match the hardware -- the dual timer names the clock which drives the
9
timers TIMCLK. (It does also have a 'pclk' input, which is used only
10
for the register and APB bus logic; on the SSE-200 these clocks are
11
both connected together.)
12
13
This is a migration compatibility break for machines mps2-an385,
14
mps2-an386, mps2-an500, mps2-an511, mps2-an505, mps2-an521, musca-a,
15
musca-b1.
2
16
3
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
17
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
4
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
18
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
5
Message-id: 20200828183354.27913-11-peter.maydell@linaro.org
19
Reviewed-by: Luc Michel <luc@lmichel.fr>
20
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
21
Message-id: 20210128114145.20536-9-peter.maydell@linaro.org
22
Message-id: 20210121190622.22000-9-peter.maydell@linaro.org
6
---
23
---
7
target/arm/helper.h | 2 ++
24
include/hw/timer/cmsdk-apb-dualtimer.h | 3 +++
8
target/arm/vfp.decode | 2 ++
25
hw/timer/cmsdk-apb-dualtimer.c | 7 +++++--
9
target/arm/vfp_helper.c | 15 +++++++------
26
2 files changed, 8 insertions(+), 2 deletions(-)
10
target/arm/translate-vfp.c.inc | 39 ++++++++++++++++++++++++++++++++++
11
4 files changed, 51 insertions(+), 7 deletions(-)
12
27
13
diff --git a/target/arm/helper.h b/target/arm/helper.h
28
diff --git a/include/hw/timer/cmsdk-apb-dualtimer.h b/include/hw/timer/cmsdk-apb-dualtimer.h
14
index XXXXXXX..XXXXXXX 100644
29
index XXXXXXX..XXXXXXX 100644
15
--- a/target/arm/helper.h
30
--- a/include/hw/timer/cmsdk-apb-dualtimer.h
16
+++ b/target/arm/helper.h
31
+++ b/include/hw/timer/cmsdk-apb-dualtimer.h
17
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_1(vfp_absd, f64, f64)
32
@@ -XXX,XX +XXX,XX @@
18
DEF_HELPER_2(vfp_sqrth, f16, f16, env)
33
*
19
DEF_HELPER_2(vfp_sqrts, f32, f32, env)
34
* QEMU interface:
20
DEF_HELPER_2(vfp_sqrtd, f64, f64, env)
35
* + QOM property "pclk-frq": frequency at which the timer is clocked
21
+DEF_HELPER_3(vfp_cmph, void, f16, f16, env)
36
+ * + Clock input "TIMCLK": clock (for both timers)
22
DEF_HELPER_3(vfp_cmps, void, f32, f32, env)
37
* + sysbus MMIO region 0: the register bank
23
DEF_HELPER_3(vfp_cmpd, void, f64, f64, env)
38
* + sysbus IRQ 0: combined timer interrupt TIMINTC
24
+DEF_HELPER_3(vfp_cmpeh, void, f16, f16, env)
39
* + sysbus IRO 1: timer block 1 interrupt TIMINT1
25
DEF_HELPER_3(vfp_cmpes, void, f32, f32, env)
40
@@ -XXX,XX +XXX,XX @@
26
DEF_HELPER_3(vfp_cmped, void, f64, f64, env)
41
27
42
#include "hw/sysbus.h"
28
diff --git a/target/arm/vfp.decode b/target/arm/vfp.decode
43
#include "hw/ptimer.h"
44
+#include "hw/clock.h"
45
#include "qom/object.h"
46
47
#define TYPE_CMSDK_APB_DUALTIMER "cmsdk-apb-dualtimer"
48
@@ -XXX,XX +XXX,XX @@ struct CMSDKAPBDualTimer {
49
MemoryRegion iomem;
50
qemu_irq timerintc;
51
uint32_t pclk_frq;
52
+ Clock *timclk;
53
54
CMSDKAPBDualTimerModule timermod[CMSDK_APB_DUALTIMER_NUM_MODULES];
55
uint32_t timeritcr;
56
diff --git a/hw/timer/cmsdk-apb-dualtimer.c b/hw/timer/cmsdk-apb-dualtimer.c
29
index XXXXXXX..XXXXXXX 100644
57
index XXXXXXX..XXXXXXX 100644
30
--- a/target/arm/vfp.decode
58
--- a/hw/timer/cmsdk-apb-dualtimer.c
31
+++ b/target/arm/vfp.decode
59
+++ b/hw/timer/cmsdk-apb-dualtimer.c
32
@@ -XXX,XX +XXX,XX @@ VSQRT_hp ---- 1110 1.11 0001 .... 1001 11.0 .... @vfp_dm_ss
60
@@ -XXX,XX +XXX,XX @@
33
VSQRT_sp ---- 1110 1.11 0001 .... 1010 11.0 .... @vfp_dm_ss
61
#include "hw/irq.h"
34
VSQRT_dp ---- 1110 1.11 0001 .... 1011 11.0 .... @vfp_dm_dd
62
#include "hw/qdev-properties.h"
35
63
#include "hw/registerfields.h"
36
+VCMP_hp ---- 1110 1.11 010 z:1 .... 1001 e:1 1.0 .... \
64
+#include "hw/qdev-clock.h"
37
+ vd=%vd_sp vm=%vm_sp
65
#include "hw/timer/cmsdk-apb-dualtimer.h"
38
VCMP_sp ---- 1110 1.11 010 z:1 .... 1010 e:1 1.0 .... \
66
#include "migration/vmstate.h"
39
vd=%vd_sp vm=%vm_sp
67
40
VCMP_dp ---- 1110 1.11 010 z:1 .... 1011 e:1 1.0 .... \
68
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_dualtimer_init(Object *obj)
41
diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
69
for (i = 0; i < ARRAY_SIZE(s->timermod); i++) {
42
index XXXXXXX..XXXXXXX 100644
70
sysbus_init_irq(sbd, &s->timermod[i].timerint);
43
--- a/target/arm/vfp_helper.c
71
}
44
+++ b/target/arm/vfp_helper.c
72
+ s->timclk = qdev_init_clock_in(DEVICE(s), "TIMCLK", NULL, NULL);
45
@@ -XXX,XX +XXX,XX @@ static void softfloat_to_vfp_compare(CPUARMState *env, FloatRelation cmp)
46
}
73
}
47
74
48
/* XXX: check quiet/signaling case */
75
static void cmsdk_apb_dualtimer_realize(DeviceState *dev, Error **errp)
49
-#define DO_VFP_cmp(p, type) \
76
@@ -XXX,XX +XXX,XX @@ static const VMStateDescription cmsdk_dualtimermod_vmstate = {
50
-void VFP_HELPER(cmp, p)(type a, type b, CPUARMState *env) \
77
51
+#define DO_VFP_cmp(P, FLOATTYPE, ARGTYPE, FPST) \
78
static const VMStateDescription cmsdk_apb_dualtimer_vmstate = {
52
+void VFP_HELPER(cmp, P)(ARGTYPE a, ARGTYPE b, CPUARMState *env) \
79
.name = "cmsdk-apb-dualtimer",
53
{ \
80
- .version_id = 1,
54
softfloat_to_vfp_compare(env, \
81
- .minimum_version_id = 1,
55
- type ## _compare_quiet(a, b, &env->vfp.fp_status)); \
82
+ .version_id = 2,
56
+ FLOATTYPE ## _compare_quiet(a, b, &env->vfp.FPST)); \
83
+ .minimum_version_id = 2,
57
} \
84
.fields = (VMStateField[]) {
58
-void VFP_HELPER(cmpe, p)(type a, type b, CPUARMState *env) \
85
+ VMSTATE_CLOCK(timclk, CMSDKAPBDualTimer),
59
+void VFP_HELPER(cmpe, P)(ARGTYPE a, ARGTYPE b, CPUARMState *env) \
86
VMSTATE_STRUCT_ARRAY(timermod, CMSDKAPBDualTimer,
60
{ \
87
CMSDK_APB_DUALTIMER_NUM_MODULES,
61
softfloat_to_vfp_compare(env, \
88
1, cmsdk_dualtimermod_vmstate,
62
- type ## _compare(a, b, &env->vfp.fp_status)); \
63
+ FLOATTYPE ## _compare(a, b, &env->vfp.FPST)); \
64
}
65
-DO_VFP_cmp(s, float32)
66
-DO_VFP_cmp(d, float64)
67
+DO_VFP_cmp(h, float16, dh_ctype_f16, fp_status_f16)
68
+DO_VFP_cmp(s, float32, float32, fp_status)
69
+DO_VFP_cmp(d, float64, float64, fp_status)
70
#undef DO_VFP_cmp
71
72
/* Integer to float and float to integer conversions */
73
diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
74
index XXXXXXX..XXXXXXX 100644
75
--- a/target/arm/translate-vfp.c.inc
76
+++ b/target/arm/translate-vfp.c.inc
77
@@ -XXX,XX +XXX,XX @@ DO_VFP_2OP(VSQRT, hp, gen_VSQRT_hp)
78
DO_VFP_2OP(VSQRT, sp, gen_VSQRT_sp)
79
DO_VFP_2OP(VSQRT, dp, gen_VSQRT_dp)
80
81
+static bool trans_VCMP_hp(DisasContext *s, arg_VCMP_sp *a)
82
+{
83
+ TCGv_i32 vd, vm;
84
+
85
+ if (!dc_isar_feature(aa32_fp16_arith, s)) {
86
+ return false;
87
+ }
88
+
89
+ /* Vm/M bits must be zero for the Z variant */
90
+ if (a->z && a->vm != 0) {
91
+ return false;
92
+ }
93
+
94
+ if (!vfp_access_check(s)) {
95
+ return true;
96
+ }
97
+
98
+ vd = tcg_temp_new_i32();
99
+ vm = tcg_temp_new_i32();
100
+
101
+ neon_load_reg32(vd, a->vd);
102
+ if (a->z) {
103
+ tcg_gen_movi_i32(vm, 0);
104
+ } else {
105
+ neon_load_reg32(vm, a->vm);
106
+ }
107
+
108
+ if (a->e) {
109
+ gen_helper_vfp_cmpeh(vd, vm, cpu_env);
110
+ } else {
111
+ gen_helper_vfp_cmph(vd, vm, cpu_env);
112
+ }
113
+
114
+ tcg_temp_free_i32(vd);
115
+ tcg_temp_free_i32(vm);
116
+
117
+ return true;
118
+}
119
+
120
static bool trans_VCMP_sp(DisasContext *s, arg_VCMP_sp *a)
121
{
122
TCGv_i32 vd, vm;
123
--
89
--
124
2.20.1
90
2.20.1
125
91
126
92
diff view generated by jsdifflib
1
The fp16 extension includes a new instruction VMOVX, which copies the
1
As the first step in converting the CMSDK_APB_TIMER device to the
2
upper 16 bits of a 32-bit source VFP register into the lower 16
2
Clock framework, add a Clock input. For the moment we do nothing
3
bits of the destination and zeroes the high half of the destination.
3
with this clock; we will change the behaviour from using the
4
Implement it.
4
wdogclk-frq property to using the Clock once all the users of this
5
device have been converted to wire up the Clock.
6
7
This is a migration compatibility break for machines mps2-an385,
8
mps2-an386, mps2-an500, mps2-an511, mps2-an505, mps2-an521, musca-a,
9
musca-b1, lm3s811evb, lm3s6965evb.
5
10
6
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
11
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
7
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
12
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
8
Message-id: 20200828183354.27913-21-peter.maydell@linaro.org
13
Reviewed-by: Luc Michel <luc@lmichel.fr>
14
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
15
Message-id: 20210128114145.20536-10-peter.maydell@linaro.org
16
Message-id: 20210121190622.22000-10-peter.maydell@linaro.org
9
---
17
---
10
target/arm/vfp-uncond.decode | 3 +++
18
include/hw/watchdog/cmsdk-apb-watchdog.h | 3 +++
11
target/arm/translate-vfp.c.inc | 25 +++++++++++++++++++++++++
19
hw/watchdog/cmsdk-apb-watchdog.c | 7 +++++--
12
2 files changed, 28 insertions(+)
20
2 files changed, 8 insertions(+), 2 deletions(-)
13
21
14
diff --git a/target/arm/vfp-uncond.decode b/target/arm/vfp-uncond.decode
22
diff --git a/include/hw/watchdog/cmsdk-apb-watchdog.h b/include/hw/watchdog/cmsdk-apb-watchdog.h
15
index XXXXXXX..XXXXXXX 100644
23
index XXXXXXX..XXXXXXX 100644
16
--- a/target/arm/vfp-uncond.decode
24
--- a/include/hw/watchdog/cmsdk-apb-watchdog.h
17
+++ b/target/arm/vfp-uncond.decode
25
+++ b/include/hw/watchdog/cmsdk-apb-watchdog.h
18
@@ -XXX,XX +XXX,XX @@ VCVT 1111 1110 1.11 11 rm:2 .... 1010 op:1 1.0 .... \
26
@@ -XXX,XX +XXX,XX @@
19
VCVT 1111 1110 1.11 11 rm:2 .... 1011 op:1 1.0 .... \
27
*
20
vm=%vm_dp vd=%vd_sp sz=3
28
* QEMU interface:
21
29
* + QOM property "wdogclk-frq": frequency at which the watchdog is clocked
22
+VMOVX 1111 1110 1.11 0000 .... 1010 01 . 0 .... \
30
+ * + Clock input "WDOGCLK": clock for the watchdog's timer
23
+ vd=%vd_sp vm=%vm_sp
31
* + sysbus MMIO region 0: the register bank
24
+
32
* + sysbus IRQ 0: watchdog interrupt
25
VINS 1111 1110 1.11 0000 .... 1010 11 . 0 .... \
33
*
26
vd=%vd_sp vm=%vm_sp
34
@@ -XXX,XX +XXX,XX @@
27
diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
35
36
#include "hw/sysbus.h"
37
#include "hw/ptimer.h"
38
+#include "hw/clock.h"
39
#include "qom/object.h"
40
41
#define TYPE_CMSDK_APB_WATCHDOG "cmsdk-apb-watchdog"
42
@@ -XXX,XX +XXX,XX @@ struct CMSDKAPBWatchdog {
43
uint32_t wdogclk_frq;
44
bool is_luminary;
45
struct ptimer_state *timer;
46
+ Clock *wdogclk;
47
48
uint32_t control;
49
uint32_t intstatus;
50
diff --git a/hw/watchdog/cmsdk-apb-watchdog.c b/hw/watchdog/cmsdk-apb-watchdog.c
28
index XXXXXXX..XXXXXXX 100644
51
index XXXXXXX..XXXXXXX 100644
29
--- a/target/arm/translate-vfp.c.inc
52
--- a/hw/watchdog/cmsdk-apb-watchdog.c
30
+++ b/target/arm/translate-vfp.c.inc
53
+++ b/hw/watchdog/cmsdk-apb-watchdog.c
31
@@ -XXX,XX +XXX,XX @@ static bool trans_VINS(DisasContext *s, arg_VINS *a)
54
@@ -XXX,XX +XXX,XX @@
32
tcg_temp_free_i32(rd);
55
#include "hw/irq.h"
33
return true;
56
#include "hw/qdev-properties.h"
34
}
57
#include "hw/registerfields.h"
35
+
58
+#include "hw/qdev-clock.h"
36
+static bool trans_VMOVX(DisasContext *s, arg_VINS *a)
59
#include "hw/watchdog/cmsdk-apb-watchdog.h"
37
+{
60
#include "migration/vmstate.h"
38
+ TCGv_i32 rm;
61
39
+
62
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_watchdog_init(Object *obj)
40
+ if (!dc_isar_feature(aa32_fp16_arith, s)) {
63
s, "cmsdk-apb-watchdog", 0x1000);
41
+ return false;
64
sysbus_init_mmio(sbd, &s->iomem);
42
+ }
65
sysbus_init_irq(sbd, &s->wdogint);
43
+
66
+ s->wdogclk = qdev_init_clock_in(DEVICE(s), "WDOGCLK", NULL, NULL);
44
+ if (s->vec_len != 0 || s->vec_stride != 0) {
67
45
+ return false;
68
s->is_luminary = false;
46
+ }
69
s->id = cmsdk_apb_watchdog_id;
47
+
70
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_watchdog_realize(DeviceState *dev, Error **errp)
48
+ if (!vfp_access_check(s)) {
71
49
+ return true;
72
static const VMStateDescription cmsdk_apb_watchdog_vmstate = {
50
+ }
73
.name = "cmsdk-apb-watchdog",
51
+
74
- .version_id = 1,
52
+ /* Set Vd to high half of Vm */
75
- .minimum_version_id = 1,
53
+ rm = tcg_temp_new_i32();
76
+ .version_id = 2,
54
+ neon_load_reg32(rm, a->vm);
77
+ .minimum_version_id = 2,
55
+ tcg_gen_shri_i32(rm, rm, 16);
78
.fields = (VMStateField[]) {
56
+ neon_store_reg32(rm, a->vd);
79
+ VMSTATE_CLOCK(wdogclk, CMSDKAPBWatchdog),
57
+ tcg_temp_free_i32(rm);
80
VMSTATE_PTIMER(timer, CMSDKAPBWatchdog),
58
+ return true;
81
VMSTATE_UINT32(control, CMSDKAPBWatchdog),
59
+}
82
VMSTATE_UINT32(intstatus, CMSDKAPBWatchdog),
60
--
83
--
61
2.20.1
84
2.20.1
62
85
63
86
diff view generated by jsdifflib
1
Implement the fp16 versions of the VFP VCVT instruction forms
1
While we transition the ARMSSE code from integer properties
2
which convert between floating point and integer with a specified
2
specifying clock frequencies to Clock objects, we want to have the
3
rounding mode.
3
device provide both at once. We want the final name of the main
4
input Clock to be "MAINCLK", following the hardware name.
5
Unfortunately creating an input Clock with a name X creates an
6
under-the-hood QOM property X; for "MAINCLK" this clashes with the
7
existing UINT32 property of that name.
8
9
Rename the UINT32 property to MAINCLK_FRQ so it can coexist with the
10
MAINCLK Clock; once the transition is complete MAINCLK_FRQ will be
11
deleted.
12
13
Commit created with:
14
perl -p -i -e 's/MAINCLK/MAINCLK_FRQ/g' hw/arm/{armsse,mps2-tz,musca}.c include/hw/arm/armsse.h
4
15
5
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
16
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
6
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
17
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
7
Message-id: 20200828183354.27913-17-peter.maydell@linaro.org
18
Reviewed-by: Luc Michel <luc@lmichel.fr>
19
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
20
Message-id: 20210128114145.20536-11-peter.maydell@linaro.org
21
Message-id: 20210121190622.22000-11-peter.maydell@linaro.org
8
---
22
---
9
target/arm/vfp-uncond.decode | 6 ++++--
23
include/hw/arm/armsse.h | 2 +-
10
target/arm/translate-vfp.c.inc | 32 ++++++++++++++++++++++++--------
24
hw/arm/armsse.c | 6 +++---
11
2 files changed, 28 insertions(+), 10 deletions(-)
25
hw/arm/mps2-tz.c | 2 +-
26
hw/arm/musca.c | 2 +-
27
4 files changed, 6 insertions(+), 6 deletions(-)
12
28
13
diff --git a/target/arm/vfp-uncond.decode b/target/arm/vfp-uncond.decode
29
diff --git a/include/hw/arm/armsse.h b/include/hw/arm/armsse.h
14
index XXXXXXX..XXXXXXX 100644
30
index XXXXXXX..XXXXXXX 100644
15
--- a/target/arm/vfp-uncond.decode
31
--- a/include/hw/arm/armsse.h
16
+++ b/target/arm/vfp-uncond.decode
32
+++ b/include/hw/arm/armsse.h
17
@@ -XXX,XX +XXX,XX @@ VRINT 1111 1110 1.11 10 rm:2 .... 1011 01.0 .... \
33
@@ -XXX,XX +XXX,XX @@
18
vm=%vm_dp vd=%vd_dp dp=1
34
* QEMU interface:
19
35
* + QOM property "memory" is a MemoryRegion containing the devices provided
20
# VCVT float to int with specified rounding mode; Vd is always single-precision
36
* by the board model.
21
+VCVT 1111 1110 1.11 11 rm:2 .... 1001 op:1 1.0 .... \
37
- * + QOM property "MAINCLK" is the frequency of the main system clock
22
+ vm=%vm_sp vd=%vd_sp sz=1
38
+ * + QOM property "MAINCLK_FRQ" is the frequency of the main system clock
23
VCVT 1111 1110 1.11 11 rm:2 .... 1010 op:1 1.0 .... \
39
* + QOM property "EXP_NUMIRQ" sets the number of expansion interrupts.
24
- vm=%vm_sp vd=%vd_sp dp=0
40
* (In hardware, the SSE-200 permits the number of expansion interrupts
25
+ vm=%vm_sp vd=%vd_sp sz=2
41
* for the two CPUs to be configured separately, but we restrict it to
26
VCVT 1111 1110 1.11 11 rm:2 .... 1011 op:1 1.0 .... \
42
diff --git a/hw/arm/armsse.c b/hw/arm/armsse.c
27
- vm=%vm_dp vd=%vd_sp dp=1
28
+ vm=%vm_dp vd=%vd_sp sz=3
29
diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
30
index XXXXXXX..XXXXXXX 100644
43
index XXXXXXX..XXXXXXX 100644
31
--- a/target/arm/translate-vfp.c.inc
44
--- a/hw/arm/armsse.c
32
+++ b/target/arm/translate-vfp.c.inc
45
+++ b/hw/arm/armsse.c
33
@@ -XXX,XX +XXX,XX @@ static bool trans_VRINT(DisasContext *s, arg_VRINT *a)
46
@@ -XXX,XX +XXX,XX @@ static Property iotkit_properties[] = {
34
static bool trans_VCVT(DisasContext *s, arg_VCVT *a)
47
DEFINE_PROP_LINK("memory", ARMSSE, board_memory, TYPE_MEMORY_REGION,
35
{
48
MemoryRegion *),
36
uint32_t rd, rm;
49
DEFINE_PROP_UINT32("EXP_NUMIRQ", ARMSSE, exp_numirq, 64),
37
- bool dp = a->dp;
50
- DEFINE_PROP_UINT32("MAINCLK", ARMSSE, mainclk_frq, 0),
38
+ int sz = a->sz;
51
+ DEFINE_PROP_UINT32("MAINCLK_FRQ", ARMSSE, mainclk_frq, 0),
39
TCGv_ptr fpst;
52
DEFINE_PROP_UINT32("SRAM_ADDR_WIDTH", ARMSSE, sram_addr_width, 15),
40
TCGv_i32 tcg_rmode, tcg_shift;
53
DEFINE_PROP_UINT32("init-svtor", ARMSSE, init_svtor, 0x10000000),
41
int rounding = fp_decode_rm[a->rm];
54
DEFINE_PROP_BOOL("CPU0_FPU", ARMSSE, cpu_fpu[0], true),
42
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT(DisasContext *s, arg_VCVT *a)
55
@@ -XXX,XX +XXX,XX @@ static Property armsse_properties[] = {
43
return false;
56
DEFINE_PROP_LINK("memory", ARMSSE, board_memory, TYPE_MEMORY_REGION,
57
MemoryRegion *),
58
DEFINE_PROP_UINT32("EXP_NUMIRQ", ARMSSE, exp_numirq, 64),
59
- DEFINE_PROP_UINT32("MAINCLK", ARMSSE, mainclk_frq, 0),
60
+ DEFINE_PROP_UINT32("MAINCLK_FRQ", ARMSSE, mainclk_frq, 0),
61
DEFINE_PROP_UINT32("SRAM_ADDR_WIDTH", ARMSSE, sram_addr_width, 15),
62
DEFINE_PROP_UINT32("init-svtor", ARMSSE, init_svtor, 0x10000000),
63
DEFINE_PROP_BOOL("CPU0_FPU", ARMSSE, cpu_fpu[0], false),
64
@@ -XXX,XX +XXX,XX @@ static void armsse_realize(DeviceState *dev, Error **errp)
44
}
65
}
45
66
46
- if (dp && !dc_isar_feature(aa32_fpdp_v2, s)) {
67
if (!s->mainclk_frq) {
47
+ if (sz == 3 && !dc_isar_feature(aa32_fpdp_v2, s)) {
68
- error_setg(errp, "MAINCLK property was not set");
48
+ return false;
69
+ error_setg(errp, "MAINCLK_FRQ property was not set");
49
+ }
70
return;
50
+
51
+ if (sz == 1 && !dc_isar_feature(aa32_fp16_arith, s)) {
52
return false;
53
}
71
}
54
72
55
/* UNDEF accesses to D16-D31 if they don't exist */
73
diff --git a/hw/arm/mps2-tz.c b/hw/arm/mps2-tz.c
56
- if (dp && !dc_isar_feature(aa32_simd_r32, s) && (a->vm & 0x10)) {
74
index XXXXXXX..XXXXXXX 100644
57
+ if (sz == 3 && !dc_isar_feature(aa32_simd_r32, s) && (a->vm & 0x10)) {
75
--- a/hw/arm/mps2-tz.c
58
return false;
76
+++ b/hw/arm/mps2-tz.c
59
}
77
@@ -XXX,XX +XXX,XX @@ static void mps2tz_common_init(MachineState *machine)
60
78
object_property_set_link(OBJECT(&mms->iotkit), "memory",
61
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT(DisasContext *s, arg_VCVT *a)
79
OBJECT(system_memory), &error_abort);
62
return true;
80
qdev_prop_set_uint32(iotkitdev, "EXP_NUMIRQ", MPS2TZ_NUMIRQ);
63
}
81
- qdev_prop_set_uint32(iotkitdev, "MAINCLK", SYSCLK_FRQ);
64
82
+ qdev_prop_set_uint32(iotkitdev, "MAINCLK_FRQ", SYSCLK_FRQ);
65
- fpst = fpstatus_ptr(FPST_FPCR);
83
sysbus_realize(SYS_BUS_DEVICE(&mms->iotkit), &error_fatal);
66
+ if (sz == 1) {
84
67
+ fpst = fpstatus_ptr(FPST_FPCR_F16);
85
/*
68
+ } else {
86
diff --git a/hw/arm/musca.c b/hw/arm/musca.c
69
+ fpst = fpstatus_ptr(FPST_FPCR);
87
index XXXXXXX..XXXXXXX 100644
70
+ }
88
--- a/hw/arm/musca.c
71
89
+++ b/hw/arm/musca.c
72
tcg_shift = tcg_const_i32(0);
90
@@ -XXX,XX +XXX,XX @@ static void musca_init(MachineState *machine)
73
91
qdev_prop_set_uint32(ssedev, "EXP_NUMIRQ", mmc->num_irqs);
74
tcg_rmode = tcg_const_i32(arm_rmode_to_sf(rounding));
92
qdev_prop_set_uint32(ssedev, "init-svtor", mmc->init_svtor);
75
gen_helper_set_rmode(tcg_rmode, tcg_rmode, fpst);
93
qdev_prop_set_uint32(ssedev, "SRAM_ADDR_WIDTH", mmc->sram_addr_width);
76
94
- qdev_prop_set_uint32(ssedev, "MAINCLK", SYSCLK_FRQ);
77
- if (dp) {
95
+ qdev_prop_set_uint32(ssedev, "MAINCLK_FRQ", SYSCLK_FRQ);
78
+ if (sz == 3) {
96
/*
79
TCGv_i64 tcg_double, tcg_res;
97
* Musca-A takes the default SSE-200 FPU/DSP settings (ie no for
80
TCGv_i32 tcg_tmp;
98
* CPU0 and yes for CPU1); Musca-B1 explicitly enables them for CPU0.
81
tcg_double = tcg_temp_new_i64();
82
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT(DisasContext *s, arg_VCVT *a)
83
tcg_single = tcg_temp_new_i32();
84
tcg_res = tcg_temp_new_i32();
85
neon_load_reg32(tcg_single, rm);
86
- if (is_signed) {
87
- gen_helper_vfp_tosls(tcg_res, tcg_single, tcg_shift, fpst);
88
+ if (sz == 1) {
89
+ if (is_signed) {
90
+ gen_helper_vfp_toslh(tcg_res, tcg_single, tcg_shift, fpst);
91
+ } else {
92
+ gen_helper_vfp_toulh(tcg_res, tcg_single, tcg_shift, fpst);
93
+ }
94
} else {
95
- gen_helper_vfp_touls(tcg_res, tcg_single, tcg_shift, fpst);
96
+ if (is_signed) {
97
+ gen_helper_vfp_tosls(tcg_res, tcg_single, tcg_shift, fpst);
98
+ } else {
99
+ gen_helper_vfp_touls(tcg_res, tcg_single, tcg_shift, fpst);
100
+ }
101
}
102
neon_store_reg32(tcg_res, rd);
103
tcg_temp_free_i32(tcg_res);
104
--
99
--
105
2.20.1
100
2.20.1
106
101
107
102
diff view generated by jsdifflib
1
The fp16 extension includes a new instruction VINS, which copies the
1
Create two input clocks on the ARMSSE devices, one for the normal
2
lower 16 bits of a 32-bit source VFP register into the upper 16 bits
2
MAINCLK, and one for the 32KHz S32KCLK, and wire these up to the
3
of the destination. Implement it.
3
appropriate devices. The old property-based clock frequency setting
4
will remain in place until conversion is complete.
5
6
This is a migration compatibility break for machines mps2-an505,
7
mps2-an521, musca-a, musca-b1.
4
8
5
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
9
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
6
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
10
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
7
Message-id: 20200828183354.27913-20-peter.maydell@linaro.org
11
Reviewed-by: Luc Michel <luc@lmichel.fr>
12
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
13
Message-id: 20210128114145.20536-12-peter.maydell@linaro.org
14
Message-id: 20210121190622.22000-12-peter.maydell@linaro.org
8
---
15
---
9
target/arm/vfp-uncond.decode | 3 +++
16
include/hw/arm/armsse.h | 6 ++++++
10
target/arm/translate-vfp.c.inc | 28 ++++++++++++++++++++++++++++
17
hw/arm/armsse.c | 17 +++++++++++++++--
11
2 files changed, 31 insertions(+)
18
2 files changed, 21 insertions(+), 2 deletions(-)
12
19
13
diff --git a/target/arm/vfp-uncond.decode b/target/arm/vfp-uncond.decode
20
diff --git a/include/hw/arm/armsse.h b/include/hw/arm/armsse.h
14
index XXXXXXX..XXXXXXX 100644
21
index XXXXXXX..XXXXXXX 100644
15
--- a/target/arm/vfp-uncond.decode
22
--- a/include/hw/arm/armsse.h
16
+++ b/target/arm/vfp-uncond.decode
23
+++ b/include/hw/arm/armsse.h
17
@@ -XXX,XX +XXX,XX @@ VCVT 1111 1110 1.11 11 rm:2 .... 1010 op:1 1.0 .... \
24
@@ -XXX,XX +XXX,XX @@
18
vm=%vm_sp vd=%vd_sp sz=2
25
* per-CPU identity and control register blocks
19
VCVT 1111 1110 1.11 11 rm:2 .... 1011 op:1 1.0 .... \
26
*
20
vm=%vm_dp vd=%vd_sp sz=3
27
* QEMU interface:
28
+ * + Clock input "MAINCLK": clock for CPUs and most peripherals
29
+ * + Clock input "S32KCLK": slow 32KHz clock used for a few peripherals
30
* + QOM property "memory" is a MemoryRegion containing the devices provided
31
* by the board model.
32
* + QOM property "MAINCLK_FRQ" is the frequency of the main system clock
33
@@ -XXX,XX +XXX,XX @@
34
#include "hw/misc/armsse-mhu.h"
35
#include "hw/misc/unimp.h"
36
#include "hw/or-irq.h"
37
+#include "hw/clock.h"
38
#include "hw/core/split-irq.h"
39
#include "hw/cpu/cluster.h"
40
#include "qom/object.h"
41
@@ -XXX,XX +XXX,XX @@ struct ARMSSE {
42
43
uint32_t nsccfg;
44
45
+ Clock *mainclk;
46
+ Clock *s32kclk;
21
+
47
+
22
+VINS 1111 1110 1.11 0000 .... 1010 11 . 0 .... \
48
/* Properties */
23
+ vd=%vd_sp vm=%vm_sp
49
MemoryRegion *board_memory;
24
diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
50
uint32_t exp_numirq;
51
diff --git a/hw/arm/armsse.c b/hw/arm/armsse.c
25
index XXXXXXX..XXXXXXX 100644
52
index XXXXXXX..XXXXXXX 100644
26
--- a/target/arm/translate-vfp.c.inc
53
--- a/hw/arm/armsse.c
27
+++ b/target/arm/translate-vfp.c.inc
54
+++ b/hw/arm/armsse.c
28
@@ -XXX,XX +XXX,XX @@ static bool trans_NOCP(DisasContext *s, arg_NOCP *a)
55
@@ -XXX,XX +XXX,XX @@
29
56
#include "hw/arm/armsse.h"
30
return false;
57
#include "hw/arm/boot.h"
31
}
58
#include "hw/irq.h"
59
+#include "hw/qdev-clock.h"
60
61
/* Format of the System Information block SYS_CONFIG register */
62
typedef enum SysConfigFormat {
63
@@ -XXX,XX +XXX,XX @@ static void armsse_init(Object *obj)
64
assert(info->sram_banks <= MAX_SRAM_BANKS);
65
assert(info->num_cpus <= SSE_MAX_CPUS);
66
67
+ s->mainclk = qdev_init_clock_in(DEVICE(s), "MAINCLK", NULL, NULL);
68
+ s->s32kclk = qdev_init_clock_in(DEVICE(s), "S32KCLK", NULL, NULL);
32
+
69
+
33
+static bool trans_VINS(DisasContext *s, arg_VINS *a)
70
memory_region_init(&s->container, obj, "armsse-container", UINT64_MAX);
34
+{
71
35
+ TCGv_i32 rd, rm;
72
for (i = 0; i < info->num_cpus; i++) {
36
+
73
@@ -XXX,XX +XXX,XX @@ static void armsse_realize(DeviceState *dev, Error **errp)
37
+ if (!dc_isar_feature(aa32_fp16_arith, s)) {
74
* map its upstream ends to the right place in the container.
38
+ return false;
75
*/
39
+ }
76
qdev_prop_set_uint32(DEVICE(&s->timer0), "pclk-frq", s->mainclk_frq);
40
+
77
+ qdev_connect_clock_in(DEVICE(&s->timer0), "pclk", s->mainclk);
41
+ if (s->vec_len != 0 || s->vec_stride != 0) {
78
if (!sysbus_realize(SYS_BUS_DEVICE(&s->timer0), errp)) {
42
+ return false;
79
return;
43
+ }
80
}
44
+
81
@@ -XXX,XX +XXX,XX @@ static void armsse_realize(DeviceState *dev, Error **errp)
45
+ if (!vfp_access_check(s)) {
82
&error_abort);
46
+ return true;
83
47
+ }
84
qdev_prop_set_uint32(DEVICE(&s->timer1), "pclk-frq", s->mainclk_frq);
48
+
85
+ qdev_connect_clock_in(DEVICE(&s->timer1), "pclk", s->mainclk);
49
+ /* Insert low half of Vm into high half of Vd */
86
if (!sysbus_realize(SYS_BUS_DEVICE(&s->timer1), errp)) {
50
+ rm = tcg_temp_new_i32();
87
return;
51
+ rd = tcg_temp_new_i32();
88
}
52
+ neon_load_reg32(rm, a->vm);
89
@@ -XXX,XX +XXX,XX @@ static void armsse_realize(DeviceState *dev, Error **errp)
53
+ neon_load_reg32(rd, a->vd);
90
&error_abort);
54
+ tcg_gen_deposit_i32(rd, rd, rm, 16, 16);
91
55
+ neon_store_reg32(rd, a->vd);
92
qdev_prop_set_uint32(DEVICE(&s->dualtimer), "pclk-frq", s->mainclk_frq);
56
+ tcg_temp_free_i32(rm);
93
+ qdev_connect_clock_in(DEVICE(&s->dualtimer), "TIMCLK", s->mainclk);
57
+ tcg_temp_free_i32(rd);
94
if (!sysbus_realize(SYS_BUS_DEVICE(&s->dualtimer), errp)) {
58
+ return true;
95
return;
59
+}
96
}
97
@@ -XXX,XX +XXX,XX @@ static void armsse_realize(DeviceState *dev, Error **errp)
98
* 0x4002f000: S32K timer
99
*/
100
qdev_prop_set_uint32(DEVICE(&s->s32ktimer), "pclk-frq", S32KCLK);
101
+ qdev_connect_clock_in(DEVICE(&s->s32ktimer), "pclk", s->s32kclk);
102
if (!sysbus_realize(SYS_BUS_DEVICE(&s->s32ktimer), errp)) {
103
return;
104
}
105
@@ -XXX,XX +XXX,XX @@ static void armsse_realize(DeviceState *dev, Error **errp)
106
qdev_get_gpio_in_named(DEVICE(&s->armv7m), "NMI", 0));
107
108
qdev_prop_set_uint32(DEVICE(&s->s32kwatchdog), "wdogclk-frq", S32KCLK);
109
+ qdev_connect_clock_in(DEVICE(&s->s32kwatchdog), "WDOGCLK", s->s32kclk);
110
if (!sysbus_realize(SYS_BUS_DEVICE(&s->s32kwatchdog), errp)) {
111
return;
112
}
113
@@ -XXX,XX +XXX,XX @@ static void armsse_realize(DeviceState *dev, Error **errp)
114
/* 0x40080000 .. 0x4008ffff : ARMSSE second Base peripheral region */
115
116
qdev_prop_set_uint32(DEVICE(&s->nswatchdog), "wdogclk-frq", s->mainclk_frq);
117
+ qdev_connect_clock_in(DEVICE(&s->nswatchdog), "WDOGCLK", s->mainclk);
118
if (!sysbus_realize(SYS_BUS_DEVICE(&s->nswatchdog), errp)) {
119
return;
120
}
121
@@ -XXX,XX +XXX,XX @@ static void armsse_realize(DeviceState *dev, Error **errp)
122
sysbus_mmio_map(SYS_BUS_DEVICE(&s->nswatchdog), 0, 0x40081000);
123
124
qdev_prop_set_uint32(DEVICE(&s->swatchdog), "wdogclk-frq", s->mainclk_frq);
125
+ qdev_connect_clock_in(DEVICE(&s->swatchdog), "WDOGCLK", s->mainclk);
126
if (!sysbus_realize(SYS_BUS_DEVICE(&s->swatchdog), errp)) {
127
return;
128
}
129
@@ -XXX,XX +XXX,XX @@ static void armsse_idau_check(IDAUInterface *ii, uint32_t address,
130
131
static const VMStateDescription armsse_vmstate = {
132
.name = "iotkit",
133
- .version_id = 1,
134
- .minimum_version_id = 1,
135
+ .version_id = 2,
136
+ .minimum_version_id = 2,
137
.fields = (VMStateField[]) {
138
+ VMSTATE_CLOCK(mainclk, ARMSSE),
139
+ VMSTATE_CLOCK(s32kclk, ARMSSE),
140
VMSTATE_UINT32(nsccfg, ARMSSE),
141
VMSTATE_END_OF_LIST()
142
}
60
--
143
--
61
2.20.1
144
2.20.1
62
145
63
146
diff view generated by jsdifflib
1
Implement the fp16 versions of the VFP VSEL instruction.
1
The old-style convenience function cmsdk_apb_timer_create() for
2
creating CMSDK_APB_TIMER objects is used in only two places in
3
mps2.c. Most of the rest of the code in that file uses the new
4
"initialize in place" coding style.
5
6
We want to connect up a Clock object which should be done between the
7
object creation and realization; rather than adding a Clock* argument
8
to the convenience function, convert the timer creation code in
9
mps2.c to the same style as is used already for the watchdog,
10
dualtimer and other devices, and delete the now-unused convenience
11
function.
2
12
3
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
13
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
4
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
14
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
5
Message-id: 20200828183354.27913-18-peter.maydell@linaro.org
15
Reviewed-by: Luc Michel <luc@lmichel.fr>
16
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
17
Message-id: 20210128114145.20536-13-peter.maydell@linaro.org
18
Message-id: 20210121190622.22000-13-peter.maydell@linaro.org
6
---
19
---
7
target/arm/vfp-uncond.decode | 6 ++++--
20
include/hw/timer/cmsdk-apb-timer.h | 21 ---------------------
8
target/arm/translate-vfp.c.inc | 16 ++++++++++++----
21
hw/arm/mps2.c | 18 ++++++++++++++++--
9
2 files changed, 16 insertions(+), 6 deletions(-)
22
2 files changed, 16 insertions(+), 23 deletions(-)
10
23
11
diff --git a/target/arm/vfp-uncond.decode b/target/arm/vfp-uncond.decode
24
diff --git a/include/hw/timer/cmsdk-apb-timer.h b/include/hw/timer/cmsdk-apb-timer.h
12
index XXXXXXX..XXXXXXX 100644
25
index XXXXXXX..XXXXXXX 100644
13
--- a/target/arm/vfp-uncond.decode
26
--- a/include/hw/timer/cmsdk-apb-timer.h
14
+++ b/target/arm/vfp-uncond.decode
27
+++ b/include/hw/timer/cmsdk-apb-timer.h
15
@@ -XXX,XX +XXX,XX @@
28
@@ -XXX,XX +XXX,XX @@ struct CMSDKAPBTimer {
16
@vfp_dnm_s ................................ vm=%vm_sp vn=%vn_sp vd=%vd_sp
29
uint32_t intstatus;
17
@vfp_dnm_d ................................ vm=%vm_dp vn=%vn_dp vd=%vd_dp
30
};
18
31
19
+VSEL 1111 1110 0. cc:2 .... .... 1001 .0.0 .... \
32
-/**
20
+ vm=%vm_sp vn=%vn_sp vd=%vd_sp sz=1
33
- * cmsdk_apb_timer_create - convenience function to create TYPE_CMSDK_APB_TIMER
21
VSEL 1111 1110 0. cc:2 .... .... 1010 .0.0 .... \
34
- * @addr: location in system memory to map registers
22
- vm=%vm_sp vn=%vn_sp vd=%vd_sp dp=0
35
- * @pclk_frq: frequency in Hz of the PCLK clock (used for calculating baud rate)
23
+ vm=%vm_sp vn=%vn_sp vd=%vd_sp sz=2
36
- */
24
VSEL 1111 1110 0. cc:2 .... .... 1011 .0.0 .... \
37
-static inline DeviceState *cmsdk_apb_timer_create(hwaddr addr,
25
- vm=%vm_dp vn=%vn_dp vd=%vd_dp dp=1
38
- qemu_irq timerint,
26
+ vm=%vm_dp vn=%vn_dp vd=%vd_dp sz=3
39
- uint32_t pclk_frq)
27
40
-{
28
VMAXNM_hp 1111 1110 1.00 .... .... 1001 .0.0 .... @vfp_dnm_s
41
- DeviceState *dev;
29
VMINNM_hp 1111 1110 1.00 .... .... 1001 .1.0 .... @vfp_dnm_s
42
- SysBusDevice *s;
30
diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
43
-
44
- dev = qdev_new(TYPE_CMSDK_APB_TIMER);
45
- s = SYS_BUS_DEVICE(dev);
46
- qdev_prop_set_uint32(dev, "pclk-frq", pclk_frq);
47
- sysbus_realize_and_unref(s, &error_fatal);
48
- sysbus_mmio_map(s, 0, addr);
49
- sysbus_connect_irq(s, 0, timerint);
50
- return dev;
51
-}
52
-
53
#endif
54
diff --git a/hw/arm/mps2.c b/hw/arm/mps2.c
31
index XXXXXXX..XXXXXXX 100644
55
index XXXXXXX..XXXXXXX 100644
32
--- a/target/arm/translate-vfp.c.inc
56
--- a/hw/arm/mps2.c
33
+++ b/target/arm/translate-vfp.c.inc
57
+++ b/hw/arm/mps2.c
34
@@ -XXX,XX +XXX,XX @@ static bool vfp_access_check(DisasContext *s)
58
@@ -XXX,XX +XXX,XX @@ struct MPS2MachineState {
35
static bool trans_VSEL(DisasContext *s, arg_VSEL *a)
59
/* CMSDK APB subsystem */
36
{
60
CMSDKAPBDualTimer dualtimer;
37
uint32_t rd, rn, rm;
61
CMSDKAPBWatchdog watchdog;
38
- bool dp = a->dp;
62
+ CMSDKAPBTimer timer[2];
39
+ int sz = a->sz;
63
};
40
64
41
if (!dc_isar_feature(aa32_vsel, s)) {
65
#define TYPE_MPS2_MACHINE "mps2"
42
return false;
66
@@ -XXX,XX +XXX,XX @@ static void mps2_common_init(MachineState *machine)
43
}
67
}
44
68
45
- if (dp && !dc_isar_feature(aa32_fpdp_v2, s)) {
69
/* CMSDK APB subsystem */
46
+ if (sz == 3 && !dc_isar_feature(aa32_fpdp_v2, s)) {
70
- cmsdk_apb_timer_create(0x40000000, qdev_get_gpio_in(armv7m, 8), SYSCLK_FRQ);
47
+ return false;
71
- cmsdk_apb_timer_create(0x40001000, qdev_get_gpio_in(armv7m, 9), SYSCLK_FRQ);
72
+ for (i = 0; i < ARRAY_SIZE(mms->timer); i++) {
73
+ g_autofree char *name = g_strdup_printf("timer%d", i);
74
+ hwaddr base = 0x40000000 + i * 0x1000;
75
+ int irqno = 8 + i;
76
+ SysBusDevice *sbd;
77
+
78
+ object_initialize_child(OBJECT(mms), name, &mms->timer[i],
79
+ TYPE_CMSDK_APB_TIMER);
80
+ sbd = SYS_BUS_DEVICE(&mms->timer[i]);
81
+ qdev_prop_set_uint32(DEVICE(&mms->timer[i]), "pclk-frq", SYSCLK_FRQ);
82
+ sysbus_realize_and_unref(sbd, &error_fatal);
83
+ sysbus_mmio_map(sbd, 0, base);
84
+ sysbus_connect_irq(sbd, 0, qdev_get_gpio_in(armv7m, irqno));
48
+ }
85
+ }
49
+
86
+
50
+ if (sz == 1 && !dc_isar_feature(aa32_fp16_arith, s)) {
87
object_initialize_child(OBJECT(mms), "dualtimer", &mms->dualtimer,
51
return false;
88
TYPE_CMSDK_APB_DUALTIMER);
52
}
89
qdev_prop_set_uint32(DEVICE(&mms->dualtimer), "pclk-frq", SYSCLK_FRQ);
53
54
/* UNDEF accesses to D16-D31 if they don't exist */
55
- if (dp && !dc_isar_feature(aa32_simd_r32, s) &&
56
+ if (sz == 3 && !dc_isar_feature(aa32_simd_r32, s) &&
57
((a->vm | a->vn | a->vd) & 0x10)) {
58
return false;
59
}
60
@@ -XXX,XX +XXX,XX @@ static bool trans_VSEL(DisasContext *s, arg_VSEL *a)
61
return true;
62
}
63
64
- if (dp) {
65
+ if (sz == 3) {
66
TCGv_i64 frn, frm, dest;
67
TCGv_i64 tmp, zero, zf, nf, vf;
68
69
@@ -XXX,XX +XXX,XX @@ static bool trans_VSEL(DisasContext *s, arg_VSEL *a)
70
tcg_temp_free_i32(tmp);
71
break;
72
}
73
+ /* For fp16 the top half is always zeroes */
74
+ if (sz == 1) {
75
+ tcg_gen_andi_i32(dest, dest, 0xffff);
76
+ }
77
neon_store_reg32(dest, rd);
78
tcg_temp_free_i32(frn);
79
tcg_temp_free_i32(frm);
80
--
90
--
81
2.20.1
91
2.20.1
82
92
83
93
diff view generated by jsdifflib
1
We already have gvec helpers for floating point VRECPE and
1
Create a fixed-frequency Clock object to be the SYSCLK, and wire it
2
VRQSRTE, so convert the Neon decoder to use them and
2
up to the devices that require it.
3
add the fp16 support.
4
3
5
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
6
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
5
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
7
Message-id: 20200828183354.27913-25-peter.maydell@linaro.org
6
Reviewed-by: Luc Michel <luc@lmichel.fr>
7
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
8
Message-id: 20210128114145.20536-14-peter.maydell@linaro.org
9
Message-id: 20210121190622.22000-14-peter.maydell@linaro.org
8
---
10
---
9
target/arm/translate-neon.c.inc | 31 +++++++++++++++++++++++++++++--
11
hw/arm/mps2.c | 9 +++++++++
10
1 file changed, 29 insertions(+), 2 deletions(-)
12
1 file changed, 9 insertions(+)
11
13
12
diff --git a/target/arm/translate-neon.c.inc b/target/arm/translate-neon.c.inc
14
diff --git a/hw/arm/mps2.c b/hw/arm/mps2.c
13
index XXXXXXX..XXXXXXX 100644
15
index XXXXXXX..XXXXXXX 100644
14
--- a/target/arm/translate-neon.c.inc
16
--- a/hw/arm/mps2.c
15
+++ b/target/arm/translate-neon.c.inc
17
+++ b/hw/arm/mps2.c
16
@@ -XXX,XX +XXX,XX @@ static bool do_2misc_fp(DisasContext *s, arg_2misc *a,
18
@@ -XXX,XX +XXX,XX @@
17
return do_2misc_fp(s, a, FUNC); \
19
#include "hw/net/lan9118.h"
20
#include "net/net.h"
21
#include "hw/watchdog/cmsdk-apb-watchdog.h"
22
+#include "hw/qdev-clock.h"
23
#include "qom/object.h"
24
25
typedef enum MPS2FPGAType {
26
@@ -XXX,XX +XXX,XX @@ struct MPS2MachineState {
27
CMSDKAPBDualTimer dualtimer;
28
CMSDKAPBWatchdog watchdog;
29
CMSDKAPBTimer timer[2];
30
+ Clock *sysclk;
31
};
32
33
#define TYPE_MPS2_MACHINE "mps2"
34
@@ -XXX,XX +XXX,XX @@ static void mps2_common_init(MachineState *machine)
35
exit(EXIT_FAILURE);
18
}
36
}
19
37
20
-DO_2MISC_FP(VRECPE_F, gen_helper_recpe_f32)
38
+ /* This clock doesn't need migration because it is fixed-frequency */
21
-DO_2MISC_FP(VRSQRTE_F, gen_helper_rsqrte_f32)
39
+ mms->sysclk = clock_new(OBJECT(machine), "SYSCLK");
22
DO_2MISC_FP(VCVT_FS, gen_helper_vfp_sitos)
40
+ clock_set_hz(mms->sysclk, SYSCLK_FRQ);
23
DO_2MISC_FP(VCVT_FU, gen_helper_vfp_uitos)
24
DO_2MISC_FP(VCVT_SF, gen_helper_vfp_tosizs)
25
DO_2MISC_FP(VCVT_UF, gen_helper_vfp_touizs)
26
27
+#define DO_2MISC_FP_VEC(INSN, HFUNC, SFUNC) \
28
+ static void gen_##INSN(unsigned vece, uint32_t rd_ofs, \
29
+ uint32_t rm_ofs, \
30
+ uint32_t oprsz, uint32_t maxsz) \
31
+ { \
32
+ static gen_helper_gvec_2_ptr * const fns[4] = { \
33
+ NULL, HFUNC, SFUNC, NULL, \
34
+ }; \
35
+ TCGv_ptr fpst; \
36
+ fpst = fpstatus_ptr(vece == MO_16 ? FPST_STD_F16 : FPST_STD); \
37
+ tcg_gen_gvec_2_ptr(rd_ofs, rm_ofs, fpst, oprsz, maxsz, 0, \
38
+ fns[vece]); \
39
+ tcg_temp_free_ptr(fpst); \
40
+ } \
41
+ static bool trans_##INSN(DisasContext *s, arg_2misc *a) \
42
+ { \
43
+ if (a->size == MO_16) { \
44
+ if (!dc_isar_feature(aa32_fp16_arith, s)) { \
45
+ return false; \
46
+ } \
47
+ } else if (a->size != MO_32) { \
48
+ return false; \
49
+ } \
50
+ return do_2misc_vec(s, a, gen_##INSN); \
51
+ }
52
+
41
+
53
+DO_2MISC_FP_VEC(VRECPE_F, gen_helper_gvec_frecpe_h, gen_helper_gvec_frecpe_s)
42
/* The FPGA images have an odd combination of different RAMs,
54
+DO_2MISC_FP_VEC(VRSQRTE_F, gen_helper_gvec_frsqrte_h, gen_helper_gvec_frsqrte_s)
43
* because in hardware they are different implementations and
55
+
44
* connected to different buses, giving varying performance/size
56
static bool trans_VRINTX(DisasContext *s, arg_2misc *a)
45
@@ -XXX,XX +XXX,XX @@ static void mps2_common_init(MachineState *machine)
57
{
46
TYPE_CMSDK_APB_TIMER);
58
if (!arm_dc_feature(s, ARM_FEATURE_V8)) {
47
sbd = SYS_BUS_DEVICE(&mms->timer[i]);
48
qdev_prop_set_uint32(DEVICE(&mms->timer[i]), "pclk-frq", SYSCLK_FRQ);
49
+ qdev_connect_clock_in(DEVICE(&mms->timer[i]), "pclk", mms->sysclk);
50
sysbus_realize_and_unref(sbd, &error_fatal);
51
sysbus_mmio_map(sbd, 0, base);
52
sysbus_connect_irq(sbd, 0, qdev_get_gpio_in(armv7m, irqno));
53
@@ -XXX,XX +XXX,XX @@ static void mps2_common_init(MachineState *machine)
54
object_initialize_child(OBJECT(mms), "dualtimer", &mms->dualtimer,
55
TYPE_CMSDK_APB_DUALTIMER);
56
qdev_prop_set_uint32(DEVICE(&mms->dualtimer), "pclk-frq", SYSCLK_FRQ);
57
+ qdev_connect_clock_in(DEVICE(&mms->dualtimer), "TIMCLK", mms->sysclk);
58
sysbus_realize(SYS_BUS_DEVICE(&mms->dualtimer), &error_fatal);
59
sysbus_connect_irq(SYS_BUS_DEVICE(&mms->dualtimer), 0,
60
qdev_get_gpio_in(armv7m, 10));
61
@@ -XXX,XX +XXX,XX @@ static void mps2_common_init(MachineState *machine)
62
object_initialize_child(OBJECT(mms), "watchdog", &mms->watchdog,
63
TYPE_CMSDK_APB_WATCHDOG);
64
qdev_prop_set_uint32(DEVICE(&mms->watchdog), "wdogclk-frq", SYSCLK_FRQ);
65
+ qdev_connect_clock_in(DEVICE(&mms->watchdog), "WDOGCLK", mms->sysclk);
66
sysbus_realize(SYS_BUS_DEVICE(&mms->watchdog), &error_fatal);
67
sysbus_connect_irq(SYS_BUS_DEVICE(&mms->watchdog), 0,
68
qdev_get_gpio_in_named(armv7m, "NMI", 0));
59
--
69
--
60
2.20.1
70
2.20.1
61
71
62
72
diff view generated by jsdifflib
1
Convert the neon floating-point vector compare-vs-0 insns VCEQ0,
1
Create and connect the two clocks needed by the ARMSSE.
2
VCGT0, VCLE0, VCGE0 and VCLT0 to use a gvec helper, and use this to
3
implement the fp16 case.
4
2
5
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
3
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
6
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
4
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
7
Message-id: 20200828183354.27913-33-peter.maydell@linaro.org
5
Reviewed-by: Luc Michel <luc@lmichel.fr>
6
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
7
Message-id: 20210128114145.20536-15-peter.maydell@linaro.org
8
Message-id: 20210121190622.22000-15-peter.maydell@linaro.org
8
---
9
---
9
target/arm/helper.h | 15 +++++++++++++++
10
hw/arm/mps2-tz.c | 13 +++++++++++++
10
target/arm/vec_helper.c | 25 +++++++++++++++++++++++++
11
1 file changed, 13 insertions(+)
11
target/arm/translate-neon.c.inc | 33 +++++----------------------------
12
3 files changed, 45 insertions(+), 28 deletions(-)
13
12
14
diff --git a/target/arm/helper.h b/target/arm/helper.h
13
diff --git a/hw/arm/mps2-tz.c b/hw/arm/mps2-tz.c
15
index XXXXXXX..XXXXXXX 100644
14
index XXXXXXX..XXXXXXX 100644
16
--- a/target/arm/helper.h
15
--- a/hw/arm/mps2-tz.c
17
+++ b/target/arm/helper.h
16
+++ b/hw/arm/mps2-tz.c
18
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_4(gvec_frsqrte_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
17
@@ -XXX,XX +XXX,XX @@
19
DEF_HELPER_FLAGS_4(gvec_frsqrte_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
18
#include "hw/net/lan9118.h"
20
DEF_HELPER_FLAGS_4(gvec_frsqrte_d, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
19
#include "net/net.h"
21
20
#include "hw/core/split-irq.h"
22
+DEF_HELPER_FLAGS_4(gvec_fcgt0_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
21
+#include "hw/qdev-clock.h"
23
+DEF_HELPER_FLAGS_4(gvec_fcgt0_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
22
#include "qom/object.h"
23
24
#define MPS2TZ_NUMIRQ 92
25
@@ -XXX,XX +XXX,XX @@ struct MPS2TZMachineState {
26
qemu_or_irq uart_irq_orgate;
27
DeviceState *lan9118;
28
SplitIRQ cpu_irq_splitter[MPS2TZ_NUMIRQ];
29
+ Clock *sysclk;
30
+ Clock *s32kclk;
31
};
32
33
#define TYPE_MPS2TZ_MACHINE "mps2tz"
34
@@ -XXX,XX +XXX,XX @@ OBJECT_DECLARE_TYPE(MPS2TZMachineState, MPS2TZMachineClass, MPS2TZ_MACHINE)
35
36
/* Main SYSCLK frequency in Hz */
37
#define SYSCLK_FRQ 20000000
38
+/* Slow 32Khz S32KCLK frequency in Hz */
39
+#define S32KCLK_FRQ (32 * 1000)
40
41
/* Create an alias of an entire original MemoryRegion @orig
42
* located at @base in the memory map.
43
@@ -XXX,XX +XXX,XX @@ static void mps2tz_common_init(MachineState *machine)
44
exit(EXIT_FAILURE);
45
}
46
47
+ /* These clocks don't need migration because they are fixed-frequency */
48
+ mms->sysclk = clock_new(OBJECT(machine), "SYSCLK");
49
+ clock_set_hz(mms->sysclk, SYSCLK_FRQ);
50
+ mms->s32kclk = clock_new(OBJECT(machine), "S32KCLK");
51
+ clock_set_hz(mms->s32kclk, S32KCLK_FRQ);
24
+
52
+
25
+DEF_HELPER_FLAGS_4(gvec_fcge0_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
53
object_initialize_child(OBJECT(machine), TYPE_IOTKIT, &mms->iotkit,
26
+DEF_HELPER_FLAGS_4(gvec_fcge0_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
54
mmc->armsse_type);
27
+
55
iotkitdev = DEVICE(&mms->iotkit);
28
+DEF_HELPER_FLAGS_4(gvec_fceq0_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
56
@@ -XXX,XX +XXX,XX @@ static void mps2tz_common_init(MachineState *machine)
29
+DEF_HELPER_FLAGS_4(gvec_fceq0_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
57
OBJECT(system_memory), &error_abort);
30
+
58
qdev_prop_set_uint32(iotkitdev, "EXP_NUMIRQ", MPS2TZ_NUMIRQ);
31
+DEF_HELPER_FLAGS_4(gvec_fcle0_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
59
qdev_prop_set_uint32(iotkitdev, "MAINCLK_FRQ", SYSCLK_FRQ);
32
+DEF_HELPER_FLAGS_4(gvec_fcle0_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
60
+ qdev_connect_clock_in(iotkitdev, "MAINCLK", mms->sysclk);
33
+
61
+ qdev_connect_clock_in(iotkitdev, "S32KCLK", mms->s32kclk);
34
+DEF_HELPER_FLAGS_4(gvec_fclt0_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
62
sysbus_realize(SYS_BUS_DEVICE(&mms->iotkit), &error_fatal);
35
+DEF_HELPER_FLAGS_4(gvec_fclt0_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, i32)
63
36
+
37
DEF_HELPER_FLAGS_5(gvec_fadd_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
38
DEF_HELPER_FLAGS_5(gvec_fadd_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
39
DEF_HELPER_FLAGS_5(gvec_fadd_d, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
40
diff --git a/target/arm/vec_helper.c b/target/arm/vec_helper.c
41
index XXXXXXX..XXXXXXX 100644
42
--- a/target/arm/vec_helper.c
43
+++ b/target/arm/vec_helper.c
44
@@ -XXX,XX +XXX,XX @@ DO_2OP(gvec_frsqrte_h, helper_rsqrte_f16, float16)
45
DO_2OP(gvec_frsqrte_s, helper_rsqrte_f32, float32)
46
DO_2OP(gvec_frsqrte_d, helper_rsqrte_f64, float64)
47
48
+#define WRAP_CMP0_FWD(FN, CMPOP, TYPE) \
49
+ static TYPE TYPE##_##FN##0(TYPE op, float_status *stat) \
50
+ { \
51
+ return TYPE##_##CMPOP(op, TYPE##_zero, stat); \
52
+ }
53
+
54
+#define WRAP_CMP0_REV(FN, CMPOP, TYPE) \
55
+ static TYPE TYPE##_##FN##0(TYPE op, float_status *stat) \
56
+ { \
57
+ return TYPE##_##CMPOP(TYPE##_zero, op, stat); \
58
+ }
59
+
60
+#define DO_2OP_CMP0(FN, CMPOP, DIRN) \
61
+ WRAP_CMP0_##DIRN(FN, CMPOP, float16) \
62
+ WRAP_CMP0_##DIRN(FN, CMPOP, float32) \
63
+ DO_2OP(gvec_f##FN##0_h, float16_##FN##0, float16) \
64
+ DO_2OP(gvec_f##FN##0_s, float32_##FN##0, float32)
65
+
66
+DO_2OP_CMP0(cgt, cgt, FWD)
67
+DO_2OP_CMP0(cge, cge, FWD)
68
+DO_2OP_CMP0(ceq, ceq, FWD)
69
+DO_2OP_CMP0(clt, cgt, REV)
70
+DO_2OP_CMP0(cle, cge, REV)
71
+
72
#undef DO_2OP
73
+#undef DO_2OP_CMP0
74
75
/* Floating-point trigonometric starting value.
76
* See the ARM ARM pseudocode function FPTrigSMul.
77
diff --git a/target/arm/translate-neon.c.inc b/target/arm/translate-neon.c.inc
78
index XXXXXXX..XXXXXXX 100644
79
--- a/target/arm/translate-neon.c.inc
80
+++ b/target/arm/translate-neon.c.inc
81
@@ -XXX,XX +XXX,XX @@ DO_2MISC_FP(VCVT_UF, gen_helper_vfp_touizs)
82
83
DO_2MISC_FP_VEC(VRECPE_F, gen_helper_gvec_frecpe_h, gen_helper_gvec_frecpe_s)
84
DO_2MISC_FP_VEC(VRSQRTE_F, gen_helper_gvec_frsqrte_h, gen_helper_gvec_frsqrte_s)
85
+DO_2MISC_FP_VEC(VCGT0_F, gen_helper_gvec_fcgt0_h, gen_helper_gvec_fcgt0_s)
86
+DO_2MISC_FP_VEC(VCGE0_F, gen_helper_gvec_fcge0_h, gen_helper_gvec_fcge0_s)
87
+DO_2MISC_FP_VEC(VCEQ0_F, gen_helper_gvec_fceq0_h, gen_helper_gvec_fceq0_s)
88
+DO_2MISC_FP_VEC(VCLT0_F, gen_helper_gvec_fclt0_h, gen_helper_gvec_fclt0_s)
89
+DO_2MISC_FP_VEC(VCLE0_F, gen_helper_gvec_fcle0_h, gen_helper_gvec_fcle0_s)
90
91
static bool trans_VRINTX(DisasContext *s, arg_2misc *a)
92
{
93
@@ -XXX,XX +XXX,XX @@ static bool trans_VRINTX(DisasContext *s, arg_2misc *a)
94
return do_2misc_fp(s, a, gen_helper_rints_exact);
95
}
96
97
-#define WRAP_FP_CMP0_FWD(WRAPNAME, FUNC) \
98
- static void WRAPNAME(TCGv_i32 d, TCGv_i32 m, TCGv_ptr fpst) \
99
- { \
100
- TCGv_i32 zero = tcg_const_i32(0); \
101
- FUNC(d, m, zero, fpst); \
102
- tcg_temp_free_i32(zero); \
103
- }
104
-#define WRAP_FP_CMP0_REV(WRAPNAME, FUNC) \
105
- static void WRAPNAME(TCGv_i32 d, TCGv_i32 m, TCGv_ptr fpst) \
106
- { \
107
- TCGv_i32 zero = tcg_const_i32(0); \
108
- FUNC(d, zero, m, fpst); \
109
- tcg_temp_free_i32(zero); \
110
- }
111
-
112
-#define DO_FP_CMP0(INSN, FUNC, REV) \
113
- WRAP_FP_CMP0_##REV(gen_##INSN, FUNC) \
114
- static bool trans_##INSN(DisasContext *s, arg_2misc *a) \
115
- { \
116
- return do_2misc_fp(s, a, gen_##INSN); \
117
- }
118
-
119
-DO_FP_CMP0(VCGT0_F, gen_helper_neon_cgt_f32, FWD)
120
-DO_FP_CMP0(VCGE0_F, gen_helper_neon_cge_f32, FWD)
121
-DO_FP_CMP0(VCEQ0_F, gen_helper_neon_ceq_f32, FWD)
122
-DO_FP_CMP0(VCLE0_F, gen_helper_neon_cge_f32, REV)
123
-DO_FP_CMP0(VCLT0_F, gen_helper_neon_cgt_f32, REV)
124
-
125
static bool do_vrint(DisasContext *s, arg_2misc *a, int rmode)
126
{
127
/*
64
/*
128
--
65
--
129
2.20.1
66
2.20.1
130
67
131
68
diff view generated by jsdifflib
1
Now the VFP_CONV_FIX macros can handle fp16's distinction between the
1
Create and connect the two clocks needed by the ARMSSE.
2
width of the operation and the width of the type used to pass operands,
3
use the macros rather than the open-coded functions.
4
5
This creates an extra six helper functions, all of which we are going
6
to need for the AArch32 VFP fp16 instructions.
7
2
8
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
3
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
9
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
4
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
10
Message-id: 20200828183354.27913-15-peter.maydell@linaro.org
5
Reviewed-by: Luc Michel <luc@lmichel.fr>
6
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
7
Message-id: 20210128114145.20536-16-peter.maydell@linaro.org
8
Message-id: 20210121190622.22000-16-peter.maydell@linaro.org
11
---
9
---
12
target/arm/helper.h | 6 +++
10
hw/arm/musca.c | 12 ++++++++++++
13
target/arm/vfp_helper.c | 86 +++--------------------------------------
11
1 file changed, 12 insertions(+)
14
2 files changed, 12 insertions(+), 80 deletions(-)
15
12
16
diff --git a/target/arm/helper.h b/target/arm/helper.h
13
diff --git a/hw/arm/musca.c b/hw/arm/musca.c
17
index XXXXXXX..XXXXXXX 100644
14
index XXXXXXX..XXXXXXX 100644
18
--- a/target/arm/helper.h
15
--- a/hw/arm/musca.c
19
+++ b/target/arm/helper.h
16
+++ b/hw/arm/musca.c
20
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_2(vfp_tosizh, s32, f16, ptr)
17
@@ -XXX,XX +XXX,XX @@
21
DEF_HELPER_2(vfp_tosizs, s32, f32, ptr)
18
#include "hw/misc/tz-ppc.h"
22
DEF_HELPER_2(vfp_tosizd, s32, f64, ptr)
19
#include "hw/misc/unimp.h"
23
20
#include "hw/rtc/pl031.h"
24
+DEF_HELPER_3(vfp_toshh_round_to_zero, i32, f16, i32, ptr)
21
+#include "hw/qdev-clock.h"
25
+DEF_HELPER_3(vfp_toslh_round_to_zero, i32, f16, i32, ptr)
22
#include "qom/object.h"
26
+DEF_HELPER_3(vfp_touhh_round_to_zero, i32, f16, i32, ptr)
23
27
+DEF_HELPER_3(vfp_toulh_round_to_zero, i32, f16, i32, ptr)
24
#define MUSCA_NUMIRQ_MAX 96
28
DEF_HELPER_3(vfp_toshs_round_to_zero, i32, f32, i32, ptr)
25
@@ -XXX,XX +XXX,XX @@ struct MuscaMachineState {
29
DEF_HELPER_3(vfp_tosls_round_to_zero, i32, f32, i32, ptr)
26
UnimplementedDeviceState sdio;
30
DEF_HELPER_3(vfp_touhs_round_to_zero, i32, f32, i32, ptr)
27
UnimplementedDeviceState gpio;
31
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_3(vfp_sqtod, f64, i64, i32, ptr)
28
UnimplementedDeviceState cryptoisland;
32
DEF_HELPER_3(vfp_uhtod, f64, i64, i32, ptr)
29
+ Clock *sysclk;
33
DEF_HELPER_3(vfp_ultod, f64, i64, i32, ptr)
30
+ Clock *s32kclk;
34
DEF_HELPER_3(vfp_uqtod, f64, i64, i32, ptr)
31
};
35
+DEF_HELPER_3(vfp_shtoh, f16, i32, i32, ptr)
32
36
+DEF_HELPER_3(vfp_uhtoh, f16, i32, i32, ptr)
33
#define TYPE_MUSCA_MACHINE "musca"
37
DEF_HELPER_3(vfp_sltoh, f16, i32, i32, ptr)
34
@@ -XXX,XX +XXX,XX @@ OBJECT_DECLARE_TYPE(MuscaMachineState, MuscaMachineClass, MUSCA_MACHINE)
38
DEF_HELPER_3(vfp_ultoh, f16, i32, i32, ptr)
35
* don't model that in our SSE-200 model yet.
39
DEF_HELPER_3(vfp_sqtoh, f16, i64, i32, ptr)
40
diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
41
index XXXXXXX..XXXXXXX 100644
42
--- a/target/arm/vfp_helper.c
43
+++ b/target/arm/vfp_helper.c
44
@@ -XXX,XX +XXX,XX @@ VFP_CONV_FIX_A64(sq, s, 32, float32, 64, int64)
45
VFP_CONV_FIX(uh, s, 32, float32, 32, uint16)
46
VFP_CONV_FIX(ul, s, 32, float32, 32, uint32)
47
VFP_CONV_FIX_A64(uq, s, 32, float32, 64, uint64)
48
+VFP_CONV_FIX(sh, h, 16, dh_ctype_f16, 32, int16)
49
+VFP_CONV_FIX(sl, h, 16, dh_ctype_f16, 32, int32)
50
+VFP_CONV_FIX_A64(sq, h, 16, dh_ctype_f16, 64, int64)
51
+VFP_CONV_FIX(uh, h, 16, dh_ctype_f16, 32, uint16)
52
+VFP_CONV_FIX(ul, h, 16, dh_ctype_f16, 32, uint32)
53
+VFP_CONV_FIX_A64(uq, h, 16, dh_ctype_f16, 64, uint64)
54
55
#undef VFP_CONV_FIX
56
#undef VFP_CONV_FIX_FLOAT
57
#undef VFP_CONV_FLOAT_FIX_ROUND
58
#undef VFP_CONV_FIX_A64
59
60
-uint32_t HELPER(vfp_sltoh)(uint32_t x, uint32_t shift, void *fpst)
61
-{
62
- return int32_to_float16_scalbn(x, -shift, fpst);
63
-}
64
-
65
-uint32_t HELPER(vfp_ultoh)(uint32_t x, uint32_t shift, void *fpst)
66
-{
67
- return uint32_to_float16_scalbn(x, -shift, fpst);
68
-}
69
-
70
-uint32_t HELPER(vfp_sqtoh)(uint64_t x, uint32_t shift, void *fpst)
71
-{
72
- return int64_to_float16_scalbn(x, -shift, fpst);
73
-}
74
-
75
-uint32_t HELPER(vfp_uqtoh)(uint64_t x, uint32_t shift, void *fpst)
76
-{
77
- return uint64_to_float16_scalbn(x, -shift, fpst);
78
-}
79
-
80
-uint32_t HELPER(vfp_toshh)(uint32_t x, uint32_t shift, void *fpst)
81
-{
82
- if (unlikely(float16_is_any_nan(x))) {
83
- float_raise(float_flag_invalid, fpst);
84
- return 0;
85
- }
86
- return float16_to_int16_scalbn(x, get_float_rounding_mode(fpst),
87
- shift, fpst);
88
-}
89
-
90
-uint32_t HELPER(vfp_touhh)(uint32_t x, uint32_t shift, void *fpst)
91
-{
92
- if (unlikely(float16_is_any_nan(x))) {
93
- float_raise(float_flag_invalid, fpst);
94
- return 0;
95
- }
96
- return float16_to_uint16_scalbn(x, get_float_rounding_mode(fpst),
97
- shift, fpst);
98
-}
99
-
100
-uint32_t HELPER(vfp_toslh)(uint32_t x, uint32_t shift, void *fpst)
101
-{
102
- if (unlikely(float16_is_any_nan(x))) {
103
- float_raise(float_flag_invalid, fpst);
104
- return 0;
105
- }
106
- return float16_to_int32_scalbn(x, get_float_rounding_mode(fpst),
107
- shift, fpst);
108
-}
109
-
110
-uint32_t HELPER(vfp_toulh)(uint32_t x, uint32_t shift, void *fpst)
111
-{
112
- if (unlikely(float16_is_any_nan(x))) {
113
- float_raise(float_flag_invalid, fpst);
114
- return 0;
115
- }
116
- return float16_to_uint32_scalbn(x, get_float_rounding_mode(fpst),
117
- shift, fpst);
118
-}
119
-
120
-uint64_t HELPER(vfp_tosqh)(uint32_t x, uint32_t shift, void *fpst)
121
-{
122
- if (unlikely(float16_is_any_nan(x))) {
123
- float_raise(float_flag_invalid, fpst);
124
- return 0;
125
- }
126
- return float16_to_int64_scalbn(x, get_float_rounding_mode(fpst),
127
- shift, fpst);
128
-}
129
-
130
-uint64_t HELPER(vfp_touqh)(uint32_t x, uint32_t shift, void *fpst)
131
-{
132
- if (unlikely(float16_is_any_nan(x))) {
133
- float_raise(float_flag_invalid, fpst);
134
- return 0;
135
- }
136
- return float16_to_uint64_scalbn(x, get_float_rounding_mode(fpst),
137
- shift, fpst);
138
-}
139
-
140
/* Set the current fp rounding mode and return the old one.
141
* The argument is a softfloat float_round_ value.
142
*/
36
*/
37
#define SYSCLK_FRQ 40000000
38
+/* Slow 32Khz S32KCLK frequency in Hz */
39
+#define S32KCLK_FRQ (32 * 1000)
40
41
static qemu_irq get_sse_irq_in(MuscaMachineState *mms, int irqno)
42
{
43
@@ -XXX,XX +XXX,XX @@ static void musca_init(MachineState *machine)
44
exit(1);
45
}
46
47
+ mms->sysclk = clock_new(OBJECT(machine), "SYSCLK");
48
+ clock_set_hz(mms->sysclk, SYSCLK_FRQ);
49
+ mms->s32kclk = clock_new(OBJECT(machine), "S32KCLK");
50
+ clock_set_hz(mms->s32kclk, S32KCLK_FRQ);
51
+
52
object_initialize_child(OBJECT(machine), "sse-200", &mms->sse,
53
TYPE_SSE200);
54
ssedev = DEVICE(&mms->sse);
55
@@ -XXX,XX +XXX,XX @@ static void musca_init(MachineState *machine)
56
qdev_prop_set_uint32(ssedev, "init-svtor", mmc->init_svtor);
57
qdev_prop_set_uint32(ssedev, "SRAM_ADDR_WIDTH", mmc->sram_addr_width);
58
qdev_prop_set_uint32(ssedev, "MAINCLK_FRQ", SYSCLK_FRQ);
59
+ qdev_connect_clock_in(ssedev, "MAINCLK", mms->sysclk);
60
+ qdev_connect_clock_in(ssedev, "S32KCLK", mms->s32kclk);
61
/*
62
* Musca-A takes the default SSE-200 FPU/DSP settings (ie no for
63
* CPU0 and yes for CPU1); Musca-B1 explicitly enables them for CPU0.
143
--
64
--
144
2.20.1
65
2.20.1
145
66
146
67
diff view generated by jsdifflib
1
Convert the Neon VRSQRTS insn to using a gvec helper,
1
Convert the SSYS code in the Stellaris boards (which encapsulates the
2
and use this to implement the fp16 case.
2
system registers) to a proper QOM device. This will provide us with
3
3
somewhere to put the output Clock whose frequency depends on the
4
As with VRECPS, we adjust the phrasing of the new implementation
4
setting of the PLL configuration registers.
5
slightly so that the fp32 version parallels the fp16 one.
5
6
This is a migration compatibility break for lm3s811evb, lm3s6965evb.
7
8
We use 3-phase reset here because the Clock will need to propagate
9
its value in the hold phase.
10
11
For the moment we reset the device during the board creation so that
12
the system_clock_scale global gets set; this will be removed in a
13
subsequent commit.
6
14
7
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
15
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
8
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
16
Reviewed-by: Luc Michel <luc@lmichel.fr>
9
Message-id: 20200828183354.27913-35-peter.maydell@linaro.org
17
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
18
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
19
Message-id: 20210128114145.20536-17-peter.maydell@linaro.org
20
Message-id: 20210121190622.22000-17-peter.maydell@linaro.org
21
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
10
---
22
---
11
target/arm/helper.h | 4 +++-
23
hw/arm/stellaris.c | 132 ++++++++++++++++++++++++++++++++++++---------
12
target/arm/vec_helper.c | 30 ++++++++++++++++++++++++++++++
24
1 file changed, 107 insertions(+), 25 deletions(-)
13
target/arm/vfp_helper.c | 15 ---------------
25
14
target/arm/translate-neon.c.inc | 21 +--------------------
26
diff --git a/hw/arm/stellaris.c b/hw/arm/stellaris.c
15
4 files changed, 34 insertions(+), 36 deletions(-)
16
17
diff --git a/target/arm/helper.h b/target/arm/helper.h
18
index XXXXXXX..XXXXXXX 100644
27
index XXXXXXX..XXXXXXX 100644
19
--- a/target/arm/helper.h
28
--- a/hw/arm/stellaris.c
20
+++ b/target/arm/helper.h
29
+++ b/hw/arm/stellaris.c
21
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_4(vfp_muladdd, f64, f64, f64, f64, ptr)
30
@@ -XXX,XX +XXX,XX @@ static void stellaris_gptm_realize(DeviceState *dev, Error **errp)
22
DEF_HELPER_4(vfp_muladds, f32, f32, f32, f32, ptr)
31
23
DEF_HELPER_4(vfp_muladdh, f16, f16, f16, f16, ptr)
32
/* System controller. */
24
33
25
-DEF_HELPER_3(rsqrts_f32, f32, env, f32, f32)
34
-typedef struct {
26
DEF_HELPER_FLAGS_2(recpe_f16, TCG_CALL_NO_RWG, f16, f16, ptr)
35
+#define TYPE_STELLARIS_SYS "stellaris-sys"
27
DEF_HELPER_FLAGS_2(recpe_f32, TCG_CALL_NO_RWG, f32, f32, ptr)
36
+OBJECT_DECLARE_SIMPLE_TYPE(ssys_state, STELLARIS_SYS)
28
DEF_HELPER_FLAGS_2(recpe_f64, TCG_CALL_NO_RWG, f64, f64, ptr)
37
+
29
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_5(gvec_fminnum_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i3
38
+struct ssys_state {
30
DEF_HELPER_FLAGS_5(gvec_recps_nf_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
39
+ SysBusDevice parent_obj;
31
DEF_HELPER_FLAGS_5(gvec_recps_nf_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
40
+
32
41
MemoryRegion iomem;
33
+DEF_HELPER_FLAGS_5(gvec_rsqrts_nf_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
42
uint32_t pborctl;
34
+DEF_HELPER_FLAGS_5(gvec_rsqrts_nf_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
43
uint32_t ldopctl;
35
+
44
@@ -XXX,XX +XXX,XX @@ typedef struct {
36
DEF_HELPER_FLAGS_5(gvec_fmla_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
45
uint32_t dcgc[3];
37
DEF_HELPER_FLAGS_5(gvec_fmla_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
46
uint32_t clkvclr;
38
47
uint32_t ldoarst;
39
diff --git a/target/arm/vec_helper.c b/target/arm/vec_helper.c
48
+ qemu_irq irq;
40
index XXXXXXX..XXXXXXX 100644
49
+ /* Properties (all read-only registers) */
41
--- a/target/arm/vec_helper.c
50
uint32_t user0;
42
+++ b/target/arm/vec_helper.c
51
uint32_t user1;
43
@@ -XXX,XX +XXX,XX @@ static float32 float32_recps_nf(float32 op1, float32 op2, float_status *stat)
52
- qemu_irq irq;
44
return float32_sub(float32_two, float32_mul(op1, op2, stat), stat);
53
- stellaris_board_info *board;
54
-} ssys_state;
55
+ uint32_t did0;
56
+ uint32_t did1;
57
+ uint32_t dc0;
58
+ uint32_t dc1;
59
+ uint32_t dc2;
60
+ uint32_t dc3;
61
+ uint32_t dc4;
62
+};
63
64
static void ssys_update(ssys_state *s)
65
{
66
@@ -XXX,XX +XXX,XX @@ static uint32_t pllcfg_fury[16] = {
67
68
static int ssys_board_class(const ssys_state *s)
69
{
70
- uint32_t did0 = s->board->did0;
71
+ uint32_t did0 = s->did0;
72
switch (did0 & DID0_VER_MASK) {
73
case DID0_VER_0:
74
return DID0_CLASS_SANDSTORM;
75
@@ -XXX,XX +XXX,XX @@ static uint64_t ssys_read(void *opaque, hwaddr offset,
76
77
switch (offset) {
78
case 0x000: /* DID0 */
79
- return s->board->did0;
80
+ return s->did0;
81
case 0x004: /* DID1 */
82
- return s->board->did1;
83
+ return s->did1;
84
case 0x008: /* DC0 */
85
- return s->board->dc0;
86
+ return s->dc0;
87
case 0x010: /* DC1 */
88
- return s->board->dc1;
89
+ return s->dc1;
90
case 0x014: /* DC2 */
91
- return s->board->dc2;
92
+ return s->dc2;
93
case 0x018: /* DC3 */
94
- return s->board->dc3;
95
+ return s->dc3;
96
case 0x01c: /* DC4 */
97
- return s->board->dc4;
98
+ return s->dc4;
99
case 0x030: /* PBORCTL */
100
return s->pborctl;
101
case 0x034: /* LDOPCTL */
102
@@ -XXX,XX +XXX,XX @@ static const MemoryRegionOps ssys_ops = {
103
.endianness = DEVICE_NATIVE_ENDIAN,
104
};
105
106
-static void ssys_reset(void *opaque)
107
+static void stellaris_sys_reset_enter(Object *obj, ResetType type)
108
{
109
- ssys_state *s = (ssys_state *)opaque;
110
+ ssys_state *s = STELLARIS_SYS(obj);
111
112
s->pborctl = 0x7ffd;
113
s->rcc = 0x078e3ac0;
114
@@ -XXX,XX +XXX,XX @@ static void ssys_reset(void *opaque)
115
s->rcgc[0] = 1;
116
s->scgc[0] = 1;
117
s->dcgc[0] = 1;
118
+}
119
+
120
+static void stellaris_sys_reset_hold(Object *obj)
121
+{
122
+ ssys_state *s = STELLARIS_SYS(obj);
123
+
124
ssys_calculate_system_clock(s);
45
}
125
}
46
126
47
+/* Reciprocal square-root step. AArch32 non-fused semantics. */
127
+static void stellaris_sys_reset_exit(Object *obj)
48
+static float16 float16_rsqrts_nf(float16 op1, float16 op2, float_status *stat)
128
+{
49
+{
129
+}
50
+ op1 = float16_squash_input_denormal(op1, stat);
130
+
51
+ op2 = float16_squash_input_denormal(op2, stat);
131
static int stellaris_sys_post_load(void *opaque, int version_id)
52
+
132
{
53
+ if ((float16_is_infinity(op1) && float16_is_zero(op2)) ||
133
ssys_state *s = opaque;
54
+ (float16_is_infinity(op2) && float16_is_zero(op1))) {
134
@@ -XXX,XX +XXX,XX @@ static const VMStateDescription vmstate_stellaris_sys = {
55
+ return float16_one_point_five;
135
}
56
+ }
136
};
57
+ op1 = float16_sub(float16_three, float16_mul(op1, op2, stat), stat);
137
58
+ return float16_div(op1, float16_two, stat);
138
+static Property stellaris_sys_properties[] = {
59
+}
139
+ DEFINE_PROP_UINT32("user0", ssys_state, user0, 0),
60
+
140
+ DEFINE_PROP_UINT32("user1", ssys_state, user1, 0),
61
+static float32 float32_rsqrts_nf(float32 op1, float32 op2, float_status *stat)
141
+ DEFINE_PROP_UINT32("did0", ssys_state, did0, 0),
62
+{
142
+ DEFINE_PROP_UINT32("did1", ssys_state, did1, 0),
63
+ op1 = float32_squash_input_denormal(op1, stat);
143
+ DEFINE_PROP_UINT32("dc0", ssys_state, dc0, 0),
64
+ op2 = float32_squash_input_denormal(op2, stat);
144
+ DEFINE_PROP_UINT32("dc1", ssys_state, dc1, 0),
65
+
145
+ DEFINE_PROP_UINT32("dc2", ssys_state, dc2, 0),
66
+ if ((float32_is_infinity(op1) && float32_is_zero(op2)) ||
146
+ DEFINE_PROP_UINT32("dc3", ssys_state, dc3, 0),
67
+ (float32_is_infinity(op2) && float32_is_zero(op1))) {
147
+ DEFINE_PROP_UINT32("dc4", ssys_state, dc4, 0),
68
+ return float32_one_point_five;
148
+ DEFINE_PROP_END_OF_LIST()
69
+ }
149
+};
70
+ op1 = float32_sub(float32_three, float32_mul(op1, op2, stat), stat);
150
+
71
+ return float32_div(op1, float32_two, stat);
151
+static void stellaris_sys_instance_init(Object *obj)
72
+}
152
+{
73
+
153
+ ssys_state *s = STELLARIS_SYS(obj);
74
#define DO_3OP(NAME, FUNC, TYPE) \
154
+ SysBusDevice *sbd = SYS_BUS_DEVICE(s);
75
void HELPER(NAME)(void *vd, void *vn, void *vm, void *stat, uint32_t desc) \
155
+
76
{ \
156
+ memory_region_init_io(&s->iomem, obj, &ssys_ops, s, "ssys", 0x00001000);
77
@@ -XXX,XX +XXX,XX @@ DO_3OP(gvec_fminnum_s, float32_minnum, float32)
157
+ sysbus_init_mmio(sbd, &s->iomem);
78
DO_3OP(gvec_recps_nf_h, float16_recps_nf, float16)
158
+ sysbus_init_irq(sbd, &s->irq);
79
DO_3OP(gvec_recps_nf_s, float32_recps_nf, float32)
159
+}
80
160
+
81
+DO_3OP(gvec_rsqrts_nf_h, float16_rsqrts_nf, float16)
161
static int stellaris_sys_init(uint32_t base, qemu_irq irq,
82
+DO_3OP(gvec_rsqrts_nf_s, float32_rsqrts_nf, float32)
162
stellaris_board_info * board,
83
+
163
uint8_t *macaddr)
84
#ifdef TARGET_AARCH64
164
{
85
165
- ssys_state *s;
86
DO_3OP(gvec_recps_h, helper_recpsf_f16, float16)
166
+ DeviceState *dev = qdev_new(TYPE_STELLARIS_SYS);
87
diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
167
+ SysBusDevice *sbd = SYS_BUS_DEVICE(dev);
88
index XXXXXXX..XXXXXXX 100644
168
89
--- a/target/arm/vfp_helper.c
169
- s = g_new0(ssys_state, 1);
90
+++ b/target/arm/vfp_helper.c
170
- s->irq = irq;
91
@@ -XXX,XX +XXX,XX @@ uint32_t HELPER(vfp_fcvt_f64_to_f16)(float64 a, void *fpstp, uint32_t ahp_mode)
171
- s->board = board;
92
return r;
172
/* Most devices come preprogrammed with a MAC address in the user data. */
173
- s->user0 = macaddr[0] | (macaddr[1] << 8) | (macaddr[2] << 16);
174
- s->user1 = macaddr[3] | (macaddr[4] << 8) | (macaddr[5] << 16);
175
+ qdev_prop_set_uint32(dev, "user0",
176
+ macaddr[0] | (macaddr[1] << 8) | (macaddr[2] << 16));
177
+ qdev_prop_set_uint32(dev, "user1",
178
+ macaddr[3] | (macaddr[4] << 8) | (macaddr[5] << 16));
179
+ qdev_prop_set_uint32(dev, "did0", board->did0);
180
+ qdev_prop_set_uint32(dev, "did1", board->did1);
181
+ qdev_prop_set_uint32(dev, "dc0", board->dc0);
182
+ qdev_prop_set_uint32(dev, "dc1", board->dc1);
183
+ qdev_prop_set_uint32(dev, "dc2", board->dc2);
184
+ qdev_prop_set_uint32(dev, "dc3", board->dc3);
185
+ qdev_prop_set_uint32(dev, "dc4", board->dc4);
186
+
187
+ sysbus_realize_and_unref(sbd, &error_fatal);
188
+ sysbus_mmio_map(sbd, 0, base);
189
+ sysbus_connect_irq(sbd, 0, irq);
190
+
191
+ /*
192
+ * Normally we should not be resetting devices like this during
193
+ * board creation. For the moment we need to do so, because
194
+ * system_clock_scale will only get set when the STELLARIS_SYS
195
+ * device is reset, and we need its initial value to pass to
196
+ * the watchdog device. This hack can be removed once the
197
+ * watchdog has been converted to use a Clock input instead.
198
+ */
199
+ device_cold_reset(dev);
200
201
- memory_region_init_io(&s->iomem, NULL, &ssys_ops, s, "ssys", 0x00001000);
202
- memory_region_add_subregion(get_system_memory(), base, &s->iomem);
203
- ssys_reset(s);
204
- vmstate_register(NULL, VMSTATE_INSTANCE_ID_ANY, &vmstate_stellaris_sys, s);
205
return 0;
93
}
206
}
94
207
95
-float32 HELPER(rsqrts_f32)(CPUARMState *env, float32 a, float32 b)
96
-{
97
- float_status *s = &env->vfp.standard_fp_status;
98
- float32 product;
99
- if ((float32_is_infinity(a) && float32_is_zero_or_denormal(b)) ||
100
- (float32_is_infinity(b) && float32_is_zero_or_denormal(a))) {
101
- if (!(float32_is_zero(a) || float32_is_zero(b))) {
102
- float_raise(float_flag_input_denormal, s);
103
- }
104
- return float32_one_point_five;
105
- }
106
- product = float32_mul(a, b, s);
107
- return float32_div(float32_sub(float32_three, product, s), float32_two, s);
108
-}
109
-
208
-
110
/* NEON helpers. */
209
/* I2C controller. */
111
210
112
/* Constants 256 and 512 are used in some helpers; we avoid relying on
211
#define TYPE_STELLARIS_I2C "stellaris-i2c"
113
diff --git a/target/arm/translate-neon.c.inc b/target/arm/translate-neon.c.inc
212
@@ -XXX,XX +XXX,XX @@ static const TypeInfo stellaris_adc_info = {
114
index XXXXXXX..XXXXXXX 100644
213
.class_init = stellaris_adc_class_init,
115
--- a/target/arm/translate-neon.c.inc
214
};
116
+++ b/target/arm/translate-neon.c.inc
215
117
@@ -XXX,XX +XXX,XX @@ DO_3S_FP_GVEC(VMLS, gen_helper_gvec_fmls_s, gen_helper_gvec_fmls_h)
216
+static void stellaris_sys_class_init(ObjectClass *klass, void *data)
118
DO_3S_FP_GVEC(VFMA, gen_helper_gvec_vfma_s, gen_helper_gvec_vfma_h)
217
+{
119
DO_3S_FP_GVEC(VFMS, gen_helper_gvec_vfms_s, gen_helper_gvec_vfms_h)
218
+ DeviceClass *dc = DEVICE_CLASS(klass);
120
DO_3S_FP_GVEC(VRECPS, gen_helper_gvec_recps_nf_s, gen_helper_gvec_recps_nf_h)
219
+ ResettableClass *rc = RESETTABLE_CLASS(klass);
121
+DO_3S_FP_GVEC(VRSQRTS, gen_helper_gvec_rsqrts_nf_s, gen_helper_gvec_rsqrts_nf_h)
220
+
122
221
+ dc->vmsd = &vmstate_stellaris_sys;
123
WRAP_FP_GVEC(gen_VMAXNM_fp32_3s, FPST_STD, gen_helper_gvec_fmaxnum_s)
222
+ rc->phases.enter = stellaris_sys_reset_enter;
124
WRAP_FP_GVEC(gen_VMAXNM_fp16_3s, FPST_STD_F16, gen_helper_gvec_fmaxnum_h)
223
+ rc->phases.hold = stellaris_sys_reset_hold;
125
@@ -XXX,XX +XXX,XX @@ static bool trans_VMINNM_fp_3s(DisasContext *s, arg_3same *a)
224
+ rc->phases.exit = stellaris_sys_reset_exit;
126
return do_3same(s, a, gen_VMINNM_fp32_3s);
225
+ device_class_set_props(dc, stellaris_sys_properties);
226
+}
227
+
228
+static const TypeInfo stellaris_sys_info = {
229
+ .name = TYPE_STELLARIS_SYS,
230
+ .parent = TYPE_SYS_BUS_DEVICE,
231
+ .instance_size = sizeof(ssys_state),
232
+ .instance_init = stellaris_sys_instance_init,
233
+ .class_init = stellaris_sys_class_init,
234
+};
235
+
236
static void stellaris_register_types(void)
237
{
238
type_register_static(&stellaris_i2c_info);
239
type_register_static(&stellaris_gptm_info);
240
type_register_static(&stellaris_adc_info);
241
+ type_register_static(&stellaris_sys_info);
127
}
242
}
128
243
129
-WRAP_ENV_FN(gen_VRSQRTS_tramp, gen_helper_rsqrts_f32)
244
type_init(stellaris_register_types)
130
-
131
-static void gen_VRSQRTS_fp_3s(unsigned vece, uint32_t rd_ofs,
132
- uint32_t rn_ofs, uint32_t rm_ofs,
133
- uint32_t oprsz, uint32_t maxsz)
134
-{
135
- static const GVecGen3 ops = { .fni4 = gen_VRSQRTS_tramp };
136
- tcg_gen_gvec_3(rd_ofs, rn_ofs, rm_ofs, oprsz, maxsz, &ops);
137
-}
138
-
139
-static bool trans_VRSQRTS_fp_3s(DisasContext *s, arg_3same *a)
140
-{
141
- if (a->size != 0) {
142
- /* TODO fp16 support */
143
- return false;
144
- }
145
-
146
- return do_3same(s, a, gen_VRSQRTS_fp_3s);
147
-}
148
-
149
static bool do_3same_fp_pair(DisasContext *s, arg_3same *a, VFPGen3OpSPFn *fn)
150
{
151
/* FP operations handled pairwise 32 bits at a time */
152
--
245
--
153
2.20.1
246
2.20.1
154
247
155
248
diff view generated by jsdifflib
1
Rewrite Neon VABS/VNEG of floats to use gvec logical AND and XOR, so
1
Create and connect the Clock input for the watchdog device on the
2
that we can implement the fp16 version of the insns.
2
Stellaris boards. Because the Stellaris boards model the ability to
3
change the clock rate by programming PLL registers, we have to create
4
an output Clock on the ssys_state device and wire it up to the
5
watchdog.
6
7
Note that the old comment on ssys_calculate_system_clock() got the
8
units wrong -- system_clock_scale is in nanoseconds, not
9
milliseconds. Improve the commentary to clarify how we are
10
calculating the period.
3
11
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
12
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
5
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
13
Reviewed-by: Luc Michel <luc@lmichel.fr>
6
Message-id: 20200828183354.27913-26-peter.maydell@linaro.org
14
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
15
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
16
Message-id: 20210128114145.20536-18-peter.maydell@linaro.org
17
Message-id: 20210121190622.22000-18-peter.maydell@linaro.org
18
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
7
---
19
---
8
target/arm/translate-neon.c.inc | 34 +++++++++++++++++++++++++++------
20
hw/arm/stellaris.c | 43 +++++++++++++++++++++++++++++++------------
9
1 file changed, 28 insertions(+), 6 deletions(-)
21
1 file changed, 31 insertions(+), 12 deletions(-)
10
22
11
diff --git a/target/arm/translate-neon.c.inc b/target/arm/translate-neon.c.inc
23
diff --git a/hw/arm/stellaris.c b/hw/arm/stellaris.c
12
index XXXXXXX..XXXXXXX 100644
24
index XXXXXXX..XXXXXXX 100644
13
--- a/target/arm/translate-neon.c.inc
25
--- a/hw/arm/stellaris.c
14
+++ b/target/arm/translate-neon.c.inc
26
+++ b/hw/arm/stellaris.c
15
@@ -XXX,XX +XXX,XX @@ static bool trans_VCNT(DisasContext *s, arg_2misc *a)
27
@@ -XXX,XX +XXX,XX @@
16
return do_2misc(s, a, gen_helper_neon_cnt_u8);
28
#include "hw/watchdog/cmsdk-apb-watchdog.h"
29
#include "migration/vmstate.h"
30
#include "hw/misc/unimp.h"
31
+#include "hw/qdev-clock.h"
32
#include "cpu.h"
33
#include "qom/object.h"
34
35
@@ -XXX,XX +XXX,XX @@ struct ssys_state {
36
uint32_t clkvclr;
37
uint32_t ldoarst;
38
qemu_irq irq;
39
+ Clock *sysclk;
40
/* Properties (all read-only registers) */
41
uint32_t user0;
42
uint32_t user1;
43
@@ -XXX,XX +XXX,XX @@ static bool ssys_use_rcc2(ssys_state *s)
17
}
44
}
18
45
19
+static void gen_VABS_F(unsigned vece, uint32_t rd_ofs, uint32_t rm_ofs,
46
/*
20
+ uint32_t oprsz, uint32_t maxsz)
47
- * Caculate the sys. clock period in ms.
21
+{
48
+ * Calculate the system clock period. We only want to propagate
22
+ tcg_gen_gvec_andi(vece, rd_ofs, rm_ofs,
49
+ * this change to the rest of the system if we're not being called
23
+ vece == MO_16 ? 0x7fff : 0x7fffffff,
50
+ * from migration post-load.
24
+ oprsz, maxsz);
51
*/
25
+}
52
-static void ssys_calculate_system_clock(ssys_state *s)
26
+
53
+static void ssys_calculate_system_clock(ssys_state *s, bool propagate_clock)
27
static bool trans_VABS_F(DisasContext *s, arg_2misc *a)
28
{
54
{
29
- if (a->size != 2) {
55
+ /*
30
+ if (a->size == MO_16) {
56
+ * SYSDIV field specifies divisor: 0 == /1, 1 == /2, etc. Input
31
+ if (!dc_isar_feature(aa32_fp16_arith, s)) {
57
+ * clock is 200MHz, which is a period of 5 ns. Dividing the clock
32
+ return false;
58
+ * frequency by X is the same as multiplying the period by X.
33
+ }
59
+ */
34
+ } else if (a->size != MO_32) {
60
if (ssys_use_rcc2(s)) {
35
return false;
61
system_clock_scale = 5 * (((s->rcc2 >> 23) & 0x3f) + 1);
62
} else {
63
system_clock_scale = 5 * (((s->rcc >> 23) & 0xf) + 1);
36
}
64
}
37
- /* TODO: FP16 : size == 1 */
65
+ clock_set_ns(s->sysclk, system_clock_scale);
38
- return do_2misc(s, a, gen_helper_vfp_abss);
66
+ if (propagate_clock) {
39
+ return do_2misc_vec(s, a, gen_VABS_F);
67
+ clock_propagate(s->sysclk);
40
+}
68
+ }
41
+
42
+static void gen_VNEG_F(unsigned vece, uint32_t rd_ofs, uint32_t rm_ofs,
43
+ uint32_t oprsz, uint32_t maxsz)
44
+{
45
+ tcg_gen_gvec_xori(vece, rd_ofs, rm_ofs,
46
+ vece == MO_16 ? 0x8000 : 0x80000000,
47
+ oprsz, maxsz);
48
}
69
}
49
70
50
static bool trans_VNEG_F(DisasContext *s, arg_2misc *a)
71
static void ssys_write(void *opaque, hwaddr offset,
72
@@ -XXX,XX +XXX,XX @@ static void ssys_write(void *opaque, hwaddr offset,
73
s->int_status |= (1 << 6);
74
}
75
s->rcc = value;
76
- ssys_calculate_system_clock(s);
77
+ ssys_calculate_system_clock(s, true);
78
break;
79
case 0x070: /* RCC2 */
80
if (ssys_board_class(s) == DID0_CLASS_SANDSTORM) {
81
@@ -XXX,XX +XXX,XX @@ static void ssys_write(void *opaque, hwaddr offset,
82
s->int_status |= (1 << 6);
83
}
84
s->rcc2 = value;
85
- ssys_calculate_system_clock(s);
86
+ ssys_calculate_system_clock(s, true);
87
break;
88
case 0x100: /* RCGC0 */
89
s->rcgc[0] = value;
90
@@ -XXX,XX +XXX,XX @@ static void stellaris_sys_reset_hold(Object *obj)
51
{
91
{
52
- if (a->size != 2) {
92
ssys_state *s = STELLARIS_SYS(obj);
53
+ if (a->size == MO_16) {
93
54
+ if (!dc_isar_feature(aa32_fp16_arith, s)) {
94
- ssys_calculate_system_clock(s);
55
+ return false;
95
+ /* OK to propagate clocks from the hold phase */
56
+ }
96
+ ssys_calculate_system_clock(s, true);
57
+ } else if (a->size != MO_32) {
97
}
58
return false;
98
99
static void stellaris_sys_reset_exit(Object *obj)
100
@@ -XXX,XX +XXX,XX @@ static int stellaris_sys_post_load(void *opaque, int version_id)
101
{
102
ssys_state *s = opaque;
103
104
- ssys_calculate_system_clock(s);
105
+ ssys_calculate_system_clock(s, false);
106
107
return 0;
108
}
109
@@ -XXX,XX +XXX,XX @@ static const VMStateDescription vmstate_stellaris_sys = {
110
VMSTATE_UINT32_ARRAY(dcgc, ssys_state, 3),
111
VMSTATE_UINT32(clkvclr, ssys_state),
112
VMSTATE_UINT32(ldoarst, ssys_state),
113
+ /* No field for sysclk -- handled in post-load instead */
114
VMSTATE_END_OF_LIST()
59
}
115
}
60
- /* TODO: FP16 : size == 1 */
116
};
61
- return do_2misc(s, a, gen_helper_vfp_negs);
117
@@ -XXX,XX +XXX,XX @@ static void stellaris_sys_instance_init(Object *obj)
62
+ return do_2misc_vec(s, a, gen_VNEG_F);
118
memory_region_init_io(&s->iomem, obj, &ssys_ops, s, "ssys", 0x00001000);
119
sysbus_init_mmio(sbd, &s->iomem);
120
sysbus_init_irq(sbd, &s->irq);
121
+ s->sysclk = qdev_init_clock_out(DEVICE(s), "SYSCLK");
63
}
122
}
64
123
65
static bool trans_VRECPE(DisasContext *s, arg_2misc *a)
124
-static int stellaris_sys_init(uint32_t base, qemu_irq irq,
125
- stellaris_board_info * board,
126
- uint8_t *macaddr)
127
+static DeviceState *stellaris_sys_init(uint32_t base, qemu_irq irq,
128
+ stellaris_board_info *board,
129
+ uint8_t *macaddr)
130
{
131
DeviceState *dev = qdev_new(TYPE_STELLARIS_SYS);
132
SysBusDevice *sbd = SYS_BUS_DEVICE(dev);
133
@@ -XXX,XX +XXX,XX @@ static int stellaris_sys_init(uint32_t base, qemu_irq irq,
134
*/
135
device_cold_reset(dev);
136
137
- return 0;
138
+ return dev;
139
}
140
141
/* I2C controller. */
142
@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
143
int flash_size;
144
I2CBus *i2c;
145
DeviceState *dev;
146
+ DeviceState *ssys_dev;
147
int i;
148
int j;
149
150
@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
151
}
152
}
153
154
- stellaris_sys_init(0x400fe000, qdev_get_gpio_in(nvic, 28),
155
- board, nd_table[0].macaddr.a);
156
+ ssys_dev = stellaris_sys_init(0x400fe000, qdev_get_gpio_in(nvic, 28),
157
+ board, nd_table[0].macaddr.a);
158
159
160
if (board->dc1 & (1 << 3)) { /* watchdog present */
161
@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
162
/* system_clock_scale is valid now */
163
uint32_t mainclk = NANOSECONDS_PER_SECOND / system_clock_scale;
164
qdev_prop_set_uint32(dev, "wdogclk-frq", mainclk);
165
+ qdev_connect_clock_in(dev, "WDOGCLK",
166
+ qdev_get_clock_out(ssys_dev, "SYSCLK"));
167
168
sysbus_realize_and_unref(SYS_BUS_DEVICE(dev), &error_fatal);
169
sysbus_mmio_map(SYS_BUS_DEVICE(dev),
66
--
170
--
67
2.20.1
171
2.20.1
68
172
69
173
diff view generated by jsdifflib
1
Implement the fp16 version of the VFP VRINT* insns.
1
Switch the CMSDK APB timer device over to using its Clock input; the
2
pclk-frq property is now ignored.
2
3
3
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
4
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
5
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
5
Message-id: 20200828183354.27913-19-peter.maydell@linaro.org
6
Reviewed-by: Luc Michel <luc@lmichel.fr>
7
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
8
Message-id: 20210128114145.20536-19-peter.maydell@linaro.org
9
Message-id: 20210121190622.22000-19-peter.maydell@linaro.org
6
---
10
---
7
target/arm/helper.h | 2 +
11
hw/timer/cmsdk-apb-timer.c | 18 ++++++++++++++----
8
target/arm/vfp-uncond.decode | 6 ++-
12
1 file changed, 14 insertions(+), 4 deletions(-)
9
target/arm/vfp.decode | 3 ++
10
target/arm/vfp_helper.c | 21 ++++++++
11
target/arm/translate-vfp.c.inc | 98 +++++++++++++++++++++++++++++++---
12
5 files changed, 122 insertions(+), 8 deletions(-)
13
13
14
diff --git a/target/arm/helper.h b/target/arm/helper.h
14
diff --git a/hw/timer/cmsdk-apb-timer.c b/hw/timer/cmsdk-apb-timer.c
15
index XXXXXXX..XXXXXXX 100644
15
index XXXXXXX..XXXXXXX 100644
16
--- a/target/arm/helper.h
16
--- a/hw/timer/cmsdk-apb-timer.c
17
+++ b/target/arm/helper.h
17
+++ b/hw/timer/cmsdk-apb-timer.c
18
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_3(shr_cc, i32, env, i32, i32)
18
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_timer_reset(DeviceState *dev)
19
DEF_HELPER_3(sar_cc, i32, env, i32, i32)
19
ptimer_transaction_commit(s->timer);
20
DEF_HELPER_3(ror_cc, i32, env, i32, i32)
21
22
+DEF_HELPER_FLAGS_2(rinth_exact, TCG_CALL_NO_RWG, f16, f16, ptr)
23
DEF_HELPER_FLAGS_2(rints_exact, TCG_CALL_NO_RWG, f32, f32, ptr)
24
DEF_HELPER_FLAGS_2(rintd_exact, TCG_CALL_NO_RWG, f64, f64, ptr)
25
+DEF_HELPER_FLAGS_2(rinth, TCG_CALL_NO_RWG, f16, f16, ptr)
26
DEF_HELPER_FLAGS_2(rints, TCG_CALL_NO_RWG, f32, f32, ptr)
27
DEF_HELPER_FLAGS_2(rintd, TCG_CALL_NO_RWG, f64, f64, ptr)
28
29
diff --git a/target/arm/vfp-uncond.decode b/target/arm/vfp-uncond.decode
30
index XXXXXXX..XXXXXXX 100644
31
--- a/target/arm/vfp-uncond.decode
32
+++ b/target/arm/vfp-uncond.decode
33
@@ -XXX,XX +XXX,XX @@ VMINNM_sp 1111 1110 1.00 .... .... 1010 .1.0 .... @vfp_dnm_s
34
VMAXNM_dp 1111 1110 1.00 .... .... 1011 .0.0 .... @vfp_dnm_d
35
VMINNM_dp 1111 1110 1.00 .... .... 1011 .1.0 .... @vfp_dnm_d
36
37
+VRINT 1111 1110 1.11 10 rm:2 .... 1001 01.0 .... \
38
+ vm=%vm_sp vd=%vd_sp sz=1
39
VRINT 1111 1110 1.11 10 rm:2 .... 1010 01.0 .... \
40
- vm=%vm_sp vd=%vd_sp dp=0
41
+ vm=%vm_sp vd=%vd_sp sz=2
42
VRINT 1111 1110 1.11 10 rm:2 .... 1011 01.0 .... \
43
- vm=%vm_dp vd=%vd_dp dp=1
44
+ vm=%vm_dp vd=%vd_dp sz=3
45
46
# VCVT float to int with specified rounding mode; Vd is always single-precision
47
VCVT 1111 1110 1.11 11 rm:2 .... 1001 op:1 1.0 .... \
48
diff --git a/target/arm/vfp.decode b/target/arm/vfp.decode
49
index XXXXXXX..XXXXXXX 100644
50
--- a/target/arm/vfp.decode
51
+++ b/target/arm/vfp.decode
52
@@ -XXX,XX +XXX,XX @@ VCVT_f16_f32 ---- 1110 1.11 0011 .... 1010 t:1 1.0 .... \
53
VCVT_f16_f64 ---- 1110 1.11 0011 .... 1011 t:1 1.0 .... \
54
vd=%vd_sp vm=%vm_dp
55
56
+VRINTR_hp ---- 1110 1.11 0110 .... 1001 01.0 .... @vfp_dm_ss
57
VRINTR_sp ---- 1110 1.11 0110 .... 1010 01.0 .... @vfp_dm_ss
58
VRINTR_dp ---- 1110 1.11 0110 .... 1011 01.0 .... @vfp_dm_dd
59
60
+VRINTZ_hp ---- 1110 1.11 0110 .... 1001 11.0 .... @vfp_dm_ss
61
VRINTZ_sp ---- 1110 1.11 0110 .... 1010 11.0 .... @vfp_dm_ss
62
VRINTZ_dp ---- 1110 1.11 0110 .... 1011 11.0 .... @vfp_dm_dd
63
64
+VRINTX_hp ---- 1110 1.11 0111 .... 1001 01.0 .... @vfp_dm_ss
65
VRINTX_sp ---- 1110 1.11 0111 .... 1010 01.0 .... @vfp_dm_ss
66
VRINTX_dp ---- 1110 1.11 0111 .... 1011 01.0 .... @vfp_dm_dd
67
68
diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
69
index XXXXXXX..XXXXXXX 100644
70
--- a/target/arm/vfp_helper.c
71
+++ b/target/arm/vfp_helper.c
72
@@ -XXX,XX +XXX,XX @@ float64 VFP_HELPER(muladd, d)(float64 a, float64 b, float64 c, void *fpstp)
73
}
20
}
74
21
75
/* ARMv8 round to integral */
22
+static void cmsdk_apb_timer_clk_update(void *opaque)
76
+dh_ctype_f16 HELPER(rinth_exact)(dh_ctype_f16 x, void *fp_status)
77
+{
23
+{
78
+ return float16_round_to_int(x, fp_status);
24
+ CMSDKAPBTimer *s = CMSDK_APB_TIMER(opaque);
25
+
26
+ ptimer_transaction_begin(s->timer);
27
+ ptimer_set_period_from_clock(s->timer, s->pclk, 1);
28
+ ptimer_transaction_commit(s->timer);
79
+}
29
+}
80
+
30
+
81
float32 HELPER(rints_exact)(float32 x, void *fp_status)
31
static void cmsdk_apb_timer_init(Object *obj)
82
{
32
{
83
return float32_round_to_int(x, fp_status);
33
SysBusDevice *sbd = SYS_BUS_DEVICE(obj);
84
@@ -XXX,XX +XXX,XX @@ float64 HELPER(rintd_exact)(float64 x, void *fp_status)
34
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_timer_init(Object *obj)
85
return float64_round_to_int(x, fp_status);
35
s, "cmsdk-apb-timer", 0x1000);
36
sysbus_init_mmio(sbd, &s->iomem);
37
sysbus_init_irq(sbd, &s->timerint);
38
- s->pclk = qdev_init_clock_in(DEVICE(s), "pclk", NULL, NULL);
39
+ s->pclk = qdev_init_clock_in(DEVICE(s), "pclk",
40
+ cmsdk_apb_timer_clk_update, s);
86
}
41
}
87
42
88
+dh_ctype_f16 HELPER(rinth)(dh_ctype_f16 x, void *fp_status)
43
static void cmsdk_apb_timer_realize(DeviceState *dev, Error **errp)
89
+{
90
+ int old_flags = get_float_exception_flags(fp_status), new_flags;
91
+ float16 ret;
92
+
93
+ ret = float16_round_to_int(x, fp_status);
94
+
95
+ /* Suppress any inexact exceptions the conversion produced */
96
+ if (!(old_flags & float_flag_inexact)) {
97
+ new_flags = get_float_exception_flags(fp_status);
98
+ set_float_exception_flags(new_flags & ~float_flag_inexact, fp_status);
99
+ }
100
+
101
+ return ret;
102
+}
103
+
104
float32 HELPER(rints)(float32 x, void *fp_status)
105
{
44
{
106
int old_flags = get_float_exception_flags(fp_status), new_flags;
45
CMSDKAPBTimer *s = CMSDK_APB_TIMER(dev);
107
diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
46
108
index XXXXXXX..XXXXXXX 100644
47
- if (s->pclk_frq == 0) {
109
--- a/target/arm/translate-vfp.c.inc
48
- error_setg(errp, "CMSDK APB timer: pclk-frq property must be set");
110
+++ b/target/arm/translate-vfp.c.inc
49
+ if (!clock_has_source(s->pclk)) {
111
@@ -XXX,XX +XXX,XX @@ static const uint8_t fp_decode_rm[] = {
50
+ error_setg(errp, "CMSDK APB timer: pclk clock must be connected");
112
static bool trans_VRINT(DisasContext *s, arg_VRINT *a)
51
return;
113
{
114
uint32_t rd, rm;
115
- bool dp = a->dp;
116
+ int sz = a->sz;
117
TCGv_ptr fpst;
118
TCGv_i32 tcg_rmode;
119
int rounding = fp_decode_rm[a->rm];
120
@@ -XXX,XX +XXX,XX @@ static bool trans_VRINT(DisasContext *s, arg_VRINT *a)
121
return false;
122
}
52
}
123
53
124
- if (dp && !dc_isar_feature(aa32_fpdp_v2, s)) {
54
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_timer_realize(DeviceState *dev, Error **errp)
125
+ if (sz == 3 && !dc_isar_feature(aa32_fpdp_v2, s)) {
55
PTIMER_POLICY_NO_COUNTER_ROUND_DOWN);
126
+ return false;
56
127
+ }
57
ptimer_transaction_begin(s->timer);
128
+
58
- ptimer_set_freq(s->timer, s->pclk_frq);
129
+ if (sz == 1 && !dc_isar_feature(aa32_fp16_arith, s)) {
59
+ ptimer_set_period_from_clock(s->timer, s->pclk, 1);
130
return false;
60
ptimer_transaction_commit(s->timer);
131
}
132
133
/* UNDEF accesses to D16-D31 if they don't exist */
134
- if (dp && !dc_isar_feature(aa32_simd_r32, s) &&
135
+ if (sz == 3 && !dc_isar_feature(aa32_simd_r32, s) &&
136
((a->vm | a->vd) & 0x10)) {
137
return false;
138
}
139
@@ -XXX,XX +XXX,XX @@ static bool trans_VRINT(DisasContext *s, arg_VRINT *a)
140
return true;
141
}
142
143
- fpst = fpstatus_ptr(FPST_FPCR);
144
+ if (sz == 1) {
145
+ fpst = fpstatus_ptr(FPST_FPCR_F16);
146
+ } else {
147
+ fpst = fpstatus_ptr(FPST_FPCR);
148
+ }
149
150
tcg_rmode = tcg_const_i32(arm_rmode_to_sf(rounding));
151
gen_helper_set_rmode(tcg_rmode, tcg_rmode, fpst);
152
153
- if (dp) {
154
+ if (sz == 3) {
155
TCGv_i64 tcg_op;
156
TCGv_i64 tcg_res;
157
tcg_op = tcg_temp_new_i64();
158
@@ -XXX,XX +XXX,XX @@ static bool trans_VRINT(DisasContext *s, arg_VRINT *a)
159
tcg_op = tcg_temp_new_i32();
160
tcg_res = tcg_temp_new_i32();
161
neon_load_reg32(tcg_op, rm);
162
- gen_helper_rints(tcg_res, tcg_op, fpst);
163
+ if (sz == 1) {
164
+ gen_helper_rinth(tcg_res, tcg_op, fpst);
165
+ } else {
166
+ gen_helper_rints(tcg_res, tcg_op, fpst);
167
+ }
168
neon_store_reg32(tcg_res, rd);
169
tcg_temp_free_i32(tcg_op);
170
tcg_temp_free_i32(tcg_res);
171
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT_f16_f64(DisasContext *s, arg_VCVT_f16_f64 *a)
172
return true;
173
}
61
}
174
62
175
+static bool trans_VRINTR_hp(DisasContext *s, arg_VRINTR_sp *a)
176
+{
177
+ TCGv_ptr fpst;
178
+ TCGv_i32 tmp;
179
+
180
+ if (!dc_isar_feature(aa32_fp16_arith, s)) {
181
+ return false;
182
+ }
183
+
184
+ if (!vfp_access_check(s)) {
185
+ return true;
186
+ }
187
+
188
+ tmp = tcg_temp_new_i32();
189
+ neon_load_reg32(tmp, a->vm);
190
+ fpst = fpstatus_ptr(FPST_FPCR_F16);
191
+ gen_helper_rinth(tmp, tmp, fpst);
192
+ neon_store_reg32(tmp, a->vd);
193
+ tcg_temp_free_ptr(fpst);
194
+ tcg_temp_free_i32(tmp);
195
+ return true;
196
+}
197
+
198
static bool trans_VRINTR_sp(DisasContext *s, arg_VRINTR_sp *a)
199
{
200
TCGv_ptr fpst;
201
@@ -XXX,XX +XXX,XX @@ static bool trans_VRINTR_dp(DisasContext *s, arg_VRINTR_dp *a)
202
return true;
203
}
204
205
+static bool trans_VRINTZ_hp(DisasContext *s, arg_VRINTZ_sp *a)
206
+{
207
+ TCGv_ptr fpst;
208
+ TCGv_i32 tmp;
209
+ TCGv_i32 tcg_rmode;
210
+
211
+ if (!dc_isar_feature(aa32_fp16_arith, s)) {
212
+ return false;
213
+ }
214
+
215
+ if (!vfp_access_check(s)) {
216
+ return true;
217
+ }
218
+
219
+ tmp = tcg_temp_new_i32();
220
+ neon_load_reg32(tmp, a->vm);
221
+ fpst = fpstatus_ptr(FPST_FPCR_F16);
222
+ tcg_rmode = tcg_const_i32(float_round_to_zero);
223
+ gen_helper_set_rmode(tcg_rmode, tcg_rmode, fpst);
224
+ gen_helper_rinth(tmp, tmp, fpst);
225
+ gen_helper_set_rmode(tcg_rmode, tcg_rmode, fpst);
226
+ neon_store_reg32(tmp, a->vd);
227
+ tcg_temp_free_ptr(fpst);
228
+ tcg_temp_free_i32(tcg_rmode);
229
+ tcg_temp_free_i32(tmp);
230
+ return true;
231
+}
232
+
233
static bool trans_VRINTZ_sp(DisasContext *s, arg_VRINTZ_sp *a)
234
{
235
TCGv_ptr fpst;
236
@@ -XXX,XX +XXX,XX @@ static bool trans_VRINTZ_dp(DisasContext *s, arg_VRINTZ_dp *a)
237
return true;
238
}
239
240
+static bool trans_VRINTX_hp(DisasContext *s, arg_VRINTX_sp *a)
241
+{
242
+ TCGv_ptr fpst;
243
+ TCGv_i32 tmp;
244
+
245
+ if (!dc_isar_feature(aa32_fp16_arith, s)) {
246
+ return false;
247
+ }
248
+
249
+ if (!vfp_access_check(s)) {
250
+ return true;
251
+ }
252
+
253
+ tmp = tcg_temp_new_i32();
254
+ neon_load_reg32(tmp, a->vm);
255
+ fpst = fpstatus_ptr(FPST_FPCR_F16);
256
+ gen_helper_rinth_exact(tmp, tmp, fpst);
257
+ neon_store_reg32(tmp, a->vd);
258
+ tcg_temp_free_ptr(fpst);
259
+ tcg_temp_free_i32(tmp);
260
+ return true;
261
+}
262
+
263
static bool trans_VRINTX_sp(DisasContext *s, arg_VRINTX_sp *a)
264
{
265
TCGv_ptr fpst;
266
--
63
--
267
2.20.1
64
2.20.1
268
65
269
66
diff view generated by jsdifflib
1
Implement VFP fp16 for VABS, VNEG and VSQRT. This is all
1
Switch the CMSDK APB dualtimer device over to using its Clock input;
2
the fp16 insns that use the DO_VFP_2OP macro, because there
2
the pclk-frq property is now ignored.
3
is no fp16 version of VMOV_reg.
4
5
Notes:
6
* the gen_helper_vfp_negh already exists as we needed to create
7
it for the fp16 multiply-add insns
8
* as usual we need to use the f16 version of the fp_status;
9
this is only relevant for VSQRT
10
3
11
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
12
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
5
Reviewed-by: Luc Michel <luc@lmichel.fr>
13
Message-id: 20200828183354.27913-9-peter.maydell@linaro.org
6
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
7
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
8
Message-id: 20210128114145.20536-20-peter.maydell@linaro.org
9
Message-id: 20210121190622.22000-20-peter.maydell@linaro.org
10
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
14
---
11
---
15
target/arm/helper.h | 2 ++
12
hw/timer/cmsdk-apb-dualtimer.c | 42 ++++++++++++++++++++++++++++++----
16
target/arm/vfp.decode | 3 +++
13
1 file changed, 37 insertions(+), 5 deletions(-)
17
target/arm/vfp_helper.c | 10 +++++++++
18
target/arm/translate-vfp.c.inc | 40 ++++++++++++++++++++++++++++++++++
19
4 files changed, 55 insertions(+)
20
14
21
diff --git a/target/arm/helper.h b/target/arm/helper.h
15
diff --git a/hw/timer/cmsdk-apb-dualtimer.c b/hw/timer/cmsdk-apb-dualtimer.c
22
index XXXXXXX..XXXXXXX 100644
16
index XXXXXXX..XXXXXXX 100644
23
--- a/target/arm/helper.h
17
--- a/hw/timer/cmsdk-apb-dualtimer.c
24
+++ b/target/arm/helper.h
18
+++ b/hw/timer/cmsdk-apb-dualtimer.c
25
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_3(vfp_minnumd, f64, f64, f64, ptr)
19
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_dualtimer_update(CMSDKAPBDualTimer *s)
26
DEF_HELPER_1(vfp_negh, f16, f16)
20
qemu_set_irq(s->timerintc, timintc);
27
DEF_HELPER_1(vfp_negs, f32, f32)
28
DEF_HELPER_1(vfp_negd, f64, f64)
29
+DEF_HELPER_1(vfp_absh, f16, f16)
30
DEF_HELPER_1(vfp_abss, f32, f32)
31
DEF_HELPER_1(vfp_absd, f64, f64)
32
+DEF_HELPER_2(vfp_sqrth, f16, f16, env)
33
DEF_HELPER_2(vfp_sqrts, f32, f32, env)
34
DEF_HELPER_2(vfp_sqrtd, f64, f64, env)
35
DEF_HELPER_3(vfp_cmps, void, f32, f32, env)
36
diff --git a/target/arm/vfp.decode b/target/arm/vfp.decode
37
index XXXXXXX..XXXXXXX 100644
38
--- a/target/arm/vfp.decode
39
+++ b/target/arm/vfp.decode
40
@@ -XXX,XX +XXX,XX @@ VMOV_imm_dp ---- 1110 1.11 .... .... 1011 0000 .... \
41
VMOV_reg_sp ---- 1110 1.11 0000 .... 1010 01.0 .... @vfp_dm_ss
42
VMOV_reg_dp ---- 1110 1.11 0000 .... 1011 01.0 .... @vfp_dm_dd
43
44
+VABS_hp ---- 1110 1.11 0000 .... 1001 11.0 .... @vfp_dm_ss
45
VABS_sp ---- 1110 1.11 0000 .... 1010 11.0 .... @vfp_dm_ss
46
VABS_dp ---- 1110 1.11 0000 .... 1011 11.0 .... @vfp_dm_dd
47
48
+VNEG_hp ---- 1110 1.11 0001 .... 1001 01.0 .... @vfp_dm_ss
49
VNEG_sp ---- 1110 1.11 0001 .... 1010 01.0 .... @vfp_dm_ss
50
VNEG_dp ---- 1110 1.11 0001 .... 1011 01.0 .... @vfp_dm_dd
51
52
+VSQRT_hp ---- 1110 1.11 0001 .... 1001 11.0 .... @vfp_dm_ss
53
VSQRT_sp ---- 1110 1.11 0001 .... 1010 11.0 .... @vfp_dm_ss
54
VSQRT_dp ---- 1110 1.11 0001 .... 1011 11.0 .... @vfp_dm_dd
55
56
diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
57
index XXXXXXX..XXXXXXX 100644
58
--- a/target/arm/vfp_helper.c
59
+++ b/target/arm/vfp_helper.c
60
@@ -XXX,XX +XXX,XX @@ float64 VFP_HELPER(neg, d)(float64 a)
61
return float64_chs(a);
62
}
21
}
63
22
64
+dh_ctype_f16 VFP_HELPER(abs, h)(dh_ctype_f16 a)
23
+static int cmsdk_dualtimermod_divisor(CMSDKAPBDualTimerModule *m)
65
+{
24
+{
66
+ return float16_abs(a);
25
+ /* Return the divisor set by the current CONTROL.PRESCALE value */
26
+ switch (FIELD_EX32(m->control, CONTROL, PRESCALE)) {
27
+ case 0:
28
+ return 1;
29
+ case 1:
30
+ return 16;
31
+ case 2:
32
+ case 3: /* UNDEFINED, we treat like 2 (and complained when it was set) */
33
+ return 256;
34
+ default:
35
+ g_assert_not_reached();
36
+ }
67
+}
37
+}
68
+
38
+
69
float32 VFP_HELPER(abs, s)(float32 a)
39
static void cmsdk_dualtimermod_write_control(CMSDKAPBDualTimerModule *m,
40
uint32_t newctrl)
70
{
41
{
71
return float32_abs(a);
42
@@ -XXX,XX +XXX,XX @@ static void cmsdk_dualtimermod_write_control(CMSDKAPBDualTimerModule *m,
72
@@ -XXX,XX +XXX,XX @@ float64 VFP_HELPER(abs, d)(float64 a)
43
default:
73
return float64_abs(a);
44
g_assert_not_reached();
45
}
46
- ptimer_set_freq(m->timer, m->parent->pclk_frq / divisor);
47
+ ptimer_set_period_from_clock(m->timer, m->parent->timclk, divisor);
48
}
49
50
if (changed & R_CONTROL_MODE_MASK) {
51
@@ -XXX,XX +XXX,XX @@ static void cmsdk_dualtimermod_reset(CMSDKAPBDualTimerModule *m)
52
* limit must both be set to 0xffff, so we wrap at 16 bits.
53
*/
54
ptimer_set_limit(m->timer, 0xffff, 1);
55
- ptimer_set_freq(m->timer, m->parent->pclk_frq);
56
+ ptimer_set_period_from_clock(m->timer, m->parent->timclk,
57
+ cmsdk_dualtimermod_divisor(m));
58
ptimer_transaction_commit(m->timer);
74
}
59
}
75
60
76
+dh_ctype_f16 VFP_HELPER(sqrt, h)(dh_ctype_f16 a, CPUARMState *env)
61
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_dualtimer_reset(DeviceState *dev)
62
s->timeritop = 0;
63
}
64
65
+static void cmsdk_apb_dualtimer_clk_update(void *opaque)
77
+{
66
+{
78
+ return float16_sqrt(a, &env->vfp.fp_status_f16);
67
+ CMSDKAPBDualTimer *s = CMSDK_APB_DUALTIMER(opaque);
68
+ int i;
69
+
70
+ for (i = 0; i < ARRAY_SIZE(s->timermod); i++) {
71
+ CMSDKAPBDualTimerModule *m = &s->timermod[i];
72
+ ptimer_transaction_begin(m->timer);
73
+ ptimer_set_period_from_clock(m->timer, m->parent->timclk,
74
+ cmsdk_dualtimermod_divisor(m));
75
+ ptimer_transaction_commit(m->timer);
76
+ }
79
+}
77
+}
80
+
78
+
81
float32 VFP_HELPER(sqrt, s)(float32 a, CPUARMState *env)
79
static void cmsdk_apb_dualtimer_init(Object *obj)
82
{
80
{
83
return float32_sqrt(a, &env->vfp.fp_status);
81
SysBusDevice *sbd = SYS_BUS_DEVICE(obj);
84
diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
82
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_dualtimer_init(Object *obj)
85
index XXXXXXX..XXXXXXX 100644
83
for (i = 0; i < ARRAY_SIZE(s->timermod); i++) {
86
--- a/target/arm/translate-vfp.c.inc
84
sysbus_init_irq(sbd, &s->timermod[i].timerint);
87
+++ b/target/arm/translate-vfp.c.inc
85
}
88
@@ -XXX,XX +XXX,XX @@ static bool do_vfp_2op_sp(DisasContext *s, VFPGen2OpSPFn *fn, int vd, int vm)
86
- s->timclk = qdev_init_clock_in(DEVICE(s), "TIMCLK", NULL, NULL);
89
return true;
87
+ s->timclk = qdev_init_clock_in(DEVICE(s), "TIMCLK",
88
+ cmsdk_apb_dualtimer_clk_update, s);
90
}
89
}
91
90
92
+static bool do_vfp_2op_hp(DisasContext *s, VFPGen2OpSPFn *fn, int vd, int vm)
91
static void cmsdk_apb_dualtimer_realize(DeviceState *dev, Error **errp)
93
+{
92
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_dualtimer_realize(DeviceState *dev, Error **errp)
94
+ /*
93
CMSDKAPBDualTimer *s = CMSDK_APB_DUALTIMER(dev);
95
+ * Do a half-precision operation. Functionally this is
94
int i;
96
+ * the same as do_vfp_2op_sp(), except:
95
97
+ * - it doesn't need the VFP vector handling (fp16 is a
96
- if (s->pclk_frq == 0) {
98
+ * v8 feature, and in v8 VFP vectors don't exist)
97
- error_setg(errp, "CMSDK APB timer: pclk-frq property must be set");
99
+ * - it does the aa32_fp16_arith feature test
98
+ if (!clock_has_source(s->timclk)) {
100
+ */
99
+ error_setg(errp, "CMSDK APB dualtimer: TIMCLK clock must be connected");
101
+ TCGv_i32 f0;
100
return;
102
+
101
}
103
+ if (!dc_isar_feature(aa32_fp16_arith, s)) {
104
+ return false;
105
+ }
106
+
107
+ if (s->vec_len != 0 || s->vec_stride != 0) {
108
+ return false;
109
+ }
110
+
111
+ if (!vfp_access_check(s)) {
112
+ return true;
113
+ }
114
+
115
+ f0 = tcg_temp_new_i32();
116
+ neon_load_reg32(f0, vm);
117
+ fn(f0, f0);
118
+ neon_store_reg32(f0, vd);
119
+ tcg_temp_free_i32(f0);
120
+
121
+ return true;
122
+}
123
+
124
static bool do_vfp_2op_dp(DisasContext *s, VFPGen2OpDPFn *fn, int vd, int vm)
125
{
126
uint32_t delta_m = 0;
127
@@ -XXX,XX +XXX,XX @@ static bool trans_VMOV_imm_dp(DisasContext *s, arg_VMOV_imm_dp *a)
128
DO_VFP_2OP(VMOV_reg, sp, tcg_gen_mov_i32)
129
DO_VFP_2OP(VMOV_reg, dp, tcg_gen_mov_i64)
130
131
+DO_VFP_2OP(VABS, hp, gen_helper_vfp_absh)
132
DO_VFP_2OP(VABS, sp, gen_helper_vfp_abss)
133
DO_VFP_2OP(VABS, dp, gen_helper_vfp_absd)
134
135
+DO_VFP_2OP(VNEG, hp, gen_helper_vfp_negh)
136
DO_VFP_2OP(VNEG, sp, gen_helper_vfp_negs)
137
DO_VFP_2OP(VNEG, dp, gen_helper_vfp_negd)
138
139
+static void gen_VSQRT_hp(TCGv_i32 vd, TCGv_i32 vm)
140
+{
141
+ gen_helper_vfp_sqrth(vd, vm, cpu_env);
142
+}
143
+
144
static void gen_VSQRT_sp(TCGv_i32 vd, TCGv_i32 vm)
145
{
146
gen_helper_vfp_sqrts(vd, vm, cpu_env);
147
@@ -XXX,XX +XXX,XX @@ static void gen_VSQRT_dp(TCGv_i64 vd, TCGv_i64 vm)
148
gen_helper_vfp_sqrtd(vd, vm, cpu_env);
149
}
150
151
+DO_VFP_2OP(VSQRT, hp, gen_VSQRT_hp)
152
DO_VFP_2OP(VSQRT, sp, gen_VSQRT_sp)
153
DO_VFP_2OP(VSQRT, dp, gen_VSQRT_dp)
154
102
155
--
103
--
156
2.20.1
104
2.20.1
157
105
158
106
diff view generated by jsdifflib
1
Implement the fp16 versions of the VFP VCVT instruction forms which
1
Switch the CMSDK APB watchdog device over to using its Clock input;
2
convert between floating point and integer.
2
the wdogclk_frq property is now ignored.
3
3
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
5
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
5
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
6
Message-id: 20200828183354.27913-13-peter.maydell@linaro.org
6
Reviewed-by: Luc Michel <luc@lmichel.fr>
7
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
8
Message-id: 20210128114145.20536-21-peter.maydell@linaro.org
9
Message-id: 20210121190622.22000-21-peter.maydell@linaro.org
7
---
10
---
8
target/arm/vfp.decode | 4 +++
11
hw/watchdog/cmsdk-apb-watchdog.c | 18 ++++++++++++++----
9
target/arm/translate-vfp.c.inc | 65 ++++++++++++++++++++++++++++++++++
12
1 file changed, 14 insertions(+), 4 deletions(-)
10
2 files changed, 69 insertions(+)
11
13
12
diff --git a/target/arm/vfp.decode b/target/arm/vfp.decode
14
diff --git a/hw/watchdog/cmsdk-apb-watchdog.c b/hw/watchdog/cmsdk-apb-watchdog.c
13
index XXXXXXX..XXXXXXX 100644
15
index XXXXXXX..XXXXXXX 100644
14
--- a/target/arm/vfp.decode
16
--- a/hw/watchdog/cmsdk-apb-watchdog.c
15
+++ b/target/arm/vfp.decode
17
+++ b/hw/watchdog/cmsdk-apb-watchdog.c
16
@@ -XXX,XX +XXX,XX @@ VCVT_sp ---- 1110 1.11 0111 .... 1010 11.0 .... @vfp_dm_ds
18
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_watchdog_reset(DeviceState *dev)
17
VCVT_dp ---- 1110 1.11 0111 .... 1011 11.0 .... @vfp_dm_sd
19
ptimer_transaction_commit(s->timer);
18
19
# VCVT from integer to floating point: Vm always single; Vd depends on size
20
+VCVT_int_hp ---- 1110 1.11 1000 .... 1001 s:1 1.0 .... \
21
+ vd=%vd_sp vm=%vm_sp
22
VCVT_int_sp ---- 1110 1.11 1000 .... 1010 s:1 1.0 .... \
23
vd=%vd_sp vm=%vm_sp
24
VCVT_int_dp ---- 1110 1.11 1000 .... 1011 s:1 1.0 .... \
25
@@ -XXX,XX +XXX,XX @@ VCVT_fix_dp ---- 1110 1.11 1.1. .... 1011 .1.0 .... \
26
vd=%vd_dp imm=%vm_sp opc=%vcvt_fix_op
27
28
# VCVT float to integer (VCVT and VCVTR): Vd always single; Vd depends on size
29
+VCVT_hp_int ---- 1110 1.11 110 s:1 .... 1001 rz:1 1.0 .... \
30
+ vd=%vd_sp vm=%vm_sp
31
VCVT_sp_int ---- 1110 1.11 110 s:1 .... 1010 rz:1 1.0 .... \
32
vd=%vd_sp vm=%vm_sp
33
VCVT_dp_int ---- 1110 1.11 110 s:1 .... 1011 rz:1 1.0 .... \
34
diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
35
index XXXXXXX..XXXXXXX 100644
36
--- a/target/arm/translate-vfp.c.inc
37
+++ b/target/arm/translate-vfp.c.inc
38
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT_dp(DisasContext *s, arg_VCVT_dp *a)
39
return true;
40
}
20
}
41
21
42
+static bool trans_VCVT_int_hp(DisasContext *s, arg_VCVT_int_sp *a)
22
+static void cmsdk_apb_watchdog_clk_update(void *opaque)
43
+{
23
+{
44
+ TCGv_i32 vm;
24
+ CMSDKAPBWatchdog *s = CMSDK_APB_WATCHDOG(opaque);
45
+ TCGv_ptr fpst;
46
+
25
+
47
+ if (!dc_isar_feature(aa32_fp16_arith, s)) {
26
+ ptimer_transaction_begin(s->timer);
48
+ return false;
27
+ ptimer_set_period_from_clock(s->timer, s->wdogclk, 1);
49
+ }
28
+ ptimer_transaction_commit(s->timer);
50
+
51
+ if (!vfp_access_check(s)) {
52
+ return true;
53
+ }
54
+
55
+ vm = tcg_temp_new_i32();
56
+ neon_load_reg32(vm, a->vm);
57
+ fpst = fpstatus_ptr(FPST_FPCR_F16);
58
+ if (a->s) {
59
+ /* i32 -> f16 */
60
+ gen_helper_vfp_sitoh(vm, vm, fpst);
61
+ } else {
62
+ /* u32 -> f16 */
63
+ gen_helper_vfp_uitoh(vm, vm, fpst);
64
+ }
65
+ neon_store_reg32(vm, a->vd);
66
+ tcg_temp_free_i32(vm);
67
+ tcg_temp_free_ptr(fpst);
68
+ return true;
69
+}
29
+}
70
+
30
+
71
static bool trans_VCVT_int_sp(DisasContext *s, arg_VCVT_int_sp *a)
31
static void cmsdk_apb_watchdog_init(Object *obj)
72
{
32
{
73
TCGv_i32 vm;
33
SysBusDevice *sbd = SYS_BUS_DEVICE(obj);
74
@@ -XXX,XX +XXX,XX @@ static bool trans_VCVT_fix_dp(DisasContext *s, arg_VCVT_fix_dp *a)
34
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_watchdog_init(Object *obj)
75
return true;
35
s, "cmsdk-apb-watchdog", 0x1000);
36
sysbus_init_mmio(sbd, &s->iomem);
37
sysbus_init_irq(sbd, &s->wdogint);
38
- s->wdogclk = qdev_init_clock_in(DEVICE(s), "WDOGCLK", NULL, NULL);
39
+ s->wdogclk = qdev_init_clock_in(DEVICE(s), "WDOGCLK",
40
+ cmsdk_apb_watchdog_clk_update, s);
41
42
s->is_luminary = false;
43
s->id = cmsdk_apb_watchdog_id;
44
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_watchdog_realize(DeviceState *dev, Error **errp)
45
{
46
CMSDKAPBWatchdog *s = CMSDK_APB_WATCHDOG(dev);
47
48
- if (s->wdogclk_frq == 0) {
49
+ if (!clock_has_source(s->wdogclk)) {
50
error_setg(errp,
51
- "CMSDK APB watchdog: wdogclk-frq property must be set");
52
+ "CMSDK APB watchdog: WDOGCLK clock must be connected");
53
return;
54
}
55
56
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_watchdog_realize(DeviceState *dev, Error **errp)
57
PTIMER_POLICY_NO_COUNTER_ROUND_DOWN);
58
59
ptimer_transaction_begin(s->timer);
60
- ptimer_set_freq(s->timer, s->wdogclk_frq);
61
+ ptimer_set_period_from_clock(s->timer, s->wdogclk, 1);
62
ptimer_transaction_commit(s->timer);
76
}
63
}
77
64
78
+static bool trans_VCVT_hp_int(DisasContext *s, arg_VCVT_sp_int *a)
79
+{
80
+ TCGv_i32 vm;
81
+ TCGv_ptr fpst;
82
+
83
+ if (!dc_isar_feature(aa32_fp16_arith, s)) {
84
+ return false;
85
+ }
86
+
87
+ if (!vfp_access_check(s)) {
88
+ return true;
89
+ }
90
+
91
+ fpst = fpstatus_ptr(FPST_FPCR_F16);
92
+ vm = tcg_temp_new_i32();
93
+ neon_load_reg32(vm, a->vm);
94
+
95
+ if (a->s) {
96
+ if (a->rz) {
97
+ gen_helper_vfp_tosizh(vm, vm, fpst);
98
+ } else {
99
+ gen_helper_vfp_tosih(vm, vm, fpst);
100
+ }
101
+ } else {
102
+ if (a->rz) {
103
+ gen_helper_vfp_touizh(vm, vm, fpst);
104
+ } else {
105
+ gen_helper_vfp_touih(vm, vm, fpst);
106
+ }
107
+ }
108
+ neon_store_reg32(vm, a->vd);
109
+ tcg_temp_free_i32(vm);
110
+ tcg_temp_free_ptr(fpst);
111
+ return true;
112
+}
113
+
114
static bool trans_VCVT_sp_int(DisasContext *s, arg_VCVT_sp_int *a)
115
{
116
TCGv_i32 vm;
117
--
65
--
118
2.20.1
66
2.20.1
119
67
120
68
diff view generated by jsdifflib
1
Implement VFP fp16 support for fused multiply-add insns
1
Now that the CMSDK APB watchdog uses its Clock input, it will
2
VFNMA, VFNMS, VFMA, VFMS.
2
correctly respond when the system clock frequency is changed using
3
the RCC register on in the Stellaris board system registers. Test
4
that when the RCC register is written it causes the watchdog timer to
5
change speed.
3
6
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
7
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
5
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
8
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
6
Message-id: 20200828183354.27913-7-peter.maydell@linaro.org
9
Reviewed-by: Luc Michel <luc@lmichel.fr>
10
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
11
Message-id: 20210128114145.20536-22-peter.maydell@linaro.org
12
Message-id: 20210121190622.22000-22-peter.maydell@linaro.org
7
---
13
---
8
target/arm/helper.h | 1 +
14
tests/qtest/cmsdk-apb-watchdog-test.c | 52 +++++++++++++++++++++++++++
9
target/arm/vfp.decode | 5 +++
15
1 file changed, 52 insertions(+)
10
target/arm/vfp_helper.c | 7 ++++
11
target/arm/translate-vfp.c.inc | 64 ++++++++++++++++++++++++++++++++++
12
4 files changed, 77 insertions(+)
13
16
14
diff --git a/target/arm/helper.h b/target/arm/helper.h
17
diff --git a/tests/qtest/cmsdk-apb-watchdog-test.c b/tests/qtest/cmsdk-apb-watchdog-test.c
15
index XXXXXXX..XXXXXXX 100644
18
index XXXXXXX..XXXXXXX 100644
16
--- a/target/arm/helper.h
19
--- a/tests/qtest/cmsdk-apb-watchdog-test.c
17
+++ b/target/arm/helper.h
20
+++ b/tests/qtest/cmsdk-apb-watchdog-test.c
18
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_3(vfp_fcvt_f64_to_f16, TCG_CALL_NO_RWG, f16, f64, ptr, i32)
21
@@ -XXX,XX +XXX,XX @@
19
22
*/
20
DEF_HELPER_4(vfp_muladdd, f64, f64, f64, f64, ptr)
23
21
DEF_HELPER_4(vfp_muladds, f32, f32, f32, f32, ptr)
24
#include "qemu/osdep.h"
22
+DEF_HELPER_4(vfp_muladdh, f16, f16, f16, f16, ptr)
25
+#include "qemu/bitops.h"
23
26
#include "libqtest-single.h"
24
DEF_HELPER_3(recps_f32, f32, env, f32, f32)
27
25
DEF_HELPER_3(rsqrts_f32, f32, env, f32, f32)
28
/*
26
diff --git a/target/arm/vfp.decode b/target/arm/vfp.decode
29
@@ -XXX,XX +XXX,XX @@
27
index XXXXXXX..XXXXXXX 100644
30
#define WDOGMIS 0x14
28
--- a/target/arm/vfp.decode
31
#define WDOGLOCK 0xc00
29
+++ b/target/arm/vfp.decode
32
30
@@ -XXX,XX +XXX,XX @@ VDIV_hp ---- 1110 1.00 .... .... 1001 .0.0 .... @vfp_dnm_s
33
+#define SSYS_BASE 0x400fe000
31
VDIV_sp ---- 1110 1.00 .... .... 1010 .0.0 .... @vfp_dnm_s
34
+#define RCC 0x60
32
VDIV_dp ---- 1110 1.00 .... .... 1011 .0.0 .... @vfp_dnm_d
35
+#define SYSDIV_SHIFT 23
33
36
+#define SYSDIV_LENGTH 4
34
+VFMA_hp ---- 1110 1.10 .... .... 1001 .0. 0 .... @vfp_dnm_s
35
+VFMS_hp ---- 1110 1.10 .... .... 1001 .1. 0 .... @vfp_dnm_s
36
+VFNMA_hp ---- 1110 1.01 .... .... 1001 .0. 0 .... @vfp_dnm_s
37
+VFNMS_hp ---- 1110 1.01 .... .... 1001 .1. 0 .... @vfp_dnm_s
38
+
37
+
39
VFMA_sp ---- 1110 1.10 .... .... 1010 .0. 0 .... @vfp_dnm_s
38
static void test_watchdog(void)
40
VFMS_sp ---- 1110 1.10 .... .... 1010 .1. 0 .... @vfp_dnm_s
39
{
41
VFNMA_sp ---- 1110 1.01 .... .... 1010 .0. 0 .... @vfp_dnm_s
40
g_assert_cmpuint(readl(WDOG_BASE + WDOGRIS), ==, 0);
42
diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
41
@@ -XXX,XX +XXX,XX @@ static void test_watchdog(void)
43
index XXXXXXX..XXXXXXX 100644
42
g_assert_cmpuint(readl(WDOG_BASE + WDOGRIS), ==, 0);
44
--- a/target/arm/vfp_helper.c
45
+++ b/target/arm/vfp_helper.c
46
@@ -XXX,XX +XXX,XX @@ uint32_t HELPER(rsqrte_u32)(uint32_t a)
47
}
43
}
48
44
49
/* VFPv4 fused multiply-accumulate */
45
+static void test_clock_change(void)
50
+dh_ctype_f16 VFP_HELPER(muladd, h)(dh_ctype_f16 a, dh_ctype_f16 b,
51
+ dh_ctype_f16 c, void *fpstp)
52
+{
46
+{
53
+ float_status *fpst = fpstp;
47
+ uint32_t rcc;
54
+ return float16_muladd(a, b, c, 0, fpst);
48
+
49
+ /*
50
+ * Test that writing to the stellaris board's RCC register to
51
+ * change the system clock frequency causes the watchdog
52
+ * to change the speed it counts at.
53
+ */
54
+ g_assert_cmpuint(readl(WDOG_BASE + WDOGRIS), ==, 0);
55
+
56
+ writel(WDOG_BASE + WDOGCONTROL, 1);
57
+ writel(WDOG_BASE + WDOGLOAD, 1000);
58
+
59
+ /* Step to just past the 500th tick */
60
+ clock_step(80 * 500 + 1);
61
+ g_assert_cmpuint(readl(WDOG_BASE + WDOGRIS), ==, 0);
62
+ g_assert_cmpuint(readl(WDOG_BASE + WDOGVALUE), ==, 500);
63
+
64
+ /* Rewrite RCC.SYSDIV from 16 to 8, so the clock is now 40ns per tick */
65
+ rcc = readl(SSYS_BASE + RCC);
66
+ g_assert_cmpuint(extract32(rcc, SYSDIV_SHIFT, SYSDIV_LENGTH), ==, 0xf);
67
+ rcc = deposit32(rcc, SYSDIV_SHIFT, SYSDIV_LENGTH, 7);
68
+ writel(SSYS_BASE + RCC, rcc);
69
+
70
+ /* Just past the 1000th tick: timer should have fired */
71
+ clock_step(40 * 500);
72
+ g_assert_cmpuint(readl(WDOG_BASE + WDOGRIS), ==, 1);
73
+
74
+ g_assert_cmpuint(readl(WDOG_BASE + WDOGVALUE), ==, 0);
75
+
76
+ /* VALUE reloads at following tick */
77
+ clock_step(41);
78
+ g_assert_cmpuint(readl(WDOG_BASE + WDOGVALUE), ==, 1000);
79
+
80
+ /* Writing any value to WDOGINTCLR clears the interrupt and reloads */
81
+ clock_step(40 * 500);
82
+ g_assert_cmpuint(readl(WDOG_BASE + WDOGVALUE), ==, 500);
83
+ g_assert_cmpuint(readl(WDOG_BASE + WDOGRIS), ==, 1);
84
+ writel(WDOG_BASE + WDOGINTCLR, 0);
85
+ g_assert_cmpuint(readl(WDOG_BASE + WDOGVALUE), ==, 1000);
86
+ g_assert_cmpuint(readl(WDOG_BASE + WDOGRIS), ==, 0);
55
+}
87
+}
56
+
88
+
57
float32 VFP_HELPER(muladd, s)(float32 a, float32 b, float32 c, void *fpstp)
89
int main(int argc, char **argv)
58
{
90
{
59
float_status *fpst = fpstp;
91
int r;
60
diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
92
@@ -XXX,XX +XXX,XX @@ int main(int argc, char **argv)
61
index XXXXXXX..XXXXXXX 100644
93
qtest_start("-machine lm3s811evb");
62
--- a/target/arm/translate-vfp.c.inc
94
63
+++ b/target/arm/translate-vfp.c.inc
95
qtest_add_func("/cmsdk-apb-watchdog/watchdog", test_watchdog);
64
@@ -XXX,XX +XXX,XX @@ static bool trans_VMAXNM_dp(DisasContext *s, arg_VMAXNM_dp *a)
96
+ qtest_add_func("/cmsdk-apb-watchdog/watchdog_clock_change",
65
a->vd, a->vn, a->vm, false);
97
+ test_clock_change);
66
}
98
67
99
r = g_test_run();
68
+static bool do_vfm_hp(DisasContext *s, arg_VFMA_sp *a, bool neg_n, bool neg_d)
69
+{
70
+ /*
71
+ * VFNMA : fd = muladd(-fd, fn, fm)
72
+ * VFNMS : fd = muladd(-fd, -fn, fm)
73
+ * VFMA : fd = muladd( fd, fn, fm)
74
+ * VFMS : fd = muladd( fd, -fn, fm)
75
+ *
76
+ * These are fused multiply-add, and must be done as one floating
77
+ * point operation with no rounding between the multiplication and
78
+ * addition steps. NB that doing the negations here as separate
79
+ * steps is correct : an input NaN should come out with its sign
80
+ * bit flipped if it is a negated-input.
81
+ */
82
+ TCGv_ptr fpst;
83
+ TCGv_i32 vn, vm, vd;
84
+
85
+ /*
86
+ * Present in VFPv4 only, and only with the FP16 extension.
87
+ * Note that we can't rely on the SIMDFMAC check alone, because
88
+ * in a Neon-no-VFP core that ID register field will be non-zero.
89
+ */
90
+ if (!dc_isar_feature(aa32_fp16_arith, s) ||
91
+ !dc_isar_feature(aa32_simdfmac, s) ||
92
+ !dc_isar_feature(aa32_fpsp_v2, s)) {
93
+ return false;
94
+ }
95
+
96
+ if (s->vec_len != 0 || s->vec_stride != 0) {
97
+ return false;
98
+ }
99
+
100
+ if (!vfp_access_check(s)) {
101
+ return true;
102
+ }
103
+
104
+ vn = tcg_temp_new_i32();
105
+ vm = tcg_temp_new_i32();
106
+ vd = tcg_temp_new_i32();
107
+
108
+ neon_load_reg32(vn, a->vn);
109
+ neon_load_reg32(vm, a->vm);
110
+ if (neg_n) {
111
+ /* VFNMS, VFMS */
112
+ gen_helper_vfp_negh(vn, vn);
113
+ }
114
+ neon_load_reg32(vd, a->vd);
115
+ if (neg_d) {
116
+ /* VFNMA, VFNMS */
117
+ gen_helper_vfp_negh(vd, vd);
118
+ }
119
+ fpst = fpstatus_ptr(FPST_FPCR_F16);
120
+ gen_helper_vfp_muladdh(vd, vn, vm, vd, fpst);
121
+ neon_store_reg32(vd, a->vd);
122
+
123
+ tcg_temp_free_ptr(fpst);
124
+ tcg_temp_free_i32(vn);
125
+ tcg_temp_free_i32(vm);
126
+ tcg_temp_free_i32(vd);
127
+
128
+ return true;
129
+}
130
+
131
static bool do_vfm_sp(DisasContext *s, arg_VFMA_sp *a, bool neg_n, bool neg_d)
132
{
133
/*
134
@@ -XXX,XX +XXX,XX @@ static bool do_vfm_dp(DisasContext *s, arg_VFMA_dp *a, bool neg_n, bool neg_d)
135
MAKE_ONE_VFM_TRANS_FN(VFNMA, PREC, false, true) \
136
MAKE_ONE_VFM_TRANS_FN(VFNMS, PREC, true, true)
137
138
+MAKE_VFM_TRANS_FNS(hp)
139
MAKE_VFM_TRANS_FNS(sp)
140
MAKE_VFM_TRANS_FNS(dp)
141
100
142
--
101
--
143
2.20.1
102
2.20.1
144
103
145
104
diff view generated by jsdifflib
1
Convert the Neon floating-point vector comparison ops VCEQ,
1
Use the MAINCLK Clock input to set the system_clock_scale variable
2
VCGE and VCGT over to using a gvec helper and use this to
2
rather than using the mainclk_frq property.
3
implement the fp16 case.
4
5
(We put the float16_ceq() etc functions above the DO_2OP()
6
macro definition because later when we convert the
7
compare-against-zero instructions we'll want their
8
definitions to be visible at that point in the source file.)
9
3
10
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
4
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
11
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
5
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
12
Message-id: 20200828183354.27913-27-peter.maydell@linaro.org
6
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
7
Reviewed-by: Luc Michel <luc@lmichel.fr>
8
Message-id: 20210128114145.20536-23-peter.maydell@linaro.org
9
Message-id: 20210121190622.22000-23-peter.maydell@linaro.org
13
---
10
---
14
target/arm/helper.h | 9 +++++++
11
hw/arm/armsse.c | 24 +++++++++++++++++++-----
15
target/arm/vec_helper.c | 44 +++++++++++++++++++++++++++++++++
12
1 file changed, 19 insertions(+), 5 deletions(-)
16
target/arm/translate-neon.c.inc | 6 ++---
17
3 files changed, 56 insertions(+), 3 deletions(-)
18
13
19
diff --git a/target/arm/helper.h b/target/arm/helper.h
14
diff --git a/hw/arm/armsse.c b/hw/arm/armsse.c
20
index XXXXXXX..XXXXXXX 100644
15
index XXXXXXX..XXXXXXX 100644
21
--- a/target/arm/helper.h
16
--- a/hw/arm/armsse.c
22
+++ b/target/arm/helper.h
17
+++ b/hw/arm/armsse.c
23
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_5(gvec_fmul_d, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
18
@@ -XXX,XX +XXX,XX @@ static void armsse_forward_sec_resp_cfg(ARMSSE *s)
24
DEF_HELPER_FLAGS_5(gvec_fabd_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
19
qdev_connect_gpio_out(dev_splitter, 2, s->sec_resp_cfg_in);
25
DEF_HELPER_FLAGS_5(gvec_fabd_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
26
27
+DEF_HELPER_FLAGS_5(gvec_fceq_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
28
+DEF_HELPER_FLAGS_5(gvec_fceq_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
29
+
30
+DEF_HELPER_FLAGS_5(gvec_fcge_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
31
+DEF_HELPER_FLAGS_5(gvec_fcge_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
32
+
33
+DEF_HELPER_FLAGS_5(gvec_fcgt_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
34
+DEF_HELPER_FLAGS_5(gvec_fcgt_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
35
+
36
DEF_HELPER_FLAGS_5(gvec_ftsmul_h, TCG_CALL_NO_RWG,
37
void, ptr, ptr, ptr, ptr, i32)
38
DEF_HELPER_FLAGS_5(gvec_ftsmul_s, TCG_CALL_NO_RWG,
39
diff --git a/target/arm/vec_helper.c b/target/arm/vec_helper.c
40
index XXXXXXX..XXXXXXX 100644
41
--- a/target/arm/vec_helper.c
42
+++ b/target/arm/vec_helper.c
43
@@ -XXX,XX +XXX,XX @@ void HELPER(gvec_fcmlad)(void *vd, void *vn, void *vm,
44
clear_tail(d, opr_sz, simd_maxsz(desc));
45
}
20
}
46
21
47
+/*
22
+static void armsse_mainclk_update(void *opaque)
48
+ * Floating point comparisons producing an integer result (all 1s or all 0s).
49
+ * Note that EQ doesn't signal InvalidOp for QNaNs but GE and GT do.
50
+ * Softfloat routines return 0/1, which we convert to the 0/-1 Neon requires.
51
+ */
52
+static uint16_t float16_ceq(float16 op1, float16 op2, float_status *stat)
53
+{
23
+{
54
+ return -float16_eq_quiet(op1, op2, stat);
24
+ ARMSSE *s = ARM_SSE(opaque);
25
+ /*
26
+ * Set system_clock_scale from our Clock input; this is what
27
+ * controls the tick rate of the CPU SysTick timer.
28
+ */
29
+ system_clock_scale = clock_ticks_to_ns(s->mainclk, 1);
55
+}
30
+}
56
+
31
+
57
+static uint32_t float32_ceq(float32 op1, float32 op2, float_status *stat)
32
static void armsse_init(Object *obj)
58
+{
33
{
59
+ return -float32_eq_quiet(op1, op2, stat);
34
ARMSSE *s = ARM_SSE(obj);
60
+}
35
@@ -XXX,XX +XXX,XX @@ static void armsse_init(Object *obj)
61
+
36
assert(info->sram_banks <= MAX_SRAM_BANKS);
62
+static uint16_t float16_cge(float16 op1, float16 op2, float_status *stat)
37
assert(info->num_cpus <= SSE_MAX_CPUS);
63
+{
38
64
+ return -float16_le(op2, op1, stat);
39
- s->mainclk = qdev_init_clock_in(DEVICE(s), "MAINCLK", NULL, NULL);
65
+}
40
+ s->mainclk = qdev_init_clock_in(DEVICE(s), "MAINCLK",
66
+
41
+ armsse_mainclk_update, s);
67
+static uint32_t float32_cge(float32 op1, float32 op2, float_status *stat)
42
s->s32kclk = qdev_init_clock_in(DEVICE(s), "S32KCLK", NULL, NULL);
68
+{
43
69
+ return -float32_le(op2, op1, stat);
44
memory_region_init(&s->container, obj, "armsse-container", UINT64_MAX);
70
+}
45
@@ -XXX,XX +XXX,XX @@ static void armsse_realize(DeviceState *dev, Error **errp)
71
+
46
return;
72
+static uint16_t float16_cgt(float16 op1, float16 op2, float_status *stat)
73
+{
74
+ return -float16_lt(op2, op1, stat);
75
+}
76
+
77
+static uint32_t float32_cgt(float32 op1, float32 op2, float_status *stat)
78
+{
79
+ return -float32_lt(op2, op1, stat);
80
+}
81
+
82
#define DO_2OP(NAME, FUNC, TYPE) \
83
void HELPER(NAME)(void *vd, void *vn, void *stat, uint32_t desc) \
84
{ \
85
@@ -XXX,XX +XXX,XX @@ DO_3OP(gvec_ftsmul_d, float64_ftsmul, float64)
86
DO_3OP(gvec_fabd_h, float16_abd, float16)
87
DO_3OP(gvec_fabd_s, float32_abd, float32)
88
89
+DO_3OP(gvec_fceq_h, float16_ceq, float16)
90
+DO_3OP(gvec_fceq_s, float32_ceq, float32)
91
+
92
+DO_3OP(gvec_fcge_h, float16_cge, float16)
93
+DO_3OP(gvec_fcge_s, float32_cge, float32)
94
+
95
+DO_3OP(gvec_fcgt_h, float16_cgt, float16)
96
+DO_3OP(gvec_fcgt_s, float32_cgt, float32)
97
+
98
#ifdef TARGET_AARCH64
99
100
DO_3OP(gvec_recps_h, helper_recpsf_f16, float16)
101
diff --git a/target/arm/translate-neon.c.inc b/target/arm/translate-neon.c.inc
102
index XXXXXXX..XXXXXXX 100644
103
--- a/target/arm/translate-neon.c.inc
104
+++ b/target/arm/translate-neon.c.inc
105
@@ -XXX,XX +XXX,XX @@ DO_3S_FP_GVEC(VADD, gen_helper_gvec_fadd_s, gen_helper_gvec_fadd_h)
106
DO_3S_FP_GVEC(VSUB, gen_helper_gvec_fsub_s, gen_helper_gvec_fsub_h)
107
DO_3S_FP_GVEC(VABD, gen_helper_gvec_fabd_s, gen_helper_gvec_fabd_h)
108
DO_3S_FP_GVEC(VMUL, gen_helper_gvec_fmul_s, gen_helper_gvec_fmul_h)
109
+DO_3S_FP_GVEC(VCEQ, gen_helper_gvec_fceq_s, gen_helper_gvec_fceq_h)
110
+DO_3S_FP_GVEC(VCGE, gen_helper_gvec_fcge_s, gen_helper_gvec_fcge_h)
111
+DO_3S_FP_GVEC(VCGT, gen_helper_gvec_fcgt_s, gen_helper_gvec_fcgt_h)
112
113
/*
114
* For all the functions using this macro, size == 1 means fp16,
115
@@ -XXX,XX +XXX,XX @@ DO_3S_FP_GVEC(VMUL, gen_helper_gvec_fmul_s, gen_helper_gvec_fmul_h)
116
return do_3same_fp(s, a, FUNC, READS_VD); \
117
}
47
}
118
48
119
-DO_3S_FP(VCEQ, gen_helper_neon_ceq_f32, false)
49
- if (!s->mainclk_frq) {
120
-DO_3S_FP(VCGE, gen_helper_neon_cge_f32, false)
50
- error_setg(errp, "MAINCLK_FRQ property was not set");
121
-DO_3S_FP(VCGT, gen_helper_neon_cgt_f32, false)
51
- return;
122
DO_3S_FP(VACGE, gen_helper_neon_acge_f32, false)
52
+ if (!clock_has_source(s->mainclk)) {
123
DO_3S_FP(VACGT, gen_helper_neon_acgt_f32, false)
53
+ error_setg(errp, "MAINCLK clock was not connected");
124
DO_3S_FP(VMAX, gen_helper_vfp_maxs, false)
54
+ }
55
+ if (!clock_has_source(s->s32kclk)) {
56
+ error_setg(errp, "S32KCLK clock was not connected");
57
}
58
59
assert(info->num_cpus <= SSE_MAX_CPUS);
60
@@ -XXX,XX +XXX,XX @@ static void armsse_realize(DeviceState *dev, Error **errp)
61
*/
62
sysbus_init_mmio(SYS_BUS_DEVICE(s), &s->container);
63
64
- system_clock_scale = NANOSECONDS_PER_SECOND / s->mainclk_frq;
65
+ /* Set initial system_clock_scale from MAINCLK */
66
+ armsse_mainclk_update(s);
67
}
68
69
static void armsse_idau_check(IDAUInterface *ii, uint32_t address,
125
--
70
--
126
2.20.1
71
2.20.1
127
72
128
73
diff view generated by jsdifflib
1
Convert the neon floating-point vector absolute comparison ops
1
Remove all the code that sets frequency properties on the CMSDK
2
VACGE and VACGT over to using a gvec hepler and use this to
2
timer, dualtimer and watchdog devices and on the ARMSSE SoC device:
3
implement the fp16 case.
3
these properties are unused now that the devices rely on their Clock
4
inputs instead.
4
5
5
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
6
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
6
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
7
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
7
Message-id: 20200828183354.27913-28-peter.maydell@linaro.org
8
Reviewed-by: Luc Michel <luc@lmichel.fr>
9
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
10
Message-id: 20210128114145.20536-24-peter.maydell@linaro.org
11
Message-id: 20210121190622.22000-24-peter.maydell@linaro.org
8
---
12
---
9
target/arm/helper.h | 6 ++++++
13
hw/arm/armsse.c | 7 -------
10
target/arm/vec_helper.c | 26 ++++++++++++++++++++++++++
14
hw/arm/mps2-tz.c | 1 -
11
target/arm/translate-neon.c.inc | 4 ++--
15
hw/arm/mps2.c | 3 ---
12
3 files changed, 34 insertions(+), 2 deletions(-)
16
hw/arm/musca.c | 1 -
17
hw/arm/stellaris.c | 3 ---
18
5 files changed, 15 deletions(-)
13
19
14
diff --git a/target/arm/helper.h b/target/arm/helper.h
20
diff --git a/hw/arm/armsse.c b/hw/arm/armsse.c
15
index XXXXXXX..XXXXXXX 100644
21
index XXXXXXX..XXXXXXX 100644
16
--- a/target/arm/helper.h
22
--- a/hw/arm/armsse.c
17
+++ b/target/arm/helper.h
23
+++ b/hw/arm/armsse.c
18
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_5(gvec_fcge_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
24
@@ -XXX,XX +XXX,XX @@ static void armsse_realize(DeviceState *dev, Error **errp)
19
DEF_HELPER_FLAGS_5(gvec_fcgt_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
25
* it to the appropriate PPC port; then we can realize the PPC and
20
DEF_HELPER_FLAGS_5(gvec_fcgt_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
26
* map its upstream ends to the right place in the container.
21
27
*/
22
+DEF_HELPER_FLAGS_5(gvec_facge_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
28
- qdev_prop_set_uint32(DEVICE(&s->timer0), "pclk-frq", s->mainclk_frq);
23
+DEF_HELPER_FLAGS_5(gvec_facge_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
29
qdev_connect_clock_in(DEVICE(&s->timer0), "pclk", s->mainclk);
24
+
30
if (!sysbus_realize(SYS_BUS_DEVICE(&s->timer0), errp)) {
25
+DEF_HELPER_FLAGS_5(gvec_facgt_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
31
return;
26
+DEF_HELPER_FLAGS_5(gvec_facgt_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
32
@@ -XXX,XX +XXX,XX @@ static void armsse_realize(DeviceState *dev, Error **errp)
27
+
33
object_property_set_link(OBJECT(&s->apb_ppc0), "port[0]", OBJECT(mr),
28
DEF_HELPER_FLAGS_5(gvec_ftsmul_h, TCG_CALL_NO_RWG,
34
&error_abort);
29
void, ptr, ptr, ptr, ptr, i32)
35
30
DEF_HELPER_FLAGS_5(gvec_ftsmul_s, TCG_CALL_NO_RWG,
36
- qdev_prop_set_uint32(DEVICE(&s->timer1), "pclk-frq", s->mainclk_frq);
31
diff --git a/target/arm/vec_helper.c b/target/arm/vec_helper.c
37
qdev_connect_clock_in(DEVICE(&s->timer1), "pclk", s->mainclk);
38
if (!sysbus_realize(SYS_BUS_DEVICE(&s->timer1), errp)) {
39
return;
40
@@ -XXX,XX +XXX,XX @@ static void armsse_realize(DeviceState *dev, Error **errp)
41
object_property_set_link(OBJECT(&s->apb_ppc0), "port[1]", OBJECT(mr),
42
&error_abort);
43
44
- qdev_prop_set_uint32(DEVICE(&s->dualtimer), "pclk-frq", s->mainclk_frq);
45
qdev_connect_clock_in(DEVICE(&s->dualtimer), "TIMCLK", s->mainclk);
46
if (!sysbus_realize(SYS_BUS_DEVICE(&s->dualtimer), errp)) {
47
return;
48
@@ -XXX,XX +XXX,XX @@ static void armsse_realize(DeviceState *dev, Error **errp)
49
/* Devices behind APB PPC1:
50
* 0x4002f000: S32K timer
51
*/
52
- qdev_prop_set_uint32(DEVICE(&s->s32ktimer), "pclk-frq", S32KCLK);
53
qdev_connect_clock_in(DEVICE(&s->s32ktimer), "pclk", s->s32kclk);
54
if (!sysbus_realize(SYS_BUS_DEVICE(&s->s32ktimer), errp)) {
55
return;
56
@@ -XXX,XX +XXX,XX @@ static void armsse_realize(DeviceState *dev, Error **errp)
57
qdev_connect_gpio_out(DEVICE(&s->nmi_orgate), 0,
58
qdev_get_gpio_in_named(DEVICE(&s->armv7m), "NMI", 0));
59
60
- qdev_prop_set_uint32(DEVICE(&s->s32kwatchdog), "wdogclk-frq", S32KCLK);
61
qdev_connect_clock_in(DEVICE(&s->s32kwatchdog), "WDOGCLK", s->s32kclk);
62
if (!sysbus_realize(SYS_BUS_DEVICE(&s->s32kwatchdog), errp)) {
63
return;
64
@@ -XXX,XX +XXX,XX @@ static void armsse_realize(DeviceState *dev, Error **errp)
65
66
/* 0x40080000 .. 0x4008ffff : ARMSSE second Base peripheral region */
67
68
- qdev_prop_set_uint32(DEVICE(&s->nswatchdog), "wdogclk-frq", s->mainclk_frq);
69
qdev_connect_clock_in(DEVICE(&s->nswatchdog), "WDOGCLK", s->mainclk);
70
if (!sysbus_realize(SYS_BUS_DEVICE(&s->nswatchdog), errp)) {
71
return;
72
@@ -XXX,XX +XXX,XX @@ static void armsse_realize(DeviceState *dev, Error **errp)
73
armsse_get_common_irq_in(s, 1));
74
sysbus_mmio_map(SYS_BUS_DEVICE(&s->nswatchdog), 0, 0x40081000);
75
76
- qdev_prop_set_uint32(DEVICE(&s->swatchdog), "wdogclk-frq", s->mainclk_frq);
77
qdev_connect_clock_in(DEVICE(&s->swatchdog), "WDOGCLK", s->mainclk);
78
if (!sysbus_realize(SYS_BUS_DEVICE(&s->swatchdog), errp)) {
79
return;
80
diff --git a/hw/arm/mps2-tz.c b/hw/arm/mps2-tz.c
32
index XXXXXXX..XXXXXXX 100644
81
index XXXXXXX..XXXXXXX 100644
33
--- a/target/arm/vec_helper.c
82
--- a/hw/arm/mps2-tz.c
34
+++ b/target/arm/vec_helper.c
83
+++ b/hw/arm/mps2-tz.c
35
@@ -XXX,XX +XXX,XX @@ static uint32_t float32_cgt(float32 op1, float32 op2, float_status *stat)
84
@@ -XXX,XX +XXX,XX @@ static void mps2tz_common_init(MachineState *machine)
36
return -float32_lt(op2, op1, stat);
85
object_property_set_link(OBJECT(&mms->iotkit), "memory",
37
}
86
OBJECT(system_memory), &error_abort);
38
87
qdev_prop_set_uint32(iotkitdev, "EXP_NUMIRQ", MPS2TZ_NUMIRQ);
39
+static uint16_t float16_acge(float16 op1, float16 op2, float_status *stat)
88
- qdev_prop_set_uint32(iotkitdev, "MAINCLK_FRQ", SYSCLK_FRQ);
40
+{
89
qdev_connect_clock_in(iotkitdev, "MAINCLK", mms->sysclk);
41
+ return -float16_le(float16_abs(op2), float16_abs(op1), stat);
90
qdev_connect_clock_in(iotkitdev, "S32KCLK", mms->s32kclk);
42
+}
91
sysbus_realize(SYS_BUS_DEVICE(&mms->iotkit), &error_fatal);
43
+
92
diff --git a/hw/arm/mps2.c b/hw/arm/mps2.c
44
+static uint32_t float32_acge(float32 op1, float32 op2, float_status *stat)
45
+{
46
+ return -float32_le(float32_abs(op2), float32_abs(op1), stat);
47
+}
48
+
49
+static uint16_t float16_acgt(float16 op1, float16 op2, float_status *stat)
50
+{
51
+ return -float16_lt(float16_abs(op2), float16_abs(op1), stat);
52
+}
53
+
54
+static uint32_t float32_acgt(float32 op1, float32 op2, float_status *stat)
55
+{
56
+ return -float32_lt(float32_abs(op2), float32_abs(op1), stat);
57
+}
58
+
59
#define DO_2OP(NAME, FUNC, TYPE) \
60
void HELPER(NAME)(void *vd, void *vn, void *stat, uint32_t desc) \
61
{ \
62
@@ -XXX,XX +XXX,XX @@ DO_3OP(gvec_fcge_s, float32_cge, float32)
63
DO_3OP(gvec_fcgt_h, float16_cgt, float16)
64
DO_3OP(gvec_fcgt_s, float32_cgt, float32)
65
66
+DO_3OP(gvec_facge_h, float16_acge, float16)
67
+DO_3OP(gvec_facge_s, float32_acge, float32)
68
+
69
+DO_3OP(gvec_facgt_h, float16_acgt, float16)
70
+DO_3OP(gvec_facgt_s, float32_acgt, float32)
71
+
72
#ifdef TARGET_AARCH64
73
74
DO_3OP(gvec_recps_h, helper_recpsf_f16, float16)
75
diff --git a/target/arm/translate-neon.c.inc b/target/arm/translate-neon.c.inc
76
index XXXXXXX..XXXXXXX 100644
93
index XXXXXXX..XXXXXXX 100644
77
--- a/target/arm/translate-neon.c.inc
94
--- a/hw/arm/mps2.c
78
+++ b/target/arm/translate-neon.c.inc
95
+++ b/hw/arm/mps2.c
79
@@ -XXX,XX +XXX,XX @@ DO_3S_FP_GVEC(VMUL, gen_helper_gvec_fmul_s, gen_helper_gvec_fmul_h)
96
@@ -XXX,XX +XXX,XX @@ static void mps2_common_init(MachineState *machine)
80
DO_3S_FP_GVEC(VCEQ, gen_helper_gvec_fceq_s, gen_helper_gvec_fceq_h)
97
object_initialize_child(OBJECT(mms), name, &mms->timer[i],
81
DO_3S_FP_GVEC(VCGE, gen_helper_gvec_fcge_s, gen_helper_gvec_fcge_h)
98
TYPE_CMSDK_APB_TIMER);
82
DO_3S_FP_GVEC(VCGT, gen_helper_gvec_fcgt_s, gen_helper_gvec_fcgt_h)
99
sbd = SYS_BUS_DEVICE(&mms->timer[i]);
83
+DO_3S_FP_GVEC(VACGE, gen_helper_gvec_facge_s, gen_helper_gvec_facge_h)
100
- qdev_prop_set_uint32(DEVICE(&mms->timer[i]), "pclk-frq", SYSCLK_FRQ);
84
+DO_3S_FP_GVEC(VACGT, gen_helper_gvec_facgt_s, gen_helper_gvec_facgt_h)
101
qdev_connect_clock_in(DEVICE(&mms->timer[i]), "pclk", mms->sysclk);
85
102
sysbus_realize_and_unref(sbd, &error_fatal);
86
/*
103
sysbus_mmio_map(sbd, 0, base);
87
* For all the functions using this macro, size == 1 means fp16,
104
@@ -XXX,XX +XXX,XX @@ static void mps2_common_init(MachineState *machine)
88
@@ -XXX,XX +XXX,XX @@ DO_3S_FP_GVEC(VCGT, gen_helper_gvec_fcgt_s, gen_helper_gvec_fcgt_h)
105
89
return do_3same_fp(s, a, FUNC, READS_VD); \
106
object_initialize_child(OBJECT(mms), "dualtimer", &mms->dualtimer,
90
}
107
TYPE_CMSDK_APB_DUALTIMER);
91
108
- qdev_prop_set_uint32(DEVICE(&mms->dualtimer), "pclk-frq", SYSCLK_FRQ);
92
-DO_3S_FP(VACGE, gen_helper_neon_acge_f32, false)
109
qdev_connect_clock_in(DEVICE(&mms->dualtimer), "TIMCLK", mms->sysclk);
93
-DO_3S_FP(VACGT, gen_helper_neon_acgt_f32, false)
110
sysbus_realize(SYS_BUS_DEVICE(&mms->dualtimer), &error_fatal);
94
DO_3S_FP(VMAX, gen_helper_vfp_maxs, false)
111
sysbus_connect_irq(SYS_BUS_DEVICE(&mms->dualtimer), 0,
95
DO_3S_FP(VMIN, gen_helper_vfp_mins, false)
112
@@ -XXX,XX +XXX,XX @@ static void mps2_common_init(MachineState *machine)
113
sysbus_mmio_map(SYS_BUS_DEVICE(&mms->dualtimer), 0, 0x40002000);
114
object_initialize_child(OBJECT(mms), "watchdog", &mms->watchdog,
115
TYPE_CMSDK_APB_WATCHDOG);
116
- qdev_prop_set_uint32(DEVICE(&mms->watchdog), "wdogclk-frq", SYSCLK_FRQ);
117
qdev_connect_clock_in(DEVICE(&mms->watchdog), "WDOGCLK", mms->sysclk);
118
sysbus_realize(SYS_BUS_DEVICE(&mms->watchdog), &error_fatal);
119
sysbus_connect_irq(SYS_BUS_DEVICE(&mms->watchdog), 0,
120
diff --git a/hw/arm/musca.c b/hw/arm/musca.c
121
index XXXXXXX..XXXXXXX 100644
122
--- a/hw/arm/musca.c
123
+++ b/hw/arm/musca.c
124
@@ -XXX,XX +XXX,XX @@ static void musca_init(MachineState *machine)
125
qdev_prop_set_uint32(ssedev, "EXP_NUMIRQ", mmc->num_irqs);
126
qdev_prop_set_uint32(ssedev, "init-svtor", mmc->init_svtor);
127
qdev_prop_set_uint32(ssedev, "SRAM_ADDR_WIDTH", mmc->sram_addr_width);
128
- qdev_prop_set_uint32(ssedev, "MAINCLK_FRQ", SYSCLK_FRQ);
129
qdev_connect_clock_in(ssedev, "MAINCLK", mms->sysclk);
130
qdev_connect_clock_in(ssedev, "S32KCLK", mms->s32kclk);
131
/*
132
diff --git a/hw/arm/stellaris.c b/hw/arm/stellaris.c
133
index XXXXXXX..XXXXXXX 100644
134
--- a/hw/arm/stellaris.c
135
+++ b/hw/arm/stellaris.c
136
@@ -XXX,XX +XXX,XX @@ static void stellaris_init(MachineState *ms, stellaris_board_info *board)
137
if (board->dc1 & (1 << 3)) { /* watchdog present */
138
dev = qdev_new(TYPE_LUMINARY_WATCHDOG);
139
140
- /* system_clock_scale is valid now */
141
- uint32_t mainclk = NANOSECONDS_PER_SECOND / system_clock_scale;
142
- qdev_prop_set_uint32(dev, "wdogclk-frq", mainclk);
143
qdev_connect_clock_in(dev, "WDOGCLK",
144
qdev_get_clock_out(ssys_dev, "SYSCLK"));
96
145
97
--
146
--
98
2.20.1
147
2.20.1
99
148
100
149
diff view generated by jsdifflib
1
Convert the Neon VRECPS insn to using a gvec helper, and
1
Now no users are setting the frq properties on the CMSDK timer,
2
use this to implement the fp16 case.
2
dualtimer, watchdog or ARMSSE SoC devices, we can remove the
3
3
properties and the struct fields that back them.
4
The phrasing of the new float32_recps_nf() is slightly different from
5
the old recps_f32() so that it parallels the f16 version; for f16 we
6
can't assume that flush-to-zero is always enabled.
7
4
8
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
5
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
9
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
6
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
10
Message-id: 20200828183354.27913-34-peter.maydell@linaro.org
7
Reviewed-by: Luc Michel <luc@lmichel.fr>
8
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
9
Message-id: 20210128114145.20536-25-peter.maydell@linaro.org
10
Message-id: 20210121190622.22000-25-peter.maydell@linaro.org
11
---
11
---
12
target/arm/helper.h | 4 +++-
12
include/hw/arm/armsse.h | 2 --
13
target/arm/vec_helper.c | 31 +++++++++++++++++++++++++++++++
13
include/hw/timer/cmsdk-apb-dualtimer.h | 2 --
14
target/arm/vfp_helper.c | 13 -------------
14
include/hw/timer/cmsdk-apb-timer.h | 2 --
15
target/arm/translate-neon.c.inc | 21 +--------------------
15
include/hw/watchdog/cmsdk-apb-watchdog.h | 2 --
16
4 files changed, 35 insertions(+), 34 deletions(-)
16
hw/arm/armsse.c | 2 --
17
hw/timer/cmsdk-apb-dualtimer.c | 6 ------
18
hw/timer/cmsdk-apb-timer.c | 6 ------
19
hw/watchdog/cmsdk-apb-watchdog.c | 6 ------
20
8 files changed, 28 deletions(-)
17
21
18
diff --git a/target/arm/helper.h b/target/arm/helper.h
22
diff --git a/include/hw/arm/armsse.h b/include/hw/arm/armsse.h
19
index XXXXXXX..XXXXXXX 100644
23
index XXXXXXX..XXXXXXX 100644
20
--- a/target/arm/helper.h
24
--- a/include/hw/arm/armsse.h
21
+++ b/target/arm/helper.h
25
+++ b/include/hw/arm/armsse.h
22
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_4(vfp_muladdd, f64, f64, f64, f64, ptr)
26
@@ -XXX,XX +XXX,XX @@
23
DEF_HELPER_4(vfp_muladds, f32, f32, f32, f32, ptr)
27
* + Clock input "S32KCLK": slow 32KHz clock used for a few peripherals
24
DEF_HELPER_4(vfp_muladdh, f16, f16, f16, f16, ptr)
28
* + QOM property "memory" is a MemoryRegion containing the devices provided
25
29
* by the board model.
26
-DEF_HELPER_3(recps_f32, f32, env, f32, f32)
30
- * + QOM property "MAINCLK_FRQ" is the frequency of the main system clock
27
DEF_HELPER_3(rsqrts_f32, f32, env, f32, f32)
31
* + QOM property "EXP_NUMIRQ" sets the number of expansion interrupts.
28
DEF_HELPER_FLAGS_2(recpe_f16, TCG_CALL_NO_RWG, f16, f16, ptr)
32
* (In hardware, the SSE-200 permits the number of expansion interrupts
29
DEF_HELPER_FLAGS_2(recpe_f32, TCG_CALL_NO_RWG, f32, f32, ptr)
33
* for the two CPUs to be configured separately, but we restrict it to
30
@@ -XXX,XX +XXX,XX @@ DEF_HELPER_FLAGS_5(gvec_fmaxnum_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i3
34
@@ -XXX,XX +XXX,XX @@ struct ARMSSE {
31
DEF_HELPER_FLAGS_5(gvec_fminnum_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
35
/* Properties */
32
DEF_HELPER_FLAGS_5(gvec_fminnum_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
36
MemoryRegion *board_memory;
33
37
uint32_t exp_numirq;
34
+DEF_HELPER_FLAGS_5(gvec_recps_nf_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
38
- uint32_t mainclk_frq;
35
+DEF_HELPER_FLAGS_5(gvec_recps_nf_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
39
uint32_t sram_addr_width;
36
+
40
uint32_t init_svtor;
37
DEF_HELPER_FLAGS_5(gvec_fmla_h, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
41
bool cpu_fpu[SSE_MAX_CPUS];
38
DEF_HELPER_FLAGS_5(gvec_fmla_s, TCG_CALL_NO_RWG, void, ptr, ptr, ptr, ptr, i32)
42
diff --git a/include/hw/timer/cmsdk-apb-dualtimer.h b/include/hw/timer/cmsdk-apb-dualtimer.h
39
40
diff --git a/target/arm/vec_helper.c b/target/arm/vec_helper.c
41
index XXXXXXX..XXXXXXX 100644
43
index XXXXXXX..XXXXXXX 100644
42
--- a/target/arm/vec_helper.c
44
--- a/include/hw/timer/cmsdk-apb-dualtimer.h
43
+++ b/target/arm/vec_helper.c
45
+++ b/include/hw/timer/cmsdk-apb-dualtimer.h
44
@@ -XXX,XX +XXX,XX @@ static float32 float32_abd(float32 op1, float32 op2, float_status *stat)
46
@@ -XXX,XX +XXX,XX @@
45
return float32_abs(float32_sub(op1, op2, stat));
47
* https://developer.arm.com/products/system-design/system-design-kits/cortex-m-system-design-kit
48
*
49
* QEMU interface:
50
- * + QOM property "pclk-frq": frequency at which the timer is clocked
51
* + Clock input "TIMCLK": clock (for both timers)
52
* + sysbus MMIO region 0: the register bank
53
* + sysbus IRQ 0: combined timer interrupt TIMINTC
54
@@ -XXX,XX +XXX,XX @@ struct CMSDKAPBDualTimer {
55
/*< public >*/
56
MemoryRegion iomem;
57
qemu_irq timerintc;
58
- uint32_t pclk_frq;
59
Clock *timclk;
60
61
CMSDKAPBDualTimerModule timermod[CMSDK_APB_DUALTIMER_NUM_MODULES];
62
diff --git a/include/hw/timer/cmsdk-apb-timer.h b/include/hw/timer/cmsdk-apb-timer.h
63
index XXXXXXX..XXXXXXX 100644
64
--- a/include/hw/timer/cmsdk-apb-timer.h
65
+++ b/include/hw/timer/cmsdk-apb-timer.h
66
@@ -XXX,XX +XXX,XX @@ OBJECT_DECLARE_SIMPLE_TYPE(CMSDKAPBTimer, CMSDK_APB_TIMER)
67
68
/*
69
* QEMU interface:
70
- * + QOM property "pclk-frq": frequency at which the timer is clocked
71
* + Clock input "pclk": clock for the timer
72
* + sysbus MMIO region 0: the register bank
73
* + sysbus IRQ 0: timer interrupt TIMERINT
74
@@ -XXX,XX +XXX,XX @@ struct CMSDKAPBTimer {
75
/*< public >*/
76
MemoryRegion iomem;
77
qemu_irq timerint;
78
- uint32_t pclk_frq;
79
struct ptimer_state *timer;
80
Clock *pclk;
81
82
diff --git a/include/hw/watchdog/cmsdk-apb-watchdog.h b/include/hw/watchdog/cmsdk-apb-watchdog.h
83
index XXXXXXX..XXXXXXX 100644
84
--- a/include/hw/watchdog/cmsdk-apb-watchdog.h
85
+++ b/include/hw/watchdog/cmsdk-apb-watchdog.h
86
@@ -XXX,XX +XXX,XX @@
87
* https://developer.arm.com/products/system-design/system-design-kits/cortex-m-system-design-kit
88
*
89
* QEMU interface:
90
- * + QOM property "wdogclk-frq": frequency at which the watchdog is clocked
91
* + Clock input "WDOGCLK": clock for the watchdog's timer
92
* + sysbus MMIO region 0: the register bank
93
* + sysbus IRQ 0: watchdog interrupt
94
@@ -XXX,XX +XXX,XX @@ struct CMSDKAPBWatchdog {
95
/*< public >*/
96
MemoryRegion iomem;
97
qemu_irq wdogint;
98
- uint32_t wdogclk_frq;
99
bool is_luminary;
100
struct ptimer_state *timer;
101
Clock *wdogclk;
102
diff --git a/hw/arm/armsse.c b/hw/arm/armsse.c
103
index XXXXXXX..XXXXXXX 100644
104
--- a/hw/arm/armsse.c
105
+++ b/hw/arm/armsse.c
106
@@ -XXX,XX +XXX,XX @@ static Property iotkit_properties[] = {
107
DEFINE_PROP_LINK("memory", ARMSSE, board_memory, TYPE_MEMORY_REGION,
108
MemoryRegion *),
109
DEFINE_PROP_UINT32("EXP_NUMIRQ", ARMSSE, exp_numirq, 64),
110
- DEFINE_PROP_UINT32("MAINCLK_FRQ", ARMSSE, mainclk_frq, 0),
111
DEFINE_PROP_UINT32("SRAM_ADDR_WIDTH", ARMSSE, sram_addr_width, 15),
112
DEFINE_PROP_UINT32("init-svtor", ARMSSE, init_svtor, 0x10000000),
113
DEFINE_PROP_BOOL("CPU0_FPU", ARMSSE, cpu_fpu[0], true),
114
@@ -XXX,XX +XXX,XX @@ static Property armsse_properties[] = {
115
DEFINE_PROP_LINK("memory", ARMSSE, board_memory, TYPE_MEMORY_REGION,
116
MemoryRegion *),
117
DEFINE_PROP_UINT32("EXP_NUMIRQ", ARMSSE, exp_numirq, 64),
118
- DEFINE_PROP_UINT32("MAINCLK_FRQ", ARMSSE, mainclk_frq, 0),
119
DEFINE_PROP_UINT32("SRAM_ADDR_WIDTH", ARMSSE, sram_addr_width, 15),
120
DEFINE_PROP_UINT32("init-svtor", ARMSSE, init_svtor, 0x10000000),
121
DEFINE_PROP_BOOL("CPU0_FPU", ARMSSE, cpu_fpu[0], false),
122
diff --git a/hw/timer/cmsdk-apb-dualtimer.c b/hw/timer/cmsdk-apb-dualtimer.c
123
index XXXXXXX..XXXXXXX 100644
124
--- a/hw/timer/cmsdk-apb-dualtimer.c
125
+++ b/hw/timer/cmsdk-apb-dualtimer.c
126
@@ -XXX,XX +XXX,XX @@ static const VMStateDescription cmsdk_apb_dualtimer_vmstate = {
127
}
128
};
129
130
-static Property cmsdk_apb_dualtimer_properties[] = {
131
- DEFINE_PROP_UINT32("pclk-frq", CMSDKAPBDualTimer, pclk_frq, 0),
132
- DEFINE_PROP_END_OF_LIST(),
133
-};
134
-
135
static void cmsdk_apb_dualtimer_class_init(ObjectClass *klass, void *data)
136
{
137
DeviceClass *dc = DEVICE_CLASS(klass);
138
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_dualtimer_class_init(ObjectClass *klass, void *data)
139
dc->realize = cmsdk_apb_dualtimer_realize;
140
dc->vmsd = &cmsdk_apb_dualtimer_vmstate;
141
dc->reset = cmsdk_apb_dualtimer_reset;
142
- device_class_set_props(dc, cmsdk_apb_dualtimer_properties);
46
}
143
}
47
144
48
+/*
145
static const TypeInfo cmsdk_apb_dualtimer_info = {
49
+ * Reciprocal step. These are the AArch32 version which uses a
146
diff --git a/hw/timer/cmsdk-apb-timer.c b/hw/timer/cmsdk-apb-timer.c
50
+ * non-fused multiply-and-subtract.
51
+ */
52
+static float16 float16_recps_nf(float16 op1, float16 op2, float_status *stat)
53
+{
54
+ op1 = float16_squash_input_denormal(op1, stat);
55
+ op2 = float16_squash_input_denormal(op2, stat);
56
+
57
+ if ((float16_is_infinity(op1) && float16_is_zero(op2)) ||
58
+ (float16_is_infinity(op2) && float16_is_zero(op1))) {
59
+ return float16_two;
60
+ }
61
+ return float16_sub(float16_two, float16_mul(op1, op2, stat), stat);
62
+}
63
+
64
+static float32 float32_recps_nf(float32 op1, float32 op2, float_status *stat)
65
+{
66
+ op1 = float32_squash_input_denormal(op1, stat);
67
+ op2 = float32_squash_input_denormal(op2, stat);
68
+
69
+ if ((float32_is_infinity(op1) && float32_is_zero(op2)) ||
70
+ (float32_is_infinity(op2) && float32_is_zero(op1))) {
71
+ return float32_two;
72
+ }
73
+ return float32_sub(float32_two, float32_mul(op1, op2, stat), stat);
74
+}
75
+
76
#define DO_3OP(NAME, FUNC, TYPE) \
77
void HELPER(NAME)(void *vd, void *vn, void *vm, void *stat, uint32_t desc) \
78
{ \
79
@@ -XXX,XX +XXX,XX @@ DO_3OP(gvec_fmaxnum_s, float32_maxnum, float32)
80
DO_3OP(gvec_fminnum_h, float16_minnum, float16)
81
DO_3OP(gvec_fminnum_s, float32_minnum, float32)
82
83
+DO_3OP(gvec_recps_nf_h, float16_recps_nf, float16)
84
+DO_3OP(gvec_recps_nf_s, float32_recps_nf, float32)
85
+
86
#ifdef TARGET_AARCH64
87
88
DO_3OP(gvec_recps_h, helper_recpsf_f16, float16)
89
diff --git a/target/arm/vfp_helper.c b/target/arm/vfp_helper.c
90
index XXXXXXX..XXXXXXX 100644
147
index XXXXXXX..XXXXXXX 100644
91
--- a/target/arm/vfp_helper.c
148
--- a/hw/timer/cmsdk-apb-timer.c
92
+++ b/target/arm/vfp_helper.c
149
+++ b/hw/timer/cmsdk-apb-timer.c
93
@@ -XXX,XX +XXX,XX @@ uint32_t HELPER(vfp_fcvt_f64_to_f16)(float64 a, void *fpstp, uint32_t ahp_mode)
150
@@ -XXX,XX +XXX,XX @@ static const VMStateDescription cmsdk_apb_timer_vmstate = {
94
return r;
151
}
152
};
153
154
-static Property cmsdk_apb_timer_properties[] = {
155
- DEFINE_PROP_UINT32("pclk-frq", CMSDKAPBTimer, pclk_frq, 0),
156
- DEFINE_PROP_END_OF_LIST(),
157
-};
158
-
159
static void cmsdk_apb_timer_class_init(ObjectClass *klass, void *data)
160
{
161
DeviceClass *dc = DEVICE_CLASS(klass);
162
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_timer_class_init(ObjectClass *klass, void *data)
163
dc->realize = cmsdk_apb_timer_realize;
164
dc->vmsd = &cmsdk_apb_timer_vmstate;
165
dc->reset = cmsdk_apb_timer_reset;
166
- device_class_set_props(dc, cmsdk_apb_timer_properties);
95
}
167
}
96
168
97
-float32 HELPER(recps_f32)(CPUARMState *env, float32 a, float32 b)
169
static const TypeInfo cmsdk_apb_timer_info = {
98
-{
170
diff --git a/hw/watchdog/cmsdk-apb-watchdog.c b/hw/watchdog/cmsdk-apb-watchdog.c
99
- float_status *s = &env->vfp.standard_fp_status;
171
index XXXXXXX..XXXXXXX 100644
100
- if ((float32_is_infinity(a) && float32_is_zero_or_denormal(b)) ||
172
--- a/hw/watchdog/cmsdk-apb-watchdog.c
101
- (float32_is_infinity(b) && float32_is_zero_or_denormal(a))) {
173
+++ b/hw/watchdog/cmsdk-apb-watchdog.c
102
- if (!(float32_is_zero(a) || float32_is_zero(b))) {
174
@@ -XXX,XX +XXX,XX @@ static const VMStateDescription cmsdk_apb_watchdog_vmstate = {
103
- float_raise(float_flag_input_denormal, s);
175
}
104
- }
176
};
105
- return float32_two;
177
106
- }
178
-static Property cmsdk_apb_watchdog_properties[] = {
107
- return float32_sub(float32_two, float32_mul(a, b, s), s);
179
- DEFINE_PROP_UINT32("wdogclk-frq", CMSDKAPBWatchdog, wdogclk_frq, 0),
108
-}
180
- DEFINE_PROP_END_OF_LIST(),
181
-};
109
-
182
-
110
float32 HELPER(rsqrts_f32)(CPUARMState *env, float32 a, float32 b)
183
static void cmsdk_apb_watchdog_class_init(ObjectClass *klass, void *data)
111
{
184
{
112
float_status *s = &env->vfp.standard_fp_status;
185
DeviceClass *dc = DEVICE_CLASS(klass);
113
diff --git a/target/arm/translate-neon.c.inc b/target/arm/translate-neon.c.inc
186
@@ -XXX,XX +XXX,XX @@ static void cmsdk_apb_watchdog_class_init(ObjectClass *klass, void *data)
114
index XXXXXXX..XXXXXXX 100644
187
dc->realize = cmsdk_apb_watchdog_realize;
115
--- a/target/arm/translate-neon.c.inc
188
dc->vmsd = &cmsdk_apb_watchdog_vmstate;
116
+++ b/target/arm/translate-neon.c.inc
189
dc->reset = cmsdk_apb_watchdog_reset;
117
@@ -XXX,XX +XXX,XX @@ DO_3S_FP_GVEC(VMLA, gen_helper_gvec_fmla_s, gen_helper_gvec_fmla_h)
190
- device_class_set_props(dc, cmsdk_apb_watchdog_properties);
118
DO_3S_FP_GVEC(VMLS, gen_helper_gvec_fmls_s, gen_helper_gvec_fmls_h)
119
DO_3S_FP_GVEC(VFMA, gen_helper_gvec_vfma_s, gen_helper_gvec_vfma_h)
120
DO_3S_FP_GVEC(VFMS, gen_helper_gvec_vfms_s, gen_helper_gvec_vfms_h)
121
+DO_3S_FP_GVEC(VRECPS, gen_helper_gvec_recps_nf_s, gen_helper_gvec_recps_nf_h)
122
123
WRAP_FP_GVEC(gen_VMAXNM_fp32_3s, FPST_STD, gen_helper_gvec_fmaxnum_s)
124
WRAP_FP_GVEC(gen_VMAXNM_fp16_3s, FPST_STD_F16, gen_helper_gvec_fmaxnum_h)
125
@@ -XXX,XX +XXX,XX @@ static bool trans_VMINNM_fp_3s(DisasContext *s, arg_3same *a)
126
return do_3same(s, a, gen_VMINNM_fp32_3s);
127
}
191
}
128
192
129
-WRAP_ENV_FN(gen_VRECPS_tramp, gen_helper_recps_f32)
193
static const TypeInfo cmsdk_apb_watchdog_info = {
130
-
131
-static void gen_VRECPS_fp_3s(unsigned vece, uint32_t rd_ofs,
132
- uint32_t rn_ofs, uint32_t rm_ofs,
133
- uint32_t oprsz, uint32_t maxsz)
134
-{
135
- static const GVecGen3 ops = { .fni4 = gen_VRECPS_tramp };
136
- tcg_gen_gvec_3(rd_ofs, rn_ofs, rm_ofs, oprsz, maxsz, &ops);
137
-}
138
-
139
-static bool trans_VRECPS_fp_3s(DisasContext *s, arg_3same *a)
140
-{
141
- if (a->size != 0) {
142
- /* TODO fp16 support */
143
- return false;
144
- }
145
-
146
- return do_3same(s, a, gen_VRECPS_fp_3s);
147
-}
148
-
149
WRAP_ENV_FN(gen_VRSQRTS_tramp, gen_helper_rsqrts_f32)
150
151
static void gen_VRSQRTS_fp_3s(unsigned vece, uint32_t rd_ofs,
152
--
194
--
153
2.20.1
195
2.20.1
154
196
155
197
diff view generated by jsdifflib
1
Macroify creation of the trans functions for single and double
1
Now that the watchdog device uses its Clock input rather than being
2
precision VFMA, VFMS, VFNMA, VFNMS. The repetition was OK for
2
passed the value of system_clock_scale at creation time, we can
3
two sizes, but we're about to add halfprec and it will get a bit
3
remove the hack where we reset the STELLARIS_SYS at board creation
4
more than seems reasonable.
4
time to force it to set system_clock_scale. Instead it will be reset
5
at the usual point in startup and will inform the watchdog of the
6
clock frequency at that point.
5
7
6
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
8
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
7
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
9
Reviewed-by: Luc Michel <luc@lmichel.fr>
8
Message-id: 20200828183354.27913-6-peter.maydell@linaro.org
10
Tested-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
11
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
12
Message-id: 20210128114145.20536-26-peter.maydell@linaro.org
13
Message-id: 20210121190622.22000-26-peter.maydell@linaro.org
14
Reviewed-by: Philippe Mathieu-Daudé <f4bug@amsat.org>
9
---
15
---
10
target/arm/translate-vfp.c.inc | 50 +++++++++-------------------------
16
hw/arm/stellaris.c | 10 ----------
11
1 file changed, 13 insertions(+), 37 deletions(-)
17
1 file changed, 10 deletions(-)
12
18
13
diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
19
diff --git a/hw/arm/stellaris.c b/hw/arm/stellaris.c
14
index XXXXXXX..XXXXXXX 100644
20
index XXXXXXX..XXXXXXX 100644
15
--- a/target/arm/translate-vfp.c.inc
21
--- a/hw/arm/stellaris.c
16
+++ b/target/arm/translate-vfp.c.inc
22
+++ b/hw/arm/stellaris.c
17
@@ -XXX,XX +XXX,XX @@ static bool do_vfm_sp(DisasContext *s, arg_VFMA_sp *a, bool neg_n, bool neg_d)
23
@@ -XXX,XX +XXX,XX @@ static DeviceState *stellaris_sys_init(uint32_t base, qemu_irq irq,
18
return true;
24
sysbus_mmio_map(sbd, 0, base);
25
sysbus_connect_irq(sbd, 0, irq);
26
27
- /*
28
- * Normally we should not be resetting devices like this during
29
- * board creation. For the moment we need to do so, because
30
- * system_clock_scale will only get set when the STELLARIS_SYS
31
- * device is reset, and we need its initial value to pass to
32
- * the watchdog device. This hack can be removed once the
33
- * watchdog has been converted to use a Clock input instead.
34
- */
35
- device_cold_reset(dev);
36
-
37
return dev;
19
}
38
}
20
39
21
-static bool trans_VFMA_sp(DisasContext *s, arg_VFMA_sp *a)
22
-{
23
- return do_vfm_sp(s, a, false, false);
24
-}
25
-
26
-static bool trans_VFMS_sp(DisasContext *s, arg_VFMS_sp *a)
27
-{
28
- return do_vfm_sp(s, a, true, false);
29
-}
30
-
31
-static bool trans_VFNMA_sp(DisasContext *s, arg_VFNMA_sp *a)
32
-{
33
- return do_vfm_sp(s, a, false, true);
34
-}
35
-
36
-static bool trans_VFNMS_sp(DisasContext *s, arg_VFNMS_sp *a)
37
-{
38
- return do_vfm_sp(s, a, true, true);
39
-}
40
-
41
static bool do_vfm_dp(DisasContext *s, arg_VFMA_dp *a, bool neg_n, bool neg_d)
42
{
43
/*
44
@@ -XXX,XX +XXX,XX @@ static bool do_vfm_dp(DisasContext *s, arg_VFMA_dp *a, bool neg_n, bool neg_d)
45
return true;
46
}
47
48
-static bool trans_VFMA_dp(DisasContext *s, arg_VFMA_dp *a)
49
-{
50
- return do_vfm_dp(s, a, false, false);
51
-}
52
+#define MAKE_ONE_VFM_TRANS_FN(INSN, PREC, NEGN, NEGD) \
53
+ static bool trans_##INSN##_##PREC(DisasContext *s, \
54
+ arg_##INSN##_##PREC *a) \
55
+ { \
56
+ return do_vfm_##PREC(s, a, NEGN, NEGD); \
57
+ }
58
59
-static bool trans_VFMS_dp(DisasContext *s, arg_VFMS_dp *a)
60
-{
61
- return do_vfm_dp(s, a, true, false);
62
-}
63
+#define MAKE_VFM_TRANS_FNS(PREC) \
64
+ MAKE_ONE_VFM_TRANS_FN(VFMA, PREC, false, false) \
65
+ MAKE_ONE_VFM_TRANS_FN(VFMS, PREC, true, false) \
66
+ MAKE_ONE_VFM_TRANS_FN(VFNMA, PREC, false, true) \
67
+ MAKE_ONE_VFM_TRANS_FN(VFNMS, PREC, true, true)
68
69
-static bool trans_VFNMA_dp(DisasContext *s, arg_VFNMA_dp *a)
70
-{
71
- return do_vfm_dp(s, a, false, true);
72
-}
73
-
74
-static bool trans_VFNMS_dp(DisasContext *s, arg_VFNMS_dp *a)
75
-{
76
- return do_vfm_dp(s, a, true, true);
77
-}
78
+MAKE_VFM_TRANS_FNS(sp)
79
+MAKE_VFM_TRANS_FNS(dp)
80
81
static bool trans_VMOV_imm_sp(DisasContext *s, arg_VMOV_imm_sp *a)
82
{
83
--
40
--
84
2.20.1
41
2.20.1
85
42
86
43
diff view generated by jsdifflib
Deleted patch
1
Implement VFP fp16 support for the VMOV immediate insn.
2
1
3
Signed-off-by: Peter Maydell <peter.maydell@linaro.org>
4
Reviewed-by: Richard Henderson <richard.henderson@linaro.org>
5
Message-id: 20200828183354.27913-10-peter.maydell@linaro.org
6
---
7
target/arm/vfp.decode | 2 ++
8
target/arm/translate-vfp.c.inc | 22 ++++++++++++++++++++++
9
2 files changed, 24 insertions(+)
10
11
diff --git a/target/arm/vfp.decode b/target/arm/vfp.decode
12
index XXXXXXX..XXXXXXX 100644
13
--- a/target/arm/vfp.decode
14
+++ b/target/arm/vfp.decode
15
@@ -XXX,XX +XXX,XX @@ VFMS_dp ---- 1110 1.10 .... .... 1011 .1.0 .... @vfp_dnm_d
16
VFNMA_dp ---- 1110 1.01 .... .... 1011 .0.0 .... @vfp_dnm_d
17
VFNMS_dp ---- 1110 1.01 .... .... 1011 .1.0 .... @vfp_dnm_d
18
19
+VMOV_imm_hp ---- 1110 1.11 .... .... 1001 0000 .... \
20
+ vd=%vd_sp imm=%vmov_imm
21
VMOV_imm_sp ---- 1110 1.11 .... .... 1010 0000 .... \
22
vd=%vd_sp imm=%vmov_imm
23
VMOV_imm_dp ---- 1110 1.11 .... .... 1011 0000 .... \
24
diff --git a/target/arm/translate-vfp.c.inc b/target/arm/translate-vfp.c.inc
25
index XXXXXXX..XXXXXXX 100644
26
--- a/target/arm/translate-vfp.c.inc
27
+++ b/target/arm/translate-vfp.c.inc
28
@@ -XXX,XX +XXX,XX @@ MAKE_VFM_TRANS_FNS(hp)
29
MAKE_VFM_TRANS_FNS(sp)
30
MAKE_VFM_TRANS_FNS(dp)
31
32
+static bool trans_VMOV_imm_hp(DisasContext *s, arg_VMOV_imm_sp *a)
33
+{
34
+ TCGv_i32 fd;
35
+
36
+ if (!dc_isar_feature(aa32_fp16_arith, s)) {
37
+ return false;
38
+ }
39
+
40
+ if (s->vec_len != 0 || s->vec_stride != 0) {
41
+ return false;
42
+ }
43
+
44
+ if (!vfp_access_check(s)) {
45
+ return true;
46
+ }
47
+
48
+ fd = tcg_const_i32(vfp_expand_imm(MO_16, a->imm));
49
+ neon_store_reg32(fd, a->vd);
50
+ tcg_temp_free_i32(fd);
51
+ return true;
52
+}
53
+
54
static bool trans_VMOV_imm_sp(DisasContext *s, arg_VMOV_imm_sp *a)
55
{
56
uint32_t delta_d = 0;
57
--
58
2.20.1
59
60
diff view generated by jsdifflib