[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug] Bring up Dom0 on Arm board

To: 蔡力刚 <cai_ligang@xxxxxxxxxxxxxxx>, xen-users <xen-users@xxxxxxxxxxxxxxxxxxxx>
From: Julien Grall <julien@xxxxxxx>
Date: Thu, 19 Jan 2023 13:55:12 +0000
Cc: "bertrand.marquis" <bertrand.marquis@xxxxxxx>, sstabellini <sstabellini@xxxxxxxxxx>
Delivery-date: Thu, 19 Jan 2023 13:56:03 +0000
List-id: Xen user discussion <xen-users.lists.xenproject.org>



On 18/01/2023 09:04, 蔡力刚 wrote:

About the mali and sdmmc drivers problem, I compare the log between boot with 
xen and boot without xen.
And found an error log as below:
[ 65.517345] arm-scmi firmware:scmi: SCMI Notifications - Core Enabled.
(XEN) d0v2 Unhandled SMC/HVC: 0x82000010
[ 66.559382] arm-scmi firmware:scmi: unable to communicate with SCMI
[ 66.559516] arm-scmi: probe of firmware:scmi failed with error -95
It seems SCMI driver probe failed.
So I did an experiment, disable SCMI driver and rebuild the Linux kernel,
boot up in normal way without xen, and reproduces the problem that mali and 
sdmmc did not bring up.
It looks like a high probability SCMI cause the problem.
I read the Linux code and targeting located the error -95,
It seems SCMI probe failed cause by SMCCC not supported, code as below:
static int smc_send_message(struct scmi_chan_info *cinfo,
struct scmi_xfer *xfer)
{
struct scmi_smc *scmi_info = cinfo->transport_info;
struct arm_smccc_res res;
mutex_lock(&scmi_info->shmem_lock);
shmem_tx_prepare(scmi_info->shmem, xfer);
if (scmi_info->irq)
reinit_completion(&scmi_info->tx_complete);
arm_smccc_1_1_invoke(scmi_info->func_id, 0, 0, 0, 0, 0, 0, 0, &res);
if (scmi_info->irq)
wait_for_completion(&scmi_info->tx_complete);
scmi_rx_callback(scmi_info->cinfo, shmem_read_header(scmi_info->shmem));
mutex_unlock(&scmi_info->shmem_lock);
/* Only SMCCC_RET_NOT_SUPPORTED is valid error code */
if (res.a0)
return -EOPNOTSUPP;
return 0;
}
#define EOPNOTSUPP 95 /* Operation not supported on transport endpoint */
I also check the code where Unhandled SMC/HVC print in xen,
and found the log cause by unhandled SMCCC call in function 
vsmccc_handle_call().
Could it be xen unhandle SMCCC call cause SCMI driver probe failed ?


Yes. The domain would need to talk to the host SCMI. This is not yet
supported because Xen doesn't provide a mediator (this is necessary to
ensure the safety of the call).

If you are *only* looking to use the Mali driver in dom0. So you could
add some code in Xen to forward simply forward the request to the host
and check if it helps you.


How can I pass the requset to the host? I'm not familiar with xen code,
is there any
reference code in xen?


There are a couple of solution:
1) You request the hypervisor to avoid trapping SVC. This would also
need some changes in Linux to force the Mali driver to use SVC rather
than HVC. There is a patch on xen-devel, to avoid trapping (see [1]).
2) Add an allow list of the SMCCC operations. There are some examples
how to "emulate" SMC call in Xen (see vsmccc_handle_call()).

Cheers,

[1]
https://lore.kernel.org/xen-devel/alpine.DEB.2.21.2106241749310.24906@sstabellini-ThinkPad-T480s/

I tried both solutions but it didn't work.
First way:
I add the code as the patch, add forward_smc=true to the Xen command line.
Then boot, but still print the log 'Unhandled SMC/HVC ...',


Can you confirm whether you ask the mali driver to use SMC call?

meanwhile xen
throw an exception, log as below:
(XEN) traps.c:1987:d0v3 HSR=0x92000007 pc=0xffffffc0106a3be8 
gva=0xffffffc0127ad0a0 gpa=0x000000001000a0
[ 5.966596] Unhandled fault at 0xffffffc0127ad0a0
[ 5.966619] Mem abort info:
[ 5.966633] ESR = 0x96000000
[ 5.966649] EC = 0x25: DABT (current EL), IL = 32 bits
[ 5.966666] SET = 0, FnV = 0
[ 5.966680] EA = 0, S1PTW = 0
[ 5.966694] Data abort info:
[ 5.966708] ISV = 0, ISS = 0x00000000
[ 5.966722] CM = 0, WnR = 0
[ 5.966738] swapper pgtable: 4k pages, 39-bit VAs, pgdp=00000000194b6000
It seems forward_smc=true did not work, but cause an exception.

This is indicating that the dom0 is trying to access a region that isnot mapped.


Can you check what the address 0x1000a0 is used for the host layout?

Second way:
I used zynqmp_eemi() in xilinx-zynqmp-eemi.c to handle smc call.
The smc call seems succeed according to xen log.
But after run smc call, kernel throw an exception, log as below:
[ 8.771446] rockchip-pm-domain fd8d8000.power-management:power-controller:
Looking up pcie-supply from device tree
[ 8.771485] rockchip-pm-domain fd8d8000.power-management:power-controller:
Looking up pcie-supply property in node 
/power-management@fd8d8000/power-controller failed
[ 66.037851] rcu: INFO: rcu_sched detected stalls on CPUs/tasks:
[ 66.037874] rcu: 3-...0: (0 ticks this GP) idle=196/1/0x4000000000000000 
softirq=30/30 fqs=6000
[ 66.037892] (detected by 5, t=18002 jiffies, g=-1147, q=81)
[ 66.037905] Task dump for CPU 3:
[ 66.037916] task:swapper/0 state:R running task stack: 0 pid: 1 ppid: 0 
flags:0x0000002a
[ 66.037939] Call trace:
[ 66.037952] __switch_to+0x130/0x1a0
[ 66.037968] 0xffffffc0112bc173
It seems I need to modify zynqmp_eemi() code to adapt the kerenl,
But based on what to modify zynqmp_eemi()?

I am not sure I understand what are the changes you made in Xen. Can youpost the diff?


Cheers,

--
Julien Grall

References:
- Re: [Bug] Bring up Dom0 on Arm board
  - From: 蔡力刚

Prev by Date: Re: [Bug] Bring up Dom0 on Arm board
Next by Date: Re: [Bug] Bring up Dom0 on Arm board
Previous by thread: Re: [Bug] Bring up Dom0 on Arm board
Next by thread: Re: [Bug] Bring up Dom0 on Arm board
Index(es):
- Date
- Thread

Lists.xenproject.org is hosted with RackSpace, monitoring our
servers 24x7x365 and backed by RackSpace's Fanatical Support®.