Skip to content

Commit 32c722a

Browse files
committed
iommu/s390: Make attach succeed when the device was surprise removed
JIRA: https://issues.redhat.com/browse/RHEL-113440 commit 9ffaf52 Author: Niklas Schnelle <schnelle@linux.ibm.com> Date: Thu Sep 4 10:59:49 2025 +0200 iommu/s390: Make attach succeed when the device was surprise removed When a PCI device is removed with surprise hotplug, there may still be attempts to attach the device to the default domain as part of tear down via (__iommu_release_dma_ownership()), or because the removal happens during probe (__iommu_probe_device()). In both cases zpci_register_ioat() fails with a cc value indicating that the device handle is invalid. This is because the device is no longer part of the instance as far as the hypervisor is concerned. Currently this leads to an error return and s390_iommu_attach_device() fails. This triggers the WARN_ON() in __iommu_group_set_domain_nofail() because attaching to the default domain must never fail. With the device fenced by the hypervisor no DMAs to or from memory are possible and the IOMMU translations have no effect. Proceed as if the registration was successful and let the hotplug event handling clean up the device. This is similar to how devices in the error state are handled since commit 59bbf59 ("iommu/s390: Make attach succeed even if the device is in error state") except that for removal the domain will not be registered later. This approach was also previously discussed at the link. Handle both cases, error state and removal, in a helper which checks if the error needs to be propagated or ignored. Avoid magic number condition codes by using the pre-existing, but never used, defines for PCI load/store condition codes and rename them to reflect that they apply to all PCI instructions. Cc: stable@vger.kernel.org # v6.2 Link: https://lore.kernel.org/linux-iommu/20240808194155.GD1985367@ziepe.ca/ Suggested-by: Jason Gunthorpe <jgg@ziepe.ca> Signed-off-by: Niklas Schnelle <schnelle@linux.ibm.com> Reviewed-by: Matthew Rosato <mjrosato@linux.ibm.com> Reviewed-by: Benjamin Block <bblock@linux.ibm.com> Link: https://lore.kernel.org/r/20250904-iommu_succeed_attach_removed-v1-1-e7f333d2f80f@linux.ibm.com Signed-off-by: Joerg Roedel <joerg.roedel@amd.com> Signed-off-by: Thomas Huth <thuth@redhat.com>
1 parent adb8164 commit 32c722a

File tree

2 files changed

+24
-12
lines changed

2 files changed

+24
-12
lines changed

arch/s390/include/asm/pci_insn.h

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -16,11 +16,11 @@
1616
#define ZPCI_PCI_ST_FUNC_NOT_AVAIL 40
1717
#define ZPCI_PCI_ST_ALREADY_IN_RQ_STATE 44
1818

19-
/* Load/Store return codes */
20-
#define ZPCI_PCI_LS_OK 0
21-
#define ZPCI_PCI_LS_ERR 1
22-
#define ZPCI_PCI_LS_BUSY 2
23-
#define ZPCI_PCI_LS_INVAL_HANDLE 3
19+
/* PCI instruction condition codes */
20+
#define ZPCI_CC_OK 0
21+
#define ZPCI_CC_ERR 1
22+
#define ZPCI_CC_BUSY 2
23+
#define ZPCI_CC_INVAL_HANDLE 3
2424

2525
/* Load/Store address space identifiers */
2626
#define ZPCI_PCIAS_MEMIO_0 0

drivers/iommu/s390-iommu.c

Lines changed: 19 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -611,6 +611,23 @@ static u64 get_iota_region_flag(struct s390_domain *domain)
611611
}
612612
}
613613

614+
static bool reg_ioat_propagate_error(int cc, u8 status)
615+
{
616+
/*
617+
* If the device is in the error state the reset routine
618+
* will register the IOAT of the newly set domain on re-enable
619+
*/
620+
if (cc == ZPCI_CC_ERR && status == ZPCI_PCI_ST_FUNC_NOT_AVAIL)
621+
return false;
622+
/*
623+
* If the device was removed treat registration as success
624+
* and let the subsequent error event trigger tear down.
625+
*/
626+
if (cc == ZPCI_CC_INVAL_HANDLE)
627+
return false;
628+
return cc != ZPCI_CC_OK;
629+
}
630+
614631
static int s390_iommu_domain_reg_ioat(struct zpci_dev *zdev,
615632
struct iommu_domain *domain, u8 *status)
616633
{
@@ -695,7 +712,7 @@ static int s390_iommu_attach_device(struct iommu_domain *domain,
695712

696713
/* If we fail now DMA remains blocked via blocking domain */
697714
cc = s390_iommu_domain_reg_ioat(zdev, domain, &status);
698-
if (cc && status != ZPCI_PCI_ST_FUNC_NOT_AVAIL)
715+
if (reg_ioat_propagate_error(cc, status))
699716
return -EIO;
700717
zdev->dma_table = s390_domain->dma_table;
701718
zdev_s390_domain_update(zdev, domain);
@@ -1123,12 +1140,7 @@ static int s390_attach_dev_identity(struct iommu_domain *domain,
11231140

11241141
/* If we fail now DMA remains blocked via blocking domain */
11251142
cc = s390_iommu_domain_reg_ioat(zdev, domain, &status);
1126-
1127-
/*
1128-
* If the device is undergoing error recovery the reset code
1129-
* will re-establish the new domain.
1130-
*/
1131-
if (cc && status != ZPCI_PCI_ST_FUNC_NOT_AVAIL)
1143+
if (reg_ioat_propagate_error(cc, status))
11321144
return -EIO;
11331145

11341146
zdev_s390_domain_update(zdev, domain);

0 commit comments

Comments
 (0)