@@ -522,7 +522,6 @@ Description: These files allow to each of ASICs by writing 1.
522522
523523 The files are write only.
524524
525-
526525What: /sys/devices/platform/mlxplat/mlxreg-io/hwmon/hwmon*/comm_chnl_ready
527526Date: July 2022
528527KernelVersion: 5.20
@@ -542,3 +541,124 @@ Description: The file indicates COME module hardware configuration.
542541 The purpose is to expose some minor BOM changes for the same system SKU.
543542
544543 The file is read only.
544+
545+ What: /sys/devices/platform/mlxplat/mlxreg-io/hwmon/hwmon*/reset_pwr_converter_fail
546+ Date: February 2023
547+ KernelVersion: 6.3
548+ Contact: Vadim Pasternak <vadimp@nvidia.com>
549+ Description: This file shows the system reset cause due to power converter
550+ devices failure.
551+ Value 1 in file means this is reset cause, 0 - otherwise.
552+
553+ The file is read only.
554+
555+ What: /sys/devices/platform/mlxplat/mlxreg-io/hwmon/hwmon*/erot1_ap_reset
556+ What: /sys/devices/platform/mlxplat/mlxreg-io/hwmon/hwmon*/erot2_ap_reset
557+ Date: February 2023
558+ KernelVersion: 6.3
559+ Contact: Vadim Pasternak <vadimp@nvidia.com>
560+ Description: These files aim to monitor the status of the External Root of Trust (EROT)
561+ processor's RESET output to the Application Processor (AP).
562+ By reading this file, could be determined if the EROT has invalidated or
563+ revoked AP Firmware, at which point it will hold the AP in RESET until a
564+ valid firmware is loaded. This protects the AP from running an
565+ unauthorized firmware. In the normal flow, the AP reset should be released
566+ after the EROT validates the integrity of the FW, and it should be done so
567+ as quickly as possible so that the AP boots before the CPU starts to
568+ communicate to each ASIC.
569+
570+ The files are read only.
571+
572+ What: /sys/devices/platform/mlxplat/mlxreg-io/hwmon/hwmon*/erot1_recovery
573+ What: /sys/devices/platform/mlxplat/mlxreg-io/hwmon/hwmon*/erot2_recovery
574+ What: /sys/devices/platform/mlxplat/mlxreg-io/hwmon/hwmon*/erot1_reset
575+ What: /sys/devices/platform/mlxplat/mlxreg-io/hwmon/hwmon*/erot2_reset
576+ Date: February 2023
577+ KernelVersion: 6.3
578+ Contact: Vadim Pasternak <vadimp@nvidia.com>
579+ Description: These files aim to perform External Root of Trust (EROT) recovery
580+ sequence after EROT device failure.
581+ These EROT devices protect ASICs from unauthorized access and in normal
582+ flow their reset should be released with system power – earliest power
583+ up stage, so that EROTs can begin boot and authentication process before
584+ CPU starts to communicate to ASICs.
585+ Issuing a reset to the EROT while asserting the recovery signal will cause
586+ the EROT Application Processor to enter recovery mode so that the EROT FW
587+ can be updated/recovered.
588+ For reset/recovery the related file should be toggled by 1/0.
589+
590+ The files are read/write.
591+
592+ What: /sys/devices/platform/mlxplat/mlxreg-io/hwmon/hwmon*/erot1_wp
593+ What: /sys/devices/platform/mlxplat/mlxreg-io/hwmon/hwmon*/erot2_wp
594+ Date: February 2023
595+ KernelVersion: 6.3
596+ Contact: Vadim Pasternak <vadimp@nvidia.com>
597+ Description: These files allow access to External Root of Trust (EROT) for reset
598+ and recovery sequence after EROT device failure.
599+ Default is 0 (programming disabled).
600+ If the system is in locked-down mode writing this file will not be allowed.
601+
602+ The files are read/write.
603+
604+ What: /sys/devices/platform/mlxplat/mlxreg-io/hwmon/hwmon*/spi_chnl_select
605+ Date: February 2023
606+ KernelVersion: 6.3
607+ Contact: Vadim Pasternak <vadimp@nvidia.com>
608+ Description: This file allows SPI chip selection for External Root of Trust (EROT)
609+ device Out-of-Band recovery.
610+ File can be written with 0 or with 1. It selects which EROT can be accessed
611+ through SPI device.
612+
613+ The file is read/write.
614+
615+ What: /sys/devices/platform/mlxplat/mlxreg-io/hwmon/hwmon*/asic_pg_fail
616+ Date: February 2023
617+ KernelVersion: 6.3
618+ Contact: Vadim Pasternak vadimp@nvidia.com
619+ Description: This file shows ASIC Power Good status.
620+ Value 1 in file means ASIC Power Good failed, 0 - otherwise.
621+
622+ The file is read only.
623+
624+ What: /sys/devices/platform/mlxplat/mlxreg-io/hwmon/hwmon*/clk_brd1_boot_fail
625+ What: /sys/devices/platform/mlxplat/mlxreg-io/hwmon/hwmon*/clk_brd2_boot_fail
626+ What: /sys/devices/platform/mlxplat/mlxreg-io/hwmon/hwmon*/clk_brd_fail
627+ Date: February 2023
628+ KernelVersion: 6.3
629+ Contact: Vadim Pasternak vadimp@nvidia.com
630+ Description: These files are related to clock boards status in system.
631+ - clk_brd1_boot_fail: warning about 1-st clock board failed to boot from CI.
632+ - clk_brd2_boot_fail: warning about 2-nd clock board failed to boot from CI.
633+ - clk_brd_fail: error about common clock board boot failure.
634+
635+ The files are read only.
636+
637+ What: /sys/devices/platform/mlxplat/mlxreg-io/hwmon/hwmon*/clk_brd_prog_en
638+ Date: February 2023
639+ KernelVersion: 6.3
640+ Contact: Vadim Pasternak <vadimp@nvidia.com>
641+ Description: This file enables programming of clock boards.
642+ Default is 0 (programming disabled).
643+ If the system is in locked-down mode writing this file will not be allowed.
644+
645+ The file is read/write.
646+
647+ What: /sys/devices/platform/mlxplat/mlxreg-io/hwmon/hwmon*/pwr_converter_prog_en
648+ Date: February 2023
649+ KernelVersion: 6.3
650+ Contact: Vadim Pasternak <vadimp@nvidia.com>
651+ Description: This file enables programming of power converters.
652+ Default is 0 (programming disabled).
653+ If the system is in locked-down mode writing this file will not be allowed.
654+
655+ The file is read/write.
656+
657+ What: /sys/devices/platform/mlxplat/mlxreg-io/hwmon/hwmon*/reset_ac_ok_fail
658+ Date: February 2023
659+ KernelVersion: 6.3
660+ Contact: Vadim Pasternak <vadimp@nvidia.com>
661+ Description: This file shows the system reset cause due to AC power failure.
662+ Value 1 in file means this is reset cause, 0 - otherwise.
663+
664+ The file is read only.
0 commit comments