19. AMD HSMP interface¶
Newer Fam19h EPYC server line of processors from AMD support system management functionality via HSMP (Host System Management Port).
The Host System Management Port (HSMP) is an interface to provide OS-level software with access to system management functions via a set of mailbox registers.
More details on the interface can be found in chapter "7 Host System Management Port (HSMP)" of the family/model PPR Eg: https://www.amd.com/system/files/TechDocs/55898_B1_pub_0.50.zip
HSMP interface is supported on EPYC server CPU models only.
19.1. HSMP device¶
amd_hsmp driver under the drivers/platforms/x86/ creates miscdevice /dev/hsmp to let user space programs run hsmp mailbox commands.
$ ls -al /dev/hsmp crw-r--r-- 1 root root 10, 123 Jan 21 21:41 /dev/hsmp
- Characteristics of the dev node:
Write mode is used for running set/configure commands
Read mode is used for running get/status monitor commands
- Access restrictions:
Only root user is allowed to open the file in write mode.
The file can be opened in read mode by all the users.
- In-kernel integration:
Other subsystems in the kernel can use the exported transport function hsmp_send_message().
Locking across callers is taken care by the driver.
19.2. HSMP sysfs interface¶
Metrics table binary sysfs
AMD MI300A MCM provides GET_METRICS_TABLE message to retrieve most of the system management information from SMU in one go.
The metrics table is made available as hexadecimal sysfs binary file under per socket sysfs directory created at /sys/devices/platform/amd_hsmp/socket%d/metrics_bin
Note: lseek() is not supported as entire metrics table is read.
Metrics table definitions will be documented as part of Public PPR. The same is defined in the amd_hsmp.h header.
19.3. An example¶
To access hsmp device from a C program. First, you need to include the headers:
#include <linux/amd_hsmp.h>
Which defines the supported messages/message IDs.
Next thing, open the device file, as follows:
int file;
file = open("/dev/hsmp", O_RDWR);
if (file < 0) {
/* ERROR HANDLING; you can check errno to see what went wrong */
exit(1);
}
The following IOCTL is defined:
ioctl(file, HSMP_IOCTL_CMD, struct hsmp_message *msg)
The argument is a pointer to a:
struct hsmp_message { __u32 msg_id; /* Message ID */ __u16 num_args; /* Number of input argument words in message */ __u16 response_sz; /* Number of expected output/response words */ __u32 args[HSMP_MAX_MSG_LEN]; /* argument/response buffer */ __u16 sock_ind; /* socket number */ };
The ioctl would return a non-zero on failure; you can read errno to see what happened. The transaction returns 0 on success.
More details on the interface and message definitions can be found in chapter "7 Host System Management Port (HSMP)" of the respective family/model PPR eg: https://www.amd.com/system/files/TechDocs/55898_B1_pub_0.50.zip
User space C-APIs are made available by linking against the esmi library, which is provided by the E-SMS project https://developer.amd.com/e-sms/. See: https://github.com/amd/esmi_ib_library