Zabbix
Why zabbix? Not AI related, gather hardware metrics like GPU usage, temperature, power consumption, used by grafana.
http ingress: http://zabbix.uaiso.lan
wget https://repo.zabbix.com/zabbix/7.4/release/ubuntu/pool/main/z/zabbix-release/zabbix-release_latest_7.4+ubuntu24.04_all.deb
dpkg -i zabbix-release_latest_7.4+ubuntu24.04_all.deb
apt update
apt install zabbix-agent2
apt install zabbix-agent2-plugin-nvidia-gpu
edit config /etc/zabbix/zabbix_agent2.conf
Server=0.0.0.0/0
start agent
systemctl restart zabbix-agent2
systemctl enable zabbix-agent2