Everyone's got their own munin plugins :)
Some have proof of concept / plaything status, partly because they aren't hugely portable.
Like the stock diskstats, but instead of per-device graphs of everything, uses per-subject graphs with all devices. And tries to use meaningful device names.
Yields four graphs:
- throughput of each devices, STACKed (read and write separately)
- IOPS of each device, STACKed
- wait time
- utilization
Three graphs:
- bad-sector details
- hours and power cycles
- temperatures
You will probably want to run visudo to add a sudoers line like:
munin ALL=(root) NOPASSWD: /usr/sbin/smartctl
TODO: more meaningfull things for SSDs. More robustness.
Attempts to record CPU use per user. Tries to categorize kernel/daemon stuff a bit, e.g. into database, web, appsupport, filesystem, services, kernel. (which can always use work, of course)
Meant to be linked as procmem_res and/or procmem_virt, to see which processes are using and/or mapping the most memory.
Just basic utilization, memory, fan, via nvidia-smi
user_cpu:
procmem_res:
smart:
diskstats: