Maintaining your vSphere environment is key to keeping your users happy. It is essential to keep the environment secure, stable, and performance at its best. The VMware Skyline Health Diagnostics Appliance is a tool that will assess and make recommendations for vCenter, ESXi, and vSAN.
Installing and maintaining the Skyline Health Diagnostics Appliance
The Skyline Health Diagnostics OVA Image is available for vSphere 6.5 and above. Once downloaded, you can deploy to a vCenter. You then use a browser to connect to https://vmware-shd_ip_address_or_fqdn. The tool’s version and compatibility database are both updated frequently. Before running a collect and analyze log bundles, I always update both, so I have the most current information.
- Settings > Tool Update > Check Tool Updates
- Settings > VCG Update > Update VCG Database (this process takes some time)
Collect and analyze log bundles
The tool can detect issues in both vSphere and vSAN environments. It will check for issues and provide KB articles for resolution to any issues detected. It also compares driver and firmware versions you have and compares them to the VMware Compatibility Guide database. First step is either to Collect Logs & Analyze, or you can upload existing log bundles to be analyzed.
- You can choose which plugins: Diagnostics, VMware Security Advisory or vSAN Health. I typically choose all 3.
- With the Collect Logs and Analyze you can choose to include vCenter and pick hosts for analysis.
- When the analysis is complete, you can view the report. A new tab will open with the report. You can also choose to Save the report to an html file. New to version 2.0.5 is an option to delete old reports.
Here is an example of a detected issue:
DIAGNOSTICS.Storage.KB67667: Memory allocation failure for “smartpqi” driver can result in host not responding state. KB Number: 67667. Resolution: This issue is fixed with the version 1.0.3 of driver “smartpqi”. Drivers prior to version 1.0.3 can work with memory allocated within 4GB Range. If there is not free allocatable memory available below 4GB range, driver related operations will fail. Please read KB: https://kb.vmware.com/s/article/67667 for more details/resolution. Fix Available In: smartpqi – 1.0.3
Here are a couple examples of driver version issues:
[WARNING] Current Driver i40en-188.8.131.52-2vmw.6184.108.40.20620388 is part of supported list. But not recent one. Recent as per VCG: 220.127.116.11
[WARNING] Driver Version smartpqi-18.104.22.1683-28vmw.622.214.171.12420388 is lower than recommended by VCG. Minimum: 126.96.36.1998
Full documentation is posted on the VMware website here; https://docs.vmware.com/en/VMware-Skyline-Health-Diagnostics/index.html
Need more information? Email firstname.lastname@example.org, we are happy to help.