Follow along with the video below to see how to install our site as a web app on your home screen.
Note: This feature may not be available in some browsers.
lol.... A single VM with 32 logical CPUs and 96 gigs of RAM?
There must be a story to that. There are some loads/configs that are just better off not virtualized.
Somebody has major CPU contention
That's some funny shit right there. 71% RDY is the more important stat on the maximum, and basically 20% ready the rest of the time. The Wait time and the Idle time are pretty close which if I understand correctly, means the VM is doing nothing. Wait% - Idle% = the time the VM is waiting on the VMKernel. It's still busting NUMA nodes and the CO-STOP has to be astronomical.
Maybe not. Wait time doesn't = cpu contention. Halo is correct the wait and idle time is nearly identical which would likely mean the Vm isn't doing anything.
Yeah i agree why you has a Vm with 32 cpu's?
Maybe not. Wait time doesn't = cpu contention. Halo is correct the wait and idle time is nearly identical which would likely mean the Vm isn't doing anything.
Where our you getting 71% ready times from?
40k ms indicates cpu contention, especially for a single VM. If you talking about an 8 proc host then that's different. But a single VM with 40k ms ready time is insane.
I would like to see what ops manager says the ready % is.
%WAIT
Percentage of time the resource pool, virtual machine, or world spent in the blocked or busy wait state. This percentage includes the percentage of time the resource pool, virtual machine, or world was idle.
%IDLE
Percentage of time the resource pool, virtual machine, or world was idle. Subtract this percentage from %WAIT to see the percentage of time the resource pool, virtual machine, or world was waiting for some event. The difference, %WAIT- %IDLE, of the VCPU worlds can be used to estimate guest I/O wait time. To find the VCPU worlds, use the single-key command e to expand a virtual machine and search for the world NAME starting with "vcpu". (The VCPU worlds might wait for other events in addition to I/O events, so this measurement is only an estimate.)
http://kb.vmware.com/selfservice/mi...nguage=en_US&cmd=displayKC&externalId=2002181
Real time = CPU Summation value / 200 = % Ready
In this case of maximum = 14207 / 200 = 71% Ready
In this case of average = 3977 / 200 = 19% Ready
CSTP% will show the SMP contention. That's what you need to look at. I believe the ESXTOP with the -m option will show the NUMA issue. %WAIT and %IDLE are nearly identical. The VM is doing nothing or very little. The high Ready times are a bi-product of overuse of vCPUs.
The VM is not using all that CPU is my bet. Show me Usage %, then drop the number of VCPU's accordingly.
Correct me if I'm wrong but that 71% then has to be divided by 32 since the graph I posted above is a summation total and the VM has 32 CPU's.
71 / 32 = 2.22%
You are correct. This is a SQL box and the CPU Idle and Wait are nearly identical, the CPU's are not being used much at all. I'm stuck trying to convince management and development team that more does not equal better. Unfortunately this is how a large part of their entire virtual environment is setup. Given the industry this is in they have massive amounts of money to spend and do so freely so when they think they need more horsepower.
I have many clusters here, but this one cluster where this particular SQL VM sits has 16 hosts (Dell M820's), 512 processors (excluding HT) and 8 TB of memory total. For the entire cluster CPU usage of all the totals total average is less then 20%. So CPU certainly doesn't seem to be a contention but performance could greatly be increased if the vCPU count was lowered. However when the developers see a spike in CPU usage on the Windows guest they freak out and want more CPU's. I guess it's only a matter of time before they set it to 64 cpu's.
At any rate of all the virtual environments I've worked with I've never see WAIT or ILDE time with such an extreme number!
You are correct. This is a SQL box and the CPU Idle and Wait are nearly identical, the CPU's are not being used much at all. I'm stuck trying to convince management and development team that more does not equal better. Unfortunately this is how a large part of their entire virtual environment is setup. Given the industry this is in they have massive amounts of money to spend and do so freely so when they think they need more horsepower.
I have many clusters here, but this one cluster where this particular SQL VM sits has 16 hosts (Dell M820's), 512 processors (excluding HT) and 8 TB of memory total. For the entire cluster CPU usage of all the totals total average is less then 20%. So CPU certainly doesn't seem to be a contention but performance could greatly be increased if the vCPU count was lowered. However when the developers see a spike in CPU usage on the Windows guest they freak out and want more CPU's. I guess it's only a matter of time before they set it to 64 cpu's.
At any rate of all the virtual environments I've worked with I've never see WAIT or ILDE time with such an extreme number!
However when the developers see a spike in CPU usage on the Windows guest they freak out and want more CPU's. I guess it's only a matter of time before they set it to 64 cpu's.
I serve both the developer and sys admin roles for the system I work on, as does each person on my tiny team. All I can say is wat.
I feel like in a perfect world, developers need software/hardware experience and sys admins need dev experience. That will of course, never happen though. I remember a previous job where I overhead a senior dev asking someone what RAID was.
Yeah that makes sense since it's all the processors. I suppose if it's not causing contention with any other VMs there's not much to complain about. I have VMs, not that big, that need everything you can give them but only for a few hours. Then they sit mostly idle the rest of the time. Seems like such a waste but I guess it's a means to end to make the workload more mobile.
Aren't you the "certified guy" on staff? Why would they even question you in the first place? Sounds backwards.
Be definitive and say it doesn't work that way. If they don't agree then ask what they pay you for.
And I don't know how the OP hasn't stroked out. I had a vendor tell me this week that their software required two cores, one for the OS and one for their software. I lost sight in my left eye for a couple minutes after reading that.