Академический Документы
Профессиональный Документы
Культура Документы
Agenda
Virtualization Candidate Consideration DRS and HA Overview Metrics: Performance/Capacity Capacity Planning and Performance Considerations
Create an opportunity to provide HA and D/R at reduced cost Create a single point of control
Monitor as you roll out Monitor resource usage for the HOSTS and the VMs Compare Actual vs Projected resource usage.
Projected Resource Usage will come from Analysis of List of Candidate Servers Projected CPU, Validate Memory, Validate I/O and Network
Develop Realistic Timeframes to provision additional hardware including cabling/network requirements. Incorporate Tolerances to Accommodate for Delays Ensure proper planning for Server (CPU, Memory, I/O, Network, Storage)
Ability to Scale and Load Balancing for better performance Ability to Re-size LUNs ReE.g., 22 GB VM storage and LUN size of 400 GB (find the sweet spot here) Number of VMs per LUN (12-15max) (12Accommodate for growth in VM storage requirements Accommodate for Snapshots, Swap Space (function of memory size) Minimize impact of Test on Production
Summarize the data and rank the candidates perhaps limiting servers with high I/O demand and minimizing the number of servers with large memory footprints.
Side 2- Windows requirements 2Measure the IOPs and kbytes/second during peak timeframes. For the candidate list, can the aggregate I/O throughput be met? Do you need to eliminate high I/O loads Example: If Capacity is 200 MB per second but you have a total requirement for 300 MB per second
then you would eliminate the highest I/O candidates which caused you to exceed your constraint for this phase.
Fixed/Preferred- Active/Active devices Fixed/PreferredAllows for manual balancing of LUNs between HBA s.
MRU Design
Note: No I/O balancing Between HBAsSingle Path I/O
Preferred Path
LUN0
Poor Mans I/O Load Balancing Manually Load Balance to Improve performance
LUN1
LUN2
LUN3
The VMKernel
VMkernel A high-performance highoperating system that occupies the virtualization layer and manages most of the physical resources on the hardware, including memory, physical processors, storage, and networking controllers.
Network and I/O goes through I/O queues within the virtualization layer
Support for 10GigE cards is now available with ESX Server 3.5.
Cluster
A group of hosts: In order to use HA and DRS feature, the hosts must be defined as part of a cluster to provide for load balancing and fail-over between hosts. fail-
Host
A container for virtual machines
Resource Pool
A collection of virtual machines on a host or within a cluster that possess the ability to have processor and memory resources controlled at an aggregated level.
Why?
Provides a container view: Isolate and Protect different workloads in a box . (Critical vs nonnonCritical) VMware DRS helps you balance resources across virtual machines.
http://pubs.vmware.com/vi3/wwhelp/wwhimpl/js/html/wwhelp.htm
VMware HA Feature that provides easy-to-use, cost-effective high easy-tocostavailability for applications running in virtual machines. In the event of server failure, affected virtual machines are automatically restarted on other production servers that have spare capacity. VMware VMotion Feature that enables the live migration of running virtual machines from one physical server to another with zero down time, continuous service availability, and complete transaction integrity
As a result, VMware Infrastructure administrators can now clone a virtual machine on one datacenter to another datacenter. You can also clone a virtual machine on one datacenter to a template on another datacenter. Templates can now be cloned between datacenters. You can also perform a cold migration of a virtual machine across datacenters. VMotion with local swap ESX 3.5 and VC 2.5 now provides for swap files to be stored on local storage and still provides VMotion. Users can configure a swap datastore policy at the host or cluster level, although the policy can be overwritten by the virtual machine configuration. During a VMotion migration or a failover for virtual machines with swap files on local storage, if local
storage on destination is selected, the virtual machine swap file is recreated. The creation time for the virtual machine swap file depends on local disk I/O (or if too many concurrent virtual machines are starting due to an ESX Server host failover with VMware HA).
DRS Settings are on a scale of 1-5: 1Level 5 is the most aggressive, while Level 1 makes the smallest number of migrations.
Examples of Migration Recommendations when in Manual or Partially Automatic Mode: Balance Average CPU Loads Balance Average Memory Loads Satisfy Affinity Rule Satisfy Anti-Affinity Rule
If CPU > 80%, overall VM performance will degrade due to CPU queuing. Increase in high % Ready times.
% CPU Ready time (indication of queuing) shouldn t exceed 5% consistently, or 1000ms over the default 20 second sampling interval. Anything more could indicate serious CPU contention issues.
Memory overhead
for each VM:
CPU Metrics
CPU Usage (%) This is cumulative % used across all CPUs in the host. CPU Usage in MHz This counter identifies the total CPU usage in MHz
used on
This counter is useful for normalizing % CPU for capacity planning when there are hosts with differing CPU speeds or core counts in the same cluster.
This identifies the amount of time a VM is ready to run, but cannot because VMkernel is unable to schedule the VM process on a physical CPU. High % Ready typically means there is CPU contention on the ESX host. CPU Ready (in Virtual Center per VM) This counter reflects the same information as % Ready in ESXTOP;
Virtual Center reflects this measurement in milliseconds. The VI Client can view virtual machines CPU ready time in real-time, but historical tracking requires the Virtual Center reallogging level to be set to 3.
CPU Ready time of 280ms over the default refresh interval (20000ms) %Ready=(280/20000) * 100 = 1.4%
Memory Management
3 mechanisms are utilized for memory management to expand/contract amount of memory used by VMs Transparent Page Sharing
Redundant virtual machine memory pages are "shared
The Vmkernel removes duplicate pages from physical RAM and the page table is adjusted to redirect the virtual machine's virtual page back to the page in RAM. This eliminates redundant pages in physical memory
Swapping
Used to forcibly reclaim memory (ESX decides not Guest)
Ballooning
VMmemctl module loaded into the guest operating system.
Guest OS determines what to pages/swap out Part of VMware tools, guest OS must be configured with sufficient swap space.
Memory
Memory Granted this is the amount of memory that the vmkernel has allocated to all virtual machines running on the server.
This represents a rough estimate of how much RAM will be needed for another host to spin up the VMs.
Memory Usage % - This metric identifies the memory used on the host as a percentage, based on Memory Consumed divided by the total memory in the ESX host. Memory Consumed How much memory is actually being used by the VMs.
This takes into account transparent page sharing, zero pages, vmkernel and service console memory usage, and virtualization overhead.
Memory Balloon This counter should be zero under normal circumstances. Otherwise, it indicates the system is under memory constraints and has begun borrowing memory from virtual machines to meet demands. Memory Swap Used This counter should always be zero. Memory swapping is used as a last resort, so if swapping currently exists on the host it means memory is severely overcommitted. Memory Zero The amount of memory that is not being used by the allocation
This metric can be helpful in identifying VMs that may have been configured for more RAM than necessary
CPU usage for host is exceeding levels-DRS will re-balance for you levelsreSo no need to balance VMs across other hosts Monitor resource usage over time to observe trends for the host and for each of the VMs.
Monitor CPU usage to profile workloads and try to consider a view of large , medium and small . Identify if the large consumers and determine if tuning is required. Did you plan for these large consumers in your ROI projection?
25 20 15 P c 10 5
14 VMs
6 VMs
1000 500 0
0 P P 1 10/13/2007 8:0010/19/2007 8:0010/25/2007 8:0010/31/2007 8:00 P1/6/2007 7:00 P P T CP Us (A /Rate) CPU Usage in MHz (A erage/Rate)
MHz
Measure CPU usage, and overhead as you load Measure the memory consumption Measure I/O throughput and I/O response time as the load increases. (Use data from SAN tools)
Remember don t have true MPIO
CPU Capacity
Monitor Physical CPU usage and %Ready for all VMs If %Ready >5% and starts to approach 10% delay the host is saturated . It doesn t have to be running at 100% physical utilization. Monitor usage of VMkernel Monitor System Services (only dispatchable on CPU 0)
Review VM memory metrics from within the VMs memory statistics e.g., Windows metrics.
Paging, Available Memory, Committed bytes, Total Working set
Windows Metrics
Use Information as an Indicator Measurements may not be repeatable
CPU: Processor Metrics System, User and Total Processor Utilization etc Memory: Available bytes, Page reads/second, Page Ins/second, Virtual/Real bytes Network
Disk Reads and Writes/second, Read and Write bytes/second, Seconds/Read, Seconds/Write, Disk utilization Process: NEED PROCESS DATA CPU Working set size TO KNOW WHO is using Read/Write bytes per second resources Database- SQL Database Reads/Writes per instance, Stored Procedure Timings Log Bytes flushed per database
Virtualization Planning
Virtual Infrastructure offers many benefits:
More efficient use of resources Reduction in server proliferation but can lead to virtual sprawl Reduction in capital expenditures for hardware (test/development)
Need to know that virtualization is NOT FREE! Can t just add more and more VMs and Storage is more expensive
Identify the candidates for virtualization and develop standards for their deployment.
1 VCPU per application/VM Identify target utilizations for the VMs (% proc, memory etc.) Utilize templates to clone VMs
E.g., one template with SQL server build, another template with only standard Windows build
However, you may want to look at reserving memory as a minimum level for some applications/VMs
Remember: Reservations will also impact HA capacity.
Monitor resource usage over time and perform workload balancing based on business information and resource usage Understand overhead as VMs are added to host and cluster Ensure Sufficient capacity for fail-over for HA failManage resources at the cluster level.