{"id":1302,"date":"2017-05-02T22:29:35","date_gmt":"2017-05-02T22:29:35","guid":{"rendered":"https:\/\/w2.cleardb.net\/?p=1302"},"modified":"2022-09-22T14:41:42","modified_gmt":"2022-09-22T14:41:42","slug":"best-practices-for-monitoring-and-measuring-data-center-performance","status":"publish","type":"post","link":"https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/","title":{"rendered":"Best Practices for Monitoring and Measuring Data Center Performance"},"content":{"rendered":"<p>IT professionals are acutely aware just how closely tied their data center infrastructure performance is to their business performance in our digitally-driven world. Technology consumers \u2013 <em>employees, suppliers, customers, and prospects<\/em> \u2013 expect highly available, fast and responsive interactions from all the systems they touch. As a result, IT professionals have critical roles in empowering the strategic success and tactical effectiveness of many businesses today. Accordingly, it is vital for IT to know what hardware and software metrics to monitor, and to understand how these metrics relate to each other. This enables IT to continuously optimize the infrastructure that empowers businesses to achieve their goals and objectives.<br \/>\nIn addition to knowing what metrics to monitor, cloud administrators often conduct before\/after and A\/B tests on pre-optimized resources to compare these metrics with metrics from production infrastructure. These tests measure the effectiveness of tuning strategies and performance solutions. In public clouds, it is simple and cost-effective to provision such testing resources.<br \/>\nThe tests and metrics used to monitor the productivity of IT infrastructure are generally grouped into three categories; quantity measures, quality measures, and responsiveness measures. These groups are applied to every layer of the IT infrastructure stack; from operating systems, CPUs, storage tiers, and networks; to the efficiency and effectiveness of application code, computing services, and databases.<\/p>\n<ol>\n<li><strong>Quantity measures<\/strong> track the amount of work being done by some component of the infrastructure stack. These measures are referred to as \u201cthroughput\u201d metrics, and they are usually represented by an absolute number for some unit of time. For an application, throughput is generally measured by the number of concurrent processes managed per minute or second; whereas throughput for a database server is often represented by the number of queries executed per second. For a web server, the number of client requests successfully processed per second is a common measure of throughput.<\/li>\n<li><strong>Quality measures<\/strong> look at the success or failure of process and application (workload) operations. For those executed correctly, the metrics represent the percentage of total work that is processed successfully. Error metrics, in comparison, capture the number of failed or erroneous results. They are commonly expressed as an error rate for some unit of time, or they are normalized by the process\u2019s throughput to yield the number of errors per a unit of work.<\/li>\n<li><strong>Responsiveness measures<\/strong> quantify how efficiently an infrastructure component completes its work. In essence, the speed of an end-to-end operation. Such measures are generally referred to as \u201clatency\u201d metrics; and they are usually expressed as an average or as a percentile of processing time. Latency might measure the time when a client issues a transaction until it receives a response, or it might measure when a database receives the request until it queues its response. As an example, latency is often shown as the percentage of operations completed within a unit of time, such as \u201c<em>97% returned within 0.3 seconds<\/em>.\u201d<\/li>\n<\/ol>\n<p>The challenge in monitoring these metrics is that the performance of multiple infrastructure components is interrelated. Network capacities and speed, the number of cores and power of CPU\u2019s, the efficiency of application code, the levels of contentions for shared computing resources; and the various configurations of hypervisors, databases, and other computing services can all impact performance capabilities. As a result, focusing on just one layer of the data center infrastructure stack without considering the multi-dimensional impact it has on the others, can negate the effectiveness of performance solutions and tuning strategies. Accordingly, multiple metrics are monitored from each group.<br \/>\nTherefore, it is very helpful to use application and system monitoring tools to stay ahead of potential issues. These tools provide alerts to application and hardware problems, often before they are noticed by end users. Lists of various monitoring tools can be found <a href=\"https:\/\/haydenjames.io\/50-top-server-monitoring-application-performance-monitoring-apm-solutions\/\">here<\/a> and <a href=\"https:\/\/www.acronis.com\/en-us\/blog\/posts\/top-10-server-application-monitoring-tools\">here<\/a>.<br \/>\nSo, what are these tools measuring and monitoring?<br \/>\nAs you know, computer systems have several types of physical resources \u2013 <em>CPU, volatile memory, network, and persistent storage<\/em> \u2013 which collectively affect data center performance. Those resources also impact application performance as well. And, it is the level of application performance that determines how the data center is judged in achieving its strategic performance goals and objectives. \u2026<em>a data center with low operating costs and efficient power usage is still considered a failure if it cannot protect its data or meet its applications\u2019 quantity, quality and responsiveness targets<\/em>\u2026<br \/>\nConsequently, monitoring tools continually measure the data center\u2019s:<\/p>\n<ul>\n<li>Most demanding workloads (applications and processes) that are impacting physical resources<\/li>\n<li>Physical resource availability to run additional workloads (measures of densities and productivity)<\/li>\n<li>Current and historic usage patterns of file systems by their various applications and processes<\/li>\n<li>Current and historic status over time of CPU, memory, storage and network IO of various workloads<\/li>\n<\/ul>\n<p>The objective is to uncover any impediments to the efficient AND effective utilization of various physical infrastructure resources in the data center. Monitoring tools look for specific workloads that are:<\/p>\n<ol>\n<li>CPU bound \u2013 meaning that workloads are blocked because CPU\u2019s are running at capacity<\/li>\n<li>IO bound \u2013 meaning that workloads are blocked by network and\/or storage bandwidth limitations<\/li>\n<li>Latency bound \u2013 meaning that resources are waiting to process workloads<\/li>\n<\/ol>\n<p>Furthermore, good monitoring tools also measure \u201cload average.\u201d Load average determines whether a physical server is in full use, not loaded (idle), highly loaded, or unusable due to overwhelming workloads. In Linux systems, this is done by examining run-queue utilization averaged over time. The run-queue lists processes waiting for resources to become available. The best monitoring tools identify which processes are in the run-queue and what they are waiting for. It should be noted as well, servers that are idling can identify data center performance problems just as much as highly loaded and unusable servers. Idling servers can be symptomatic of network saturations, poor load balancing, and thread locks or deadlocks.<br \/>\nMonitoring tools can only go so far, however. Application troubleshooting and profiling tools need to be used to help identify causes of performance problems. As an example, a profiling tool, like JProfiler, can check for Java methods that use lots of CPU resources, and it can determine how much time a Java application is spending on Garbage Collection. Some tools also provide the details of transactions within an application server, pinpointing, for instance, the SQL queries that are taking too much time to execute; or identifying which methods in a Java class are slowing down applications.<br \/>\nOnce problem processes and applications are identified, it\u2019s time to dig into these workloads to determine exactly how they are negatively impacting performance so fixes can be made. A list of common problems and potential solutions can be found <a href=\"#Troubleshooting\">here<\/a>. Additionally, some good, quick tuning strategies can be found <a href=\"#Tuning\">here<\/a> if short-term, temporary repairs will suffice while long-term solutions are developed and deployed.<br \/>\nAs noted, numerous factors impact data center performance. Therefore, IT organizations are constantly looking for the ways to proactively identify and respond to problems. Accordingly, knowing what to monitor and understanding how to improve the data center\u2019s performance is critical. Because in today\u2019s world, IT\u2019s effectiveness is measured by how much they empower the strategic and tactical success of the businesses they support.<\/p>\n","protected":false},"excerpt":{"rendered":"","protected":false},"author":114,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[76,54,77,78,79,80],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v21.7 (Yoast SEO v21.7) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Best Practices for Monitoring and Measuring Data Center Performance - Navisite<\/title>\n<meta name=\"description\" content=\"IT professionals are acutely aware just how closely tied their data center infrastructure performance is to their business performance in our\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Best Practices for Monitoring and Measuring Data Center Performance\" \/>\n<meta property=\"og:description\" content=\"IT professionals are acutely aware just how closely tied their data center infrastructure performance is to their business performance in our\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/\" \/>\n<meta property=\"og:site_name\" content=\"Navisite\" \/>\n<meta property=\"article:published_time\" content=\"2017-05-02T22:29:35+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2022-09-22T14:41:42+00:00\" \/>\n<meta name=\"author\" content=\"Megan Ferringer\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@navisite\" \/>\n<meta name=\"twitter:site\" content=\"@navisite\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Megan Ferringer\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/\"},\"author\":{\"name\":\"Megan Ferringer\",\"@id\":\"https:\/\/www.navisite.com\/#\/schema\/person\/99bad6e33cc6d3ac27337400026e4a9f\"},\"headline\":\"Best Practices for Monitoring and Measuring Data Center Performance\",\"datePublished\":\"2017-05-02T22:29:35+00:00\",\"dateModified\":\"2022-09-22T14:41:42+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/\"},\"wordCount\":1197,\"publisher\":{\"@id\":\"https:\/\/www.navisite.com\/#organization\"},\"keywords\":[\"cloud administrators\",\"data center\",\"infrastructure\",\"metrics\",\"monitoring tools\",\"quality\"],\"articleSection\":[\"Blog\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/\",\"url\":\"https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/\",\"name\":\"Best Practices for Monitoring and Measuring Data Center Performance - Navisite\",\"isPartOf\":{\"@id\":\"https:\/\/www.navisite.com\/#website\"},\"datePublished\":\"2017-05-02T22:29:35+00:00\",\"dateModified\":\"2022-09-22T14:41:42+00:00\",\"description\":\"IT professionals are acutely aware just how closely tied their data center infrastructure performance is to their business performance in our\",\"breadcrumb\":{\"@id\":\"https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.navisite.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Best Practices for Monitoring and Measuring Data Center Performance\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.navisite.com\/#website\",\"url\":\"https:\/\/www.navisite.com\/\",\"name\":\"Navisite\",\"description\":\"Elevate Your Cloud Journey\",\"publisher\":{\"@id\":\"https:\/\/www.navisite.com\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.navisite.com\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.navisite.com\/#organization\",\"name\":\"Navisite\",\"url\":\"https:\/\/www.navisite.com\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.navisite.com\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.navisite.com\/wp-content\/uploads\/2020\/02\/logo.svg\",\"contentUrl\":\"https:\/\/www.navisite.com\/wp-content\/uploads\/2020\/02\/logo.svg\",\"width\":1,\"height\":1,\"caption\":\"Navisite\"},\"image\":{\"@id\":\"https:\/\/www.navisite.com\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/twitter.com\/navisite\",\"https:\/\/www.instagram.com\/navisite\/\",\"https:\/\/www.linkedin.com\/company\/navisite\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.navisite.com\/#\/schema\/person\/99bad6e33cc6d3ac27337400026e4a9f\",\"name\":\"Megan Ferringer\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.navisite.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/www.navisite.com\/wp-content\/uploads\/2021\/02\/IMG-0529-150x150.jpg\",\"contentUrl\":\"https:\/\/www.navisite.com\/wp-content\/uploads\/2021\/02\/IMG-0529-150x150.jpg\",\"caption\":\"Megan Ferringer\"},\"description\":\"Megan is the Content Marketing Manager at Navisite with more than 10 years of experience helping brands discover and tell their stories. From working at a global non-profit organization to boutique marketing agencies in Chicago, she champions the power of storytelling across all industries.\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Best Practices for Monitoring and Measuring Data Center Performance - Navisite","description":"IT professionals are acutely aware just how closely tied their data center infrastructure performance is to their business performance in our","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/","og_locale":"en_US","og_type":"article","og_title":"Best Practices for Monitoring and Measuring Data Center Performance","og_description":"IT professionals are acutely aware just how closely tied their data center infrastructure performance is to their business performance in our","og_url":"https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/","og_site_name":"Navisite","article_published_time":"2017-05-02T22:29:35+00:00","article_modified_time":"2022-09-22T14:41:42+00:00","author":"Megan Ferringer","twitter_card":"summary_large_image","twitter_creator":"@navisite","twitter_site":"@navisite","twitter_misc":{"Written by":"Megan Ferringer","Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/#article","isPartOf":{"@id":"https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/"},"author":{"name":"Megan Ferringer","@id":"https:\/\/www.navisite.com\/#\/schema\/person\/99bad6e33cc6d3ac27337400026e4a9f"},"headline":"Best Practices for Monitoring and Measuring Data Center Performance","datePublished":"2017-05-02T22:29:35+00:00","dateModified":"2022-09-22T14:41:42+00:00","mainEntityOfPage":{"@id":"https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/"},"wordCount":1197,"publisher":{"@id":"https:\/\/www.navisite.com\/#organization"},"keywords":["cloud administrators","data center","infrastructure","metrics","monitoring tools","quality"],"articleSection":["Blog"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/","url":"https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/","name":"Best Practices for Monitoring and Measuring Data Center Performance - Navisite","isPartOf":{"@id":"https:\/\/www.navisite.com\/#website"},"datePublished":"2017-05-02T22:29:35+00:00","dateModified":"2022-09-22T14:41:42+00:00","description":"IT professionals are acutely aware just how closely tied their data center infrastructure performance is to their business performance in our","breadcrumb":{"@id":"https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.navisite.com\/blog\/best-practices-for-monitoring-and-measuring-data-center-performance\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.navisite.com\/"},{"@type":"ListItem","position":2,"name":"Best Practices for Monitoring and Measuring Data Center Performance"}]},{"@type":"WebSite","@id":"https:\/\/www.navisite.com\/#website","url":"https:\/\/www.navisite.com\/","name":"Navisite","description":"Elevate Your Cloud Journey","publisher":{"@id":"https:\/\/www.navisite.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.navisite.com\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.navisite.com\/#organization","name":"Navisite","url":"https:\/\/www.navisite.com\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.navisite.com\/#\/schema\/logo\/image\/","url":"https:\/\/www.navisite.com\/wp-content\/uploads\/2020\/02\/logo.svg","contentUrl":"https:\/\/www.navisite.com\/wp-content\/uploads\/2020\/02\/logo.svg","width":1,"height":1,"caption":"Navisite"},"image":{"@id":"https:\/\/www.navisite.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/twitter.com\/navisite","https:\/\/www.instagram.com\/navisite\/","https:\/\/www.linkedin.com\/company\/navisite"]},{"@type":"Person","@id":"https:\/\/www.navisite.com\/#\/schema\/person\/99bad6e33cc6d3ac27337400026e4a9f","name":"Megan Ferringer","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.navisite.com\/#\/schema\/person\/image\/","url":"https:\/\/www.navisite.com\/wp-content\/uploads\/2021\/02\/IMG-0529-150x150.jpg","contentUrl":"https:\/\/www.navisite.com\/wp-content\/uploads\/2021\/02\/IMG-0529-150x150.jpg","caption":"Megan Ferringer"},"description":"Megan is the Content Marketing Manager at Navisite with more than 10 years of experience helping brands discover and tell their stories. From working at a global non-profit organization to boutique marketing agencies in Chicago, she champions the power of storytelling across all industries."}]}},"publishpress_future_action":{"enabled":false,"date":"1970-01-01 00:00:00","action":"","terms":[],"taxonomy":"","browser_timezone_offset":0},"_links":{"self":[{"href":"https:\/\/www.navisite.com\/wp-json\/wp\/v2\/posts\/1302"}],"collection":[{"href":"https:\/\/www.navisite.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.navisite.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.navisite.com\/wp-json\/wp\/v2\/users\/114"}],"replies":[{"embeddable":true,"href":"https:\/\/www.navisite.com\/wp-json\/wp\/v2\/comments?post=1302"}],"version-history":[{"count":2,"href":"https:\/\/www.navisite.com\/wp-json\/wp\/v2\/posts\/1302\/revisions"}],"predecessor-version":[{"id":32370,"href":"https:\/\/www.navisite.com\/wp-json\/wp\/v2\/posts\/1302\/revisions\/32370"}],"wp:attachment":[{"href":"https:\/\/www.navisite.com\/wp-json\/wp\/v2\/media?parent=1302"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.navisite.com\/wp-json\/wp\/v2\/categories?post=1302"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.navisite.com\/wp-json\/wp\/v2\/tags?post=1302"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}