The Myth of Bandwidth and Application Performance
The Myth of Bandwidth and Application Performance
The Myth of Bandwidth and Application Performance
Overview Moores Law states that data density doubles approximately every 18 months, and Metcalfe's Law says that the value of a network grows as the square of the number of users. Because these postulates have held true in practice, global enterprises have found it advantageous to embed information technology into every aspect of their operations, and the worldwide data communications services industry now generates revenue in excess of $19 billion annually, with an increasing portion derived from IP VPN services. Despite a growing worldwide thirst for bandwidth, supply has outpaced demand by a wide margin. During the rapid expansion of the Internet in the 1990s, the data communications industry created an infrastructure that was capable of delivering cheap bandwidth in high volumes. In fact, bandwidth has become so plentiful that even the effects of Metcalfes Law are insufficient to consume available capacity for many years to come. The result of this imbalance has been the commoditization of bandwidth, rapidly declining bandwidth prices, and a vendor environment that has actively promoted the myth that high bandwidth can address almost any performance problem. But as enterprise application deployments have expanded to the wide area, an environment where bandwidth is sometimes as plentiful as on the LAN, IT managers have witnessed a dramatic decrease in application performance. They wonder, Why would two networks, the LAN and the WAN, with identical bandwidth capacities, deliver such different performance results? The answer is that application performance is affected by many factors, associated with both network and application logic, that must be addressed in order to achieve satisfactory application performance results. At the network level, application performance is limited by high latency (the effect of physical distance), jitter, packet loss, and congestion. At the application level, performance is further limited by natural behavior of application protocols (especially when faced with latency, jitter, packet loss, and congestion at the network level), application protocols that engage in excessive handshaking across the network links, and the serialization of the applications themselves. This white paper is intended to shed light on the issues affecting application performance in the wide area, and to give IT managers the knowledge required to design strategic enterprise application acceleration and deployment solutions. Common Application Performance Myths Myth #1: Application Performance Depends Only On Bandwidth Application performance and throughput are influenced by many factors. Latency and packet loss have a profound effect on application performance. Littles Law, the seminal description of queuing theory and an equation that models the effects of physical distance (latency) and packet loss, illustrates the impacts of these two factors on application performance. This law states: Lambda (Throughput) = n (number of outstanding requests)/t (response time) In terms of IP-based protocols, this translates to: TCP throughput = congestion window size divided by round trip time Therefore, as the round trip time (RTT) of each request increases, the congestion window must increase or TCP throughput will decrease. Unfortunately, TCP does not effectively manage large windows. As a result, even small amounts of latency and packet loss can quickly drop network performance for a given application to less than 1 megabit per second. Even if bandwidth capacity were to be increased to 100 Mbps, the application would never consume more than 1% of the total F5 Networks, Inc. -1 Nov-06
capacity. Under these conditions, managers who add network capacity waste money on a resource that cannot be consumed. The Macroscopic Behavior of the TCP Congestion Avoidance Algorithm by Mathis, Semke, Mahdavi & Ott in Computer Communication Review, 27(3), July 1997, provides a short and useful formula for the upper bound on the transfer rate: Rate = (MSS/RTT)*(1 / sqrt{p}) Where: Rate: the TCP transfer rate or throughput MSS: the maximum segment size (fixed for each Internet path, typically 1460 bytes) RTT: the round trip time (as measured by TCP) p: the packet loss rate The figure below illustrates this point:
Figure 1: TCP performance in the face of physical distance In wide area networks, sources of high round trip times (i.e., latency) include physical distance, inefficient network routing patterns, and network congestionelements that are all present in abundance on the WAN. Today, many TCP protocol stacks are highly inefficient when it comes to managing retransmissions. In fact, some stacks may have to retransmit the whole congestion window if a single packet is lost. They also tend to back off exponentially (i.e., reduce congestion windows and increase retransmission timers) in the face of network congestiona behavior that is detected by TCP as packet loss. And while loss is often insignificant in frame relay networks (less than .01% on average), it is very significant in IP VPN networks that go into and out of certain markets like China, where loss rates commonly exceed 5%. Under this latter scenario, high loss rates can have a catastrophic effect on performance. When packet loss and latency effects are combined, the performance drop-off is even more severe. The figure below illustrates this point:
F5 Networks, Inc.
-2-
Nov-06
Figure 2: TCP performance when packet loss is present Myth #2: TCP Requires Aggressive Back-Off To Ensure Fairness Many network engineers believe that aggressive back-off in the face of congestion is necessary to keep network access fair. While in some cases that statement is true, in others it is not. Where congestion control is the responsibility of each host on a network, an environment where each host has no knowledge of the other hosts bandwidth needs, aggressive back-off is necessary to ensure fairness. However, if congestion is managed within the fabric of the network, by a system that sees all traffic on a given WAN connection, then much greater and more efficient throughput is possibleand aggressive back off is not required. Standard protocol behavior specifies that when hosts consume bandwidth, they must do so independent of: The requirements of the application The amount of bandwidth is available The amount of competition that exists for that bandwidth The result is a situation where applications are often starved for bandwidth resources at the same time that the network is largely unused. This situation is obviously highly inefficient. A much better solution to the TCP fairness problem is allow individual hosts to consume as much bandwidth as they need, so long as all other hosts receive adequate service when they need it. This can be accomplished by implementing a single congestion window, shared by all hosts, that is managed within the network itself. The result is a system where hosts get the bandwidth they need in periods of light competition, and all hosts get sufficient bandwidth when competition is more intense. This single window method delivers consistently higher utilization and greater overall throughput. Hosts each see a clean, fast network that never loses packets (and therefore doesnt diminish TCP performancesee Myth #1), and cumulative traffic demands are matched to the overall buffering capability of the network. As a result, IT managers experience optimally utilized networks, under the broadest range of network latency and loss conditions.
F5 Networks, Inc.
-3-
Nov-06
Single window solutions can be constructed that are completely transparent to client systems. Components of such solutions may include TCP technologies such as selective acknowledgement, local congestion window management, improved retransmission algorithms, and packet dispersion. These capabilities are then combined with other technologies that match the throughput requirements of applications to the availability of network resources, and that track the bandwidth requirements of all hosts utilizing the network. By aggregating the throughput of multiple, parallel WAN links, this technology can achieve even greater throughput and reliability. Myth #3: Packet Compression Improves Application Performance While common packet compression techniques can reduce the amount of traffic on the WAN, they often impede application performance since they tend to add latency to application transactions. These techniques require that packets be queued up, compressed, transmitted, decompressed on the receiver, and then retransmittedall of which can take substantial resources and add substantial latency, actually slowing down the very applications that need acceleration. Next-generation application performance solutions combine protocol streamlining with transparent data reduction techniques. Compared to packet based solutions, next-generation solutions dramatically reduce the amount of data that needs to be transmitted, eliminate latency that is introduced by protocol behavior in the face of physical distance, and can drive wide area network performance at gigabit speeds. Transparent data reduction techniques often include multiple dictionaries where the level 1 dictionary is small and highly effective at reducing smaller patterns in data, and the level 2 dictionary is a multi-gigabyte space that can be used to reduce much larger patterns. Myth #4: Quality of Service Technology Accelerates Applications Quality of Service (QoS), if used properly, is a highly beneficial technology that can be helpful for managing application performance. However, the only thing that QoS can do is divide existing bandwidth into multiple virtual channels. QoS does nothing to move more data or streamline protocol behavior. QoS simply decides, in an intelligent way, which packets to drop. And while it is better to drop packets in a controlled way than to leave it to chance, dropping packets does not accelerate applications. Many QoS implementations rely on port numbers to track applications. Since applications often negotiate port assignments dynamically, these mechanisms have to be configured to reserve a large port range to ensure coverage of the ports actually used by the application. For QoS to be most effective, it should be dynamic. First-generation QoS implementations reduce large links into multiple smaller links, statically reserving bandwidth whether it is needed or not. Channelizing a network this way can ensure bandwidth availability for critical applications like voice, but actually wastes bandwidthbecause bandwidth is reserved for the specific application, even when the application is not in use. Dynamic QoS solutions, on the other hand, ensure that bandwidth is reserved only when applications can use it. One common use of this technology is to extend enterprise backup windows by enabling continuous data backup when bandwidth becomes available. Solution F5 Brings It All Together F5s application acceleration solutions deliver dramatic application performance and greatly reduced WAN costs. F5 delivers these benefits by monitoring the limiting effects of network conditions, adjusting protocol behavior, and by managing all levels of the protocol stack, from the network layer through to the application layer.
F5 Networks, Inc.
-4-
Nov-06
More specifically, F5 integrates advanced transport acceleration technologies such as adaptive TCP acceleration, Transparent Data Reduction, and session-aware QoS, with best-of-class application acceleration technologies including dynamic object caching, application smart proxies, and application smart encryption. The system is supported by a statistics generation and monitoring engine that enables real-time management of application network behavior.
Figure 3: Architecture F5 delivers LAN-like application performance over the WAN. F5 solutions accelerate applications such as ERP, CRM, email, file transfer, data replication, and other applications, resulting in predictable, fast performance for all WAN users.
Figure 4: Dual-end deployments accelerate all application traffic on the WAN F5 WAN optimization and application acceleration solutions are deployed on F5 appliance hardware. F5s data center model, the WANJet 500, features fault tolerance, massive scalability, and performance up to 622 Mbps. For branch office deployments, the WANJet 200 features fault tolerant features, silent operation and performance up to 2 Mbps.
F5 Networks, Inc.
-5-
Nov-06
Figure 5: Single ended deployments accelerate performance of enterprise web applications such as SAP, Siebel, Oracle, and corporate portals
Typical Performance Results TCP Performance without Acceleration In this example, a Windows XP client is using active FTP to get a 10 MB file from a Linux Redhat 7.3 FTP server. The link is a 2 Mbps E1 with 400 milliseconds of round trip delay, and approximates the conditions of a link from California to Asia. FTP Get Transfer Rate 0% Packet Loss 1% Packet Loss 50 KB/sec 20 KB/sec Transfer Time 250 Seconds 625 Seconds Link Utilization 20% 8%
In this scenario, it is clear that a single FTP transfer cannot drive more than 20% utilization. With 1% packet loss, performance is cut by more than half. Adding bandwidth to this link will not increase throughput in any way. TCP Performance with WANJet, TDR Disabled Transferring the same file as above through the WANJet appliance, over a network with 0% packet loss, results in a performance increase of 5x over native performance. On a network with 1% loss, performance is 12x better than native. FTP Get Transfer Rate 0% Packet Loss 1% Packet Loss 230 KB/sec 200 KB/sec Transfer Time 45 Seconds 50 Seconds Link Utilization 100% 100%
TCP Performance with WANJet, TDR Enabled Transferring the same file through the WANJet appliance, this time with F5s patented Transparent Data Reduction and congestion management algorithms enabled, results in performance improvements of 40x to 625x. The first example shows a file transfer of a 10 MB file that is 3x compressible, yielding a 40x performance improvement. The second example shows the transfer F5 Networks, Inc. -6 Nov-06
of a similar 10 MB file, this time with half the bytes modified, yielding a 78x performance increase. The third example shows a subsequent read of either the first or second file, yielding a 625x performance increase.
Transfer Rate 10 MB 3x Compressible file active FTP get 10 MB 3x Compressible file w/ perturbed data 10 MB 3x Compressible file w/ perturbed data, second read 700 KB/sec 1,500 KB/sec 10,300 KB/sec
Of particular note, these transfers show effective data rates that range from 3x to 44x of the effective rate of a native E1.
Conclusion Application performance on the WAN is affected by a large number of factors in addition to bandwidth. The notion that bandwidth solves all, or even most, application performance problems is a complete myth. At the network level, application performance is limited by high latency, jitter, packet loss, and congestion. At the application level, performance is likewise limited by factors such as: the natural behavior of application protocols that were not designed for WAN conditions; application protocols that engage in excessive handshaking; and the serialization of the applications themselves. F5s application acceleration solution recognizes the critical interdependence between applicationlevel and transport-level behavior. F5 solutions deliver predictable application performance, increased throughput ranging from 3x to over 500x, and improved application performance on networks as diverse as premium-quality, class-of-service managed networks to commodity, best efforts-based IP VPNs. F5s architectural advantages results in application performance solutions that deliver best-of-class performance, massive scalability, and a return on investment that is measured in months. About F5 F5 Networks is the global leader in Application Delivery Networking. F5 provides solutions that make applications secure, fast and available for everyone, helping organizations get the most out of their investment. By adding intelligence and manageability into the network to offload applications, F5 optimizes applications and allows them to work faster and consume fewer resources. F5s extensible architecture intelligently integrates application optimization, protects the application and the network, and delivers application reliability. Over 10,000 organizations and service providers worldwide trust F5 to keep their applications running. The company is headquartered in Seattle, Washington with offices worldwide. For more information, go to www.f5.com.
F5 Networks, Inc.
-7-
Nov-06