WEB SERVER LATENCY REDUCTION STUDY
This paper investigates the characteristics of web server response delay in order to understand and analyze the optimisation techniques of reducing latency. The analysis of the latency behavior for multi-process Apache HTTP server with different thread count and various workloads, was made. It was indicated, that the insufficient number of threads used by the server handling the concurrent requests of clients, is responsible for increasing latency under various loads. The problem can be solved by using a modified web server configuration allowing to reduce the response time.
web server; latency; thread
Aaqib S.M., Sharma L.: Analysis of Delivery of Web Contents for Kernel-mode and User-mode Web Servers. International Journal of Computer Applications 12(9), 2011, 37–42
Almeida J.M., Almeida V., Yates D.J.: Measuring the behavior of a World Wide Web server. High Performance Networking VII. IFIP — The International Federation for Information Processing. Springer, Boston 1997.
Apache Software Foundation 2016. Apache HTTP server version 2.4, https://httpd.apache.org/docs/2.4/mpm.html
Apache Software Foundation 2016. Apache MPM Event. Available from https://httpd.apache.org/docs/2.4/mod/event.html
Arlitt M., Williamson C.: Understanding Web server configuration issues. Software: Practice and Experience 34(2) 2004, 163–186, [DOI: 10.1002/spe.575].
Aron M., Druschel P.: TCP implementation enhancements for improving Web server performance. Technical Report TR99-335, Rice University, July 1999.
Banga G., Druschel P.: Measuring the capacity of a Web server under realistic loads. Baltzer Science Publishers BV, 1999, 69–83.
Boyed-Wickizer S., Clements A.T., Mao Y., Pesterev A., Frans-Kaashoek M., Morris R., Zeldovich N.: An Analysis of Linux Scalability to Many Cores. OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation, 2010, 1–16.
Chandranmenon G.P., Varghese G.: Reducing Web Latency Using Reference Point Caching. Proceeding of IEEE INFOCOM 2001.
Choi G.S., Kim J., Ersoz D., Das C.R.: A Multi-threaded Pipelined Web Server Architecture for SMP/SoC Machines. International World Wide Web Conference Committee (IW3C2), Chiba, Japan, 2005, 730–739.
Dodge R., Menascé D.A.: Prefetching inlines to improve web server latency. Proc. of the Computer Measurement Group Conference, Anaheim, 1998.
Faber T., Touch J., Jue W.: The TIME-WAIT state in TCP and its Effect on Busy Servers. Proceedings of IEEE INFOCOM, 1999.
Hu Y., Nanda A., Yang Q.: Measurement, analysis, and performance improvement of the Apache Web server. The 18th IEEE International Performance, Computing, and Communications Conference (IPCCC’99), Phoenix/Scottsdale, Arizona, 1999.
Marshall D.: Threads: Basic Theory and Libraries. 5/1999. Available: https://www.cs.cf.ac.uk/Dave/C/node29.html
Nahum E., Barzilai T., Kandlur D.: Performance Issues in WWW Servers. IEEE/ACM Transactions on Networking Conference, 2/2002.
Padmanabhan V.N., Mogul J.C.: Improving HTTP Latency. Computer Networks and ISDN Systems, 12/1995, 25–35.
Ruan Y., Pai V.S.: The Origins of Network Server Latency & the Myth of Connection Scheduling. SIGMETRICS/Performance, New York, 2004.
Savoia A.: Web Page Response Time 101. The software testing and quality engineering magazine STQE, 2001, 48–53.
Tanenbaum A.S.: Modern Operating Systems, 2nd Edition, 2002, 81–100. Available: http://www.cs.vu.nl/~ast/books/mos2/sample-2.pdf
Viles C.L., French J.C.: Availability and Latency of World Wide Web Information Servers. The USENIX Association, Computing Systems 8(1)/1995, 61–91.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.