Improving Performance of QUIC in WiFi
Summary (3 min read)
Introduction
- With the exponential growth in adoption of mobile phones and other smart connected devices, the usage of wireless networks continues to grow.
- On the other hand, efforts from content providers in developing customized mobile versions of websites and new low-latency transport protocols such as QUIC have contributed to improve user experience.
- The authors start from the observation that Chromium’s implementation of QUIC (version 39) has suboptimal performance in WiFi.
- As mentioned above, frame aggregation is a key feature to achieve high throughput in recent 802.11 standards and burstiness increases aggregation opportunities.
B. QUIC protocol
- QUIC provides several cross-layer enhancements, covering the weaknesses of TCP for transporting web content.
- QUIC also provides an improved congestion controller, better RTT estimation, and a better loss recovery mechanism than TCP.
- This was the default mode of QUIC in Chromium at the time of writing.
- In either mode, if an out-of-order packet or a previously missing packet is received, the ACK is sent without any delay to inform the sender immediately about it.
- Reducing traffic burstiness is known to prevent congestion and, as a consequence, reduce the undesirable effects of packet loss.
C. Transport protocols enhancements for WiFi
- To the best of their knowledge, no previous work has evaluated the performance of QUIC in WiFi by exploring its interactions with the wireless medium and 802.11 enhancements such as frame aggregation.
- Many works propose to reduce TCP acknowledgement frequency in wireless networks.
- Oliveira el. al [12] propose Dynamic Adaptive Acknowledgement where the delay window is adjusted according to the channel condition.
- Reducing the number of ACKs saves wireless resources and reduces interferences with other packets.
- The authors show, for example, that since QUIC runs on user-space it incurs performance penalties particularly for mobile devices that are usually constrained by processing power.
III. METHODOLOGY
- The authors now describe their methodology, covering their customization to Chromium’s QUIC implementation (Sec. III-A), their test environment (Sec. III-B and Sec. III-C) and the performance metrics used in the experiments (Sec. III-D).
- The results presented in the evaluation have been obtained parsing traces captured with tcpdump.
C. Wireless community network
- The authors second testbed is a production wireless community network deployed in a neighborhood of the city of Barcelona called Sants [16].
- The nodes use the linux/openwrt [18] based distribution provided by the Quick Mesh Project (QMP) [19], which runs the BMX6 mesh routing protocol [20].
- QMPSU is part of a larger community network started in 2004, which has more than 30.000 operative nodes called Guifi.net [21].
- Fig. 1 shows the geographic location of active nodes and links, using distinct colors to represent wireless links configured with different channels.
- There are also a number of point-to-point links using Ubiquiti parabolic antennas running the original manufacturer firmware.
D. Measuring performance
- The authors have selected 10 websites from Alexa’s top 100 list, downloaded their landing pages and other publicly available pages and hosted them on their servers.
- The selected websites are a mix of social networks, online shopping, news and search engines.
- The main characteristics of the cloned pages are summarized in Tab.
- The authors load these pages from the clients using the default Chromium QUIC implementation and BQUIC.
- The authors parse the HAR file and the captured traffic to calculate various metrics such as the page load time (PLT), throughput, and packet inter-arrival time over 30 runs.
A. Bulk transfer throughput
- III shows the mean values of the measured throughput and end-to-end % loss obtained by downloading their 10 MB 1https://github.com/cyrus-and/chrome-har-capturer synthetic web page during the 100 runs.
- The losses have been computed by comparing the identification field of the IP header of transmitted and received datagrams.
- This is mainly because the antenna gain of the RPi is higher than in the smartphone, and thus, the network card can use MCS with higher bitrates during the transfer.
- The throughput gain of BQUIC over QUIC is similar for both devices (26% and 23% in the smartphone and the RPi, respectively).
- Despite these differences and the large variations of measured throughput between mesh nodes (1.97 Mbps for RP2 and 20.9 Mbps for RP4 with QUIC) the authors observe significant performance improvements (between 20% and 31%) in all cases.
B. Web page load time
- In order to see the impact of BQUIC upon different types of web browsing the authors perform experiments using the cloned websites.
- Fig. 2 shows the mean PLT for various cloned web pages which are summarized in Tab.
- Using BQUIC the authors observe a decrease in PLT for all websites ranging from 5% for small web pages such as Google and Live up to 25% for large web pages such as Amazon and Facebook.
- The authors can see that the larger the page is, the larger is the reduction of the PLT.
- This is an expected result, since the connection establishment, which includes the exchange of certificates, has a larger relative overhead for small web pages.
C. Detailed analysis
- To better understand the difference in behavior of QUIC and BQUIC, the authors now perform a detailed analysis for one of the experimental runs from Section IV-A with RP5 node.
- The authors observe similar trends for the other nodes, albeit to different extent.
- In QUIC the slow start phase exits when increasing delay is detected.
- Fig. 4 shows the ACK reception (blue vertical bar) and packet transmission (yellow circle) at the sender side during an interval of 20 ms, also known as 2) Sequence-Acknowledgement analysis.
- Note that, despite BQUIC being more bursty than QUIC at packet level, as shown before, Fig. 6 depicts similar variations of throughput at larger time scale.
V. CONCLUSIONS
- The authors analyzed the performance of QUIC in WiFi, investigating the interactions of the protocol with 802.11 frame aggregation.
- The authors first highlighted that Chromium’s QUIC (v.39) delivers sub-optimal throughput in typical WiFi scenarios.
- The root-cause is the way QUIC paces packets in the network and its acknowledgment mechanisms.
- The authors carried out experiments using both a controlled testbed and a production WMN.
Did you find this useful? Give us your feedback
Citations
9 citations
2 citations
Cites background from "Improving Performance of QUIC in Wi..."
...[392] showed that QUIC achieves sub-optimal throughput in WiFi networks....
[...]
1 citations
References
610 citations
"Improving Performance of QUIC in Wi..." refers background in this paper
...In a recent measurement study [3], it was estimated that around 7% of the Internet traffic is QUIC....
[...]
422 citations
"Improving Performance of QUIC in Wi..." refers background in this paper
...11 standards, have undergone an enormous evolution in the recent years [2]....
[...]
380 citations
"Improving Performance of QUIC in Wi..." refers background in this paper
...Their impact on throughput have also been extensively evaluated [7], [6], [8]....
[...]
...A detailed description of these concepts can be found in [6]....
[...]
239 citations
"Improving Performance of QUIC in Wi..." refers background in this paper
...Their impact on throughput have also been extensively evaluated [7], [6], [8]....
[...]
144 citations
"Improving Performance of QUIC in Wi..." refers background in this paper
...al [10] investigated the impact of increasing the TCP delayed acknowledgement mechanism to more than two segments as recommended by RFC 1122....
[...]
Related Papers (5)
Frequently Asked Questions (15)
Q2. What are the contributions in "Improving performance of quic in wifi" ?
In this paper the authors experimentally analyze the performance of QUIC in WiFi networks. The authors perform experiments using both a controlled WiFi testbed and a production WiFi mesh network. In particular, the authors study how QUIC interplays with MAC layer features such as IEEE 802. 11 frame aggregation. The authors show that the current implementation of QUIC in Chromium achieves sub-optimal throughput in wireless networks. Indeed, burstiness in modern WiFi standards may improve network performance, and the authors show that a Bursty QUIC ( BQUIC ), i. e., a customized version of QUIC that is targeted to increase its burstiness, can achieve better performance in WiFi.
Q3. What is the effect of a burst of packets on the server?
since a large window is acknowledged by each ACK and pacing is disabled, a burst of packets is released shortly after an ACK is received.
Q4. What is the effect of reducing the ACK frequency on traffic?
lowering the ACK frequency increases burstiness of traffic as the sender releases a micro burst of packets after receiving the cumulative ACK.
Q5. What is the role of TCP acknowledgment in wireless networks?
Considering the role of TCP acknowledgments for the protocol reliability, reducing the acknowledgement frequency and performing delayed cumulative acknowledgements may provide benefits in wireless networks.
Q6. What distribution is used to run the BMX6 mesh protocol?
The nodes use the linux/openwrt [18] based distribution provided by the Quick Mesh Project (QMP) [19], which runs the BMX6 mesh routing protocol [20].
Q7. What are the main concerns about bursty traffic?
There are concerns about the impact of high burstiness on packet drops and queuing delays in wired networks, particularly in long Internet paths.
Q8. How do the authors load the pages from the client?
To automate the page loading the authors use Chrome-HAR-capturer1 to connect to remote clients in the lab or WMN and repeatedly load the pages multiple times while capturing traffic at both client and server sides.
Q9. What is the proposed TCP with adaptive delayed acknowledgement?
Singh et. al [11] propose TCP with adaptive delayed acknowledgement, which aims to reduce the number of ACKs to one per congestion window.
Q10. What are the two acknowledgment modes of QUIC?
Two aspects of QUIC implementation particularly influence how the protocol interacts with 802.11 frame aggregation: (i) acknowledgment modes, and (ii) packet pacing.1) QUIC acknowledgment modes: Chromium’s implementation of QUIC includes two acknowledgment modes:• TCP ACKING:
Q11. What is the main reason why QUIC is not a good choice for mobile devices?
The authors show, for example, that since QUIC runs on user-space it incurs performance penalties particularly for mobile devices that are usually constrained by processing power.
Q12. What is the proposed method of asynchronous acknowledgment?
Oliveira el. al [12] propose Dynamic Adaptive Acknowledgement where the delay window is adjusted according to the channel condition.
Q13. Why is the RPi higher throughput than the smartphone?
This is mainly because the antenna gain of the RPi is higher than in the smartphone, and thus, the network card can use MCS with higher bitrates during the transfer.
Q14. How much does the standard deviation of the throughput increase in QUIC?
the standard deviation of the throughput measured at 50 ms intervals increases only from 8.5 in QUIC to 9.1 in BQUIC (7%).
Q15. What is the pacing rate of QUIC?
The pacing rate is decided by QUIC on the fly depending on the link conditions such as bandwidth, RTT etc., and varies between different nodes in the mesh and even different runs using the same node.