Topic

Latency (engineering)

About: Latency (engineering) is a research topic. Over the lifetime, 7278 publications have been published within this topic receiving 115409 citations. The topic is also known as: lag.

...read moreread less

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Book•

Limits on interconnection network performance

[...]

Anant Agarwal

01 Jun 1994

TL;DR: A simple closed-form expression for contention in buffered, direct networks is derived and found to agree closely with simulations, and it is shown that a much larger fraction of the resulting performance improvement arises from the reduction in bandwidth requirements than from the decrease in latency.

...read moreread less

Abstract: The latency of direct networks is modeled, taking into account both switch and wire delays. A simple closed-form expression for contention in buffered, direct networks is derived and found to agree closely with simulations. The model includes the effects of packet size and communication locality. Network analysis under various constraints and under different workload parameters reveals that performance is highly sensitive to these constraints and workloads. A two-dimensional network is shown to have the lowest latency only when switch delays and network contention are ignored; three- or four-dimensional networks are favored otherwise. If communication locality exists, two-dimensional networks regain their advantage. Communication locality decreases both the base network latency and the network bandwidth requirements of applications. It is shown that a much larger fraction of the resulting performance improvement arises from the reduction in bandwidth requirements than from the decrease in latency. >

...read moreread less

494 citations

Journal Article•DOI•

Energy-Latency Tradeoff for Energy-Aware Offloading in Mobile Edge Computing Networks

[...]

Jiao Zhang¹, Xiping Hu², Zhaolong Ning³, Edith C.-H. Ngai⁴, Li Zhou¹, Jibo Wei¹, Jun Cheng², Bin Hu⁵ - Show less +4 more•Institutions (5)

National University of Defense Technology¹, Chinese Academy of Sciences², Dalian University of Technology³, Uppsala University⁴, Lanzhou University⁵

01 Aug 2018-IEEE Internet of Things Journal

TL;DR: An energy-aware offloading scheme, which jointly optimizes communication and computation resource allocation under the limited energy and sensitive latency, and an iterative search algorithm combining interior penalty function with D.C. (the difference of two convex functions/sets) programming to find the optimal solution.

...read moreread less

Abstract: Mobile edge computing (MEC) brings computation capacity to the edge of mobile networks in close proximity to smart mobile devices (SMDs) and contributes to energy saving compared with local computing, but resulting in increased network load and transmission latency. To investigate the tradeoff between energy consumption and latency, we present an energy-aware offloading scheme, which jointly optimizes communication and computation resource allocation under the limited energy and sensitive latency. In this paper, single and multicell MEC network scenarios are considered at the same time. The residual energy of smart devices’ battery is introduced into the definition of the weighting factor of energy consumption and latency. In terms of the mixed integer nonlinear problem for computation offloading and resource allocation, we propose an iterative search algorithm combining interior penalty function with D.C. (the difference of two convex functions/sets) programming to find the optimal solution. Numerical results show that the proposed algorithm can obtain lower total cost (i.e., the weighted sum of energy consumption and execution latency) comparing with the baseline algorithms, and the energy-aware weighting factor is of great significance to maintain the lifetime of SMDs.

...read moreread less

467 citations

Proceedings Article•

Less is more: trading a little bandwidth for ultra-low latency in the data center

[...]

Mohammad Alizadeh¹, Abdul Kabbani², Tom Edsall³, Balaji Prabhakar¹, Amin Vahdat², Masato Yasuda⁴ - Show less +2 more•Institutions (4)

Stanford University¹, Google², Cisco Systems, Inc.³, NEC⁴

25 Apr 2012

TL;DR: The HULL (High-bandwidth Ultra-Low Latency) architecture is presented to balance two seemingly contradictory goals: near baseline fabric latency and high bandwidth utilization and results show that by sacrificing a small amount of bandwidth, HULL can dramatically reduce average and tail latencies in the data center.

...read moreread less

Abstract: Traditional measures of network goodness--goodput, quality of service, fairness--are expressed in terms of bandwidth. Network latency has rarely been a primary concern because delivering the highest level of bandwidth essentially entails driving up latency--at the mean and, especially, at the tail. Recently, however, there has been renewed interest in latency as a primary metric for mainstream applications. In this paper, we present the HULL (High-bandwidth Ultra-Low Latency) architecture to balance two seemingly contradictory goals: near baseline fabric latency and high bandwidth utilization. HULL leaves 'bandwidth headroom' using Phantom Queues that deliver congestion signals before network links are fully utilized and queues form at switches. By capping utilization at less than link capacity, we leave room for latency sensitive traffic to avoid buffering and the associated large delays. At the same time, we use DCTCP, a recently proposed congestion control algorithm, to adaptively respond to congestion and to mitigate the bandwidth penalties which arise from operating in a bufferless fashion. HULL further employs packet pacing to counter burstiness caused by Interrupt Coalescing and Large Send Offloading. Our implementation and simulation results show that by sacrificing a small amount (e.g., 10%) of bandwidth, HULL can dramatically reduce average and tail latencies in the data center.

...read moreread less

463 citations

Book•

Selective Attention in Vision

[...]

A. H. C. van der Heijden

24 Oct 1991

TL;DR: This paper presents a meta-analysis of single-item and multiple-item detection and recognition tasks with Latency as the Dependent Variable and its applications in information processing and reinforcement learning.

...read moreread less

Abstract: 1. The Approach 2. The Topic 3. Single-item Detection and Recognition Tasks with Latency as the Dependent Variable 4. Single-item Detection and Recognition Tasks with Accuracy as the Dependent Variable 5. Multiple-item Detection and Recognition Tasks with Accuracy as the Dependent Variable 6. Multiple-item Detection and Recognition Tasks With Latency as the Dependent Variable 7. Information Processing 8. Attention, Expectation and Intention

...read moreread less

454 citations

Proceedings Article•DOI•

TIMELY: RTT-based Congestion Control for the Datacenter

[...]

Radhika Mittal¹, Nandita Dukkipati², Emily Blem², Hassan M. G. Wassel², Monia Ghobadi³, Amin Vahdat², Yaogong Wang², David Wetherall², David Zats² - Show less +5 more•Institutions (3)

University of California, Berkeley¹, Google², Microsoft³

17 Aug 2015

TL;DR: TIMELY is the first delay-based congestion control protocol for use in the datacenter, and it achieves its results despite having an order of magnitude fewer RTT signals than earlier delay- based schemes such as Vegas.

...read moreread less

Abstract: Datacenter transports aim to deliver low latency messaging together with high throughput. We show that simple packet delay, measured as round-trip times at hosts, is an effective congestion signal without the need for switch feedback. First, we show that advances in NIC hardware have made RTT measurement possible with microsecond accuracy, and that these RTTs are sufficient to estimate switch queueing. Then we describe how TIMELY can adjust transmission rates using RTT gradients to keep packet latency low while delivering high bandwidth. We implement our design in host software running over NICs with OS-bypass capabilities. We show using experiments with up to hundreds of machines on a Clos network topology that it provides excellent performance: turning on TIMELY for OS-bypass messaging over a fabric with PFC lowers 99 percentile tail latency by 9X while maintaining near line-rate throughput. Our system also outperforms DCTCP running in an optimized kernel, reducing tail latency by $13$X. To the best of our knowledge, TIMELY is the first delay-based congestion control protocol for use in the datacenter, and it achieves its results despite having an order of magnitude fewer RTT signals (due to NIC offload) than earlier delay-based schemes such as Vegas.

...read moreread less

442 citations

Collapse

Network Information

Performance

Metrics

7,278

Papers

130,965

Citations

No. of papers in the topic in previous years
Year	Papers
2022	2
2021	485
2020	529
2019	533
2018	500
2017	405

Latency (engineering)

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics