Topic

Latency (engineering)

About: Latency (engineering) is a research topic. Over the lifetime, 3729 publications have been published within this topic receiving 39210 citations. The topic is also known as: lag.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Journal Article•DOI•

High-performance multi-queue buffers for VLSI communications switches

[...]

Yuval Tamir¹, G. L. Frazier¹•Institutions (1)

University of California, Los Angeles¹

17 May 1988

TL;DR: This work presents a new design of buffers that provide non-FIFO message handling and efficient storage allocation for variable size packets through the use of linked lists managed by a simple on-chip controller and shows that the new buffer outperforms its "competition" and can be used to improve the performance of a wide variety of systems currently using less efficient buffers.

...read moreread less

Abstract: Small nxn switches are key components of multistage interconnection networks used in multiprocessors as well as in the communication coprocessors used in multicomputers. The design of the internal buffers in these switches is of critical importance for achieving high throughput low latency communication. We discuss several buffer structures and compare them in terms of implementation complexity and their ability to deal with variations in traffic patterns and message lengths. We present a new design of buffers that provide non-FIFO message handling and efficient storage allocation for variable size packets through the use of linked lists managed by a simple on-chip controller. We evaluate the new buffer design by comparing it to several alternative designs in the context of a multi-stage interconnection network. Our modeling and simulations show that the new buffer outperforms its “competition” and can thus be used to improve the performance of a wide variety of systems currently using less efficient buffers.

...read moreread less

338 citations

Journal Article•DOI•

Dynamic Task Offloading and Resource Allocation for Ultra-Reliable Low-Latency Edge Computing

[...]

Chen-Feng Liu¹, Mehdi Bennis¹, Merouane Debbah², H. Vincent Poor³•Institutions (3)

University of Oulu¹, Université Paris-Saclay², Princeton University³

11 Feb 2019-IEEE Transactions on Communications

TL;DR: In this article, the authors proposed a new system design, where probabilistic and statistical constraints are imposed on task queue lengths, by applying extreme value theory to minimize users' power consumption while trading off the allocated resources for local computation and task offloading.

...read moreread less

Abstract: To overcome devices’ limitations in performing computation-intense applications, mobile edge computing (MEC) enables users to offload tasks to proximal MEC servers for faster task computation. However, the current MEC system design is based on average-based metrics, which fails to account for the ultra-reliable low-latency requirements in mission-critical applications. To tackle this, this paper proposes a new system design, where probabilistic and statistical constraints are imposed on task queue lengths, by applying extreme value theory . The aim is to minimize users’ power consumption while trading off the allocated resources for local computation and task offloading. Due to wireless channel dynamics, users are reassociated to MEC servers in order to offload tasks using higher rates or accessing proximal servers. In this regard, a user–server association policy is proposed, taking into account the channel quality as well as the servers’ computation capabilities and workloads. By marrying tools from Lyapunov optimization and matching theory, a two-timescale mechanism is proposed, where a user–server association is solved in the long timescale, while a dynamic task offloading and resource allocation policy are executed in the short timescale. The simulation results corroborate the effectiveness of the proposed approach by guaranteeing highly reliable task computation and lower delay performance, compared to several baselines.

...read moreread less

297 citations

Journal Article•DOI•

Short Block-Length Codes for Ultra-Reliable Low Latency Communications

[...]

Mahyar Shirvanimoghaddam¹, Mohammad Mohammadi¹, Rana Abbas¹, Aleksandar Minja², Chentao Yue¹, Balazs Matuz³, Guojun Han⁴, Zihuai Lin¹, Wanchun Liu¹, Yonghui Li¹, Sarah J. Johnson⁵, Branka Vucetic¹ - Show less +8 more•Institutions (5)

University of Sydney¹, University of Novi Sad², German Aerospace Center³, Guangdong University of Technology⁴, University of Newcastle⁵

01 Feb 2019-IEEE Communications Magazine

TL;DR: In this paper, the authors provide an overview of channel coding techniques for URLLC and compare them in terms of performance and complexity, identifying several important research directions and discussed in more detail.

...read moreread less

Abstract: This article reviews state of the art channel coding techniques for URLLC. The stringent requirements of URLLC services, such as ultrahigh reliability and low latency, have made it the most challenging feature of 5G of mobile networks. The problem is even more challenging for services beyond the 5G promise, such as tele-surgery and factory automation, which require latencies less than 1ms and packet error rates as low as 10-9. This article provides an overview of channel coding techniques for URLLC and compares them in terms of performance and complexity. Several important research directions are identified and discussed in more detail.

...read moreread less

293 citations

Proceedings Article•DOI•

Mencius: building efficient replicated state machines for WANs

[...]

Yanhua Mao¹, Flavio Junqueira², Keith Marzullo¹•Institutions (2)

University of California, San Diego¹, Yahoo!²

08 Dec 2008

TL;DR: This work presents a protocol for general state machine replication - a method that provides strong consistency - that has high performance in a wide-area network and low latency under low client load even under changing wide- area network environment and client load.

...read moreread less

Abstract: We present a protocol for general state machine replication - a method that provides strong consistency - that has high performance in a wide-area network. In particular, our protocol Mencius has high throughput under high client load and low latency under low client load even under changing wide-area network environment and client load. We develop our protocol as a derivation from the well-known protocol Paxos. Such a development can be changed or further refined to take advantage of specific network or application requirements.

...read moreread less

292 citations

Proceedings Article•DOI•

Low Latency Geo-distributed Data Analytics

[...]

Qifan Pu¹, Ganesh Ananthanarayanan², Peter Bodik², Srikanth Kandula², Aditya Akella³, Paramvir Bahl², Ion Stoica¹ - Show less +3 more•Institutions (3)

University of California, Berkeley¹, Microsoft², University of Wisconsin-Madison³

17 Aug 2015

TL;DR: Iridium is presented, a system for low latency geo-distributed analytics that achieves low query response times by optimizing placement of both data and tasks of the queries and contains a knob to budget WAN usage.

...read moreread less

Abstract: Low latency analytics on geographically distributed datasets (across datacenters, edge clusters) is an upcoming and increasingly important challenge. The dominant approach of aggregating all the data to a single datacenter significantly inflates the timeliness of analytics. At the same time, running queries over geo-distributed inputs using the current intra-DC analytics frameworks also leads to high query response times because these frameworks cannot cope with the relatively low and variable capacity of WAN links. We present Iridium, a system for low latency geo-distributed analytics. Iridium achieves low query response times by optimizing placement of both data and tasks of the queries. The joint data and task placement optimization, however, is intractable. Therefore, Iridium uses an online heuristic to redistribute datasets among the sites prior to queries' arrivals, and places the tasks to reduce network bottlenecks during the query's execution. Finally, it also contains a knob to budget WAN usage. Evaluation across eight worldwide EC2 regions using production queries show that Iridium speeds up queries by 3× -- 19× and lowers WAN usage by 15% -- 64% compared to existing baselines.

...read moreread less

286 citations

Collapse

Network Information

Performance

Metrics

3,729

Papers

51,651

Citations

No. of papers in the topic in previous years
Year	Papers
2022	10
2021	692
2020	481
2019	389
2018	366
2017	227

Latency (engineering)

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics