Topic

Latency (engineering)

About: Latency (engineering) is a research topic. Over the lifetime, 3729 publications have been published within this topic receiving 39210 citations. The topic is also known as: lag.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Real-Time Streaming Data Delivery over Named Data Networking

[...]

Peter Gusev¹, Zhehao Wang¹, Jeff Burke¹, Lixia Zhang¹, Takahiro Yoneda², Ryota Ohnishi², Eiichi Muramoto² - Show less +3 more•Institutions (2)

University of California, Los Angeles¹, Panasonic²

01 May 2016-IEICE Transactions on Communications

TL;DR: A general design for receiver-driven, real-time streaming data (RTSD) applications over the current NDN implementation that aims to take advantage of the architecture’s unique affordances.

...read moreread less

Abstract: Named Data Networking (NDN) is a proposed future Internet architecture that shifts the fundamental abstraction of the network from host-to-host communication to request-response for named, signed data–an information dissemination focused approach. This paper describes a general design for receiver-driven, real-time streaming data (RTSD) applications over the current NDN implementation that aims to take advantage of the architecture’s unique affordances. It is based on experimental development and testing of running code for real-time video conferencing, a positional tracking system for interactive multimedia, and a distributed control system for live performance. The design includes initial approaches to minimizing latency, managing buffer size and Interest retransmission, and adapting retrieval to maximize bandwidth and control congestion. Initial implementations of these approaches are evaluated for functionality and performance results, and the potential for future research in this area, and improved performance as new features of the architecture become available, is discussed. key words: named data networking, information centric networking, videoconferencing, real-time, low-latency, congestion control

...read moreread less

20 citations

Posted Content•

Super-Human Performance in Online Low-latency Recognition of Conversational Speech

[...]

Thai-Son Nguyen, Sebastian Stueker¹, Alex Waibel•Institutions (1)

Karlsruhe Institute of Technology¹

07 Oct 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: Results are presented for a system that can achieve super-human performance (at a WER of 5.0%, over the Switchboard conversational benchmark) at a word based latency of only 1 second behind a speaker's speech.

...read moreread less

Abstract: Achieving super-human performance in recognizing human speech has been a goal for several decades, as researchers have worked on increasingly challenging tasks. In the 1990's it was discovered, that conversational speech between two humans turns out to be considerably more difficult than read speech as hesitations, disfluencies, false starts and sloppy articulation complicate acoustic processing and require robust handling of acoustic, lexical and language context, jointly. Early attempts with statistical models could only reach error rates over 50% and far from human performance (WER of around 5.5%). Neural hybrid models and recent attention-based encoder-decoder models have considerably improved performance as such contexts can now be learned in an integral fashion. However, processing such contexts requires an entire utterance presentation and thus introduces unwanted delays before a recognition result can be output. In this paper, we address performance as well as latency. We present results for a system that can achieve super-human performance (at a WER of 5.0%, over the Switchboard conversational benchmark) at a word based latency of only 1 second behind a speaker's speech. The system uses multiple attention-based encoder-decoder networks integrated within a novel low latency incremental inference approach.

...read moreread less

20 citations

Proceedings Article•DOI•

TrIMS: Transparent and Isolated Model Sharing for Low Latency Deep Learning Inference in Function-as-a-Service

[...]

Abdul Dakkak¹, Cheng Li¹, Simon Garcia de Gonzalo¹, Jinjun Xiong², Wen-mei W. Hwu¹ - Show less +1 more•Institutions (2)

University of Illinois at Urbana–Champaign¹, IBM²

08 Jul 2019

TL;DR: The proposed TrIMS is a generic memory sharing technique that enables constant data to be shared across processes or containers while still maintaining isolation between users, and consists of a persistent model store across the GPU, CPU, local storage, and cloud storage hierarchy, an efficient resource management layer that provides isolation.

...read moreread less

Abstract: Deep neural networks (DNNs) have become core computation components within low latency Function as a Service (FaaS) prediction pipelines. Cloud computing, as the defacto backbone of modern computing infrastructure, has to be able to handle user-defined FaaS pipelines containing diverse DNN inference workloads while maintaining isolation and latency guarantees with minimal resource waste. The current solution for guaranteeing isolation and latency within FaaS is inefficient. A major cause of the inefficiency is the need to move large amount of data within and across servers. We propose TrIMS as a novel solution to address this issue. TrIMSis a generic memory sharing technique that enables constant data to be shared across processes or containers while still maintaining isolation between users. TrIMS consists of a persistent model store across the GPU, CPU, local storage, and cloud storage hierarchy, an efficient resource management layer that provides isolation, and a succinct set of abstracts, applicationAPIs, and container technologies for easy and transparent integration with FaaS, Deep Learning (DL) frameworks, and user code. We demonstrate our solution by interfacing TrIMS with the Apache MXNet framework and demonstrate up to 24x speedup in latency for image classification models, up to 210x speedup for large models, and up to8×system throughput improvement.

...read moreread less

20 citations

Journal Article•DOI•

A low latency asynchronous arbitration circuit

[...]

Alex Yakovlev¹, A. Petrov, Luciano Lavagno²•Institutions (2)

Universities UK¹, Polytechnic University of Turin²

01 Sep 1994-IEEE Transactions on Very Large Scale Integration Systems

TL;DR: An asynchronous circuit for an arbiter cell that can be used to construct cascaded multiway arbitration circuits and has a short response delay at the input request-grant handshake link.

...read moreread less

Abstract: We present an asynchronous circuit for an arbiter cell that can be used to construct cascaded multiway arbitration circuits. The circuit is completely speed-independent. It has a short response delay at the input request-grant handshake link due to both a) the propagation of requests in parallel with starting arbitration and b) the concurrent resetting of request-grant handshakes in different cascades of a request-grant propagation chain. >

...read moreread less

20 citations

Journal Article•DOI•

Low-Latency Shack–Hartmann Wavefront Sensor Based on an Industrial Smart Camera

[...]

Markus Thier¹, Rene Paris¹, Thomas Thurner², Georg Schitter¹•Institutions (2)

Vienna University of Technology¹, Graz University of Technology²

01 May 2013-IEEE Transactions on Instrumentation and Measurement

TL;DR: This paper proposes a versatile Shack–Hartmann WFS based on an industrial smart camera for high-performance measurements of wavefront deformations, using a low-cost field-programmable gate array as the parallel processing platform.

...read moreread less

Abstract: Wavefront sensing is important in various optical measurement systems, particularly in the field of adaptive optics (AO). For AO systems, the sampling rate, as well as the latency time, of the wavefront sensors (WFSs) imposes a restriction on the overall achievable temporal resolution. In this paper, we propose a versatile Shack–Hartmann WFS based on an industrial smart camera for high-performance measurements of wavefront deformations, using a low-cost field-programmable gate array as the parallel processing platform. The proposed wavefront reconstruction adds a processing latency of only 740 ns for calculating wavefront characteristics from the pixel stream of the image sensor, providing great potential for demanding AO system designs.

...read moreread less

20 citations

Collapse

Network Information

Performance

Metrics

3,729

Papers

51,651

Citations

No. of papers in the topic in previous years
Year	Papers
2022	10
2021	692
2020	481
2019	389
2018	366
2017	227

Latency (engineering)

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics