Showing papers by "Xilinx published in 2013"

PDF

Open Access

Journal Article•DOI•

On the Impact of Phase Noise on Active Cancelation in Wireless Full-Duplex

[...]

Achaleshwar Sahai¹, Gaurav Patel¹, Christopher H. Dick², Ashutosh Sabharwal¹•Institutions (2)

05 Jun 2013-IEEE Transactions on Vehicular Technology

TL;DR: The root cause of performance bottlenecks in current full-duplex systems is investigated and signal models for wideband and multiple-input-multiple-output (MIMO) full- DUplex systems are proposed, capturing all the salient design parameters, thus allowing future analytical development of advanced coding and signal design for full- duplex systems.

...read moreread less

Abstract: Recent experimental results have shown that full-duplex communication is possible for short-range communications. However, extending full-duplex to long-range communication remains a challenge, primarily due to residual self-interference, even with a combination of passive suppression and active cancelation methods. In this paper, we investigate the root cause of performance bottlenecks in current full-duplex systems. We first classify all known full-duplex architectures based on how they compute their canceling signal and where the canceling signal is injected to cancel self-interference. Based on the classification, we analytically explain several published experimental results. The key bottleneck in current systems turns out to be the phase noise in the local oscillators in the transmit-and-receive chain of the full-duplex node. As a key by-product of our analysis, we propose signal models for wideband and multiple-input-multiple-output (MIMO) full-duplex systems, capturing all the salient design parameters, thus allowing future analytical development of advanced coding and signal design for full-duplex systems.

...read moreread less

251 citations

Proceedings Article•DOI•

Approximate matrix inversion for high-throughput data detection in the large-scale MIMO uplink

[...]

Michael Wu¹, Bei Yin¹, Aida Vosoughi¹, Christoph Studer¹, Joseph R. Cavallaro¹, Christopher H. Dick² - Show less +2 more•Institutions (2)

Rice University¹, Xilinx²

19 May 2013

TL;DR: This paper proposes a novel VLSI architecture to efficiently compute the approximate inverse using a systolic array and shows reference FPGA implementation results for various system configurations.

...read moreread less

Abstract: The high processing complexity of data detection in the large-scale multiple-input multiple-output (MIMO) uplink necessitates high-throughput VLSI implementations In this paper, we propose - to the best of our knowledge - first matrix inversion implementation suitable for data detection in systems having hundreds of antennas at the base station (BS) The underlying idea is to carry out an approximate matrix inversion using a small number of Neumann-series terms, which allows one to achieve near-optimal performance at low complexity We propose a novel VLSI architecture to efficiently compute the approximate inverse using a systolic array and show reference FPGA implementation results for various system configurations For a system where 128 BS antennas receive data from 8 single-antenna users, a single instance of our design processes 19M matrices/s on a Xilinx Virtex-7 FPGA, while using only 39% of the available slices and 36% of the available DSP48 units

...read moreread less

158 citations

Proceedings Article•

Achieving 10Gbps Line-rate Key-value Stores with FPGAs

[...]

Michaela Blott¹, Kimon Karras¹, Ling Liu¹, Kees Vissers¹, Jeremia Bar², Zsolt István² - Show less +2 more•Institutions (2)

Xilinx¹, ETH Zurich²

01 Jan 2013

TL;DR: The design of a novel memcached architecture implemented on Field Programmable Gate Arrays (FPGAs) which is the first in literature to achieve 10Gbps line rate processing for all packet sizes and can not only provide significant speed-up but also operate at a lower power consumption than any x86.

...read moreread less

Abstract: Distributed in-memory key-value stores such as memcached have become a critical middleware application within current web infrastructure However, typical x86based systems yield limited performance scalability and high power consumption as their architecture with its optimization for single thread performance is not wellmatched towards the memory-intensive and parallel nature of this application In this paper we present the design of a novel memcached architecture implemented on Field Programmable Gate Arrays (FPGAs) which is the first in literature to achieve 10Gbps line rate processing for all packet sizes By transformation of the functionality into a dataflow architecture, the implementation can not only provide significant speed-up but also operate at a lower power consumption than any x86 More specifically, with our prototype we have measured an increase of up to a factor of 36x in requests per second per Watt that can be serviced in comparison to the best published numbers for regular servers with optimized software Additionally, we show that through the tight integration of network interface, memory and compute, round trip latency can be reduced down to below 45 microseconds

...read moreread less

97 citations

Proceedings Article•DOI•

Iterative hard-decision decoding of braided BCH codes for high-speed optical communication

[...]

Yung-Yih Jian¹, Henry D. Pfister¹, Krishna R. Narayanan¹, Raghu Rao², Raied N. Mazahreh² - Show less +1 more•Institutions (2)

Texas A&M University¹, Xilinx²

01 Dec 2013

TL;DR: Analysis and simulation of the iterative HDD of tightly-braided block codes with BCH component codes for high-speed optical communication shows that these codes are competitive with the best schemes based on HDD.

...read moreread less

Abstract: Designing error-correcting codes for optical communication is challenging mainly because of the high data rates (e.g., 100 Gbps) required and the expectation of low latency, low overhead (e.g., 7% redundancy), and large coding gain (e.g., >9dB). Although soft-decision decoding (SDD) of low-density parity-check (LDPC) codes is an active area of research, the mainstay of optical transport systems is still the iterative hard-decision decoding (HDD) of generalized product codes with algebraic syndrome decoding of the component codes. This is because iterative HDD allows many simplifications and SDD of LDPC codes results in much higher implementation complexity. In this paper, we use analysis and simulation to evaluate tightly-braided block codes with BCH component codes for high-speed optical communication. Simulation of the iterative HDD shows that these codes are competitive with the best schemes based on HDD. Finally, we suggest a specific design that is compatible with the G.709 framing structure and exhibits a coding gain of >9.35 dB at 7% redundancy under iterative HDD with a latency of approximately 1 million bits.

...read moreread less

86 citations

Proceedings Article•DOI•

Scalable ternary content addressable memory implementation using FPGAs

[...]

Weirong Jiang¹•Institutions (1)