Top 6 papers published by Kai Li from Princeton University in 2015

Journal Article•DOI•

Targeted exploration and analysis of large cross-platform human transcriptomic compendia.

[...]

Qian Zhu¹, Aaron K. Wong¹, Arjun Krishnan¹, Miriam Ragle Aure², Alicja Tadych¹, Ran Zhang¹, David C Corney¹, Casey S. Greene³, Lars Ailo Bongo⁴, Vessela N. Kristensen⁵, Moses Charikar¹, Kai Li¹, Olga G. Troyanskaya¹ - Show less +9 more•Institutions (5)

Princeton University¹, Oslo University Hospital², Dartmouth College³, University of Tromsø⁴, Akershus University Hospital⁵

01 Mar 2015-Nature Methods

TL;DR: This work presents SEEK (search-based exploration of expression compendia), a query-based search engine for very large transcriptomic data collections, including thousands of human data sets from many different microarray and high-throughput sequencing platforms.

...read moreread less

Abstract: The search engine SEEK allows multigene query across a large number of human expression data sets from array and sequencing platforms.

...read moreread less

132 citations

Proceedings Article•DOI•

RIPQ: advanced photo caching on flash for facebook

[...]

Linpeng Tang¹, Qi Huang², Wyatt Lloyd³, Sanjeev Kumar⁴, Kai Li¹ - Show less +1 more•Institutions (4)

Princeton University¹, Cornell University², University of Southern California³, Facebook⁴

16 Feb 2015

TL;DR: This paper shows that two families of advanced caching algorithms, Segmented-LRU and Greedy-Dual-Size-Frequency, can be easily implemented with RIPQ and shows that these algorithms running on RIPQ increase hit ratios up to ∼20% over the current FIFO system, incur low overhead, and achieve high throughput.

...read moreread less

Abstract: Facebook uses flash devices extensively in its photo-caching stack The key design challenge for an efficient photo cache on flash at Facebook is its workload: many small random writes are generated by inserting cache-missed content, or updating cache-hit content for advanced caching algorithms The Flash Translation Layer on flash devices performs poorly with such a workload, lowering throughput and decreasing device lifespan Existing coping strategies under-utilize the space on flash devices, sacrificing cache capacity, or are limited to simple caching algorithms like FIFO, sacrificing hit ratiosWe overcome these limitations with the novel Restricted Insertion Priority Queue (RIPQ) framework that supports advanced caching algorithms with large cache sizes, high throughput, and long device lifespan RIPQ aggregates small random writes, co-locates similarly prioritized content, and lazily moves updated content to further reduce device overhead We show that two families of advanced caching algorithms, Segmented-LRU and Greedy-Dual-Size-Frequency, can be easily implemented with RIPQ Our evaluation on Facebook's photo trace shows that these algorithms running on RIPQ increase hit ratios up to ∼20% over the current FIFO system, incur low overhead, and achieve high throughput

...read moreread less

97 citations

Patent•

Stream locality delta compression

[...]

Mark Huang¹, Philip Shilane¹, Grant Wallace², Nitin Garg², Edward K. Lee, Ming Benjamin Zhu, Kai Li - Show less +3 more•Institutions (2)

Data Domain¹, EMC Corporation²

27 May 2015

TL;DR: In this article, a first data segment is determined to be similar to a data segment in the stream indicated locale, and then a first segment is decomposed in a stream locality delta compression.

...read moreread less

Abstract: Stream locality delta compression is disclosed. A previous stream indicated locale of data segments is selected. A first data segment is then determined to be similar to a data segment in the stream indicated locale.

...read moreread less

62 citations

Journal Article•DOI•

Full correlation matrix analysis (FCMA): An unbiased method for task-related functional connectivity.

[...]

Yida Wang¹, Jonathan D. Cohen¹, Kai Li¹, Nicholas B. Turk-Browne¹•Institutions (1)

Princeton University¹

15 Aug 2015-Journal of Neuroscience Methods

TL;DR: Full correlation matrix analysis demonstrates how advances in computer science can alleviate computational bottlenecks in neuroscience by accelerating a naive, serial approach and revealing a region of medial prefrontal cortex whose selectivity derived from differential patterns of functional connectivity across categories.

...read moreread less

39 citations

Proceedings Article•DOI•

Full correlation matrix analysis of fMRI data on Intel® Xeon Phi™ coprocessors

[...]

Yida Wang¹, Michael J. Anderson², Jonathan D. Cohen¹, Alexander Heinecke², Kai Li¹, Nadathur Satish², Narayanan Sundaram², Nicholas B. Turk-Browne¹, Theodore L. Willke² - Show less +5 more•Institutions (2)

Princeton University¹, Intel²

15 Nov 2015

TL;DR: A closed-loop analysis system with FCMA on a cluster of nodes with Intel® Xeon Phi™ coprocessors and shows that the optimized single-node code runs 5x-16x faster than the baseline implementation using the well-known Intel® MKL and LibSVM libraries.

...read moreread less

Abstract: Full correlation matrix analysis (FCMA) is an unbiased approach for exhaustively studying interactions among brain regions in functional magnetic resonance imaging (fMRI) data from human participants. In order to answer neuroscientific questions efficiently, we are developing a closed-loop analysis system with FCMA on a cluster of nodes with Intel® Xeon Phi™ coprocessors. Here we propose several ideas for data-driven algorithmic modification to improve the performance on the coprocessor. Our experiments with real datasets show that the optimized single-node code runs 5x-16x faster than the baseline implementation using the well-known Intel® MKL and LibSVM libraries, and that the cluster implementation achieves near linear speedup on 5760 cores.

...read moreread less

18 citations

Optimizing Full Correlation Matrix Analysis of fMRI Data on Intel Xeon Phi Coprocessors

[...]

Yida Wang, Michael J. Anderson, Jonathan D. Cohen, Alexander Heinecke, Kai Li, Nadathur Satish, Narayanan Sundaram, Nicholas B. Turk-Browne, Ted Willke - Show less +5 more

01 Jan 2015

Abstract: Full correlation matrix analysis (FCMA) is an unbiased approach for exhaustively studying interactions among brain regions in functional magnetic resonance imaging (fMRI) data from human participants. In order to answer neuroscientific questions efficiently, we are developing a closed-loop analysis system with FCMA on a cluster of nodes with Intel® Xeon Phi™ coprocessors. Here we propose several ideas for data-driven algorithmic modification to improve the performance on the coprocessor. Our experiments with real datasets show that the optimized single-node code runs 5x-16x faster than the baseline implementation using the well-known Intel® MKL and LibSVM libraries, and that the cluster implementation achieves near linear speedup on 5760 cores.

...read moreread less

2 citations

Showing papers by "Kai Li published in 2015"