Zhong Wang

Researcher at Lawrence Berkeley National Laboratory

Publications - 68

Citations - 24509

Zhong Wang is an academic researcher from Lawrence Berkeley National Laboratory. The author has contributed to research in topics: Gene & Genome. The author has an hindex of 29, co-authored 61 publications receiving 21060 citations. Previous affiliations of Zhong Wang include Joint Genome Institute & Yale University.

Papers

PDF

Open Access

More filters

Posted ContentDOI

Widespread polycistronic transcripts in mushroom-forming fungi revealed by single-molecule long-read mRNA sequencing

Sean P. Gordon, +12 more

- 11 Dec 2014 -

bioRxiv

TL;DR: This study revealed, for the first time, the genome prevalence of polycistronic transcription in a subset of fungi and systematically demonstrated that short-read assembly is insufficient for mRNA isoform discovery, especially for isoform-rich loci.

...read moreread less

Posted ContentDOI

A new method for rapid genome classification, clustering, visualization, and novel taxa discovery from metagenome

Zhong Wang, +17 more

- 21 Oct 2019 -

bioRxiv

TL;DR: An efficient software suite that estimates similarities between genomes based on their k-mer matches, and subsequently uses these similarities for classification, clustering, and visualization, and demonstrates that Genome Constellation can tackle the computational and algorithmic challenges in large-scale taxonomy analyses in metagenomics.

...read moreread less

Journal ArticleDOI

Deconvolute individual genomes from metagenome sequences through short read clustering.

Kexue Li, +6 more

- 08 Apr 2020 -

PeerJ

TL;DR: This work extended their previous read clustering software, SpaRC, by exploiting statistics derived from multiple samples in a dataset to reduce the under-clustering problem and demonstrate that this method has the potential to cluster almost all of the short reads from genomes with sufficient sequencing coverage.

...read moreread less

Posted ContentDOI

SpaRC: Scalable Sequence Clustering using Apache Spark

Lizhen Shi, +4 more

- 11 Jan 2018 -

bioRxiv

TL;DR: A Apache Spark-based scalable sequence clustering application, SparkReadClust (SpaRC), that partitions the reads based on their molecule of origin to enable downstream assembly optimization and suggests SpaRC provides a scalable solution for clustering billions of reads from the next-generation sequencing experiments.

...read moreread less

Journal ArticleDOI

Combining Hadoop with MPI to Solve Metagenomics Problems that are both Data- and Compute-intensive

Han Lin, +8 more

- 01 Aug 2018 -

International Journal of Parallel Progra...

TL;DR: The results suggest integrating heterogeneous technologies such as Hadoop and MPI is quite efficient to solve large genomics problems that are both data-intensive and compute-intensive.

...read moreread less

Collapse