scispace - formally typeset
Z

Zhong Wang

Researcher at Lawrence Berkeley National Laboratory

Publications -  68
Citations -  24509

Zhong Wang is an academic researcher from Lawrence Berkeley National Laboratory. The author has contributed to research in topics: Gene & Genome. The author has an hindex of 29, co-authored 61 publications receiving 21060 citations. Previous affiliations of Zhong Wang include Joint Genome Institute & Yale University.

Papers
More filters
Posted ContentDOI

Widespread polycistronic transcripts in mushroom-forming fungi revealed by single-molecule long-read mRNA sequencing

TL;DR: This study revealed, for the first time, the genome prevalence of polycistronic transcription in a subset of fungi and systematically demonstrated that short-read assembly is insufficient for mRNA isoform discovery, especially for isoform-rich loci.
Posted ContentDOI

A new method for rapid genome classification, clustering, visualization, and novel taxa discovery from metagenome

TL;DR: An efficient software suite that estimates similarities between genomes based on their k-mer matches, and subsequently uses these similarities for classification, clustering, and visualization, and demonstrates that Genome Constellation can tackle the computational and algorithmic challenges in large-scale taxonomy analyses in metagenomics.
Journal ArticleDOI

Deconvolute individual genomes from metagenome sequences through short read clustering.

TL;DR: This work extended their previous read clustering software, SpaRC, by exploiting statistics derived from multiple samples in a dataset to reduce the under-clustering problem and demonstrate that this method has the potential to cluster almost all of the short reads from genomes with sufficient sequencing coverage.
Posted ContentDOI

SpaRC: Scalable Sequence Clustering using Apache Spark

TL;DR: A Apache Spark-based scalable sequence clustering application, SparkReadClust (SpaRC), that partitions the reads based on their molecule of origin to enable downstream assembly optimization and suggests SpaRC provides a scalable solution for clustering billions of reads from the next-generation sequencing experiments.
Journal ArticleDOI

Combining Hadoop with MPI to Solve Metagenomics Problems that are both Data- and Compute-intensive

TL;DR: The results suggest integrating heterogeneous technologies such as Hadoop and MPI is quite efficient to solve large genomics problems that are both data-intensive and compute-intensive.