Z
Zhong Wang
Researcher at Lawrence Berkeley National Laboratory
Publications - 68
Citations - 24509
Zhong Wang is an academic researcher from Lawrence Berkeley National Laboratory. The author has contributed to research in topics: Gene & Genome. The author has an hindex of 29, co-authored 61 publications receiving 21060 citations. Previous affiliations of Zhong Wang include Joint Genome Institute & Yale University.
Papers
More filters
Posted ContentDOI
Widespread polycistronic transcripts in mushroom-forming fungi revealed by single-molecule long-read mRNA sequencing
Sean P. Gordon,Elizabeth Tseng,Asaf Salamov,Jiwei Zhang,Xiandong Meng,Zhiying Zhao,Dongwan D. Kang,Jason G. Underwood,Grigoriev,Melania Figueroa,Jonathan S. Schilling,Feng Chen,Zhong Wang +12 more
TL;DR: This study revealed, for the first time, the genome prevalence of polycistronic transcription in a subset of fungi and systematically demonstrated that short-read assembly is insufficient for mRNA isoform discovery, especially for isoform-rich loci.
Posted ContentDOI
A new method for rapid genome classification, clustering, visualization, and novel taxa discovery from metagenome
Zhong Wang,Harrison Ho,Rob Egan,Shijie Yao,Dongwan D. Kang,Jeff Froula,Volkan Sevim,Frederik Schulz,Jackie E. Shay,Derek N. Macklin,Kayla McCue,Rachel Orsini,Daniel Barich,Christopher J. Sedlacek,Wei Li,Rachael M. Morgan-Kiss,Tanja Woyke,Joan L. Slonczewski +17 more
TL;DR: An efficient software suite that estimates similarities between genomes based on their k-mer matches, and subsequently uses these similarities for classification, clustering, and visualization, and demonstrates that Genome Constellation can tackle the computational and algorithmic challenges in large-scale taxonomy analyses in metagenomics.
Journal ArticleDOI
Deconvolute individual genomes from metagenome sequences through short read clustering.
TL;DR: This work extended their previous read clustering software, SpaRC, by exploiting statistics derived from multiple samples in a dataset to reduce the under-clustering problem and demonstrate that this method has the potential to cluster almost all of the short reads from genomes with sufficient sequencing coverage.
Posted ContentDOI
SpaRC: Scalable Sequence Clustering using Apache Spark
TL;DR: A Apache Spark-based scalable sequence clustering application, SparkReadClust (SpaRC), that partitions the reads based on their molecule of origin to enable downstream assembly optimization and suggests SpaRC provides a scalable solution for clustering billions of reads from the next-generation sequencing experiments.
Journal ArticleDOI
Combining Hadoop with MPI to Solve Metagenomics Problems that are both Data- and Compute-intensive
Han Lin,Zhichao Su,Xiandong Meng,Xu Jin,Zhong Wang,Wenting Han,Hong An,Mengxian Chi,Zheng Wu +8 more
TL;DR: The results suggest integrating heterogeneous technologies such as Hadoop and MPI is quite efficient to solve large genomics problems that are both data-intensive and compute-intensive.