scispace - formally typeset
Search or ask a question
Author

Yifang Hu

Other affiliations: University of Melbourne
Bio: Yifang Hu is an academic researcher from Walter and Eliza Hall Institute of Medical Research. The author has contributed to research in topics: Haematopoiesis & Progenitor cell. The author has an hindex of 23, co-authored 34 publications receiving 16561 citations. Previous affiliations of Yifang Hu include University of Melbourne.

Papers
More filters
Journal ArticleDOI
TL;DR: The philosophy and design of the limma package is reviewed, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.
Abstract: limma is an R/Bioconductor software package that provides an integrated solution for analysing data from gene expression experiments. It contains rich features for handling complex experimental designs and for information borrowing to overcome the problem of small sample sizes. Over the past decade, limma has been a popular choice for gene discovery through differential expression analyses of microarray and high-throughput PCR data. The package contains particularly strong facilities for reading, normalizing and exploring such data. Recently, the capabilities of limma have been significantly expanded in two important directions. First, the package can now perform both differential expression and differential splicing analyses of RNA sequencing (RNA-seq) data. All the downstream analysis tools previously restricted to microarray data are now available for RNA-seq as well. These capabilities allow users to analyse both RNA-seq and microarray data with very similar pipelines. Second, the package is now able to go past the traditional gene-wise expression analyses in a variety of ways, analysing expression profiles in terms of co-regulated sets of genes or in terms of higher-order expression signatures. This provides enhanced possibilities for biological interpretation of gene expression differences. This article reviews the philosophy and design of the limma package, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

22,147 citations

Journal ArticleDOI
TL;DR: ELDA is a software application for limiting dilution analysis (LDA), with particular attention to the needs of stem cell assays, which is the first limiting dilutions analysis software to provide meaningful confidence intervals for all LDA data sets, including those with 0% or 100% responses.

1,645 citations

Journal ArticleDOI
TL;DR: Comparing the gene expression profiles of Trm cells isolated from the brain with those of circulating memory T cellsisolated from the spleen after an acute virus infection indicates that Trm Cells are a distinct memory T cell population disconnected from the circulatingMemory T cell pool and display a unique molecular signature that likely results in optimal survival and function within their local environment.
Abstract: Tissue resident memory (Trm) CD8 T cells represent a newly described memory T cell population. We have previously characterized a population of Trm cells that persists within the brain after acute virus infection. Although capable of providing marked protection against a subsequent local challenge, brain Trm cells do not undergo recall expansion after dissociation from the tissue. Furthermore, these Trm cells do not depend on the same survival factors as the circulating memory T cell pool as assessed either in vivo or in vitro. To gain greater insight into this population of cells, we compared the gene expression profiles of Trm cells isolated from the brain with those of circulating memory T cells isolated from the spleen after an acute virus infection. Trm cells displayed altered expression of genes involved in chemotaxis, expressed a distinct set of transcription factors, and overexpressed several inhibitory receptors. Cumulatively, these data indicate that Trm cells are a distinct memory T cell population disconnected from the circulating memory T cell pool and display a unique molecular signature that likely results in optimal survival and function within their local environment.

312 citations

Journal ArticleDOI
TL;DR: It is shown that Id2 is broadly expressed in all cDC subsets with the highest expression in CD103+ and CD8α+ lineages and that Irf‐8 and Batf3 regulate distinct stages in DC differentiation during the development of cDCs.
Abstract: Dendritic cells (DCs) have critical roles in the induction of the adaptive immune response. The transcription factors Id2, Batf3 and Irf-8 are required for many aspects of murine DC differentiation including development of CD8α+ and CD103+ DCs. How they regulate DC subset specification is not completely understood. Using an Id2-GFP reporter system, we show that Id2 is broadly expressed in all cDC subsets with the highest expression in CD103+ and CD8α+ lineages. Notably, CD103+ DCs were the only DC able to constitutively cross-present cell-associated antigens in vitro. Irf-8 deficiency affected loss of development of virtually all conventional DCs (cDCs) while Batf3 deficiency resulted in the development of Sirp-α− DCs that had impaired survival. Exposure to GM-CSF during differentiation induced expression of CD103 in Id2-GFP+ DCs. It did not restore cross-presenting capacity to Batf3−/− or CD103−Sirp-α−DCs in vitro. Thus, Irf-8 and Batf3 regulate distinct stages in DC differentiation during the development of cDCs. Genetic mapping DC subset differentiation using Id2-GFP may have broad implications in understanding the interplay of DC subsets during protective and pathological immune responses.

134 citations

Journal ArticleDOI
05 Aug 2010-Blood
TL;DR: It is shown that mice with loss of function mutations in PRC2 components display enhanced HSC/progenitor population activity, whereas mutations that disrupt PRC1 or pleiohomeotic repressive complex are associated with HSC or progenitor cell defects.

131 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: The philosophy and design of the limma package is reviewed, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.
Abstract: limma is an R/Bioconductor software package that provides an integrated solution for analysing data from gene expression experiments. It contains rich features for handling complex experimental designs and for information borrowing to overcome the problem of small sample sizes. Over the past decade, limma has been a popular choice for gene discovery through differential expression analyses of microarray and high-throughput PCR data. The package contains particularly strong facilities for reading, normalizing and exploring such data. Recently, the capabilities of limma have been significantly expanded in two important directions. First, the package can now perform both differential expression and differential splicing analyses of RNA sequencing (RNA-seq) data. All the downstream analysis tools previously restricted to microarray data are now available for RNA-seq as well. These capabilities allow users to analyse both RNA-seq and microarray data with very similar pipelines. Second, the package is now able to go past the traditional gene-wise expression analyses in a variety of ways, analysing expression profiles in terms of co-regulated sets of genes or in terms of higher-order expression signatures. This provides enhanced possibilities for biological interpretation of gene expression differences. This article reviews the philosophy and design of the limma package, summarizing both new and historical features, with an emphasis on recent enhancements and features that have not been previously described.

22,147 citations

Journal ArticleDOI
TL;DR: An analytical strategy for integrating scRNA-seq data sets based on common sources of variation is introduced, enabling the identification of shared populations across data sets and downstream comparative analysis.
Abstract: Computational single-cell RNA-seq (scRNA-seq) methods have been successfully applied to experiments representing a single condition, technology, or species to discover and define cellular phenotypes. However, identifying subpopulations of cells that are present across multiple data sets remains challenging. Here, we introduce an analytical strategy for integrating scRNA-seq data sets based on common sources of variation, enabling the identification of shared populations across data sets and downstream comparative analysis. We apply this approach, implemented in our R toolkit Seurat (http://satijalab.org/seurat/), to align scRNA-seq data sets of peripheral blood mononuclear cells under resting and stimulated conditions, hematopoietic progenitors sequenced using two profiling technologies, and pancreatic cell 'atlases' generated from human and mouse islets. In each case, we learn distinct or transitional cell states jointly across data sets, while boosting statistical power through integrated analysis. Our approach facilitates general comparisons of scRNA-seq data sets, potentially deepening our understanding of how distinct cell states respond to perturbation, disease, and evolution.

7,741 citations

Journal ArticleDOI
TL;DR: This protocol describes all the steps necessary to process a large set of raw sequencing reads and create lists of gene transcripts, expression levels, and differentially expressed genes and transcripts.
Abstract: High-throughput sequencing of mRNA (RNA-seq) has become the standard method for measuring and comparing the levels of gene expression in a wide variety of species and conditions. RNA-seq experiments generate very large, complex data sets that demand fast, accurate and flexible software to reduce the raw read data to comprehensible results. HISAT (hierarchical indexing for spliced alignment of transcripts), StringTie and Ballgown are free, open-source software tools for comprehensive analysis of RNA-seq experiments. Together, they allow scientists to align reads to a genome, assemble transcripts including novel splice variants, compute the abundance of these transcripts in each sample and compare experiments to identify differentially expressed genes and transcripts. This protocol describes all the steps necessary to process a large set of raw sequencing reads and create lists of gene transcripts, expression levels, and differentially expressed genes and transcripts. The protocol's execution time depends on the computing resources, but it typically takes under 45 min of computer time. HISAT, StringTie and Ballgown are available from http://ccb.jhu.edu/software.shtml.

3,755 citations

Journal ArticleDOI
TL;DR: Comparing the performance of UMAP with five other tools, it is found that UMAP provides the fastest run times, highest reproducibility and the most meaningful organization of cell clusters.
Abstract: Advances in single-cell technologies have enabled high-resolution dissection of tissue composition. Several tools for dimensionality reduction are available to analyze the large number of parameters generated in single-cell studies. Recently, a nonlinear dimensionality-reduction technique, uniform manifold approximation and projection (UMAP), was developed for the analysis of any type of high-dimensional data. Here we apply it to biological data, using three well-characterized mass cytometry and single-cell RNA sequencing datasets. Comparing the performance of UMAP with five other tools, we find that UMAP provides the fastest run times, highest reproducibility and the most meaningful organization of cell clusters. The work highlights the use of UMAP for improved visualization and interpretation of single-cell data.

3,016 citations

Journal ArticleDOI
TL;DR: Treatment with atezolizumab resulted in a significantly improved RECIST v1.1 response rate, compared with a historical control overall response rate of 10%, and Exploratory analyses showed The Cancer Genome Atlas (TCGA) subtypes and mutation load to be independently predictive for response to atezolediazepine.

2,934 citations