scispace - formally typeset
Search or ask a question
Browse all papers

Posted Content
TL;DR: In this paper, a simple warm restart technique for stochastic gradient descent was proposed to improve its anytime performance when training deep neural networks, which achieved state-of-the-art results on both the CIFAR-10 and CifAR-100 datasets.
Abstract: Restart techniques are common in gradient-free optimization to deal with multimodal functions. Partial warm restarts are also gaining popularity in gradient-based optimization to improve the rate of convergence in accelerated gradient schemes to deal with ill-conditioned functions. In this paper, we propose a simple warm restart technique for stochastic gradient descent to improve its anytime performance when training deep neural networks. We empirically study its performance on the CIFAR-10 and CIFAR-100 datasets, where we demonstrate new state-of-the-art results at 3.14% and 16.21%, respectively. We also demonstrate its advantages on a dataset of EEG recordings and on a downsampled version of the ImageNet dataset. Our source code is available at this https URL

3,497 citations


Proceedings ArticleDOI
18 May 2015
TL;DR: A novel network embedding method called the ``LINE,'' which is suitable for arbitrary types of information networks: undirected, directed, and/or weighted, and optimizes a carefully designed objective function that preserves both the local and global network structures.
Abstract: This paper studies the problem of embedding very large information networks into low-dimensional vector spaces, which is useful in many tasks such as visualization, node classification, and link prediction. Most existing graph embedding methods do not scale for real world information networks which usually contain millions of nodes. In this paper, we propose a novel network embedding method called the ``LINE,'' which is suitable for arbitrary types of information networks: undirected, directed, and/or weighted. The method optimizes a carefully designed objective function that preserves both the local and global network structures. An edge-sampling algorithm is proposed that addresses the limitation of the classical stochastic gradient descent and improves both the effectiveness and the efficiency of the inference. Empirical experiments prove the effectiveness of the LINE on a variety of real-world information networks, including language networks, social networks, and citation networks. The algorithm is very efficient, which is able to learn the embedding of a network with millions of vertices and billions of edges in a few hours on a typical single machine. The source code of the LINE is available online\footnote{\url{https://github.com/tangjianpku/LINE}}.

3,492 citations


Journal ArticleDOI
TL;DR: Investigators in Germany detected the spread of the novel coronavirus (2019-nCoV) from a person who had recently traveled from China and found it to be a novel virus.
Abstract: 2019-nCoV Transmission from Asymptomatic Patient In this report, investigators in Germany detected the spread of the novel coronavirus (2019-nCoV) from a person who had recently traveled from China...

3,492 citations


Proceedings Article
06 Jul 2015
TL;DR: A method for optimizing control policies, with guaranteed monotonic improvement, by making several approximations to the theoretically-justified scheme, called Trust Region Policy Optimization (TRPO).
Abstract: In this article, we describe a method for optimizing control policies, with guaranteed monotonic improvement. By making several approximations to the theoretically-justified scheme, we develop a practical algorithm, called Trust Region Policy Optimization (TRPO). This algorithm is effective for optimizing large nonlinear policies such as neural networks. Our experiments demonstrate its robust performance on a wide variety of tasks: learning simulated robotic swimming, hopping, and walking gaits; and playing Atari games using images of the screen as input. Despite its approximations that deviate from the theory, TRPO tends to give monotonic improvement, with little tuning of hyperparameters.

3,479 citations


Proceedings Article
19 Jun 2016
TL;DR: A new theoretical framework is developed casting dropout training in deep neural networks (NNs) as approximate Bayesian inference in deep Gaussian processes, which mitigates the problem of representing uncertainty in deep learning without sacrificing either computational complexity or test accuracy.
Abstract: Deep learning tools have gained tremendous attention in applied machine learning. However such tools for regression and classification do not capture model uncertainty. In comparison, Bayesian models offer a mathematically grounded framework to reason about model uncertainty, but usually come with a prohibitive computational cost. In this paper we develop a new theoretical framework casting dropout training in deep neural networks (NNs) as approximate Bayesian inference in deep Gaussian processes. A direct result of this theory gives us tools to model uncertainty with dropout NNs - extracting information from existing models that has been thrown away so far. This mitigates the problem of representing uncertainty in deep learning without sacrificing either computational complexity or test accuracy. We perform an extensive study of the properties of dropout's uncertainty. Various network architectures and nonlinearities are assessed on tasks of regression and classification, using MNIST as an example. We show a considerable improvement in predictive log-likelihood and RMSE compared to existing state-of-the-art methods, and finish by using dropout's uncertainty in deep reinforcement learning.

3,472 citations


Journal ArticleDOI
12 Feb 2015-Nature
TL;DR: A genome-wide association study and Metabochip meta-analysis of body mass index (BMI), a measure commonly used to define obesity and assess adiposity, in up to 339,224 individuals provide strong support for a role of the central nervous system in obesity susceptibility.
Abstract: Obesity is heritable and predisposes to many diseases To understand the genetic basis of obesity better, here we conduct a genome-wide association study and Metabochip meta-analysis of body mass index (BMI), a measure commonly used to define obesity and assess adiposity, in up to 339,224 individuals This analysis identifies 97 BMI-associated loci (P 20% of BMI variation Pathway analyses provide strong support for a role of the central nervous system in obesity susceptibility and implicate new genes and pathways, including those related to synaptic function, glutamate signalling, insulin secretion/action, energy metabolism, lipid biology and adipogenesis

3,472 citations


Journal ArticleDOI
TL;DR: In this paper, the triple cation perovskite photovoltaics with inorganic cesium were shown to be thermally more stable, contain less phase impurities and are less sensitive to processing conditions.
Abstract: Today's best perovskite solar cells use a mixture of formamidinium and methylammonium as the monovalent cations. With the addition of inorganic cesium, the resulting triple cation perovskite compositions are thermally more stable, contain less phase impurities and are less sensitive to processing conditions. This enables more reproducible device performances to reach a stabilized power output of 21.1% and ∼18% after 250 hours under operational conditions. These properties are key for the industrialization of perovskite photovoltaics.

3,470 citations


Journal ArticleDOI
TL;DR: Seurat is a computational strategy to infer cellular localization by integrating single-cell RNA-seq data with in situ RNA patterns, and correctly localizes rare subpopulations, accurately mapping both spatially restricted and scattered groups.
Abstract: Spatial localization is a key determinant of cellular fate and behavior, but methods for spatially resolved, transcriptome-wide gene expression profiling across complex tissues are lacking. RNA staining methods assay only a small number of transcripts, whereas single-cell RNA-seq, which measures global gene expression, separates cells from their native spatial context. Here we present Seurat, a computational strategy to infer cellular localization by integrating single-cell RNA-seq data with in situ RNA patterns. We applied Seurat to spatially map 851 single cells from dissociated zebrafish (Danio rerio) embryos and generated a transcriptome-wide map of spatial patterning. We confirmed Seurat's accuracy using several experimental approaches, then used the strategy to identify a set of archetypal expression patterns and spatial markers. Seurat correctly localizes rare subpopulations, accurately mapping both spatially restricted and scattered groups. Seurat will be applicable to mapping cellular localization within complex patterned tissues in diverse systems.

3,465 citations


Book ChapterDOI
08 Oct 2016
TL;DR: This paper proposes a new supervision signal, called center loss, for face recognition task, which simultaneously learns a center for deep features of each class and penalizes the distances between the deep features and their corresponding class centers.
Abstract: Convolutional neural networks (CNNs) have been widely used in computer vision community, significantly improving the state-of-the-art. In most of the available CNNs, the softmax loss function is used as the supervision signal to train the deep model. In order to enhance the discriminative power of the deeply learned features, this paper proposes a new supervision signal, called center loss, for face recognition task. Specifically, the center loss simultaneously learns a center for deep features of each class and penalizes the distances between the deep features and their corresponding class centers. More importantly, we prove that the proposed center loss function is trainable and easy to optimize in the CNNs. With the joint supervision of softmax loss and center loss, we can train a robust CNNs to obtain the deep features with the two key learning objectives, inter-class dispension and intra-class compactness as much as possible, which are very essential to face recognition. It is encouraging to see that our CNNs (with such joint supervision) achieve the state-of-the-art accuracy on several important face recognition benchmarks, Labeled Faces in the Wild (LFW), YouTube Faces (YTF), and MegaFace Challenge. Especially, our new approach achieves the best results on MegaFace (the largest public domain face benchmark) under the protocol of small training set (contains under 500000 images and under 20000 persons), significantly improving the previous results and setting new state-of-the-art for both face recognition and face verification tasks.

3,464 citations


Posted Content
TL;DR: The \textit{Transformers} library is an open-source library that consists of carefully engineered state-of-the art Transformer architectures under a unified API and a curated collection of pretrained models made by and available for the community.
Abstract: Recent progress in natural language processing has been driven by advances in both model architecture and model pretraining. Transformer architectures have facilitated building higher-capacity models and pretraining has made it possible to effectively utilize this capacity for a wide variety of tasks. \textit{Transformers} is an open-source library with the goal of opening up these advances to the wider machine learning community. The library consists of carefully engineered state-of-the art Transformer architectures under a unified API. Backing this library is a curated collection of pretrained models made by and available for the community. \textit{Transformers} is designed to be extensible by researchers, simple for practitioners, and fast and robust in industrial deployments. The library is available at \url{this https URL}.

3,463 citations


Proceedings ArticleDOI
18 Jun 2018
TL;DR: In this paper, a new method for synthesizing high-resolution photo-realistic images from semantic label maps using conditional generative adversarial networks (conditional GANs) is presented.
Abstract: We present a new method for synthesizing high-resolution photo-realistic images from semantic label maps using conditional generative adversarial networks (conditional GANs). Conditional GANs have enabled a variety of applications, but the results are often limited to low-resolution and still far from realistic. In this work, we generate 2048 A— 1024 visually appealing results with a novel adversarial loss, as well as new multi-scale generator and discriminator architectures. Furthermore, we extend our framework to interactive visual manipulation with two additional features. First, we incorporate object instance segmentation information, which enables object manipulations such as removing/adding objects and changing the object category. Second, we propose a method to generate diverse results given the same input, allowing users to edit the object appearance interactively. Human opinion studies demonstrate that our method significantly outperforms existing methods, advancing both the quality and the resolution of deep image synthesis and editing.

Journal ArticleDOI
B. P. Abbott1, Richard J. Abbott1, T. D. Abbott2, M. R. Abernathy3  +970 moreInstitutions (114)
TL;DR: This second gravitational-wave observation provides improved constraints on stellar populations and on deviations from general relativity.
Abstract: We report the observation of a gravitational-wave signal produced by the coalescence of two stellar-mass black holes. The signal, GW151226, was observed by the twin detectors of the Laser Interferometer Gravitational-Wave Observatory (LIGO) on December 26, 2015 at 03:38:53 UTC. The signal was initially identified within 70 s by an online matched-filter search targeting binary coalescences. Subsequent off-line analyses recovered GW151226 with a network signal-to-noise ratio of 13 and a significance greater than 5 σ. The signal persisted in the LIGO frequency band for approximately 1 s, increasing in frequency and amplitude over about 55 cycles from 35 to 450 Hz, and reached a peak gravitational strain of 3.4+0.7−0.9×10−22. The inferred source-frame initial black hole masses are 14.2+8.3−3.7M⊙ and 7.5+2.3−2.3M⊙ and the final black hole mass is 20.8+6.1−1.7M⊙. We find that at least one of the component black holes has spin greater than 0.2. This source is located at a luminosity distance of 440+180−190 Mpc corresponding to a redshift 0.09+0.03−0.04. All uncertainties define a 90 % credible interval. This second gravitational-wave observation provides improved constraints on stellar populations and on deviations from general relativity.

Proceedings ArticleDOI
TL;DR: LINE as discussed by the authors proposes a network embedding method called LINE, which is suitable for arbitrary types of information networks: undirected, directed, and/or weighted, and optimizes a carefully designed objective function that preserves both the local and global network structures.
Abstract: This paper studies the problem of embedding very large information networks into low-dimensional vector spaces, which is useful in many tasks such as visualization, node classification, and link prediction. Most existing graph embedding methods do not scale for real world information networks which usually contain millions of nodes. In this paper, we propose a novel network embedding method called the "LINE," which is suitable for arbitrary types of information networks: undirected, directed, and/or weighted. The method optimizes a carefully designed objective function that preserves both the local and global network structures. An edge-sampling algorithm is proposed that addresses the limitation of the classical stochastic gradient descent and improves both the effectiveness and the efficiency of the inference. Empirical experiments prove the effectiveness of the LINE on a variety of real-world information networks, including language networks, social networks, and citation networks. The algorithm is very efficient, which is able to learn the embedding of a network with millions of vertices and billions of edges in a few hours on a typical single machine. The source code of the LINE is available online.

Proceedings Article
Mingxing Tan1, Quoc V. Le1
24 May 2019
TL;DR: EfficientNet-B7 as discussed by the authors proposes a new scaling method that uniformly scales all dimensions of depth/width/resolution using a simple yet highly effective compound coefficient, which achieves state-of-the-art accuracy on ImageNet, while being 8.4x smaller and 6.1x faster on inference.
Abstract: Convolutional Neural Networks (ConvNets) are commonly developed at a fixed resource budget, and then scaled up for better accuracy if more resources are available. In this paper, we systematically study model scaling and identify that carefully balancing network depth, width, and resolution can lead to better performance. Based on this observation, we propose a new scaling method that uniformly scales all dimensions of depth/width/resolution using a simple yet highly effective compound coefficient. We demonstrate the effectiveness of this method on scaling up MobileNets and ResNet. To go even further, we use neural architecture search to design a new baseline network and scale it up to obtain a family of models, called EfficientNets, which achieve much better accuracy and efficiency than previous ConvNets. In particular, our EfficientNet-B7 achieves state-of-the-art 84.3% top-1 accuracy on ImageNet, while being 8.4x smaller and 6.1x faster on inference than the best existing ConvNet. Our EfficientNets also transfer well and achieve state-of-the-art accuracy on CIFAR-100 (91.7%), Flowers (98.8%), and 3 other transfer learning datasets, with an order of magnitude fewer parameters. Source code is at this https URL.

Journal ArticleDOI
TL;DR: PartitionFinder 2 is a program for automatically selecting best-fit partitioning schemes and models of evolution for phylogenetic analyses that includes the ability to analyze morphological datasets, new methods to analyze genome-scale datasets, and new output formats to facilitate interoperability with downstream software.
Abstract: PartitionFinder 2 is a program for automatically selecting best-fit partitioning schemes and models of evolution for phylogenetic analyses. PartitionFinder 2 is substantially faster and more efficient than version 1, and incorporates many new methods and features. These include the ability to analyze morphological datasets, new methods to analyze genome-scale datasets, new output formats to facilitate interoperability with downstream software, and many new models of molecular evolution. PartitionFinder 2 is freely available under an open source license and works on Windows, OSX, and Linux operating systems. It can be downloaded from www.robertlanfear.com/partitionfinder. The source code is available at https://github.com/brettc/partitionfinder.

Journal ArticleDOI
Peter A. R. Ade, Nabila Aghanim, Monique Arnaud, Frederico Arroja, M. Ashdown, J. Aumont, Carlo Baccigalupi, Mario Ballardini, A. J. Banday, R. B. Barreiro, Nicola Bartolo, E. Battaner, K. Benabed, Alain Benoit, A. Benoit-Lévy, J.-P. Bernard, Marco Bersanelli, P. Bielewicz, J. J. Bock, Anna Bonaldi, Laura Bonavera, J. R. Bond, Julian Borrill, François R. Bouchet, F. Boulanger, M. Bucher, Carlo Burigana, R. C. Butler, Erminia Calabrese, Jean-François Cardoso, A. Catalano, Anthony Challinor, A. Chamballu, R.-R. Chary, H. C. Chiang, P. R. Christensen, Sarah E. Church, David L. Clements, S. Colombi, L. P. L. Colombo, C. Combet, D. Contreras, F. Couchot, A. Coulais, B. P. Crill, A. Curto, F. Cuttaia, Luigi Danese, R. D. Davies, R. J. Davis, P. de Bernardis, A. de Rosa, G. de Zotti, Jacques Delabrouille, F.-X. Désert, Jose M. Diego, H. Dole, S. Donzelli, Olivier Doré, Marian Douspis, A. Ducout, X. Dupac, George Efstathiou, F. Elsner, Torsten A. Ensslin, H. K. Eriksen, James R. Fergusson, Fabio Finelli, Olivier Forni, M. Frailis, Aurelien A. Fraisse, E. Franceschi, A. Frejsel, Andrei V. Frolov, S. Galeotta, Silvia Galli, K. Ganga, C. Gauthier, M. Giard, Y. Giraud-Héraud, E. Gjerløw, J. González-Nuevo, Krzysztof M. Gorski, Serge Gratton, A. Gregorio, Alessandro Gruppuso, Jon E. Gudmundsson, Jan Hamann, Will Handley, F. K. Hansen, Duncan Hanson, D. L. Harrison, Sophie Henrot-Versille, C. Hernández-Monteagudo, D. Herranz, S. R. Hildebrandt, E. Hivon, Michael P. Hobson, W. A. Holmes 
TL;DR: In this article, the authors report on the implications for cosmic inflation of the 2018 Release of the Planck CMB anisotropy measurements, which are fully consistent with the two previous Planck cosmological releases, but have smaller uncertainties thanks to improvements in the characterization of polarization at low and high multipoles.
Abstract: We report on the implications for cosmic inflation of the 2018 Release of the Planck CMB anisotropy measurements. The results are fully consistent with the two previous Planck cosmological releases, but have smaller uncertainties thanks to improvements in the characterization of polarization at low and high multipoles. Planck temperature, polarization, and lensing data determine the spectral index of scalar perturbations to be $n_\mathrm{s}=0.9649\pm 0.0042$ at 68% CL and show no evidence for a scale dependence of $n_\mathrm{s}.$ Spatial flatness is confirmed at a precision of 0.4% at 95% CL with the combination with BAO data. The Planck 95% CL upper limit on the tensor-to-scalar ratio, $r_{0.002}<0.10$, is further tightened by combining with the BICEP2/Keck Array BK15 data to obtain $r_{0.002}<0.056$. In the framework of single-field inflationary models with Einstein gravity, these results imply that: (a) slow-roll models with a concave potential, $V" (\phi) < 0,$ are increasingly favoured by the data; and (b) two different methods for reconstructing the inflaton potential find no evidence for dynamics beyond slow roll. Non-parametric reconstructions of the primordial power spectrum consistently confirm a pure power law. A complementary analysis also finds no evidence for theoretically motivated parameterized features in the Planck power spectrum, a result further strengthened for certain oscillatory models by a new combined analysis that includes Planck bispectrum data. The new Planck polarization data provide a stringent test of the adiabaticity of the initial conditions. The polarization data also provide improved constraints on inflationary models that predict a small statistically anisotropic quadrupolar modulation of the primordial fluctuations. However, the polarization data do not confirm physical models for a scale-dependent dipolar modulation.

Journal ArticleDOI
23 Mar 2020-JAMA
TL;DR: Since then, the number of cases identified in Italy has rapidly increased, mainly in northern Italy, but all regions of the country have reported having patients with COVID-19, and Italy now has the second largest number of CO VID-19 cases and also has a very high case-fatality rate.
Abstract: Only 3 cases of coronavirus disease 2019 (COVID-19) were identified in Italy in the first half of February 2020 and all involved people who had recently traveled to China. On February 20, 2020, a severe case of pneumonia due to SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) was diagnosed in northern Italy’s Lombardy region in a man in his 30s who had no history of possible exposure abroad. Within 14 days, many other cases of COVID-19 in the surrounding area were diagnosed, including a substantial number of critically ill patients.1 On the basis of the number of cases and of the advanced stage of the disease it was hypothesized that the virus had been circulating within the population since January. Another cluster of patients with COVID-19 was simultaneously identified in Veneto, which borders Lombardy. Since then, the number of cases identified in Italy has rapidly increased, mainly in northern Italy, but all regions of the country have reported having patients with COVID-19. After China, Italy now has the second largest number of COVID-19 cases2 and also has a very high case-fatality rate.3 This Viewpoint reviews the Italian experience with COVID-19 with an emphasis on fatalities.

Journal ArticleDOI
22 Oct 2015-Cell
TL;DR: In this paper, the authors characterized Cpf1, a putative class 2 CRISPR effector, which is a single RNA-guided endonuclease lacking tracrRNA and utilizes a T-rich protospacer-adjacent motif.

Journal ArticleDOI
TL;DR: The cardiovascular safety profile of dapagliflozin, a selective inhibitor of sodium–glucose cotransporter 2 that promotes glucosuria in patients with type 2 diabetes, is undefined.
Abstract: Background The cardiovascular safety profile of dapagliflozin, a selective inhibitor of sodium–glucose cotransporter 2 that promotes glucosuria in patients with type 2 diabetes, is undefined. Methods We randomly assigned patients with type 2 diabetes who had or were at risk for atherosclerotic cardiovascular disease to receive either dapagliflozin or placebo. The primary safety outcome was a composite of major adverse cardiovascular events (MACE), defined as cardiovascular death, myocardial infarction, or ischemic stroke. The primary efficacy outcomes were MACE and a composite of cardiovascular death or hospitalization for heart failure. Secondary efficacy outcomes were a renal composite (≥40% decrease in estimated glomerular filtration rate to <60 ml per minute per 1.73 m2 of body-surface area, new end-stage renal disease, or death from renal or cardiovascular causes) and death from any cause. Results We evaluated 17,160 patients, including 10,186 without atherosclerotic cardiovascular disease, ...

Proceedings ArticleDOI
Mingxing Tan1, Ruoming Pang1, Quoc V. Le1
14 Jun 2020
TL;DR: EfficientDetD7 as discussed by the authors proposes a weighted bi-directional feature pyramid network (BiFPN), which allows easy and fast multi-scale feature fusion, and a compound scaling method that uniformly scales the resolution, depth, and width for all backbone, feature network, and box/class prediction networks at the same time.
Abstract: Model efficiency has become increasingly important in computer vision. In this paper, we systematically study neural network architecture design choices for object detection and propose several key optimizations to improve efficiency. First, we propose a weighted bi-directional feature pyramid network (BiFPN), which allows easy and fast multi-scale feature fusion; Second, we propose a compound scaling method that uniformly scales the resolution, depth, and width for all backbone, feature network, and box/class prediction networks at the same time. Based on these optimizations and EfficientNet backbones, we have developed a new family of object detectors, called EfficientDet, which consistently achieve much better efficiency than prior art across a wide spectrum of resource constraints. In particular, with single-model and single-scale, our EfficientDetD7 achieves state-of-the-art 52.2 AP on COCO test-dev with 52M parameters and 325B FLOPs, being 4x – 9x smaller and using 13x – 42x fewer FLOPs than previous detector.

Journal ArticleDOI
TL;DR: The Movement Disorder Society PD Criteria retain motor parkinsonism as the core feature of the disease, defined as bradykinesia plus rest tremor or rigidity, and two levels of certainty are delineated: clinically established PD and probable PD.
Abstract: This document presents the Movement Disorder Society Clinical Diagnostic Criteria for Parkinson's disease (PD). The Movement Disorder Society PD Criteria are intended for use in clinical research but also may be used to guide clinical diagnosis. The benchmark for these criteria is expert clinical diagnosis; the criteria aim to systematize the diagnostic process, to make it reproducible across centers and applicable by clinicians with less expertise in PD diagnosis. Although motor abnormalities remain central, increasing recognition has been given to nonmotor manifestations; these are incorporated into both the current criteria and particularly into separate criteria for prodromal PD. Similar to previous criteria, the Movement Disorder Society PD Criteria retain motor parkinsonism as the core feature of the disease, defined as bradykinesia plus rest tremor or rigidity. Explicit instructions for defining these cardinal features are included. After documentation of parkinsonism, determination of PD as the cause of parkinsonism relies on three categories of diagnostic features: absolute exclusion criteria (which rule out PD), red flags (which must be counterbalanced by additional supportive criteria to allow diagnosis of PD), and supportive criteria (positive features that increase confidence of the PD diagnosis). Two levels of certainty are delineated: clinically established PD (maximizing specificity at the expense of reduced sensitivity) and probable PD (which balances sensitivity and specificity). The Movement Disorder Society criteria retain elements proven valuable in previous criteria and omit aspects that are no longer justified, thereby encapsulating diagnosis according to current knowledge. As understanding of PD expands, the Movement Disorder Society criteria will need continuous revision to accommodate these advances.

Journal ArticleDOI
TL;DR: Weyl and Dirac semimetals as discussed by the authors are three-dimensional phases of matter with gapless electronic excitations that are protected by topology and symmetry, and they have generated much recent interest.
Abstract: Weyl and Dirac semimetals are three-dimensional phases of matter with gapless electronic excitations that are protected by topology and symmetry. As three-dimensional analogs of graphene, they have generated much recent interest. Deep connections exist with particle physics models of relativistic chiral fermions, and, despite their gaplessness, to solid-state topological and Chern insulators. Their characteristic electronic properties lead to protected surface states and novel responses to applied electric and magnetic fields. The theoretical foundations of these phases, their proposed realizations in solid-state systems, and recent experiments on candidate materials as well as their relation to other states of matter are reviewed.

Journal ArticleDOI
19 Nov 2015-Nature
TL;DR: It is demonstrated that exosomes from mouse and human lung-, liver- and brain-tropic tumour cells fuse preferentially with resident cells at their predicted destination, namely lung fibroblasts and epithelial cells, liver Kupffer cells and brain endothelial cells.
Abstract: Ever since Stephen Paget's 1889 hypothesis, metastatic organotropism has remained one of cancer's greatest mysteries. Here we demonstrate that exosomes from mouse and human lung-, liver- and brain-tropic tumour cells fuse preferentially with resident cells at their predicted destination, namely lung fibroblasts and epithelial cells, liver Kupffer cells and brain endothelial cells. We show that tumour-derived exosomes uptaken by organ-specific cells prepare the pre-metastatic niche. Treatment with exosomes from lung-tropic models redirected the metastasis of bone-tropic tumour cells. Exosome proteomics revealed distinct integrin expression patterns, in which the exosomal integrins α6β4 and α6β1 were associated with lung metastasis, while exosomal integrin αvβ5 was linked to liver metastasis. Targeting the integrins α6β4 and αvβ5 decreased exosome uptake, as well as lung and liver metastasis, respectively. We demonstrate that exosome integrin uptake by resident cells activates Src phosphorylation and pro-inflammatory S100 gene expression. Finally, our clinical data indicate that exosomal integrins could be used to predict organ-specific metastasis.

Book ChapterDOI
08 Sep 2018
TL;DR: ShuffleNet V2 as discussed by the authors proposes to evaluate the direct metric on the target platform, beyond only considering FLOPs, based on a series of controlled experiments, and derives several practical guidelines for efficient network design.
Abstract: Currently, the neural network architecture design is mostly guided by the indirect metric of computation complexity, i.e., FLOPs. However, the direct metric, e.g., speed, also depends on the other factors such as memory access cost and platform characterics. Thus, this work proposes to evaluate the direct metric on the target platform, beyond only considering FLOPs. Based on a series of controlled experiments, this work derives several practical guidelines for efficient network design. Accordingly, a new architecture is presented, called ShuffleNet V2. Comprehensive ablation experiments verify that our model is the state-of-the-art in terms of speed and accuracy tradeoff.

Proceedings Article
03 Nov 2016
TL;DR: Gumbel-Softmax as mentioned in this paper replaces the non-differentiable samples from a categorical distribution with a differentiable sample from a novel Gumbel softmax distribution, which has the essential property that it can be smoothly annealed into the categorical distributions.
Abstract: Categorical variables are a natural choice for representing discrete structure in the world. However, stochastic neural networks rarely use categorical latent variables due to the inability to backpropagate through samples. In this work, we present an efficient gradient estimator that replaces the non-differentiable sample from a categorical distribution with a differentiable sample from a novel Gumbel-Softmax distribution. This distribution has the essential property that it can be smoothly annealed into a categorical distribution. We show that our Gumbel-Softmax estimator outperforms state-of-the-art gradient estimators on structured output prediction and unsupervised generative modeling tasks with categorical latent variables, and enables large speedups on semi-supervised classification.

Journal ArticleDOI
19 May 2016-Nature
TL;DR: E engineered fusions of CRISPR/Cas9 and a cytidine deaminase enzyme that retain the ability to be programmed with a guide RNA, do not induce dsDNA breaks, and mediate the direct conversion of cytidine to uridine, thereby effecting a C→T (or G→A) substitution.
Abstract: Current genome-editing technologies introduce double-stranded (ds) DNA breaks at a target locus as the first step to gene correction. Although most genetic diseases arise from point mutations, current approaches to point mutation correction are inefficient and typically induce an abundance of random insertions and deletions (indels) at the target locus resulting from the cellular response to dsDNA breaks. Here we report the development of 'base editing', a new approach to genome editing that enables the direct, irreversible conversion of one target DNA base into another in a programmable manner, without requiring dsDNA backbone cleavage or a donor template. We engineered fusions of CRISPR/Cas9 and a cytidine deaminase enzyme that retain the ability to be programmed with a guide RNA, do not induce dsDNA breaks, and mediate the direct conversion of cytidine to uridine, thereby effecting a C→T (or G→A) substitution. The resulting 'base editors' convert cytidines within a window of approximately five nucleotides, and can efficiently correct a variety of point mutations relevant to human disease. In four transformed human and murine cell lines, second- and third-generation base editors that fuse uracil glycosylase inhibitor, and that use a Cas9 nickase targeting the non-edited strand, manipulate the cellular DNA repair response to favour desired base-editing outcomes, resulting in permanent correction of ~15-75% of total cellular DNA with minimal (typically ≤1%) indel formation. Base editing expands the scope and efficiency of genome editing of point mutations.

Posted Content
TL;DR: A convolutional neural network is trained to map raw pixels from a single front-facing camera directly to steering commands and it is argued that this will eventually lead to better performance and smaller systems.
Abstract: We trained a convolutional neural network (CNN) to map raw pixels from a single front-facing camera directly to steering commands. This end-to-end approach proved surprisingly powerful. With minimum training data from humans the system learns to drive in traffic on local roads with or without lane markings and on highways. It also operates in areas with unclear visual guidance such as in parking lots and on unpaved roads. The system automatically learns internal representations of the necessary processing steps such as detecting useful road features with only the human steering angle as the training signal. We never explicitly trained it to detect, for example, the outline of roads. Compared to explicit decomposition of the problem, such as lane marking detection, path planning, and control, our end-to-end system optimizes all processing steps simultaneously. We argue that this will eventually lead to better performance and smaller systems. Better performance will result because the internal components self-optimize to maximize overall system performance, instead of optimizing human-selected intermediate criteria, e.g., lane detection. Such criteria understandably are selected for ease of human interpretation which doesn't automatically guarantee maximum system performance. Smaller networks are possible because the system learns to solve the problem with the minimal number of processing steps. We used an NVIDIA DevBox and Torch 7 for training and an NVIDIA DRIVE(TM) PX self-driving car computer also running Torch 7 for determining where to drive. The system operates at 30 frames per second (FPS).

Journal ArticleDOI
25 Aug 2020-JAMA
TL;DR: This review discusses current evidence regarding the pathophysiology, transmission, diagnosis, and management of COVID-19, the novel severe acute respiratory syndrome coronavirus 2 pandemic that has caused a worldwide sudden and substantial increase in hospitalizations for pneumonia with multiorgan disease.
Abstract: Importance The coronavirus disease 2019 (COVID-19) pandemic, due to the novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has caused a worldwide sudden and substantial increase in hospitalizations for pneumonia with multiorgan disease. This review discusses current evidence regarding the pathophysiology, transmission, diagnosis, and management of COVID-19. Observations SARS-CoV-2 is spread primarily via respiratory droplets during close face-to-face contact. Infection can be spread by asymptomatic, presymptomatic, and symptomatic carriers. The average time from exposure to symptom onset is 5 days, and 97.5% of people who develop symptoms do so within 11.5 days. The most common symptoms are fever, dry cough, and shortness of breath. Radiographic and laboratory abnormalities, such as lymphopenia and elevated lactate dehydrogenase, are common, but nonspecific. Diagnosis is made by detection of SARS-CoV-2 via reverse transcription polymerase chain reaction testing, although false-negative test results may occur in up to 20% to 67% of patients; however, this is dependent on the quality and timing of testing. Manifestations of COVID-19 include asymptomatic carriers and fulminant disease characterized by sepsis and acute respiratory failure. Approximately 5% of patients with COVID-19, and 20% of those hospitalized, experience severe symptoms necessitating intensive care. More than 75% of patients hospitalized with COVID-19 require supplemental oxygen. Treatment for individuals with COVID-19 includes best practices for supportive management of acute hypoxic respiratory failure. Emerging data indicate that dexamethasone therapy reduces 28-day mortality in patients requiring supplemental oxygen compared with usual care (21.6% vs 24.6%; age-adjusted rate ratio, 0.83 [95% CI, 0.74-0.92]) and that remdesivir improves time to recovery (hospital discharge or no supplemental oxygen requirement) from 15 to 11 days. In a randomized trial of 103 patients with COVID-19, convalescent plasma did not shorten time to recovery. Ongoing trials are testing antiviral therapies, immune modulators, and anticoagulants. The case-fatality rate for COVID-19 varies markedly by age, ranging from 0.3 deaths per 1000 cases among patients aged 5 to 17 years to 304.9 deaths per 1000 cases among patients aged 85 years or older in the US. Among patients hospitalized in the intensive care unit, the case fatality is up to 40%. At least 120 SARS-CoV-2 vaccines are under development. Until an effective vaccine is available, the primary methods to reduce spread are face masks, social distancing, and contact tracing. Monoclonal antibodies and hyperimmune globulin may provide additional preventive strategies. Conclusions and Relevance As of July 1, 2020, more than 10 million people worldwide had been infected with SARS-CoV-2. Many aspects of transmission, infection, and treatment remain unclear. Advances in prevention and effective management of COVID-19 will require basic and clinical investigation and public health and clinical interventions.

Journal ArticleDOI
TL;DR: A unified 5-level architecture is proposed as a guideline for implementation of Cyber-Physical Systems (CPS), within which information from all related perspectives is closely monitored and synchronized between the physical factory floor and the cyber computational space.
Abstract: Recent advances in manufacturing industry has paved way for a systematical deployment of Cyber-Physical Systems (CPS), within which information from all related perspectives is closely monitored and synchronized between the physical factory floor and the cyber computational space. Moreover, by utilizing advanced information analytics, networked machines will be able to perform more efficiently, collaboratively and resiliently. Such trend is transforming manufacturing industry to the next generation, namely Industry 4.0. At this early development phase, there is an urgent need for a clear definition of CPS. In this paper, a unified 5-level architecture is proposed as a guideline for implementation of CPS.

Journal ArticleDOI
TL;DR: Patients with refractory large B‐cell lymphoma who received CAR T‐cell therapy with axi‐cel had high levels of durable response, with a safety profile that included myelosuppression, the cytokine release syndrome, and neurologic events.
Abstract: BackgroundIn a phase 1 trial, axicabtagene ciloleucel (axi-cel), an autologous anti-CD19 chimeric antigen receptor (CAR) T-cell therapy, showed efficacy in patients with refractory large B-cell lymphoma after the failure of conventional therapy. MethodsIn this multicenter, phase 2 trial, we enrolled 111 patients with diffuse large B-cell lymphoma, primary mediastinal B-cell lymphoma, or transformed follicular lymphoma who had refractory disease despite undergoing recommended prior therapy. Patients received a target dose of 2×106 anti-CD19 CAR T cells per kilogram of body weight after receiving a conditioning regimen of low-dose cyclophosphamide and fludarabine. The primary end point was the rate of objective response (calculated as the combined rates of complete response and partial response). Secondary end points included overall survival, safety, and biomarker assessments. ResultsAmong the 111 patients who were enrolled, axi-cel was successfully manufactured for 110 (99%) and administered to 101 (91%)....