200 million+ research papers across 250,000+ topics on SciSpace

Browse all papers

PDF

Open Access

Proceedings Article•DOI•

Hyperledger fabric: a distributed operating system for permissioned blockchains

[...]

Elli Androulaki¹, Artem Barger¹, Vita Bortnikov¹, Christian Cachin¹, Konstantinos Christidis¹, Angelo De Caro¹, David Michael Enyeart¹, Christopher Ferris¹, Gennady Laventman¹, Yacov Manevich¹, Srinivasan Muralidharan², Chet Murthy¹, Binh Nguyen², Manish Sethi¹, Gari Singh¹, Keith Smith¹, Alessandro Sorniotti¹, Chrysoula Stathakopoulou¹, Marko Vukolic¹, Sharon Weed Cocco¹, Jason Yellick¹ - Show less +17 more•Institutions (2)

IBM¹, State Street Corporation²

23 Apr 2018

TL;DR: This paper describes Fabric, its architecture, the rationale behind various design decisions, its most prominent implementation aspects, as well as its distributed application programming model, and shows that Fabric achieves end-to-end throughput of more than 3500 transactions per second in certain popular deployment configurations.

...read moreread less

Abstract: Fabric is a modular and extensible open-source system for deploying and operating permissioned blockchains and one of the Hyperledger projects hosted by the Linux Foundation (www.hyperledger.org). Fabric is the first truly extensible blockchain system for running distributed applications. It supports modular consensus protocols, which allows the system to be tailored to particular use cases and trust models. Fabric is also the first blockchain system that runs distributed applications written in standard, general-purpose programming languages, without systemic dependency on a native cryptocurrency. This stands in sharp contrast to existing block-chain platforms that require "smart-contracts" to be written in domain-specific languages or rely on a cryptocurrency. Fabric realizes the permissioned model using a portable notion of membership, which may be integrated with industry-standard identity management. To support such flexibility, Fabric introduces an entirely novel blockchain design and revamps the way blockchains cope with non-determinism, resource exhaustion, and performance attacks. This paper describes Fabric, its architecture, the rationale behind various design decisions, its most prominent implementation aspects, as well as its distributed application programming model. We further evaluate Fabric by implementing and benchmarking a Bitcoin-inspired digital currency. We show that Fabric achieves end-to-end throughput of more than 3500 transactions per second in certain popular deployment configurations, with sub-second latency, scaling well to over 100 peers.

...read moreread less

2,813 citations

Journal Article•DOI•

TGFβ attenuates tumour response to PD-L1 blockade by contributing to exclusion of T cells.

[...]

Sanjeev Mariathasan¹, Shannon J. Turley¹, Dorothee Nickles¹, Alessandra Castiglioni¹, Kobe C. Yuen¹, Yulei Wang¹, Edward E. Kadel¹, Hartmut Koeppen¹, Jillian L. Astarita¹, Rafael Cubas¹, Suchit Jhunjhunwala¹, Romain Banchereau¹, Yagai Yang¹, Yinghui Guan¹, Cecile Chalouni¹, James Ziai¹, Yasin Senbabaoglu¹, Stephen Santoro¹, Daniel Sheinson¹, Jeffrey Hung¹, Jennifer M. Giltnane¹, Andrew A. Pierce¹, Kathryn Mesh¹, Steve Lianoglou¹, Johannes Riegler¹, Richard A.D. Carano¹, Pontus Eriksson², Mattias Höglund², Loan Somarriba, Daniel L. Halligan, Michiel S. van der Heijden³, Yohann Loriot⁴, Jonathan E. Rosenberg⁵, Lawrence Fong⁶, Ira Mellman¹, Daniel S. Chen¹, Marjorie C. Green¹, Christina Louise Derleth¹, Gregg Fine¹, Priti S. Hegde¹, Richard Bourgon¹, Thomas Powles⁷ - Show less +38 more•Institutions (7)

Genentech¹, Lund University², Netherlands Cancer Institute³, University of Paris-Sud⁴, Memorial Sloan Kettering Cancer Center⁵, University of California, San Francisco⁶, Queen Mary University of London⁷

22 Feb 2018-Nature

TL;DR: Tumours from a large cohort of patients with metastatic urothelial cancer who were treated with an anti-PD-L1 agent were examined and major determinants of clinical outcome were identified and suggested that TGFβ shapes the tumour microenvironment to restrain anti-tumour immunity by restricting T-cell infiltration.

...read moreread less

Abstract: Therapeutic antibodies that block the programmed death-1 (PD-1)-programmed death-ligand 1 (PD-L1) pathway can induce robust and durable responses in patients with various cancers, including metastatic urothelial cancer. However, these responses only occur in a subset of patients. Elucidating the determinants of response and resistance is key to improving outcomes and developing new treatment strategies. Here we examined tumours from a large cohort of patients with metastatic urothelial cancer who were treated with an anti-PD-L1 agent (atezolizumab) and identified major determinants of clinical outcome. Response to treatment was associated with CD8+ T-effector cell phenotype and, to an even greater extent, high neoantigen or tumour mutation burden. Lack of response was associated with a signature of transforming growth factor β (TGFβ) signalling in fibroblasts. This occurred particularly in patients with tumours, which showed exclusion of CD8+ T cells from the tumour parenchyma that were instead found in the fibroblast- and collagen-rich peritumoural stroma; a common phenotype among patients with metastatic urothelial cancer. Using a mouse model that recapitulates this immune-excluded phenotype, we found that therapeutic co-administration of TGFβ-blocking and anti-PD-L1 antibodies reduced TGFβ signalling in stromal cells, facilitated T-cell penetration into the centre of tumours, and provoked vigorous anti-tumour immunity and tumour regression. Integration of these three independent biological features provides the best basis for understanding patient outcome in this setting and suggests that TGFβ shapes the tumour microenvironment to restrain anti-tumour immunity by restricting T-cell infiltration.

...read moreread less

2,808 citations

Journal Article•DOI•

A Survey of Methods for Explaining Black Box Models

[...]

Riccardo Guidotti¹, Anna Monreale¹, Salvatore Ruggieri¹, Franco Turini¹, Fosca Giannotti², Dino Pedreschi¹ - Show less +2 more•Institutions (2)

University of Pisa¹, Istituto di Scienza e Tecnologie dell'Informazione²

22 Aug 2018-ACM Computing Surveys

TL;DR: In this paper, the authors provide a classification of the main problems addressed in the literature with respect to the notion of explanation and the type of black box decision support systems, given a problem definition, a black box type, and a desired explanation, this survey should help the researcher to find the proposals more useful for his own work.

...read moreread less

Abstract: In recent years, many accurate decision support systems have been constructed as black boxes, that is as systems that hide their internal logic to the user. This lack of explanation constitutes both a practical and an ethical issue. The literature reports many approaches aimed at overcoming this crucial weakness, sometimes at the cost of sacrificing accuracy for interpretability. The applications in which black box decision systems can be used are various, and each approach is typically developed to provide a solution for a specific problem and, as a consequence, it explicitly or implicitly delineates its own definition of interpretability and explanation. The aim of this article is to provide a classification of the main problems addressed in the literature with respect to the notion of explanation and the type of black box system. Given a problem definition, a black box type, and a desired explanation, this survey should help the researcher to find the proposals more useful for his own work. The proposed classification of approaches to open black box models should also be useful for putting the many research open questions in perspective.

...read moreread less

2,805 citations

Journal Article•DOI•

The IntCal20 Northern Hemisphere Radiocarbon Age Calibration Curve (0-55 cal kBP)

[...]

Paula J. Reimer¹, William E. N. Austin², Edouard Bard³, Alex Bayliss⁴, Paul G. Blackwell⁵, Christopher Bronk Ramsey⁶, Martin Butzin⁷, Hai Cheng⁸, Hai Cheng⁹, R. Lawrence Edwards⁸, R. Lawrence Edwards¹⁰, Michael Friedrich¹¹, Pieter Meiert Grootes¹², Thomas P. Guilderson¹³, Thomas P. Guilderson¹⁴, Irka Hajdas¹⁵, Timothy J Heaton⁵, Alan G. Hogg¹⁶, Konrad A Hughen¹⁷, Bernd Kromer¹⁸, Sturt W. Manning¹⁹, Raimund Muscheler²⁰, Jonathan G. Palmer²¹, Charlotte Pearson²², Johannes van der Plicht²³, Ron W Reimer¹, David Richards²⁴, E. Marian Scott²⁵, John Southon²⁶, Christian Turney²¹, Lukas Wacker¹⁵, Florian Adolphi²⁷, Ulf Büntgen, Manuela Capano³, Simon Fahrni²⁶, Alexandra Fogtmann-Schulz²⁸, Ronny Friedrich, Peter Köhler⁷, Sabrina G K Kudsk²⁸, Fusa Miyake²⁹, Jesper V. Olsen²⁸, Frederick Reinig³⁰, Minoru Sakamoto³¹, Adam Sookdeo²¹, Adam Sookdeo¹⁵, Sahra Talamo³² - Show less +42 more•Institutions (32)

Queen's University Belfast¹, University of St Andrews², Aix-Marseille University³, Historic England⁴, University of Sheffield⁵, University of Oxford⁶, Alfred Wegener Institute for Polar and Marine Research⁷, University of Minnesota⁸, Xi'an Jiaotong University⁹, Nanjing Normal University¹⁰, University of Hohenheim¹¹, University of Kiel¹², University of California, Santa Cruz¹³, Lawrence Livermore National Laboratory¹⁴, ETH Zurich¹⁵, University of Waikato¹⁶, Woods Hole Oceanographic Institution¹⁷, Heidelberg University¹⁸, Cornell University¹⁹, Lund University²⁰, University of New South Wales²¹, University of Arizona²², University of Groningen²³, University of Bristol²⁴, University of Glasgow²⁵, University of California, Irvine²⁶, University of Bern²⁷, Aarhus University²⁸, Nagoya University²⁹, Swiss Federal Institute for Forest, Snow and Landscape Research³⁰, National Museum of Japanese History³¹, University of Bologna³²

12 Aug 2020-Radiocarbon

TL;DR: In this article, the international 14C calibration curves for both the Northern and Southern Hemispheres, as well as for the ocean surface layer, have been updated to include a wealth of new data and extended to 55,000 cal BP.

...read moreread less

Abstract: Radiocarbon (14C) ages cannot provide absolutely dated chronologies for archaeological or paleoenvironmental studies directly but must be converted to calendar age equivalents using a calibration curve compensating for fluctuations in atmospheric 14C concentration. Although calibration curves are constructed from independently dated archives, they invariably require revision as new data become available and our understanding of the Earth system improves. In this volume the international 14C calibration curves for both the Northern and Southern Hemispheres, as well as for the ocean surface layer, have been updated to include a wealth of new data and extended to 55,000 cal BP. Based on tree rings, IntCal20 now extends as a fully atmospheric record to ca. 13,900 cal BP. For the older part of the timescale, IntCal20 comprises statistically integrated evidence from floating tree-ring chronologies, lacustrine and marine sediments, speleothems, and corals. We utilized improved evaluation of the timescales and location variable 14C offsets from the atmosphere (reservoir age, dead carbon fraction) for each dataset. New statistical methods have refined the structure of the calibration curves while maintaining a robust treatment of uncertainties in the 14C ages, the calendar ages and other corrections. The inclusion of modeled marine reservoir ages derived from a three-dimensional ocean circulation model has allowed us to apply more appropriate reservoir corrections to the marine 14C data rather than the previous use of constant regional offsets from the atmosphere. Here we provide an overview of the new and revised datasets and the associated methods used for the construction of the IntCal20 curve and explore potential regional offsets for tree-ring data. We discuss the main differences with respect to the previous calibration curve, IntCal13, and some of the implications for archaeology and geosciences ranging from the recent past to the time of the extinction of the Neanderthals.

...read moreread less

2,800 citations

Journal Article•DOI•

Gut microbiome modulates response to anti–PD-1 immunotherapy in melanoma patients

[...]

Vancheswaran Gopalakrishnan¹, Vancheswaran Gopalakrishnan², Christine N. Spencer¹, Christine N. Spencer², Luigi Nezi², Alexandre Reuben², Miles C. Andrews², Tatiana Karpinets², Peter A. Prieto², D. Vicente², K. Hoffman², Spencer C. Wei², Alexandria P. Cogdill², Li Zhao², Courtney W. Hudgens², Diane S. Hutchinson³, T. Manzo², M. Petaccia de Macedo², Tiziana Cotechini⁴, T. Kumar², Wei Shen Chen², Sangeetha M. Reddy², R. Szczepaniak Sloane², Jessica Galloway-Peña², Hong Jiang², P. L. Chen², Elizabeth J. Shpall², Katayoun Rezvani², Amin M. Alousi², Roy F. Chemaly², Samuel A. Shelburne², Luis M Vence², Pablo C. Okhuysen², V. B. Jensen², Alton G. Swennes³, Florencia McAllister², E. Marcelo Riquelme Sanchez², Yu Zhang², Laurence Zitvogel⁵, Nicolas Pons⁶, Jacob Austin-Breneman², Lauren E. Haydu², Elizabeth M. Burton², J. M. Gardner², E. Sirmans², Jing Shan Hu², Alexander J. Lazar², Takahiro Tsujikawa⁴, Adi Diab², Hussein Abdul-Hassan Tawbi², Isabella C. Glitza², Wen-Jen Hwu², Sapna Pradyuman Patel², Scott E. Woodman², Rodabe N. Amaria², Michael A. Davies², Jeffrey E. Gershenwald², Patrick Hwu², J. E. Lee², Jianhua Zhang², Lisa M. Coussens⁴, Zachary A. Cooper², P.A. Futreal², Carrie R. Daniel², Carrie R. Daniel¹, Nadim J. Ajami³, Joseph F. Petrosino³, Michael T. Tetzlaff², Pradeep Sharma², James P. Allison², Robert R. Jenq², Jennifer A. Wargo² - Show less +68 more•Institutions (6)

University of Texas at Austin¹, University of Texas MD Anderson Cancer Center², Baylor College of Medicine³, Oregon Health & Science University⁴, Institut Gustave Roussy⁵, Institut national de la recherche agronomique⁶

05 Jan 2018-Science

TL;DR: Examination of the oral and gut microbiome of melanoma patients undergoing anti-programmed cell death 1 protein (PD-1) immunotherapy suggested enhanced systemic and antitumor immunity in responding patients with a favorable gut microbiome as well as in germ-free mice receiving fecal transplants from responding patients.

...read moreread less

Abstract: Preclinical mouse models suggest that the gut microbiome modulates tumor response to checkpoint blockade immunotherapy; however, this has not been well-characterized in human cancer patients. Here we examined the oral and gut microbiome of melanoma patients undergoing anti-programmed cell death 1 protein (PD-1) immunotherapy (n = 112). Significant differences were observed in the diversity and composition of the patient gut microbiome of responders versus nonresponders. Analysis of patient fecal microbiome samples (n = 43, 30 responders, 13 nonresponders) showed significantly higher alpha diversity (P < 0.01) and relative abundance of bacteria of the Ruminococcaceae family (P < 0.01) in responding patients. Metagenomic studies revealed functional differences in gut bacteria in responders, including enrichment of anabolic pathways. Immune profiling suggested enhanced systemic and antitumor immunity in responding patients with a favorable gut microbiome as well as in germ-free mice receiving fecal transplants from responding patients. Together, these data have important implications for the treatment of melanoma patients with immune checkpoint inhibitors.

...read moreread less

2,791 citations

Journal Article•DOI•

Discovery of a Weyl Fermion Semimetal and Topological Fermi Arcs

[...]

Su-Yang Xu, Ilya Belopolski, Nasser Alidoust, Madhab Neupane, Chenglong Zhang, Raman Sankar, Shin-Ming Huang, Chi-Cheng Lee, Guoqing Chang, Baokai Wang, Guang Bian, Hao Zheng, Daniel S. Sanchez, Fangcheng Chou, Hsin Lin, Shuang Jia, M. Zahid Hasan - Show less +13 more

12 Feb 2015-arXiv: Mesoscale and Nanoscale Physics

TL;DR: The experimental discovery of a Weyl semimetal, tantalum arsenide (TaAs), using photoemission spectroscopy, which finds that Fermi arcs terminate on the Weyl fermion nodes, consistent with their topological character.

...read moreread less

2,789 citations

Proceedings Article•

Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results

[...]

Antti Tarvainen, Harri Valpola¹•Institutions (1)

Dalle Molle Institute for Artificial Intelligence Research¹

17 Feb 2017

TL;DR: The recently proposed Temporal Ensembling has achieved state-of-the-art results in several semi-supervised learning benchmarks, but it becomes unwieldy when learning large datasets, so Mean Teacher, a method that averages model weights instead of label predictions, is proposed.

...read moreread less

Abstract: The recently proposed Temporal Ensembling has achieved state-of-the-art results in several semi-supervised learning benchmarks. It maintains an exponential moving average of label predictions on each training example, and penalizes predictions that are inconsistent with this target. However, because the targets change only once per epoch, Temporal Ensembling becomes unwieldy when learning large datasets. To overcome this problem, we propose Mean Teacher, a method that averages model weights instead of label predictions. As an additional benefit, Mean Teacher improves test accuracy and enables training with fewer labels than Temporal Ensembling. Without changing the network architecture, Mean Teacher achieves an error rate of 4.35% on SVHN with 250 labels, outperforming Temporal Ensembling trained with 1000 labels. We also show that a good network architecture is crucial to performance. Combining Mean Teacher and Residual Networks, we improve the state of the art on CIFAR-10 with 4000 labels from 10.55% to 6.28%, and on ImageNet 2012 with 10% of the labels from 35.24% to 9.11%.

...read moreread less

2,784 citations

Journal Article•DOI•

Discovering governing equations from data by sparse identification of nonlinear dynamical systems

[...]

Steven L. Brunton¹, Joshua L. Proctor, J. Nathan Kutz¹•Institutions (1)

University of Washington¹

12 Apr 2016-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: This work develops a novel framework to discover governing equations underlying a dynamical system simply from data measurements, leveraging advances in sparsity techniques and machine learning and using sparse regression to determine the fewest terms in the dynamic governing equations required to accurately represent the data.

...read moreread less

Abstract: Extracting governing equations from data is a central challenge in many diverse areas of science and engineering. Data are abundant whereas models often remain elusive, as in climate science, neuroscience, ecology, finance, and epidemiology, to name only a few examples. In this work, we combine sparsity-promoting techniques and machine learning with nonlinear dynamical systems to discover governing equations from noisy measurement data. The only assumption about the structure of the model is that there are only a few important terms that govern the dynamics, so that the equations are sparse in the space of possible functions; this assumption holds for many physical systems in an appropriate basis. In particular, we use sparse regression to determine the fewest terms in the dynamic governing equations required to accurately represent the data. This results in parsimonious models that balance accuracy with model complexity to avoid overfitting. We demonstrate the algorithm on a wide range of problems, from simple canonical systems, including linear and nonlinear oscillators and the chaotic Lorenz system, to the fluid vortex shedding behind an obstacle. The fluid example illustrates the ability of this method to discover the underlying dynamics of a system that took experts in the community nearly 30 years to resolve. We also show that this method generalizes to parameterized systems and systems that are time-varying or have external forcing.

...read moreread less

2,784 citations

Journal Article•DOI•

Worldwide trends in diabetes since 1980: a pooled analysis of 751 population-based studies with 4.4 million participants

[...]

Bin Zhou¹, Yuan Lu², Kaveh Hajifathalian², James Bentham¹ +494 more•Institutions (170)

09 Apr 2016-The Lancet

TL;DR: In this article, the authors used a Bayesian hierarchical model to estimate trends in diabetes prevalence, defined as fasting plasma glucose of 7.0 mmol/L or higher, or history of diagnosis with diabetes, or use of insulin or oral hypoglycaemic drugs in 200 countries and territories in 21 regions, by sex and from 1980 to 2014.

...read moreread less

2,782 citations

Book Chapter•DOI•

Temporal Segment Networks: Towards Good Practices for Deep Action Recognition

[...]

Limin Wang¹, Yuanjun Xiong², Zhe Wang, Yu Qiao, Dahua Lin², Xiaoou Tang², Luc Van Gool¹ - Show less +3 more•Institutions (2)

ETH Zurich¹, The Chinese University of Hong Kong²

08 Oct 2016

TL;DR: Temporal Segment Networks (TSN) as discussed by the authors combine a sparse temporal sampling strategy and video-level supervision to enable efficient and effective learning using the whole action video, which obtains the state-of-the-art performance on the datasets of HMDB51 and UCF101.

...read moreread less

Abstract: Deep convolutional networks have achieved great success for visual recognition in still images. However, for action recognition in videos, the advantage over traditional methods is not so evident. This paper aims to discover the principles to design effective ConvNet architectures for action recognition in videos and learn these models given limited training samples. Our first contribution is temporal segment network (TSN), a novel framework for video-based action recognition. which is based on the idea of long-range temporal structure modeling. It combines a sparse temporal sampling strategy and video-level supervision to enable efficient and effective learning using the whole action video. The other contribution is our study on a series of good practices in learning ConvNets on video data with the help of temporal segment network. Our approach obtains the state-the-of-art performance on the datasets of HMDB51 ($ 69.4\,\% $) and UCF101 ($ 94.2\,\% $). We also visualize the learned ConvNet models, which qualitatively demonstrates the effectiveness of temporal segment network and the proposed good practices (Models and code at https://github.com/yjxiong/temporal-segment-networks).

...read moreread less

2,778 citations

Posted Content•

An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling

[...]

Shaojie Bai, J. Zico Kolter, Vladlen Koltun

04 Mar 2018-arXiv: Learning

TL;DR: A systematic evaluation of generic convolutional and recurrent architectures for sequence modeling concludes that the common association between sequence modeling and recurrent networks should be reconsidered, and convolutionals should be regarded as a natural starting point for sequence modeled tasks.

...read moreread less

Abstract: For most deep learning practitioners, sequence modeling is synonymous with recurrent networks. Yet recent results indicate that convolutional architectures can outperform recurrent networks on tasks such as audio synthesis and machine translation. Given a new sequence modeling task or dataset, which architecture should one use? We conduct a systematic evaluation of generic convolutional and recurrent architectures for sequence modeling. The models are evaluated across a broad range of standard tasks that are commonly used to benchmark recurrent networks. Our results indicate that a simple convolutional architecture outperforms canonical recurrent networks such as LSTMs across a diverse range of tasks and datasets, while demonstrating longer effective memory. We conclude that the common association between sequence modeling and recurrent networks should be reconsidered, and convolutional networks should be regarded as a natural starting point for sequence modeling tasks. To assist related work, we have made code available at this http URL .

...read moreread less

Posted Content•

Aggregated Residual Transformations for Deep Neural Networks

[...]

Saining Xie¹, Ross Girshick², Piotr Dollár², Zhuowen Tu¹, Kaiming He² - Show less +1 more•Institutions (2)

University of California, San Diego¹, Facebook²

16 Nov 2016-arXiv: Computer Vision and Pattern Recognition

TL;DR: On the ImageNet-1K dataset, it is empirically show that even under the restricted condition of maintaining complexity, increasing cardinality is able to improve classification accuracy and is more effective than going deeper or wider when the authors increase the capacity.

...read moreread less

Abstract: We present a simple, highly modularized network architecture for image classification. Our network is constructed by repeating a building block that aggregates a set of transformations with the same topology. Our simple design results in a homogeneous, multi-branch architecture that has only a few hyper-parameters to set. This strategy exposes a new dimension, which we call "cardinality" (the size of the set of transformations), as an essential factor in addition to the dimensions of depth and width. On the ImageNet-1K dataset, we empirically show that even under the restricted condition of maintaining complexity, increasing cardinality is able to improve classification accuracy. Moreover, increasing cardinality is more effective than going deeper or wider when we increase the capacity. Our models, named ResNeXt, are the foundations of our entry to the ILSVRC 2016 classification task in which we secured 2nd place. We further investigate ResNeXt on an ImageNet-5K set and the COCO detection set, also showing better results than its ResNet counterpart. The code and models are publicly available online.

...read moreread less

Proceedings Article•DOI•

SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

[...]

Daniel S. Park¹, William Chan¹, Yu Zhang², Chung-Cheng Chiu¹, Barret Zoph¹, Ekin D. Cubuk¹, Quoc V. Le¹ - Show less +3 more•Institutions (2)

Google¹, Massachusetts Institute of Technology²

18 Apr 2019

TL;DR: This work presents SpecAugment, a simple data augmentation method for speech recognition that is applied directly to the feature inputs of a neural network (i.e., filter bank coefficients) and achieves state-of-the-art performance on the LibriSpeech 960h and Swichboard 300h tasks, outperforming all prior work.

...read moreread less

Abstract: We present SpecAugment, a simple data augmentation method for speech recognition. SpecAugment is applied directly to the feature inputs of a neural network (i.e., filter bank coefficients). The augmentation policy consists of warping the features, masking blocks of frequency channels, and masking blocks of time steps. We apply SpecAugment on Listen, Attend and Spell networks for end-to-end speech recognition tasks. We achieve state-of-the-art performance on the LibriSpeech 960h and Swichboard 300h tasks, outperforming all prior work. On LibriSpeech, we achieve 6.8% WER on test-other without the use of a language model, and 5.8% WER with shallow fusion with a language model. This compares to the previous state-of-the-art hybrid system of 7.5% WER. For Switchboard, we achieve 7.2%/14.6% on the Switchboard/CallHome portion of the Hub5'00 test set without the use of a language model, and 6.8%/14.1% with shallow fusion, which compares to the previous state-of-the-art hybrid system at 8.3%/17.3% WER.

...read moreread less

Journal Article•DOI•

Global surveillance of trends in cancer survival 2000-14 (CONCORD-3): analysis of individual records for 37 513 025 patients diagnosed with one of 18 cancers from 322 population-based registries in 71 countries.

[...]

Claudia Allemani¹, Tomohiro Matsuda, Veronica Di Carlo¹, Rhea Harewood¹ +591 more•Institutions (10)

17 Mar 2018-The Lancet

TL;DR: For most cancers, 5-year net survival remains among the highest in the world in the USA and Canada, in Australia and New Zealand, and in Finland, Iceland, Norway, and Sweden, while for many cancers, Denmark is closing the survival gap with the other Nordic countries.

...read moreread less

Journal Article•DOI•

Multi-messenger observations of a binary neutron star merger

[...]

Fermi Gbm¹, AstroSat Cadmium Zinc Telluride Imager Team², Agile Team³, M H Team⁴, Atca: Australia Telescope Compact Array⁵, Askap: Australian Ska Pathfinder⁶, Askap: Australian Ska Pathfinder⁷, OzGrav, Dwf , Ast⁸, Caastro Collaborations⁹, J-Gem¹⁰, Growth, Jagwar, Caltech-NRAO, Ttu-Nrao¹¹, NuSTAR Collaborations¹¹, Pan-STARRS¹², Maxi Team, Nordic Optical Telescope¹³, ePESSTO¹⁴, Grond, Mwa: Murchison Widefield Array¹⁵, Lwa: Long Wavelength Array, Euro Vlbi Team¹⁶, Rimas¹⁴, Ratir¹⁴, Ska South Africa¹⁴, MeerKAT¹⁴ - Show less +20 more•Institutions (16)

University of Geneva¹, Ioffe Institute², University of California, Santa Cruz³, University of Mississippi⁴, Curtin University⁵, University of California, Santa Barbara⁶, Las Cumbres Observatory Global Telescope Network⁷, University of Warwick⁸, Spanish National Research Council⁹, University of Colorado Boulder¹⁰, University of Hawaii¹¹, Aoyama Gakuin University¹², Queen's University Belfast¹³, Max Planck Society¹⁴, Nagoya University¹⁵, University of Warsaw¹⁶

16 Oct 2017-The Astrophysical Journal

TL;DR: A binary neutron star coalescence candidate (later designated GW170817) with merger time 12:41:04 UTC was observed through gravitational waves by the Advanced LIGO and Advanced Virgo detectors.

...read moreread less

Abstract: On 2017 August 17 a binary neutron star coalescence candidate (later designated GW170817) with merger time 12:41:04 UTC was observed through gravitational waves by the Advanced LIGO and Advanced Virgo detectors. The Fermi Gamma-ray Burst Monitor independently detected a gamma-ray burst (GRB 170817A) with a time delay of $\sim 1.7\,{\rm{s}}$ with respect to the merger time. From the gravitational-wave signal, the source was initially localized to a sky region of 31 deg2 at a luminosity distance of ${40}_{-8}^{+8}$ Mpc and with component masses consistent with neutron stars. The component masses were later measured to be in the range 0.86 to 2.26 $\,{M}_{\odot }$. An extensive observing campaign was launched across the electromagnetic spectrum leading to the discovery of a bright optical transient (SSS17a, now with the IAU identification of AT 2017gfo) in NGC 4993 (at $\sim 40\,{\rm{Mpc}}$) less than 11 hours after the merger by the One-Meter, Two Hemisphere (1M2H) team using the 1 m Swope Telescope. The optical transient was independently detected by multiple teams within an hour. Subsequent observations targeted the object and its environment. Early ultraviolet observations revealed a blue transient that faded within 48 hours. Optical and infrared observations showed a redward evolution over ~10 days. Following early non-detections, X-ray and radio emission were discovered at the transient's position $\sim 9$ and $\sim 16$ days, respectively, after the merger. Both the X-ray and radio emission likely arise from a physical process that is distinct from the one that generates the UV/optical/near-infrared emission. No ultra-high-energy gamma-rays and no neutrino candidates consistent with the source were found in follow-up searches. These observations support the hypothesis that GW170817 was produced by the merger of two neutron stars in NGC 4993 followed by a short gamma-ray burst (GRB 170817A) and a kilonova/macronova powered by the radioactive decay of r-process nuclei synthesized in the ejecta.

...read moreread less

Book Chapter•DOI•

Corruption and Development: A Review of Issues

[...]

Pranab Bardhan

12 Jul 2017-Journal of Economic Literature

TL;DR: In this article, the authors discuss the reasons for the persistence of corruption that have to do with frequency-dependent equilibria or intertemporal externalities, and suggest that corruption may actually improve efficiency and help growth.

...read moreread less

Abstract: Corruption has its adverse effects not just on static efficiency but also on investment and growth. This chapter discusses the reasons for the persistence of corruption that have to do with frequency-dependent equilibria or intertemporal externalities. There are many cases where corruption is mutually beneficial between the official and his client, so neither the briber nor the bribee has an incentive to report or protest, for example, when a customs officer lets contraband through, or a tax auditor purposely overlooks a case of tax evasion, and so on. The idea of multiple equilibria in the incidence of corruption is salient in some of the recent economic theorists' explanations. There is a strand in the corruption literature, contributed both by economists and non-economists, suggesting that, in the context of pervasive and cumbersome regulations in developing countries, corruption may actually improve efficiency and help growth.

...read moreread less

Journal Article•DOI•

Global Cancer Incidence and Mortality Rates and Trends—An Update

[...]

Lindsey A. Torre¹, Rebecca L. Siegel¹, Elizabeth Ward¹, Ahmedin Jemal¹•Institutions (1)

American Cancer Society¹

01 Jan 2016-Cancer Epidemiology, Biomarkers & Prevention

TL;DR: Applied cancer control measures are needed to reduce rates in HICs and arrest the growing burden in LMICs, as well as for lung, colorectal, breast, and prostate cancer, although some low- and middle-income countries (LMIC) now count among those with the highest rates.

...read moreread less

Abstract: There are limited published data on recent cancer incidence and mortality trends worldwide. We used the International Agency for Research on Cancer's CANCERMondial clearinghouse to present age-standardized cancer incidence and death rates for 2003-2007. We also present trends in incidence through 2007 and mortality through 2012 for select countries from five continents. High-income countries (HIC) continue to have the highest incidence rates for all sites, as well as for lung, colorectal, breast, and prostate cancer, although some low- and middle-income countries (LMIC) now count among those with the highest rates. Mortality rates from these cancers are declining in many HICs while they are increasing in LMICs. LMICs have the highest rates of stomach, liver, esophageal, and cervical cancer. Although rates remain high in HICs, they are plateauing or decreasing for the most common cancers due to decreases in known risk factors, screening and early detection, and improved treatment (mortality only). In contrast, rates in several LMICs are increasing for these cancers due to increases in smoking, excess body weight, and physical inactivity. LMICs also have a disproportionate burden of infection-related cancers. Applied cancer control measures are needed to reduce rates in HICs and arrest the growing burden in LMICs.

...read moreread less

Journal Article•DOI•

AASLD guidelines for the treatment of hepatocellular carcinoma

[...]

Julie K. Heimbach¹, Laura Kulik², Richard S. Finn³, Claude B. Sirlin⁴, Michael Abecassis², Lewis R. Roberts¹, Andrew X. Zhu⁵, M. Hassan Murad¹, Jorge A. Marrero⁶ - Show less +5 more•Institutions (6)

Mayo Clinic¹, Northwestern University², University of California, Los Angeles³, University of California, San Diego⁴, Harvard University⁵, University of Texas Southwestern Medical Center⁶

01 Jan 2018-Hepatology

TL;DR: This paper aims to demonstrate the efforts towards in-situ applicability of EMMARM, as to provide real-time information about concrete mechanical properties such as E-modulus and compressive strength.

...read moreread less

Journal Article•DOI•

SignalP 5.0 improves signal peptide predictions using deep neural networks

[...]

Jose Juan Almagro Armenteros¹, Konstantinos D. Tsirigos, Casper Kaae Sønderby², Thomas Nordahl Petersen¹, Ole Winther¹, Ole Winther², Søren Brunak², Søren Brunak¹, Gunnar von Heijne³, Gunnar von Heijne⁴, Henrik Nielsen¹ - Show less +7 more•Institutions (4)

Technical University of Denmark¹, University of Copenhagen², Stockholm University³, Science for Life Laboratory⁴

18 Feb 2019-Nature Biotechnology

TL;DR: A deep neural network-based approach that improves SP prediction across all domains of life and distinguishes between three types of prokaryotic SPs is presented.

...read moreread less

Abstract: Signal peptides (SPs) are short amino acid sequences in the amino terminus of many newly synthesized proteins that target proteins into, or across, membranes. Bioinformatic tools can predict SPs from amino acid sequences, but most cannot distinguish between various types of signal peptides. We present a deep neural network-based approach that improves SP prediction across all domains of life and distinguishes between three types of prokaryotic SPs.

...read moreread less

Journal Article•DOI•

Radiomics: the bridge between medical imaging and personalized medicine

[...]

Philippe Lambin¹, Ralph T.H. Leijenaar¹, Timo M. Deist¹, Jurgen Peerlings¹, Evelyn E.C. de Jong¹, Janita E. van Timmeren¹, Sebastian Sanduleanu¹, Ruben T. H. M. Larue¹, Aniek J.G. Even¹, Arthur Jochems¹, Yvonka van Wijk¹, Henry C. Woodruff¹, Johan van Soest¹, Tim Lustberg¹, Erik Roelofs¹, Wouter van Elmpt¹, Andre Dekker¹, Felix M. Mottaghy¹, Felix M. Mottaghy², Joachim E. Wildberger¹, Sean Walsh¹ - Show less +17 more•Institutions (2)

Maastricht University Medical Centre¹, RWTH Aachen University²

04 Oct 2017-Nature Reviews Clinical Oncology

TL;DR: Radiomics, the high-throughput mining of quantitative image features from standard-of-care medical imaging that enables data to be extracted and applied within clinical-decision support systems to improve diagnostic, prognostic, and predictive accuracy, is gaining importance in cancer research as mentioned in this paper.

...read moreread less

Abstract: Radiomics, the high-throughput mining of quantitative image features from standard-of-care medical imaging that enables data to be extracted and applied within clinical-decision support systems to improve diagnostic, prognostic, and predictive accuracy, is gaining importance in cancer research. Radiomic analysis exploits sophisticated image analysis tools and the rapid development and validation of medical imaging data that uses image-based signatures for precision diagnosis and treatment, providing a powerful tool in modern medicine. Herein, we describe the process of radiomics, its pitfalls, challenges, opportunities, and its capacity to improve clinical decision making, emphasizing the utility for patients with cancer. Currently, the field of radiomics lacks standardized evaluation of both the scientific integrity and the clinical relevance of the numerous published radiomics investigations resulting from the rapid growth of this area. Rigorous evaluation criteria and reporting guidelines need to be established in order for radiomics to mature as a discipline. Herein, we provide guidance for investigations to meet this urgent need in the field of radiomics.

...read moreread less

Journal Article•DOI•

Efficacy and Safety of the mRNA-1273 SARS-CoV-2 Vaccine.

[...]

Lindsey R. Baden¹, Hana M. El Sahly², Brandon Essink³, Karen L. Kotloff⁴, Sharon E. Frey⁵, Rick Novak⁶, David Diemert⁷, Stephen A. Spector⁸, Nadine Rouphael⁹, C. Buddy Creech, John W McGettigan, Shishir Khetan, Nathan Segall¹⁰, Joel Solis, Adam Brosz, Carlos Fierro, Howard J. Schwartz, Kathleen M. Neuzil, Lawrence Corey, Peter B. Gilbert, Holly Janes, Dean Follmann, Mary A. Marovich, John R. Mascola, Laura Polakowski, Julie E. Ledgerwood, Barney S. Graham, Hamilton Bennett, Rolando Pajon, Conor Knightly, Brett Leav, Weiping Deng, Honghong Zhou, Shu Liang Han, Melanie Ivarsson, Jacqueline Miller, Tal Z Zaks - Show less +33 more•Institutions (10)

Brigham and Women's Hospital¹, Baylor College of Medicine², Emory University³, University of Maryland, Baltimore⁴, Saint Louis University⁵, University of Illinois at Chicago⁶, George Washington University⁷, University of California, San Diego⁸, Vanderbilt University⁹, Fred Hutchinson Cancer Research Center¹⁰

04 Feb 2021-The New England Journal of Medicine

TL;DR: The mRNA-1273 vaccine as discussed by the authors is a lipid nanoparticle-encapsulated mRNA-based vaccine that encodes the prefusion stabilized full-length spike protein of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the virus that causes Covid-19.

...read moreread less

Abstract: Background Vaccines are needed to prevent coronavirus disease 2019 (Covid-19) and to protect persons who are at high risk for complications. The mRNA-1273 vaccine is a lipid nanoparticle-encapsulated mRNA-based vaccine that encodes the prefusion stabilized full-length spike protein of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the virus that causes Covid-19. Methods This phase 3 randomized, observer-blinded, placebo-controlled trial was conducted at 99 centers across the United States. Persons at high risk for SARS-CoV-2 infection or its complications were randomly assigned in a 1:1 ratio to receive two intramuscular injections of mRNA-1273 (100 μg) or placebo 28 days apart. The primary end point was prevention of Covid-19 illness with onset at least 14 days after the second injection in participants who had not previously been infected with SARS-CoV-2. Results The trial enrolled 30,420 volunteers who were randomly assigned in a 1:1 ratio to receive either vaccine or placebo (15,210 participants in each group). More than 96% of participants received both injections, and 2.2% had evidence (serologic, virologic, or both) of SARS-CoV-2 infection at baseline. Symptomatic Covid-19 illness was confirmed in 185 participants in the placebo group (56.5 per 1000 person-years; 95% confidence interval [CI], 48.7 to 65.3) and in 11 participants in the mRNA-1273 group (3.3 per 1000 person-years; 95% CI, 1.7 to 6.0); vaccine efficacy was 94.1% (95% CI, 89.3 to 96.8%; P Conclusions The mRNA-1273 vaccine showed 94.1% efficacy at preventing Covid-19 illness, including severe disease. Aside from transient local and systemic reactions, no safety concerns were identified. (Funded by the Biomedical Advanced Research and Development Authority and the National Institute of Allergy and Infectious Diseases; COVE ClinicalTrials.gov number, NCT04470427.).

...read moreread less

Proceedings Article•DOI•

Learning Deconvolution Network for Semantic Segmentation

[...]

Hyeonwoo Noh¹, Seunghoon Hong¹, Bohyung Han¹•Institutions (1)

Pohang University of Science and Technology¹

07 Dec 2015

TL;DR: A novel semantic segmentation algorithm by learning a deep deconvolution network on top of the convolutional layers adopted from VGG 16-layer net, which demonstrates outstanding performance in PASCAL VOC 2012 dataset.

...read moreread less

Abstract: We propose a novel semantic segmentation algorithm by learning a deep deconvolution network. We learn the network on top of the convolutional layers adopted from VGG 16-layer net. The deconvolution network is composed of deconvolution and unpooling layers, which identify pixelwise class labels and predict segmentation masks. We apply the trained network to each proposal in an input image, and construct the final semantic segmentation map by combining the results from all proposals in a simple manner. The proposed algorithm mitigates the limitations of the existing methods based on fully convolutional networks by integrating deep deconvolution network and proposal-wise prediction, our segmentation method typically identifies detailed structures and handles objects in multiple scales naturally. Our network demonstrates outstanding performance in PASCAL VOC 2012 dataset, and we achieve the best accuracy (72.5%) among the methods trained without using Microsoft COCO dataset through ensemble with the fully convolutional network.

...read moreread less

Proceedings Article•

Convolutional neural networks on graphs with fast localized spectral filtering

[...]

Michaël Defferrard¹, Xavier Bresson¹, Pierre Vandergheynst¹•Institutions (1)

École Polytechnique Fédérale de Lausanne¹

05 Dec 2016

TL;DR: This work presents a formulation of CNNs in the context of spectral graph theory, which provides the necessary mathematical background and efficient numerical schemes to design fast localized convolutional filters on graphs.

...read moreread less

Abstract: In this work, we are interested in generalizing convolutional neural networks (CNNs) from low-dimensional regular grids, where image, video and speech are represented, to high-dimensional irregular domains, such as social networks, brain connectomes or words' embedding, represented by graphs. We present a formulation of CNNs in the context of spectral graph theory, which provides the necessary mathematical background and efficient numerical schemes to design fast localized convolutional filters on graphs. Importantly, the proposed technique offers the same linear computational complexity and constant learning complexity as classical CNNs, while being universal to any graph structure. Experiments on MNIST and 20NEWS demonstrate the ability of this novel deep learning system to learn local, stationary, and compositional features on graphs.

...read moreread less

Journal Article•DOI•

Integrative clinical genomics of advanced prostate cancer

[...]

Dan R. Robinson¹, Eliezer M. Van Allen², Eliezer M. Van Allen³, Yi-Mi Wu¹, Nikolaus Schultz⁴, Robert J. Lonigro¹, Juan Miguel Mosquera, Bruce Montgomery⁵, Mary-Ellen Taplin³, Colin C. Pritchard⁵, Gerhardt Attard⁶, Gerhardt Attard⁷, Himisha Beltran, Wassim Abida⁴, Robert K. Bradley⁵, Jake Vinson⁴, Xuhong Cao¹, Pankaj Vats¹, Lakshmi P. Kunju¹, Maha Hussain¹, Felix Y. Feng¹, Scott A. Tomlins, Kathleen A. Cooney¹, David Smith¹, Christine Brennan¹, Javed Siddiqui¹, Rohit Mehra¹, Yu Chen⁸, Yu Chen⁴, Dana E. Rathkopf⁴, Dana E. Rathkopf⁸, Michael J. Morris⁸, Michael J. Morris⁴, Stephen B. Solomon⁴, Jeremy C. Durack⁴, Victor E. Reuter⁴, Anuradha Gopalan⁴, Jianjiong Gao⁴, Massimo Loda, Rosina T. Lis³, Michaela Bowden³, Michaela Bowden⁹, Stephen P. Balk¹⁰, Glenn C. Gaviola⁹, Carrie Sougnez², Manaswi Gupta², Evan Y. Yu⁵, Elahe A. Mostaghel⁵, Heather H. Cheng⁵, Hyojeong Mulcahy⁵, Lawrence D. True¹¹, Stephen R. Plymate⁵, Heidi Dvinge⁵, Roberta Ferraldeschi⁷, Roberta Ferraldeschi⁶, Penny Flohr⁷, Penny Flohr⁶, Susana Miranda⁶, Susana Miranda⁷, Zafeiris Zafeiriou⁷, Zafeiris Zafeiriou⁶, Nina Tunariu⁶, Nina Tunariu⁷, Joaquin Mateo⁷, Joaquin Mateo⁶, Raquel Perez-Lopez⁶, Raquel Perez-Lopez⁷, Francesca Demichelis⁸, Francesca Demichelis¹², Brian D. Robinson, Marc H. Schiffman⁸, David M. Nanus, Scott T. Tagawa, Alexandros Sigaras⁸, Kenneth Eng⁸, Olivier Elemento⁸, Andrea Sboner⁸, Elisabeth I. Heath¹³, Howard I. Scher⁸, Howard I. Scher⁴, Kenneth J. Pienta¹⁴, Philip W. Kantoff³, Johann S. de Bono⁷, Johann S. de Bono⁶, Mark A. Rubin, Peter S. Nelson, Levi A. Garraway³, Levi A. Garraway², Charles L. Sawyers⁴, Arul M. Chinnaiyan - Show less +86 more•Institutions (14)

University of Michigan¹, Massachusetts Institute of Technology², Harvard University³, Memorial Sloan Kettering Cancer Center⁴, University of Washington⁵, Institute of Cancer Research⁶, The Royal Marsden NHS Foundation Trust⁷, Cornell University⁸, Brigham and Women's Hospital⁹, Beth Israel Deaconess Medical Center¹⁰, University of Washington Medical Center¹¹, University of Trento¹², Wayne State University¹³, Johns Hopkins University¹⁴

21 May 2015-Cell

TL;DR: This cohort study provides clinically actionable information that could impact treatment decisions for affected individuals and identified new genomic alterations in PIK3CA/B, R-spondin, BRAF/RAF1, APC, β-catenin, and ZBTB16/PLZF.

...read moreread less

Proceedings Article•

Axiomatic attribution for deep networks

[...]

Mukund Sundararajan¹, Ankur Taly¹, Qiqi Yan¹•Institutions (1)

Google¹

06 Aug 2017

TL;DR: In this article, the authors identify two fundamental axioms (sensitivity and implementation invariance) that attribution methods ought to satisfy and use them to guide the design of a new attribution method called Integrated Gradients.

...read moreread less

Abstract: We study the problem of attributing the prediction of a deep network to its input features, a problem previously studied by several other works. We identify two fundamental axioms— Sensitivity and Implementation Invariance that attribution methods ought to satisfy. We show that they are not satisfied by most known attribution methods, which we consider to be a fundamental weakness of those methods. We use the axioms to guide the design of a new attribution method called Integrated Gradients. Our method requires no modification to the original network and is extremely simple to implement; it just needs a few calls to the standard gradient operator. We apply this method to a couple of image models, a couple of text models and a chemistry model, demonstrating its ability to debug networks, to extract rules from a network, and to enable users to engage with models better.

...read moreread less

Journal Article•DOI•

PLATON SQUEEZE: a tool for the calculation of the disordered solvent contribution to the calculated structure factors

[...]

Anthony L. Spek¹•Institutions (1)

Utrecht University¹

01 Jan 2015-Acta Crystallographica Section C-crystal Structure Communications

TL;DR: The SQUEEZE method is documents as an alternative means of addressing the solvent disorder issue and conveniently interfaces with the 2014 version of the least-squares refinement program SHELXL, and many twinned structures containing disordered solvents are now also treatable by SQUEEze.

...read moreread less

Abstract: The completion of a crystal structure determination is often hampered by the presence of embedded solvent molecules or ions that are seriously disordered. Their contribution to the calculated structure factors in the least-squares refinement of a crystal structure has to be included in some way. Traditionally, an atomistic solvent disorder model is attempted. Such an approach is generally to be preferred, but it does not always lead to a satisfactory result and may even be impossible in cases where channels in the structure are filled with continuous electron density. This paper documents the SQUEEZE method as an alternative means of addressing the solvent disorder issue. It conveniently interfaces with the 2014 version of the least-squares refinement program SHELXL [Sheldrick (2015). Acta Cryst. C71. In the press] and other refinement programs that accept externally provided fixed contributions to the calculated structure factors. The PLATON SQUEEZE tool calculates the solvent contribution to the structure factors by back-Fourier transformation of the electron density found in the solvent-accessible region of a phase-optimized difference electron-density map. The actual least-squares structure refinement is delegated to, for example, SHELXL. The current versions of PLATON SQUEEZE and SHELXL now address several of the unnecessary complications with the earlier implementation of the SQUEEZE procedure that were a necessity because least-squares refinement with the now superseded SHELXL97 program did not allow for the input of fixed externally provided contributions to the structure-factor calculation. It is no longer necessary to subtract the solvent contribution temporarily from the observed intensities to be able to use SHELXL for the least-squares refinement, since that program now accepts the solvent contribution from an external file (.fab file) if the ABIN instruction is used. In addition, many twinned structures containing disordered solvents are now also treatable by SQUEEZE. The details of a SQUEEZE calculation are now automatically included in the CIF archive file, along with the unmerged reflection data. The current implementation of the SQUEEZE procedure is described, and discussed and illustrated with three examples. Two of them are based on the reflection data of published structures and one on synthetic reflection data generated for a published structure.

...read moreread less

Proceedings Article•DOI•

Practical Black-Box Attacks against Machine Learning

[...]

Nicolas Papernot¹, Patrick McDaniel¹, Ian Goodfellow², Somesh Jha³, Z. Berkay Celik¹, Ananthram Swami⁴ - Show less +2 more•Institutions (4)

Pennsylvania State University¹, OpenAI², University of Wisconsin-Madison³, United States Army Research Laboratory⁴

02 Apr 2017

TL;DR: This work introduces the first practical demonstration of an attacker controlling a remotely hosted DNN with no such knowledge, and finds that this black-box attack strategy is capable of evading defense strategies previously found to make adversarial example crafting harder.

...read moreread less

Abstract: Machine learning (ML) models, e.g., deep neural networks (DNNs), are vulnerable to adversarial examples: malicious inputs modified to yield erroneous model outputs, while appearing unmodified to human observers. Potential attacks include having malicious content like malware identified as legitimate or controlling vehicle behavior. Yet, all existing adversarial example attacks require knowledge of either the model internals or its training data. We introduce the first practical demonstration of an attacker controlling a remotely hosted DNN with no such knowledge. Indeed, the only capability of our black-box adversary is to observe labels given by the DNN to chosen inputs. Our attack strategy consists in training a local model to substitute for the target DNN, using inputs synthetically generated by an adversary and labeled by the target DNN. We use the local substitute to craft adversarial examples, and find that they are misclassified by the targeted DNN. To perform a real-world and properly-blinded evaluation, we attack a DNN hosted by MetaMind, an online deep learning API. We find that their DNN misclassifies 84.24% of the adversarial examples crafted with our substitute. We demonstrate the general applicability of our strategy to many ML techniques by conducting the same attack against models hosted by Amazon and Google, using logistic regression substitutes. They yield adversarial examples misclassified by Amazon and Google at rates of 96.19% and 88.94%. We also find that this black-box attack strategy is capable of evading defense strategies previously found to make adversarial example crafting harder.

...read moreread less

Journal Article•DOI•

Global burden of bacterial antimicrobial resistance in 2019: a systematic analysis

[...]

Adela Mariana Pintea¹•Institutions (1)

Robert Koch Institute¹

01 Feb 2022-The Lancet

TL;DR: In this paper , the authors presented the most comprehensive estimates of AMR burden to date, which can be divided into five broad components: number of deaths where infection played a role, proportion of infectious deaths attributable to a given infectious syndrome, proportionof infectious syndrome deaths attributed to a particular pathogen, the percentage of a given pathogen resistant to an antibiotic of interest, and the excess risk of death or duration of an infection associated with this resistance.

...read moreread less

Posted Content•

Least Squares Generative Adversarial Networks

[...]

Xudong Mao¹, Qing Li¹, Haoran Xie², Raymond Y. K. Lau, Zhen Wang³, Stephen Paul Smolley - Show less +2 more•Institutions (3)

City University of Hong Kong¹, University of Hong Kong², Northwestern Polytechnical University³

13 Nov 2016-arXiv: Computer Vision and Pattern Recognition

TL;DR: This paper proposes the Least Squares Generative Adversarial Networks (LSGANs) which adopt the least squares loss function for the discriminator, and shows that minimizing the objective function of LSGAN yields minimizing the Pearson X2 divergence.

...read moreread less

Abstract: Unsupervised learning with generative adversarial networks (GANs) has proven hugely successful. Regular GANs hypothesize the discriminator as a classifier with the sigmoid cross entropy loss function. However, we found that this loss function may lead to the vanishing gradients problem during the learning process. To overcome such a problem, we propose in this paper the Least Squares Generative Adversarial Networks (LSGANs) which adopt the least squares loss function for the discriminator. We show that minimizing the objective function of LSGAN yields minimizing the Pearson $\chi^2$ divergence. There are two benefits of LSGANs over regular GANs. First, LSGANs are able to generate higher quality images than regular GANs. Second, LSGANs perform more stable during the learning process. We evaluate LSGANs on five scene datasets and the experimental results show that the images generated by LSGANs are of better quality than the ones generated by regular GANs. We also conduct two comparison experiments between LSGANs and regular GANs to illustrate the stability of LSGANs.

...read moreread less

Posted Content•

Denoising Diffusion Probabilistic Models

[...]

Jonathan Ho¹, Ajay Jain¹, Pieter Abbeel¹•Institutions (1)

University of California, Berkeley¹

19 Jun 2020-arXiv: Learning

TL;DR: High quality image synthesis results are presented using diffusion probabilistic models, a class of latent variable models inspired by considerations from nonequilibrium thermodynamics, which naturally admit a progressive lossy decompression scheme that can be interpreted as a generalization of autoregressive decoding.

...read moreread less

Abstract: We present high quality image synthesis results using diffusion probabilistic models, a class of latent variable models inspired by considerations from nonequilibrium thermodynamics. Our best results are obtained by training on a weighted variational bound designed according to a novel connection between diffusion probabilistic models and denoising score matching with Langevin dynamics, and our models naturally admit a progressive lossy decompression scheme that can be interpreted as a generalization of autoregressive decoding. On the unconditional CIFAR10 dataset, we obtain an Inception score of 9.46 and a state-of-the-art FID score of 3.17. On 256x256 LSUN, we obtain sample quality similar to ProgressiveGAN. Our implementation is available at this https URL

...read moreread less

Collapse