200 million+ research papers across 250,000+ topics on SciSpace

Browse all papers

PDF

Open Access

Journal Article•DOI•

Preliminary estimation of the basic reproduction number of novel coronavirus (2019-nCoV) in China, from 2019 to 2020: A data-driven analysis in the early phase of the outbreak.

[...]

Shi Zhao¹, Qianying Lin², Jinjun Ran³, Salihu S. Musa⁴, Guangpu Yang¹, Weiming Wang, Yijun Lou³, Daozhou Gao⁵, Lin Yang⁴, Daihai He⁴, Maggie Haitian Wang¹ - Show less +7 more•Institutions (5)

The Chinese University of Hong Kong¹, University of Michigan², Li Ka Shing Faculty of Medicine, University of Hong Kong³, Hong Kong Polytechnic University⁴, Shanghai Normal University⁵

30 Jan 2020-International Journal of Infectious Diseases

TL;DR: The early outbreak data largely follows the exponential growth and indicates the potential of 2019-nCoV to cause outbreaks, as well as the impact of the variations in disease reporting rate, modelled through theonential growth.

...read moreread less

1,561 citations

Journal Article•DOI•

Metrics for evaluating 3D medical image segmentation: analysis, selection, and tool

[...]

Abdel Aziz Taha¹, Allan Hanbury¹•Institutions (1)

Vienna University of Technology¹

12 Aug 2015-BMC Medical Imaging

TL;DR: An efficient evaluation tool for 3D medical image segmentation is proposed using 20 evaluation metrics based on a comprehensive literature review and guidelines for selecting a subset of these metrics that is suitable for the data and the segmentation task are provided.

...read moreread less

Abstract: Medical Image segmentation is an important image processing step. Comparing images to evaluate the quality of segmentation is an essential part of measuring progress in this research area. Some of the challenges in evaluating medical segmentation are: metric selection, the use in the literature of multiple definitions for certain metrics, inefficiency of the metric calculation implementations leading to difficulties with large volumes, and lack of support for fuzzy segmentation by existing metrics. First we present an overview of 20 evaluation metrics selected based on a comprehensive literature review. For fuzzy segmentation, which shows the level of membership of each voxel to multiple classes, fuzzy definitions of all metrics are provided. We present a discussion about metric properties to provide a guide for selecting evaluation metrics. Finally, we propose an efficient evaluation tool implementing the 20 selected metrics. The tool is optimized to perform efficiently in terms of speed and required memory, also if the image size is extremely large as in the case of whole body MRI or CT volume segmentation. An implementation of this tool is available as an open source project. We propose an efficient evaluation tool for 3D medical image segmentation using 20 evaluation metrics and provide guidelines for selecting a subset of these metrics that is suitable for the data and the segmentation task.

...read moreread less

1,561 citations

Journal Article•DOI•

H-DenseUNet: Hybrid Densely Connected UNet for Liver and Tumor Segmentation From CT Volumes

[...]

Xiaomeng Li¹, Hao Chen¹, Xiaojuan Qi¹, Qi Dou¹, Chi-Wing Fu¹, Pheng-Ann Heng¹ - Show less +2 more•Institutions (1)

The Chinese University of Hong Kong¹

11 Jun 2018-IEEE Transactions on Medical Imaging

TL;DR: This work proposes a novel hybrid densely connected UNet (H-DenseUNet), which consists of a 2-D Dense UNet for efficiently extracting intra-slice features and a 3-D counterpart for hierarchically aggregating volumetric contexts under the spirit of the auto-context algorithm for liver and tumor segmentation.

...read moreread less

Abstract: Liver cancer is one of the leading causes of cancer death. To assist doctors in hepatocellular carcinoma diagnosis and treatment planning, an accurate and automatic liver and tumor segmentation method is highly demanded in the clinical practice. Recently, fully convolutional neural networks (FCNs), including 2-D and 3-D FCNs, serve as the backbone in many volumetric image segmentation. However, 2-D convolutions cannot fully leverage the spatial information along the third dimension while 3-D convolutions suffer from high computational cost and GPU memory consumption. To address these issues, we propose a novel hybrid densely connected UNet (H-DenseUNet), which consists of a 2-D DenseUNet for efficiently extracting intra-slice features and a 3-D counterpart for hierarchically aggregating volumetric contexts under the spirit of the auto-context algorithm for liver and tumor segmentation. We formulate the learning process of the H-DenseUNet in an end-to-end manner, where the intra-slice representations and inter-slice features can be jointly optimized through a hybrid feature fusion layer. We extensively evaluated our method on the data set of the MICCAI 2017 Liver Tumor Segmentation Challenge and 3DIRCADb data set. Our method outperformed other state-of-the-arts on the segmentation results of tumors and achieved very competitive performance for liver segmentation even with a single model.

...read moreread less

1,561 citations

Journal Article•DOI•

Coronavirus Infections-More Than Just the Common Cold.

[...]

Catharine I. Paules¹, Hilary D. Marston², Anthony S. Fauci²•Institutions (2)

Penn State Milton S. Hershey Medical Center¹, National Institutes of Health²

25 Feb 2020-JAMA

TL;DR: Yet another pathogenic HCoV, 2019 novel coronavirus (2019-nCoV), was recognized in Wuhan, China, and has caused serious illness and death, and the ultimate scope and effect of this outbreak is unclear at present.

...read moreread less

Abstract: Human coronaviruses (HCoVs) have long been considered inconsequential pathogens, causing the “common cold” in otherwise healthy people. However, in the 21st century, 2 highly pathogenic HCoVs—severe acute respiratory syndrome coronavirus (SARS-CoV) and Middle East respiratory syndrome coronavirus (MERS-CoV)—emerged from animal reservoirs to cause global epidemics with alarming morbidity and mortality. In December 2019, yet another pathogenic HCoV, 2019 novel coronavirus (2019-nCoV), was recognized in Wuhan, China, and has caused serious illness and death. The ultimate scope and effect of this outbreak is unclear at present as the situation is rapidly evolving. Coronaviruses are large, enveloped, positivestrand RNA viruses that can be divided into 4 genera: alpha, beta, delta, and gamma, of which alpha and beta CoVs are known to infect humans.1 Four HCoVs (HCoV 229E, NL63, OC43, and HKU1) are endemic globally and account for 10% to 30% of upper respiratory tract infections in adults. Coronaviruses are ecologically diverse with the greatest variety seen in bats, suggesting that they are the reservoirs for many of these viruses.2 Peridomestic mammals may serve as intermediate hosts, facilitating recombination and mutation events with expansion of genetic diversity.

...read moreread less

1,561 citations

Proceedings Article•DOI•

Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring

[...]

Seungjun Nah¹, Tae Hyun Kim¹, Kyoung Mu Lee¹•Institutions (1)

Seoul National University¹

01 Jul 2017

TL;DR: This work proposes a multi-scale convolutional neural network that restores sharp images in an end-to-end manner where blur is caused by various sources and presents a new large-scale dataset that provides pairs of realistic blurry image and the corresponding ground truth sharp image that are obtained by a high-speed camera.

...read moreread less

Abstract: Non-uniform blind deblurring for general dynamic scenes is a challenging computer vision problem as blurs arise not only from multiple object motions but also from camera shake, scene depth variation. To remove these complicated motion blurs, conventional energy optimization based methods rely on simple assumptions such that blur kernel is partially uniform or locally linear. Moreover, recent machine learning based methods also depend on synthetic blur datasets generated under these assumptions. This makes conventional deblurring methods fail to remove blurs where blur kernel is difficult to approximate or parameterize (e.g. object motion boundaries). In this work, we propose a multi-scale convolutional neural network that restores sharp images in an end-to-end manner where blur is caused by various sources. Together, we present multi-scale loss function that mimics conventional coarse-to-fine approaches. Furthermore, we propose a new large-scale dataset that provides pairs of realistic blurry image and the corresponding ground truth sharp image that are obtained by a high-speed camera. With the proposed model trained on this dataset, we demonstrate empirically that our method achieves the state-of-the-art performance in dynamic scene deblurring not only qualitatively, but also quantitatively.

...read moreread less

1,560 citations

Journal Article•DOI•

Increases in Drug and Opioid Overdose Deaths--United States, 2000-2014

[...]

Rose A. Rudd, Noah Aleshire¹, Jon E. Zibbell, R. Matthew Gladden¹•Institutions (1)

Centers for Disease Control and Prevention¹

01 Jan 2016-Morbidity and Mortality Weekly Report

TL;DR: Findings indicate that the opioid overdose epidemic is worsening and there is a need for continued action to prevent opioid abuse, dependence, and death, improve treatment capacity for opioid use disorders, and reduce the supply of illicit opioids, particularly heroin and illicit fentanyl.

...read moreread less

Abstract: The United States is experiencing an epidemic of drug overdose (poisoning) deaths. Since 2000, the rate of deaths from drug overdoses has increased 137%, including a 200% increase in the rate of overdose deaths involving opioids (opioid pain relievers and heroin). CDC analyzed recent multiple cause-of-death mortality data to examine current trends and characteristics of drug overdose deaths, including the types of opioids associated with drug overdose deaths. During 2014, a total of 47,055 drug overdose deaths occurred in the United States, representing a 1-year increase of 6.5%, from 13.8 per 100,000 persons in 2013 to 14.7 per 100,000 persons in 2014. The rate of drug overdose deaths increased significantly for both sexes, persons aged 25-44 years and ≥55 years, non-Hispanic whites and non-Hispanic blacks, and in the Northeastern, Midwestern, and Southern regions of the United States. Rates of opioid overdose deaths also increased significantly, from 7.9 per 100,000 in 2013 to 9.0 per 100,000 in 2014, a 14% increase. Historically, CDC has programmatically characterized all opioid pain reliever deaths (natural and semisynthetic opioids, methadone, and other synthetic opioids) as "prescription" opioid overdoses (1). Between 2013 and 2014, the age-adjusted rate of death involving methadone remained unchanged; however, the age-adjusted rate of death involving natural and semisynthetic opioid pain relievers, heroin, and synthetic opioids, other than methadone (e.g., fentanyl) increased 9%, 26%, and 80%, respectively. The sharp increase in deaths involving synthetic opioids, other than methadone, in 2014 coincided with law enforcement reports of increased availability of illicitly manufactured fentanyl, a synthetic opioid; however, illicitly manufactured fentanyl cannot be distinguished from prescription fentanyl in death certificate data. These findings indicate that the opioid overdose epidemic is worsening. There is a need for continued action to prevent opioid abuse, dependence, and death, improve treatment capacity for opioid use disorders, and reduce the supply of illicit opioids, particularly heroin and illicit fentanyl.

...read moreread less

1,559 citations

Posted Content•

Weight Uncertainty in Neural Networks

[...]

Charles Blundell, Julien Cornebise, Koray Kavukcuoglu, Daan Wierstra

20 May 2015-arXiv: Machine Learning

TL;DR: This work introduces a new, efficient, principled and backpropagation-compatible algorithm for learning a probability distribution on the weights of a neural network, called Bayes by Backprop, and shows how the learnt uncertainty in the weights can be used to improve generalisation in non-linear regression problems.

...read moreread less

Abstract: We introduce a new, efficient, principled and backpropagation-compatible algorithm for learning a probability distribution on the weights of a neural network, called Bayes by Backprop. It regularises the weights by minimising a compression cost, known as the variational free energy or the expected lower bound on the marginal likelihood. We show that this principled kind of regularisation yields comparable performance to dropout on MNIST classification. We then demonstrate how the learnt uncertainty in the weights can be used to improve generalisation in non-linear regression problems, and how this weight uncertainty can be used to drive the exploration-exploitation trade-off in reinforcement learning.

...read moreread less

1,558 citations

Proceedings Article•

A theoretically grounded application of dropout in recurrent neural networks

[...]

Yarin Gal¹, Zoubin Ghahramani¹•Institutions (1)

University of Cambridge¹

05 Dec 2016

TL;DR: The authors apply this variational inference based dropout technique in LSTM and GRU models, assessing it on language modelling and sentiment analysis tasks, and to the best of their knowledge improve on the single model state-of-the-art in language modelling with the Penn Treebank (73.4 test perplexity).

...read moreread less

Abstract: Recurrent neural networks (RNNs) stand at the forefront of many recent developments in deep learning. Yet a major difficulty with these models is their tendency to overfit, with dropout shown to fail when applied to recurrent layers. Recent results at the intersection of Bayesian modelling and deep learning offer a Bayesian interpretation of common deep learning techniques such as dropout. This grounding of dropout in approximate Bayesian inference suggests an extension of the theoretical results, offering insights into the use of dropout with RNN models. We apply this new variational inference based dropout technique in LSTM and GRU models, assessing it on language modelling and sentiment analysis tasks. The new approach outperforms existing techniques, and to the best of our knowledge improves on the single model state-of-the-art in language modelling with the Penn Treebank (73.4 test perplexity). This extends our arsenal of variational tools in deep learning.

...read moreread less

1,557 citations

Journal Article•DOI•

Emotion and Decision Making

[...]

Jennifer S. Lerner¹, Ye Li², Piercarlo Valdesolo³, Karim S. Kassam⁴•Institutions (4)

Harvard University¹, University of California, Riverside², Claremont McKenna College³, Carnegie Mellon University⁴

05 Jan 2015-Annual Review of Psychology

TL;DR: This work organizes and analyze what has been learned from the past 35 years of work on emotion and decision making and proposes the emotion-imbued choice model, which accounts for inputs from traditional rational choice theory and from newer emotion research, synthesizing scientific models.

...read moreread less

Abstract: A revolution in the science of emotion has emerged in recent decades, with the potential to create a paradigm shift in decision theories. The research reveals that emotions constitute potent, pervasive, predictable, sometimes harmful and sometimes beneficial drivers of decision making. Across different domains, important regularities appear in the mechanisms through which emotions influence judgments and choices. We organize and analyze what has been learned from the past 35 years of work on emotion and decision making. In so doing, we propose the emotion-imbued choice model, which accounts for inputs from traditional rational choice theory and from newer emotion research, synthesizing scientific models.

...read moreread less

1,556 citations

Journal Article•DOI•

Chitin and Chitosan Preparation from Marine Sources. Structure, Properties and Applications

[...]

Islem Younes¹, Marguerite Rinaudo•Institutions (1)

University of Sfax¹

02 Mar 2015-Marine Drugs

TL;DR: Several selected pharmaceutical and biomedical applications are presented, in which chitin and chitosan are recognized as new biomaterials taking advantage of their biocompatibility and biodegradability.

...read moreread less

Abstract: This review describes the most common methods for recovery of chitin from marine organisms. In depth, both enzymatic and chemical treatments for the step of deproteinization are compared, as well as different conditions for demineralization. The conditions of chitosan preparation are also discussed, since they significantly impact the synthesis of chitosan with varying degree of acetylation (DA) and molecular weight (MW). In addition, the main characterization techniques applied for chitin and chitosan are recalled, pointing out the role of their solubility in relation with the chemical structure (mainly the acetyl group distribution along the backbone). Biological activities are also presented, such as: antibacterial, antifungal, antitumor and antioxidant. Interestingly, the relationship between chemical structure and biological activity is demonstrated for chitosan molecules with different DA and MW and homogeneous distribution of acetyl groups for the first time. In the end, several selected pharmaceutical and biomedical applications are presented, in which chitin and chitosan are recognized as new biomaterials taking advantage of their biocompatibility and biodegradability.

...read moreread less

1,554 citations

Journal Article•DOI•

ColabFold: making protein folding accessible to all

[...]

Milot Mirdita¹, Tatiana Valdez Bubnova², Oi Wah Liew³•Institutions (3)

Seoul National University¹, Harvard University², University of Göttingen³

30 May 2022-Nature Methods

TL;DR: ColabFold as discussed by the authors combines the fast homology search of MMseqs2 with AlphaFold2 or RoseTTAFold for protein folding and achieves 40-60fold faster search and optimized model utilization.

...read moreread less

Abstract: ColabFold offers accelerated prediction of protein structures and complexes by combining the fast homology search of MMseqs2 with AlphaFold2 or RoseTTAFold. ColabFold's 40-60-fold faster search and optimized model utilization enables prediction of close to 1,000 structures per day on a server with one graphics processing unit. Coupled with Google Colaboratory, ColabFold becomes a free and accessible platform for protein folding. ColabFold is open-source software available at https://github.com/sokrypton/ColabFold and its novel environmental databases are available at https://colabfold.mmseqs.com .

...read moreread less

Journal Article•DOI•

Prevalence and Characteristics of Autism Spectrum Disorder Among Children Aged 8 Years--Autism and Developmental Disabilities Monitoring Network, 11 Sites, United States, 2012.

[...]

Deborah Christensen¹, Jon Baio¹, Kim Van Naarden Braun¹, Deborah A. Bilder², Jane M. Charles³, John N. Constantino⁴, Julie L. Daniels⁵, Maureen S. Durkin⁶, Robert T. Fitzgerald⁴, Margaret Kurzius-Spencer⁷, Li Ching Lee⁸, Sydney Pettygrove⁷, Cordelia Robinson⁹, Eldon G. Schulz¹⁰, Chris S. Wells¹¹, Martha S. Wingate¹², Walter Zahorodny¹³, Marshalyn Yeargin-Allsopp¹ - Show less +14 more•Institutions (13)

Centers for Disease Control and Prevention¹, University of Utah², Medical University of South Carolina³, Washington University in St. Louis⁴, University of North Carolina at Chapel Hill⁵, University of Wisconsin-Madison⁶, University of Arizona⁷, Johns Hopkins University⁸, University of Colorado Denver⁹, University of Arkansas for Medical Sciences¹⁰, Colorado Department of Public Health and Environment¹¹, University of Alabama at Birmingham¹², Rutgers University¹³

01 Apr 2016

TL;DR: ASD prevalence estimates for children aged 8 years living in catchment areas of the ADDM Network sites in 2012 are provided, overall and stratified by sex, race/ethnicity, and the type of source records (education and health records versus health records only).

...read moreread less

Abstract: PROBLEM/CONDITION Autism spectrum disorder (ASD). PERIOD COVERED 2012. DESCRIPTION OF SYSTEM The Autism and Developmental Disabilities Monitoring (ADDM) Network is an active surveillance system that provides estimates of the prevalence and characteristics of ASD among children aged 8 years whose parents or guardians reside in 11 ADDM Network sites in the United States (Arkansas, Arizona, Colorado, Georgia, Maryland, Missouri, New Jersey, North Carolina, South Carolina, Utah, and Wisconsin). Surveillance to determine ASD case status is conducted in two phases. The first phase consists of screening and abstracting comprehensive evaluations performed by professional service providers in the community. Data sources identified for record review are categorized as either 1) education source type, including developmental evaluations to determine eligibility for special education services or 2) health care source type, including diagnostic and developmental evaluations. The second phase involves the review of all abstracted evaluations by trained clinicians to determine ASD surveillance case status. A child meets the surveillance case definition for ASD if one or more comprehensive evaluations of that child completed by a qualified professional describes behaviors that are consistent with the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision diagnostic criteria for any of the following conditions: autistic disorder, pervasive developmental disorder-not otherwise specified (including atypical autism), or Asperger disorder. This report provides ASD prevalence estimates for children aged 8 years living in catchment areas of the ADDM Network sites in 2012, overall and stratified by sex, race/ethnicity, and the type of source records (education and health records versus health records only). In addition, this report describes the proportion of children with ASD with a score consistent with intellectual disability on a standardized intellectual ability test, the age at which the earliest known comprehensive evaluation was performed, the proportion of children with a previous ASD diagnosis, the specific type of ASD diagnosis, and any special education eligibility classification. RESULTS For 2012, the combined estimated prevalence of ASD among the 11 ADDM Network sites was 14.5 per 1,000 (one in 69) children aged 8 years. Estimated prevalence was significantly higher among boys aged 8 years (23.4 per 1,000) than among girls aged 8 years (5.2 per 1,000). Estimated ASD prevalence was significantly higher among non-Hispanic white children aged 8 years (15.3 per 1,000) compared with non-Hispanic black children (13.1 per 1,000), and Hispanic (10.2 per 1,000) children aged 8 years. Estimated prevalence varied widely among the 11 ADDM Network sites, ranging from 8.2 per 1,000 children aged 8 years (in the area of the Maryland site where only health care records were reviewed) to 24.6 per 1,000 children aged 8 years (in New Jersey, where both education and health care records were reviewed). Estimated prevalence was higher in surveillance sites where education records and health records were reviewed compared with sites where health records only were reviewed (17.1 per 1,000 and 10.4 per 1,000 children aged 8 years, respectively; p<0.05). Among children identified with ASD by the ADDM Network, 82% had a previous ASD diagnosis or educational classification; this did not vary by sex or between non-Hispanic white and non-Hispanic black children. A lower percentage of Hispanic children (78%) had a previous ASD diagnosis or classification compared with non-Hispanic white children (82%) and with non-Hispanic black children (84%). The median age at earliest known comprehensive evaluation was 40 months, and 43% of children had received an earliest known comprehensive evaluation by age 36 months. The percentage of children with an earliest known comprehensive evaluation by age 36 months was similar for boys and girls, but was higher for non-Hispanic white children (45%) compared with non-Hispanic black children (40%) and Hispanic children (39%). INTERPRETATION Overall estimated ASD prevalence was 14.5 per 1,000 children aged 8 years in the ADDM Network sites in 2012. The higher estimated prevalence among sites that reviewed both education and health records suggests the role of special education systems in providing comprehensive evaluations and services to children with developmental disabilities. Disparities by race/ethnicity in estimated ASD prevalence, particularly for Hispanic children, as well as disparities in the age of earliest comprehensive evaluation and presence of a previous ASD diagnosis or classification, suggest that access to treatment and services might be lacking or delayed for some children. PUBLIC HEALTH ACTION The ADDM Network will continue to monitor the prevalence and characteristics of ASD among children aged 8 years living in selected sites across the United States. Recommendations from the ADDM Network include enhancing strategies to 1) lower the age of first evaluation of ASD by community providers in accordance with the Healthy People 2020 goal that children with ASD are evaluated by age 36 months and begin receiving community-based support and services by age 48 months; 2) reduce disparities by race/ethnicity in identified ASD prevalence, the age of first comprehensive evaluation, and presence of a previous ASD diagnosis or classification; and 3) assess the effect on ASD prevalence of the revised ASD diagnostic criteria published in the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition.

...read moreread less

Journal Article•DOI•

Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression

[...]

Zhaohui Zheng¹, Ping Wang¹, Wei Liu¹, Jinze Li², Rongguang Ye¹, Dongwei Ren¹ - Show less +2 more•Institutions (2)

Tianjin University¹, Chinese People's Public Security University²

03 Apr 2020

TL;DR: A Distance-IoU (DIoU) loss is proposed by incorporating the normalized distance between the predicted box and the target box, which converges much faster in training than IoU and GIoU losses, thereby leading to faster convergence and better performance.

...read moreread less

Abstract: Bounding box regression is the crucial step in object detection. In existing methods, while ln-norm loss is widely adopted for bounding box regression, it is not tailored to the evaluation metric, i.e., Intersection over Union (IoU). Recently, IoU loss and generalized IoU (GIoU) loss have been proposed to benefit the IoU metric, but still suffer from the problems of slow convergence and inaccurate regression. In this paper, we propose a Distance-IoU (DIoU) loss by incorporating the normalized distance between the predicted box and the target box, which converges much faster in training than IoU and GIoU losses. Furthermore, this paper summarizes three geometric factors in bounding box regression, i.e., overlap area, central point distance and aspect ratio, based on which a Complete IoU (CIoU) loss is proposed, thereby leading to faster convergence and better performance. By incorporating DIoU and CIoU losses into state-of-the-art object detection algorithms, e.g., YOLO v3, SSD and Faster R-CNN, we achieve notable performance gains in terms of not only IoU metric but also GIoU metric. Moreover, DIoU can be easily adopted into non-maximum suppression (NMS) to act as the criterion, further boosting performance improvement. The source code and trained models are available at https://github.com/Zzh-tju/DIoU.

...read moreread less

Journal Article•DOI•

Res2Net: A New Multi-Scale Backbone Architecture

[...]

Shanghua Gao¹, Ming-Ming Cheng¹, Kai Zhao¹, Xin-Yu Zhang¹, Ming-Hsuan Yang², Philip H. S. Torr³ - Show less +2 more•Institutions (3)

Nankai University¹, University of California, Merced², University of Oxford³

01 Feb 2021-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: Res2Net as mentioned in this paper constructs hierarchical residual-like connections within one single residual block to represent multi-scale features at a granular level and increases the range of receptive fields for each network layer.

...read moreread less

Abstract: Representing features at multiple scales is of great importance for numerous vision tasks. Recent advances in backbone convolutional neural networks (CNNs) continually demonstrate stronger multi-scale representation ability, leading to consistent performance gains on a wide range of applications. However, most existing methods represent the multi-scale features in a layer-wise manner. In this paper, we propose a novel building block for CNNs, namely Res2Net, by constructing hierarchical residual-like connections within one single residual block. The Res2Net represents multi-scale features at a granular level and increases the range of receptive fields for each network layer. The proposed Res2Net block can be plugged into the state-of-the-art backbone CNN models, e.g., ResNet, ResNeXt, and DLA. We evaluate the Res2Net block on all these models and demonstrate consistent performance gains over baseline models on widely-used datasets, e.g., CIFAR-100 and ImageNet. Further ablation studies and experimental results on representative computer vision tasks, i.e., object detection, class activation mapping, and salient object detection, further verify the superiority of the Res2Net over the state-of-the-art baseline methods. The source code and trained models are available on https://mmcheng.net/res2net/ .

...read moreread less

Journal Article•DOI•

Graph embedding techniques, applications, and performance: A survey

[...]

Palash Goyal¹, Emilio Ferrara¹•Institutions (1)

University of Southern California¹

01 Jul 2018-Knowledge Based Systems

TL;DR: A comprehensive and structured analysis of various graph embedding techniques proposed in the literature, and the open-source Python library, named GEM (Graph Embedding Methods, available at https://github.com/palash1992/GEM ), which provides all presented algorithms within a unified interface to foster and facilitate research on the topic.

...read moreread less

Abstract: Graphs, such as social networks, word co-occurrence networks, and communication networks, occur naturally in various real-world applications. Analyzing them yields insight into the structure of society, language, and different patterns of communication. Many approaches have been proposed to perform the analysis. Recently, methods which use the representation of graph nodes in vector space have gained traction from the research community. In this survey, we provide a comprehensive and structured analysis of various graph embedding techniques proposed in the literature. We first introduce the embedding task and its challenges such as scalability, choice of dimensionality, and features to be preserved, and their possible solutions. We then present three categories of approaches based on factorization methods, random walks, and deep learning, with examples of representative algorithms in each category and analysis of their performance on various tasks. We evaluate these state-of-the-art methods on a few common datasets and compare their performance against one another. Our analysis concludes by suggesting some potential applications and future directions. We finally present the open-source Python library we developed, named GEM (Graph Embedding Methods, available at https://github.com/palash1992/GEM ), which provides all presented algorithms within a unified interface to foster and facilitate research on the topic.

...read moreread less

Posted Content•DOI•

Analysis of protein-coding genetic variation in 60,706 humans

[...]

Monkol Lek¹, Konrad J. Karczewski¹, Eric Vallabh Minikel¹, Kaitlin E. Samocha¹, Eric Banks², Timothy Fennell², Anne H. O’Donnell-Luria¹, James S. Ware², Andrew J. Hill¹, Beryl B. Cummings¹, Taru Tukiainen¹, Daniel P. Birnbaum¹, Jack A. Kosmicki¹, Laramie E. Duncan¹, Karol Estrada¹, Fengmei Zhao¹, James Zou², Emma Pierce-Hoffman¹, David Neil Cooper³, Mark A. DePristo², Ron Do⁴, Jason Flannick², Menachem Fromer¹, Laura D. Gauthier², Jackie Goldstein¹, Namrata Gupta², Daniel P. Howrigan¹, Adam Kiezun², Mitja I. Kurki², Ami Levy Moonshine², Pradeep Natarajan², Lorena Orozco, Gina M. Peloso², Ryan Poplin², Manuel A. Rivas², Valentin Ruano-Rubio², Douglas M. Ruderfer⁴, Khalid Shakir², Peter D. Stenson³, Christine Stevens², Brett Thomas¹, Grace Tiao², María Teresa Tusié-Luna, Ben Weisburd², Hong-Hee Won², Dongmei Yu², David Altshuler², Diego Ardissino, Michael Boehnke⁵, John Danesh⁶, Roberto Elosua, Jose C. Florez², Stacey Gabriel², Gad Getz², Christina M. Hultman⁷, Sekar Kathiresan², Markku Laakso⁸, Steven A. McCarroll², Mark I. McCarthy⁹, Dermot P.B. McGovern¹⁰, Ruth McPherson¹¹, Benjamin M. Neale¹, Aarno Palotie¹², Shaun Purcell⁴, Danish Saleheen¹³, Jeremiah M. Scharf², Pamela Sklar⁴, Patrick F. Sullivan¹⁴, Jaakko Tuomilehto¹², Hugh Watkins⁹, James G. Wilson¹⁵, Mark J. Daly¹, Daniel G. MacArthur¹ - Show less +69 more•Institutions (15)

Harvard University¹, Broad Institute², Cardiff University³, Icahn School of Medicine at Mount Sinai⁴, University of Michigan⁵, University of Cambridge⁶, Karolinska Institutet⁷, University of Eastern Finland⁸, University of Oxford⁹, Cedars-Sinai Medical Center¹⁰, University of Ottawa¹¹, University of Helsinki¹², University of Pennsylvania¹³, University of North Carolina at Chapel Hill¹⁴, University of Mississippi Medical Center¹⁵

30 Oct 2015-bioRxiv

TL;DR: The aggregation and analysis of high-quality exome (protein-coding region) sequence data for 60,706 individuals of diverse ethnicities generated as part of the Exome Aggregation Consortium (ExAC) provides direct evidence for the presence of widespread mutational recurrence.

...read moreread less

Abstract: Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) sequence data for 60,706 individuals of diverse ethnicities. The resulting catalogue of human genetic diversity has unprecedented resolution, with an average of one variant every eight bases of coding sequence and the presence of widespread mutational recurrence. The deep catalogue of variation provided by the Exome Aggregation Consortium (ExAC) can be used to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; we identify 3,230 genes with near-complete depletion of truncating variants, 79% of which have no currently established human disease phenotype. Finally, we show that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human knockout variants in protein-coding genes.

...read moreread less

Journal Article•DOI•

Maintenance Olaparib in Patients with Newly Diagnosed Advanced Ovarian Cancer

[...]

Kathleen N. Moore¹, Nicoletta Colombo², Giovanni Scambia, Byoung Gie Kim³, Ana Oaknin⁴, Michael Friedlander⁵, Alla Lisyanskaya, Anne Floquet⁶, Alexandra Leary⁷, Gabe S. Sonke⁸, Charlie Gourley⁹, Susana Banerjee¹⁰, Amit M. Oza¹¹, Antonio González-Martín¹², Carol Aghajanian¹³, William H. Bradley¹⁴, Cara Mathews, Joyce F. Liu¹⁵, Elizabeth S. Lowe¹⁶, Ralph Bloomfield¹⁶, Paul DiSilvestro - Show less +17 more•Institutions (16)

University of Oklahoma¹, European Institute of Oncology², Sungkyunkwan University³, Autonomous University of Barcelona⁴, University of New South Wales⁵, University of Bordeaux⁶, University of Paris-Sud⁷, Netherlands Cancer Institute⁸, Edinburgh Cancer Research Centre⁹, The Royal Marsden NHS Foundation Trust¹⁰, University of Toronto¹¹, University of Texas MD Anderson Cancer Center¹², Memorial Sloan Kettering Cancer Center¹³, Medical College of Wisconsin¹⁴, Harvard University¹⁵, AstraZeneca¹⁶

21 Oct 2018-The New England Journal of Medicine

TL;DR: The use of maintenance therapy with olaparib provided a substantial benefit with regard to progression‐free survival among women with newly diagnosed advanced ovarian cancer and a BRCA1/2 mutation, with a 70% lower risk of disease progression or death with olAParib than with placebo.

...read moreread less

Abstract: Background Most women with newly diagnosed advanced ovarian cancer have a relapse within 3 years after standard treatment with surgery and platinum-based chemotherapy. The benefit of the o...

...read moreread less

Journal Article•DOI•

A Survey on Non-Orthogonal Multiple Access for 5G Networks: Research Challenges and Future Trends

[...]

Zhiguo Ding¹, Xianfu Lei², George K. Karagiannidis³, Robert Schober⁴, Jinhong Yuan⁵, Vijay K. Bhargava⁶ - Show less +2 more•Institutions (6)

Lancaster University¹, Southwest Jiaotong University², Aristotle University of Thessaloniki³, University of Erlangen-Nuremberg⁴, University of New South Wales⁵, University of British Columbia⁶

11 Jul 2017-IEEE Journal on Selected Areas in Communications

TL;DR: In this paper, the authors provide an overview of the latest NOMA research and innovations as well as their applications in 5G wireless networks and discuss future challenges and future research challenges.

...read moreread less

Abstract: Non-orthogonal multiple access (NOMA) is an essential enabling technology for the fifth-generation (5G) wireless networks to meet the heterogeneous demands on low latency, high reliability, massive connectivity, improved fairness, and high throughput. The key idea behind NOMA is to serve multiple users in the same resource block, such as a time slot, subcarrier, or spreading code. The NOMA principle is a general framework, and several recently proposed 5G multiple access schemes can be viewed as special cases. This survey provides an overview of the latest NOMA research and innovations as well as their applications. Thereby, the papers published in this special issue are put into the context of the existing literature. Future research challenges regarding NOMA in 5G and beyond are also discussed.

...read moreread less

Journal Article•DOI•

Understanding the burnout experience: recent research and its implications for psychiatry.

[...]

Christina Maslach¹, Michael P. Leiter²•Institutions (2)

University of California, Berkeley¹, Acadia University²

01 Jun 2016-World Psychiatry

TL;DR: Considering that the treatment goal for burnout is usually to enable people to return to their job, and to be successful in their work, psychiatry could make an important contribution by identifying the treatment strategies that would be most effective in achieving that goal.

...read moreread less

Journal Article•

MLlib: machine learning in apache spark

[...]

Xiangrui Meng, Joseph K. Bradley, Burak Yavuz, Evan R. Sparks¹, Shivaram Venkataraman¹, Davies Liu, Jeremy Freeman, DB Tsai, Manish Amde, Sean Owen², Doris Xin³, Reynold Xin, Michael J. Franklin¹, Reza Bosagh Zadeh⁴, Matei Zaharia⁵, Ameet Talwalkar⁶ - Show less +12 more•Institutions (6)

University of California, Berkeley¹, Cloudera², Urbana University³, Stanford University⁴, Massachusetts Institute of Technology⁵, University of California, Los Angeles⁶

01 Jan 2016-Journal of Machine Learning Research

TL;DR: MLlib as mentioned in this paper is an open-source distributed machine learning library for Apache Spark that provides efficient functionality for a wide range of learning settings and includes several underlying statistical, optimization, and linear algebra primitives.

...read moreread less

Abstract: Apache Spark is a popular open-source platform for large-scale data processing that is well-suited for iterative machine learning tasks. In this paper we present MLlib, Spark's open-source distributed machine learning library. MLLIB provides efficient functionality for a wide range of learning settings and includes several underlying statistical, optimization, and linear algebra primitives. Shipped with Spark, MLLIB supports several languages and provides a high-level API that leverages Spark's rich ecosystem to simplify the development of end-to-end machine learning pipelines. MLLIB has experienced a rapid growth due to its vibrant open-source community of over 140 contributors, and includes extensive documentation to support further growth and to let users quickly get up to speed.

...read moreread less

Proceedings Article•

An Empirical Exploration of Recurrent Network Architectures

[...]

Rafal Jozefowicz¹, Wojciech Zaremba², Wojciech Zaremba³, Ilya Sutskever¹•Institutions (3)

Google¹, New York University², Facebook³

06 Jul 2015

TL;DR: It is found that adding a bias of 1 to the LSTM's forget gate closes the gap between the L STM and the recently-introduced Gated Recurrent Unit (GRU) on some but not all tasks.

...read moreread less

Abstract: The Recurrent Neural Network (RNN) is an extremely powerful sequence model that is often difficult to train. The Long Short-Term Memory (LSTM) is a specific RNN architecture whose design makes it much easier to train. While wildly successful in practice, the LSTM's architecture appears to be ad-hoc so it is not clear if it is optimal, and the significance of its individual components is unclear. In this work, we aim to determine whether the LSTM architecture is optimal or whether much better architectures exist. We conducted a thorough architecture search where we evaluated over ten thousand different RNN architectures, and identified an architecture that outperforms both the LSTM and the recently-introduced Gated Recurrent Unit (GRU) on some but not all tasks. We found that adding a bias of 1 to the LSTM's forget gate closes the gap between the LSTM and the GRU.

...read moreread less

Journal Article•DOI•

Adjuvant Nivolumab versus Ipilimumab in Resected Stage III or IV Melanoma

[...]

Jeffrey S. Weber¹, Mario Mandalà, Michele Del Vecchio, Helen Gogas², Ana Arance³, C. Lance Cowey⁴, Stéphane Dalle, Michael Schenker, Vanna Chiarion-Sileni, Ivan Marquez-Rodas⁵, Jean-Jacques Grob⁶, Marcus O. Butler⁷, Mark R. Middleton⁸, Michele Maio, Victoria Atkinson⁹, Paola Queirolo, Rene Gonzalez¹⁰, Ragini R. Kudchadkar¹¹, Michael Smylie¹², Nicolas Meyer¹³, Laurent Mortier, Michael B. Atkins¹⁴, Georgina V. Long¹⁵, Shailender Bhatia¹⁶, Céleste Lebbé¹⁷, Piotr Rutkowski, Kenji Yokota¹⁸, Naoya Yamazaki, Tae M. Kim¹⁹, Veerle de Pril²⁰, J Sabater²⁰, Anila Qureshi²⁰, James Larkin²¹, Paolo A. Ascierto - Show less +30 more•Institutions (21)

10 Sep 2017-The New England Journal of Medicine

TL;DR: Among patients undergoing resection of stage IIIB, IIIC, or IV melanoma, adjuvant therapy with nivolumab resulted in significantly longer recurrence‐free survival and a lower rate of grade 3 or 4 adverse events than adjuant therapy with ipilimumab.

...read moreread less

Abstract: BackgroundNivolumab and ipilimumab are immune checkpoint inhibitors that have been approved for the treatment of advanced melanoma. In the United States, ipilimumab has also been approved as adjuvant therapy for melanoma on the basis of recurrence-free and overall survival rates that were higher than those with placebo in a phase 3 trial. We wanted to determine the efficacy of nivolumab versus ipilimumab for adjuvant therapy in patients with resected advanced melanoma. MethodsIn this randomized, double-blind, phase 3 trial, we randomly assigned 906 patients (≥15 years of age) who were undergoing complete resection of stage IIIB, IIIC, or IV melanoma to receive an intravenous infusion of either nivolumab at a dose of 3 mg per kilogram of body weight every 2 weeks (453 patients) or ipilimumab at a dose of 10 mg per kilogram every 3 weeks for four doses and then every 12 weeks (453 patients). The patients were treated for a period of up to 1 year or until disease recurrence, a report of unacceptable toxic ef...

...read moreread less

Journal Article•DOI•

Reversible photo-induced trap formation in mixed-halide hybrid perovskites for photovoltaics

[...]

Eric T. Hoke¹, Daniel J. Slotcavage¹, Emma R. Dohner¹, Andrea R. Bowring¹, Hemamala I. Karunadasa¹, Michael D. McGehee¹ - Show less +2 more•Institutions (1)

Stanford University¹

01 Jan 2015-Chemical Science

TL;DR: A reversible photo-induced instability has been found in mixed-halide photovoltaic perovskites that limits the open circuit voltage in solar cells.

...read moreread less

Abstract: We report on reversible, light-induced transformations in (CH3NH3)Pb(BrxI1−x)3. Photoluminescence (PL) spectra of these perovskites develop a new, red-shifted peak at 1.68 eV that grows in intensity under constant, 1-sun illumination in less than a minute. This is accompanied by an increase in sub-bandgap absorption at ∼1.7 eV, indicating the formation of luminescent trap states. Light soaking causes a splitting of X-ray diffraction (XRD) peaks, suggesting segregation into two crystalline phases. Surprisingly, these photo-induced changes are fully reversible; the XRD patterns and the PL and absorption spectra revert to their initial states after the materials are left for a few minutes in the dark. We speculate that photoexcitation may cause halide segregation into iodide-rich minority and bromide-enriched majority domains, the former acting as a recombination center trap. This instability may limit achievable voltages from some mixed-halide perovskite solar cells and could have implications for the photostability of halide perovskites used in optoelectronics.

...read moreread less

Journal Article•DOI•

Approaches to Catheter Ablation for Persistent Atrial Fibrillation

[...]

Atul Verma¹, Chen-yang Jiang², Timothy R. Betts³, Jian Chen⁴, I. Deisenhofer⁵, Roberto Mantovan, Laurent Macle⁶, Carlos A. Morillo, Wilhelm Haverkamp, Rukshen Weerasooriya, Jean-Paul Albenque, Stefano Nardi, Endrj Menardi, Paul Novak, Prashanthan Sanders - Show less +11 more•Institutions (6)

McMaster University¹, Sir Run Run Shaw Hospital², John Radcliffe Hospital³, Haukeland University Hospital⁴, Charité⁵, University of Adelaide⁶

06 May 2015-The New England Journal of Medicine

TL;DR: Among patients with persistent atrial fibrillation, there was no reduction in the rate of recurrent atrialfibrillation when either linear ablation or ablation of complex fractionated electrograms was performed in addition to pulmonary-vein isolation.

...read moreread less

Abstract: BackgroundCatheter ablation is less successful for persistent atrial fibrillation than for paroxysmal atrial fibrillation. Guidelines suggest that adjuvant substrate modification in addition to pulmonary-vein isolation is required in persistent atrial fibrillation. MethodsWe randomly assigned 589 patients with persistent atrial fibrillation in a 1:4:4 ratio to ablation with pulmonary-vein isolation alone (67 patients), pulmonary-vein isolation plus ablation of electrograms showing complex fractionated activity (263 patients), or pulmonary-vein isolation plus additional linear ablation across the left atrial roof and mitral valve isthmus (259 patients). The duration of follow-up was 18 months. The primary end point was freedom from any documented recurrence of atrial fibrillation lasting longer than 30 seconds after a single ablation procedure. ResultsProcedure time was significantly shorter for pulmonary-vein isolation alone than for the other two procedures (P<0.001). After 18 months, 59% of patients ass...

...read moreread less

Proceedings Article•DOI•

Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks

[...]

Konstantinos Bousmalis¹, Nathan Silberman¹, David Dohan¹, Dumitru Erhan¹, Dilip Krishnan¹ - Show less +1 more•Institutions (1)

Google¹

01 Jul 2017

TL;DR: In this paper, a generative adversarial network (GAN)-based method adapts source-domain images to appear as if drawn from the target domain by learning in an unsupervised manner a transformation in the pixel space from one domain to another.

...read moreread less

Abstract: Collecting well-annotated image datasets to train modern machine learning algorithms is prohibitively expensive for many tasks. One appealing alternative is rendering synthetic data where ground-truth annotations are generated automatically. Unfortunately, models trained purely on rendered images fail to generalize to real images. To address this shortcoming, prior work introduced unsupervised domain adaptation algorithms that have tried to either map representations between the two domains, or learn to extract features that are domain-invariant. In this work, we approach the problem in a new light by learning in an unsupervised manner a transformation in the pixel space from one domain to the other. Our generative adversarial network (GAN)-based method adapts source-domain images to appear as if drawn from the target domain. Our approach not only produces plausible samples, but also outperforms the state-of-the-art on a number of unsupervised domain adaptation scenarios by large margins. Finally, we demonstrate that the adaptation process generalizes to object classes unseen during training.

...read moreread less

Proceedings Article•

Convolutional Sequence to Sequence Learning

[...]

Jonas Gehring¹, Michael Auli¹, David Grangier¹, Denis Yarats¹, Yann N. Dauphin¹ - Show less +1 more•Institutions (1)

Facebook¹

08 May 2017

TL;DR: The authors introduced an architecture based entirely on convolutional neural networks, where computations over all elements can be fully parallelized during training and optimization is easier since the number of nonlinearities is fixed and independent of the input length.

...read moreread less

Abstract: The prevalent approach to sequence to sequence learning maps an input sequence to a variable length output sequence via recurrent neural networks. We introduce an architecture based entirely on convolutional neural networks. Compared to recurrent models, computations over all elements can be fully parallelized during training and optimization is easier since the number of non-linearities is fixed and independent of the input length. Our use of gated linear units eases gradient propagation and we equip each decoder layer with a separate attention module. We outperform the accuracy of the deep LSTM setup of Wu et al. (2016) on both WMT'14 English-German and WMT'14 English-French translation at an order of magnitude faster speed, both on GPU and CPU.

...read moreread less

Proceedings Article•

Coupled Generative Adversarial Networks

[...]

Ming-Yu Liu¹, Oncel Tuzel¹•Institutions (1)

Mitsubishi Electric Research Laboratories¹

05 Dec 2016

TL;DR: This work proposes coupled generative adversarial network (CoGAN), which can learn a joint distribution without any tuple of corresponding images, and applies it to several joint distribution learning tasks, and demonstrates its applications to domain adaptation and image transformation.

...read moreread less

Abstract: We propose the coupled generative adversarial nets (CoGAN) framework for generating pairs of corresponding images in two different domains. The framework consists of a pair of generative adversarial nets, each responsible for generating images in one domain. We show that by enforcing a simple weight-sharing constraint, the CoGAN learns to generate pairs of corresponding images without existence of any pairs of corresponding images in the two domains in the training set. In other words, the CoGAN learns a joint distribution of images in the two domains from images drawn separately from the marginal distributions of the individual domains. This is in contrast to the existing multi-modal generative models, which require corresponding images for training. We apply the CoGAN to several pair image generation tasks. For each task, the CoGAN learns to generate convincing pairs of corresponding images. We further demonstrate the applications of the CoGAN framework for the domain adaptation and cross-domain image generation tasks.

...read moreread less

Journal Article•DOI•

The epidemiology of venous thromboembolism

[...]

John A. Heit¹, Frederick A. Spencer², Richard H. White³•Institutions (3)

Mayo Clinic¹, McMaster University², University of California, Davis³

01 Jan 2016-Journal of Thrombosis and Thrombolysis

TL;DR: Venous thromboembolism is a complex disease, involving interactions between acquired or inherited predispositions to thrombosis and VTE risk factors, including increasing patient age and obesity, hospitalization for surgery or acute illness, nursing-home confinement, active cancer, trauma or fracture, immobility or leg paresis, superficial vein thromBosis, and, in women, pregnancy and puerperium.

...read moreread less

Abstract: Venous thromboembolism (VTE) is categorized by the U.S. Surgeon General as a major public health problem. VTE is relatively common and associated with reduced survival and substantial health-care costs, and recurs frequently. VTE is a complex (multifactorial) disease, involving interactions between acquired or inherited predispositions to thrombosis and VTE risk factors, including increasing patient age and obesity, hospitalization for surgery or acute illness, nursing-home confinement, active cancer, trauma or fracture, immobility or leg paresis, superficial vein thrombosis, and, in women, pregnancy and puerperium, oral contraception, and hormone therapy. Although independent VTE risk factors and predictors of VTE recurrence have been identified, and effective primary and secondary prophylaxis is available, the occurrence of VTE seems to be relatively constant, or even increasing.

...read moreread less

Book Chapter•DOI•

BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation

[...]

Changqian Yu¹, Jingbo Wang², Chao Peng, Changxin Gao¹, Gang Yu, Nong Sang¹ - Show less +2 more•Institutions (2)

Huazhong University of Science and Technology¹, Peking University²

08 Sep 2018

TL;DR: BiSeNet as discussed by the authors designs a spatial path with a small stride to preserve the spatial information and generate high-resolution features, while a context path with fast downsampling strategy is employed to obtain sufficient receptive field.

...read moreread less

Abstract: Semantic segmentation requires both rich spatial information and sizeable receptive field. However, modern approaches usually compromise spatial resolution to achieve real-time inference speed, which leads to poor performance. In this paper, we address this dilemma with a novel Bilateral Segmentation Network (BiSeNet). We first design a Spatial Path with a small stride to preserve the spatial information and generate high-resolution features. Meanwhile, a Context Path with a fast downsampling strategy is employed to obtain sufficient receptive field. On top of the two paths, we introduce a new Feature Fusion Module to combine features efficiently. The proposed architecture makes a right balance between the speed and segmentation performance on Cityscapes, CamVid, and COCO-Stuff datasets. Specifically, for a 2048 \(\times \) 1024 input, we achieve 68.4% Mean IOU on the Cityscapes test dataset with speed of 105 FPS on one NVIDIA Titan XP card, which is significantly faster than the existing methods with comparable performance.

...read moreread less

Proceedings Article•DOI•

Collaborative Deep Learning for Recommender Systems

[...]

Hao Wang¹, Naiyan Wang¹, Dit-Yan Yeung¹•Institutions (1)

Hong Kong University of Science and Technology¹

10 Aug 2015

TL;DR: Wang et al. as discussed by the authors proposed a hierarchical Bayesian model called collaborative deep learning (CDL), which jointly performs deep representation learning for the content information and collaborative filtering for the ratings (feedback) matrix.

...read moreread less

Abstract: Collaborative filtering (CF) is a successful approach commonly used by many recommender systems. Conventional CF-based methods use the ratings given to items by users as the sole source of information for learning to make recommendation. However, the ratings are often very sparse in many applications, causing CF-based methods to degrade significantly in their recommendation performance. To address this sparsity problem, auxiliary information such as item content information may be utilized. Collaborative topic regression (CTR) is an appealing recent method taking this approach which tightly couples the two components that learn from two different sources of information. Nevertheless, the latent representation learned by CTR may not be very effective when the auxiliary information is very sparse. To address this problem, we generalize recently advances in deep learning from i.i.d. input to non-i.i.d. (CF-based) input and propose in this paper a hierarchical Bayesian model called collaborative deep learning (CDL), which jointly performs deep representation learning for the content information and collaborative filtering for the ratings (feedback) matrix. Extensive experiments on three real-world datasets from different domains show that CDL can significantly advance the state of the art.

...read moreread less

Collapse