scispace - formally typeset
Search or ask a question
Book ChapterDOI

I and J

01 Jan 2012-pp 62-66
About: The article was published on 2012-01-01. It has received 139059 citations till now.
Citations
More filters
Proceedings Article
01 Jan 2015
TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.
Abstract: In this work we investigate the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting. Our main contribution is a thorough evaluation of networks of increasing depth using an architecture with very small (3x3) convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers. These findings were the basis of our ImageNet Challenge 2014 submission, where our team secured the first and the second places in the localisation and classification tracks respectively. We also show that our representations generalise well to other datasets, where they achieve state-of-the-art results. We have made our two best-performing ConvNet models publicly available to facilitate further research on the use of deep visual representations in computer vision.

49,914 citations

Posted Content
TL;DR: This paper presents a general end-to-end approach to sequence learning that makes minimal assumptions on the sequence structure, and finds that reversing the order of the words in all source sentences improved the LSTM's performance markedly, because doing so introduced many short term dependencies between the source and the target sentence which made the optimization problem easier.
Abstract: Deep Neural Networks (DNNs) are powerful models that have achieved excellent performance on difficult learning tasks. Although DNNs work well whenever large labeled training sets are available, they cannot be used to map sequences to sequences. In this paper, we present a general end-to-end approach to sequence learning that makes minimal assumptions on the sequence structure. Our method uses a multilayered Long Short-Term Memory (LSTM) to map the input sequence to a vector of a fixed dimensionality, and then another deep LSTM to decode the target sequence from the vector. Our main result is that on an English to French translation task from the WMT'14 dataset, the translations produced by the LSTM achieve a BLEU score of 34.8 on the entire test set, where the LSTM's BLEU score was penalized on out-of-vocabulary words. Additionally, the LSTM did not have difficulty on long sentences. For comparison, a phrase-based SMT system achieves a BLEU score of 33.3 on the same dataset. When we used the LSTM to rerank the 1000 hypotheses produced by the aforementioned SMT system, its BLEU score increases to 36.5, which is close to the previous best result on this task. The LSTM also learned sensible phrase and sentence representations that are sensitive to word order and are relatively invariant to the active and the passive voice. Finally, we found that reversing the order of the words in all source sentences (but not target sentences) improved the LSTM's performance markedly, because doing so introduced many short term dependencies between the source and the target sentence which made the optimization problem easier.

11,936 citations


Additional excerpts

  • ...While it could work in pr i ciple since the RNN is provided with all the relevant information, it would be difficult to tr ain the RNNs due to the resulting long term dependencies [14, 4] (figure 1) [16, 15]....

    [...]

Journal Article
TL;DR: The first direct detection of gravitational waves and the first observation of a binary black hole merger were reported in this paper, with a false alarm rate estimated to be less than 1 event per 203,000 years, equivalent to a significance greater than 5.1σ.
Abstract: On September 14, 2015 at 09:50:45 UTC the two detectors of the Laser Interferometer Gravitational-Wave Observatory simultaneously observed a transient gravitational-wave signal. The signal sweeps upwards in frequency from 35 to 250 Hz with a peak gravitational-wave strain of 1.0×10(-21). It matches the waveform predicted by general relativity for the inspiral and merger of a pair of black holes and the ringdown of the resulting single black hole. The signal was observed with a matched-filter signal-to-noise ratio of 24 and a false alarm rate estimated to be less than 1 event per 203,000 years, equivalent to a significance greater than 5.1σ. The source lies at a luminosity distance of 410(-180)(+160) Mpc corresponding to a redshift z=0.09(-0.04)(+0.03). In the source frame, the initial black hole masses are 36(-4)(+5)M⊙ and 29(-4)(+4)M⊙, and the final black hole mass is 62(-4)(+4)M⊙, with 3.0(-0.5)(+0.5)M⊙c(2) radiated in gravitational waves. All uncertainties define 90% credible intervals. These observations demonstrate the existence of binary stellar-mass black hole systems. This is the first direct detection of gravitational waves and the first observation of a binary black hole merger.

4,375 citations

Journal ArticleDOI
TL;DR: A comprehensive survey of recent work on modified theories of gravity and their cosmological consequences can be found in this article, where the authors provide a reference tool for researchers and students in cosmology and gravitational physics, as well as a selfcontained, comprehensive and up-to-date introduction to the subject as a whole.

3,674 citations


Cites background from "I and J"

  • ...The simplest and most well studied scenario is to consider co-dimension-2 branes in six dimensional EGB gravity without any additional sources in the bulk (see, for example, [174, 277, 676, 675, 281, 280, 348, 349])....

    [...]

  • ...One of the ways to do this is to add Gauss-Bonnet corrections in the bulk [174, 283, 282]....

    [...]

  • ...Alternatively, one could consider a genuine delta function source and avoid issues with the singularity by including higher-order operators in the bulk gravity theory [449, 174, 283, 282]....

    [...]

  • ...The boundary conditions derived in [174] lead to the condition Wμν |brane = 0, suggesting that Einstein gravity should be recovered on the brane at all scales, even for an infinitely large bulk....

    [...]

  • ...and it turns out that the Einstein tensor on the brane is given by [174, 277]...

    [...]

Journal ArticleDOI
TL;DR: A critical review of the synthesis methods for graphene and its derivatives as well as their properties and the advantages of graphene-based composites in applications such as the Li-ion batteries, supercapacitors, fuel cells, photovoltaic devices, photocatalysis, and Raman enhancement are described.
Abstract: Graphene has attracted tremendous research interest in recent years, owing to its exceptional properties. The scaled-up and reliable production of graphene derivatives, such as graphene oxide (GO) and reduced graphene oxide (rGO), offers a wide range of possibilities to synthesize graphene-based functional materials for various applications. This critical review presents and discusses the current development of graphene-based composites. After introduction of the synthesis methods for graphene and its derivatives as well as their properties, we focus on the description of various methods to synthesize graphene-based composites, especially those with functional polymers and inorganic nanostructures. Particular emphasis is placed on strategies for the optimization of composite properties. Lastly, the advantages of graphene-based composites in applications such as the Li-ion batteries, supercapacitors, fuel cells, photovoltaic devices, photocatalysis, as well as Raman enhancement are described (279 references).

3,340 citations


Cites result from "I and J"

  • ...closer to the graphene surface.(72) However, the CM-based enhancement is orders weaker compared to the EM-based one....

    [...]