Institution

Massachusetts Institute of Technology

Education•Cambridge, Massachusetts, United States•

About: Massachusetts Institute of Technology is a education organization based out in Cambridge, Massachusetts, United States. It is known for research contribution in the topics: Population & Laser. The organization has 116795 authors who have published 268000 publications receiving 18272025 citations. The organization is also known as: MIT & M.I.T..

...read moreread less

Topics: Population, Laser, Context (language use), Computer science, Gene ...read more

Papers published on a yearly basis

1 / 3

Papers

PDF

Open Access

More filters

Posted Content•

Language Models are Few-Shot Learners

[...]

Tom B. Brown¹, Benjamin Mann, Nick Ryder², Melanie Subbiah, Jared Kaplan³, Prafulla Dhariwal¹, Arvind Neelakantan⁴, Pranav Shyam, Girish Sastry¹, Amanda Askell¹, Sandhini Agarwal¹, Ariel Herbert-Voss¹, Gretchen Krueger¹, Thomas Henighan¹, Rewon Child¹, Aditya Ramesh¹, Daniel M. Ziegler⁵, Jeffrey Wu¹, Clemens Winter, Christopher Hesse¹, Mark Chen¹, Eric Sigler, Mateusz Litwin, Scott Gray¹, Benjamin Chess¹, Jack Clark¹, Christopher Berner, Samuel McCandlish¹, Alec Radford¹, Ilya Sutskever¹, Dario Amodei¹ - Show less +27 more•Institutions (5)

OpenAI¹, University of California, Berkeley², Johns Hopkins University³, Google⁴, Massachusetts Institute of Technology⁵

28 May 2020-arXiv: Computation and Language

TL;DR: This article showed that scaling up language models greatly improves task-agnostic, few-shot performance, sometimes even reaching competitiveness with prior state-of-the-art fine-tuning approaches.

...read moreread less

Abstract: Recent work has demonstrated substantial gains on many NLP tasks and benchmarks by pre-training on a large corpus of text followed by fine-tuning on a specific task. While typically task-agnostic in architecture, this method still requires task-specific fine-tuning datasets of thousands or tens of thousands of examples. By contrast, humans can generally perform a new language task from only a few examples or from simple instructions - something which current NLP systems still largely struggle to do. Here we show that scaling up language models greatly improves task-agnostic, few-shot performance, sometimes even reaching competitiveness with prior state-of-the-art fine-tuning approaches. Specifically, we train GPT-3, an autoregressive language model with 175 billion parameters, 10x more than any previous non-sparse language model, and test its performance in the few-shot setting. For all tasks, GPT-3 is applied without any gradient updates or fine-tuning, with tasks and few-shot demonstrations specified purely via text interaction with the model. GPT-3 achieves strong performance on many NLP datasets, including translation, question-answering, and cloze tasks, as well as several tasks that require on-the-fly reasoning or domain adaptation, such as unscrambling words, using a novel word in a sentence, or performing 3-digit arithmetic. At the same time, we also identify some datasets where GPT-3's few-shot learning still struggles, as well as some datasets where GPT-3 faces methodological issues related to training on large web corpora. Finally, we find that GPT-3 can generate samples of news articles which human evaluators have difficulty distinguishing from articles written by humans. We discuss broader societal impacts of this finding and of GPT-3 in general.

...read moreread less

1,886 citations

Journal Article•DOI•

Diameter-Selective Raman Scattering from Vibrational Modes in Carbon Nanotubes

[...]

Apparao M. Rao¹, Ernst Richter¹, Shunji Bandow, Bruce Chase², Peter C. Eklund¹, Keith A. Williams¹, S. Fang¹, K. R. Subbaswamy¹, Madhu Menon¹, Andreas Thess³, Richard E. Smalley³, Gene Dresselhaus⁴, Mildred S. Dresselhaus⁴ - Show less +9 more•Institutions (4)

University of Kentucky¹, Wilmington University², Rice University³, Massachusetts Institute of Technology⁴

10 Jan 1997-Science

TL;DR: In this paper, the Raman spectra of single wall carbon nanotubes (SWNTs) were studied using laser excitation wavelengths in the range from 514.5 to 1320 nanometers.

...read moreread less

Abstract: Single wall carbon nanotubes (SWNTs) that are found as close-packed arrays in crystalline ropes have been studied by using Raman scattering techniques with laser excitation wavelengths in the range from 514.5 to 1320 nanometers. Numerous Raman peaks were observed and identified with vibrational modes of armchair symmetry (n, n) SWNTs. The Raman spectra are in good agreement with lattice dynamics calculations based on C-C force constants used to fit the two-dimensional, experimental phonon dispersion of a single graphene sheet. Calculated intensities from a nonresonant, bond polarizability model optimized for sp2 carbon are also in qualitative agreement with the Raman data, although a resonant Raman scattering process is also taking place. This resonance results from the one-dimensional quantum confinement of the electrons in the nanotube.

...read moreread less

1,882 citations

Journal Article•DOI•

Ancient Admixture in Human History

[...]

Nick Patterson¹, Priya Moorjani², Yontao Luo³, Swapan Mallick², Nadin Rohland², Yiping Zhan³, Teri Genschoreck³, Teresa Webster³, David Reich¹, David Reich² - Show less +6 more•Institutions (3)

Massachusetts Institute of Technology¹, Harvard University², Affymetrix³

01 Nov 2012-Genetics

TL;DR: A suite of methods for learning about population mixtures are presented, implemented in a software package called ADMIXTOOLS, that support formal tests for whether mixture occurred and make it possible to infer proportions and dates of mixture.

...read moreread less

Abstract: Population mixture is an important process in biology. We present a suite of methods for learning about population mixtures, implemented in a software package called ADMIXTOOLS, that support formal tests for whether mixture occurred and make it possible to infer proportions and dates of mixture. We also describe the development of a new single nucleotide polymorphism (SNP) array consisting of 629,433 sites with clearly documented ascertainment that was specifically designed for population genetic analyses and that we genotyped in 934 individuals from 53 diverse populations. To illustrate the methods, we give a number of examples that provide new insights about the history of human admixture. The most striking finding is a clear signal of admixture into northern Europe, with one ancestral population related to present-day Basques and Sardinians and the other related to present-day populations of northeast Asia and the Americas. This likely reflects a history of admixture between Neolithic migrants and the indigenous Mesolithic population of Europe, consistent with recent analyses of ancient bones from Sweden and the sequencing of the genome of the Tyrolean "Iceman."

...read moreread less

1,877 citations

Journal Article•DOI•

Distributed Event-Triggered Control for Multi-Agent Systems

[...]

Dimos V. Dimarogonas¹, Emilio Frazzoli², Karl Henrik Johansson¹•Institutions (2)

Royal Institute of Technology¹, Massachusetts Institute of Technology²

01 May 2012-IEEE Transactions on Automatic Control

TL;DR: The controller updates considered here are event-driven, depending on the ratio of a certain measurement error with respect to the norm of a function of the state, and are applied to a first order agreement problem.

...read moreread less

Abstract: Event-driven strategies for multi-agent systems are motivated by the future use of embedded microprocessors with limited resources that will gather information and actuate the individual agent controller updates. The controller updates considered here are event-driven, depending on the ratio of a certain measurement error with respect to the norm of a function of the state, and are applied to a first order agreement problem. A centralized formulation is considered first and then its distributed counterpart, in which agents require knowledge only of their neighbors' states for the controller implementation. The results are then extended to a self-triggered setup, where each agent computes its next update time at the previous one, without having to keep track of the state error that triggers the actuation between two consecutive update instants. The results are illustrated through simulation examples.

...read moreread less

1,876 citations

Journal Article•DOI•

Zipf's Law for Cities: An Explanation

[...]

Xavier Gabaix¹•Institutions (1)

Massachusetts Institute of Technology¹

01 Aug 1999-Quarterly Journal of Economics

TL;DR: In this paper, it was shown that, at least in the upper tail, all cities follow some proportional growth process (this appears to be verified empirically), which automatically leads their distribution to converge to Zipf's law.

...read moreread less

Abstract: Zipf ’s law is a very tight constraint on the class of admissible models of local growth. It says that for most countries the size distribution of cities strikingly fits a power law: the number of cities with populations greater than S is proportional to 1/S. Suppose that, at least in the upper tail, all cities follow some proportional growth process (this appears to be verified empirically). This automatically leads their distribution to converge to Zipf ’s law.

...read moreread less

1,875 citations

Collapse

Authors

Showing all 117442 results

Name	H-index	Papers	Citations
Eric S. Lander	301	826	525976
Robert Langer	281	2324	326306
George M. Whitesides	240	1739	269833
Trevor W. Robbins	231	1137	164437
George Davey Smith	224	2540	248373
Yi Cui	220	1015	199725
Robert J. Lefkowitz	214	860	147995
David J. Hunter	213	1836	207050
Daniel Levy	212	933	194778
Rudolf Jaenisch	206	606	178436
Mark J. Daly	204	763	304452
David Miller	203	2573	204840
David Baltimore	203	876	162955
Rakesh K. Jain	200	1467	177727
Ronald M. Evans	199	708	166722

Network Information

Related Institutions (5)

University of California, Berkeley

265.6K papers, 16.8M citations

96% related

Stanford University

320.3K papers, 21.8M citations

225.1K papers, 10.1M citations

95% related

University of California, San Diego

204.5K papers, 12.3M citations

95% related

Columbia University

224K papers, 12.8M citations

94% related

Performance

Metrics

269,256

Papers

20,472,857

Citations

No. of papers from the Institution in previous years
Year	Papers
2023	240
2022	1,124
2021	10,595
2020	11,922
2019	11,207
2018	10,883