Institution

Carnegie Mellon University

Education•Pittsburgh, Pennsylvania, United States•

About: Carnegie Mellon University is a education organization based out in Pittsburgh, Pennsylvania, United States. It is known for research contribution in the topics: Computer science & Robot. The organization has 36317 authors who have published 104359 publications receiving 5975734 citations. The organization is also known as: CMU & Carnegie Mellon.

...read moreread less

Topics: Computer science, Robot, Context (language use), Population, Mobile robot ...read more

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Journal Article•DOI•

STEM: a tool for the analysis of short time series gene expression data

[...]

Jason Ernst¹, Ziv Bar-Joseph¹•Institutions (1)

Carnegie Mellon University¹

05 Apr 2006-BMC Bioinformatics

TL;DR: The unique algorithms STEM implements to cluster and compare short time series gene expression data combined with its visualization capabilities and integration with the Gene Ontology should make STEM useful in the analysis of data from a significant portion of all microarray studies.

...read moreread less

Abstract: Time series microarray experiments are widely used to study dynamical biological processes. Due to the cost of microarray experiments, and also in some cases the limited availability of biological material, about 80% of microarray time series experiments are short (3–8 time points). Previously short time series gene expression data has been mainly analyzed using more general gene expression analysis tools not designed for the unique challenges and opportunities inherent in short time series gene expression data. We introduce the Short Time-series Expression Miner (STEM) the first software program specifically designed for the analysis of short time series microarray gene expression data. STEM implements unique methods to cluster, compare, and visualize such data. STEM also supports efficient and statistically rigorous biological interpretations of short time series data through its integration with the Gene Ontology. The unique algorithms STEM implements to cluster and compare short time series gene expression data combined with its visualization capabilities and integration with the Gene Ontology should make STEM useful in the analysis of data from a significant portion of all microarray studies. STEM is available for download for free to academic and non-profit users at http://www.cs.cmu.edu/~jernst/stem .

...read moreread less

1,201 citations

Journal Issue•DOI•

Autonomous driving in urban environments: Boss and the Urban Challenge

[...]

Chris Urmson¹, Joshua Anhalt¹, Drew Bagnell¹, Christopher R. Baker¹, Robert Bittner¹, Michael Clark¹, John M. Dolan¹, D Duggins¹, Tugrul Galatali¹, Christopher Geyer¹, Michele Gittleman¹, Sam Harbaugh¹, Martial Hebert¹, Thomas M. Howard¹, Sascha Kolski¹, Alonzo Kelly¹, Maxim Likhachev¹, Matthew McNaughton¹, Nick Miller¹, Kevin Peterson¹, Brian Pilnick¹, Ragunathan Rajkumar¹, Paul E. Rybski¹, Bryan Salesky¹, Young-Woo Seo¹, Sanjiv Singh¹, Jarrod M. Snider¹, Anthony Stentz¹, William Whittaker¹, Ziv Wolkowicki¹, Jason Ziglar¹, Hong Bae², Thomas G. Brown², Daniel Demitrish², Bakhtiar Brian Litkouhi², Jim Nickolaou², Varsha Sadekar², Wende Zhang², Joshua Struble³, Michael Taylor³, Michael Darms⁴, Dave Ferguson⁵ - Show less +38 more•Institutions (5)

Carnegie Mellon University¹, General Motors², Caterpillar Inc.³, Continental AG⁴, Intel⁵

01 Aug 2008-Journal of Field Robotics

TL;DR: Boss is an autonomous vehicle that uses on-board sensors to track other vehicles, detect static obstacles, and localize itself relative to a road model using a spiral system development process with a heavy emphasis on regular, regressive system testing.

...read moreread less

Abstract: Boss is an autonomous vehicle that uses on-board sensors (global positioning system, lasers, radars, and cameras) to track other vehicles, detect static obstacles, and localize itself relative to a road model. A three-layer planning system combines mission, behavioral, and motion planning to drive in urban environments. The mission planning layer considers which street to take to achieve a mission goal. The behavioral layer determines when to change lanes and precedence at intersections and performs error recovery maneuvers. The motion planning layer selects actions to avoid obstacles while making progress toward local goals. The system was developed from the ground up to address the requirements of the DARPA Urban Challenge using a spiral system development process with a heavy emphasis on regular, regressive system testing. During the National Qualification Event and the 85-km Urban Challenge Final Event, Boss demonstrated some of its capabilities, qualifying first and winning the challenge. © 2008 Wiley Periodicals, Inc.

...read moreread less

1,201 citations

Journal Article•DOI•

Sloan Digital Sky Survey IV: Mapping the Milky Way, Nearby Galaxies and the Distant Universe

[...]

Michael R. Blanton¹, Matthew A. Bershady², Bela Abolfathi³, Franco D. Albareti⁴ +412 more•Institutions (91)

29 Jun 2017-The Astronomical Journal

TL;DR: SDSS-IV as mentioned in this paper is a project encompassing three major spectroscopic programs: the Mapping Nearby Galaxies at Apache Point Observatory (MaNGA), the Extended Baryon Oscillation Spectroscopic Survey (eBOSS), and the Time Domain Spectroscopy Survey (TDSS).

...read moreread less

Abstract: We describe the Sloan Digital Sky Survey IV (SDSS-IV), a project encompassing three major spectroscopic programs. The Apache Point Observatory Galactic Evolution Experiment 2 (APOGEE-2) is observing hundreds of thousands of Milky Way stars at high resolution and high signal-to-noise ratios in the near-infrared. The Mapping Nearby Galaxies at Apache Point Observatory (MaNGA) survey is obtaining spatially resolved spectroscopy for thousands of nearby galaxies (median $z\sim 0.03$). The extended Baryon Oscillation Spectroscopic Survey (eBOSS) is mapping the galaxy, quasar, and neutral gas distributions between $z\sim 0.6$ and 3.5 to constrain cosmology using baryon acoustic oscillations, redshift space distortions, and the shape of the power spectrum. Within eBOSS, we are conducting two major subprograms: the SPectroscopic IDentification of eROSITA Sources (SPIDERS), investigating X-ray AGNs and galaxies in X-ray clusters, and the Time Domain Spectroscopic Survey (TDSS), obtaining spectra of variable sources. All programs use the 2.5 m Sloan Foundation Telescope at the Apache Point Observatory; observations there began in Summer 2014. APOGEE-2 also operates a second near-infrared spectrograph at the 2.5 m du Pont Telescope at Las Campanas Observatory, with observations beginning in early 2017. Observations at both facilities are scheduled to continue through 2020. In keeping with previous SDSS policy, SDSS-IV provides regularly scheduled public data releases; the first one, Data Release 13, was made available in 2016 July.

...read moreread less

1,200 citations

Journal Article•DOI•

Multiple Recurrent De Novo CNVs, Including Duplications of the 7q11.23 Williams Syndrome Region, Are Strongly Associated with Autism

[...]

Stephen Sanders¹, A. Gulhan Ercan-Sencicek, Vanessa Hus², Rui Luo³, Michael T. Murtha, Daniel Moreno-De-Luca⁴, Su H. Chu⁵, Michael P. Moreau⁶, Abha R. Gupta¹, Susanne Thomson⁷, Christopher E. Mason⁸, Kaya Bilguvar¹, Patrícia B. S. Celestino-Soper⁹, Murim Choi¹, Emily L. Crawford⁷, Lea K. Davis¹⁰, Nicole R. Davis Wright¹, Rahul M. Dhodapkar¹, Michael DiCola⁶, Nicholas M. DiLullo¹, Thomas V. Fernandez¹, Vikram Fielding-Singh¹¹, Daniel O. Fishman⁷, Stephanie Frahm⁶, Rouben Garagaloyan, Gerald Goh¹, Sindhuja Kammela¹, Lambertus Klei¹², Jennifer K. Lowe³, Sabata C. Lund², Anna D. McGrew⁷, Kyle A. Meyer¹, William J. Moffat¹, John D. Murdoch¹, Brian J. O'Roak¹³, Gordon T. Ober¹, Rebecca S. Pottenger¹⁴, Melanie J. Raubeson¹, Youeun Song¹, Qi Wang⁶, Brian L. Yaspan⁷, Timothy W. Yu¹⁵, Ilana R. Yurkiewicz¹, Arthur L. Beaudet⁹, Rita M. Cantor³, Martin Curland, Dorothy E. Grice¹⁶, Murat Gunel¹, Richard P. Lifton¹, Shrikant Mane¹, Donna M. Martin², Chad A. Shaw⁹, Michael Sheldon⁶, Jay A. Tischfield⁶, Christopher A. Walsh¹⁷, Eric M. Morrow¹⁸, David H. Ledbetter¹⁹, Eric Fombonne²⁰, Catherine Lord², Christa Lese Martin⁴, Andrew Brooks⁶, James S. Sutcliffe⁷, Edwin H. Cook¹⁰, Daniel H. Geschwind³, Kathryn Roeder⁵, Bernie Devlin¹², Matthew W. State - Show less +63 more•Institutions (20)

09 Jun 2011-Neuron

TL;DR: A genome-wide analysis of rare copy-number variation in 1124 autism spectrum disorder families, each comprised of a single proband, unaffected parents, and, in most kindreds, an unaffected sibling, finds significant association of ASD with de novo duplications of 7q11.23, where the reciprocal deletion causes Williams-Beuren syndrome.

...read moreread less

1,198 citations

Proceedings Article•DOI•

Efficient clustering of high-dimensional data sets with application to reference matching

[...]

Andrew McCallum¹, Kamal Nigam¹, Lyle H. Ungar²•Institutions (2)

Carnegie Mellon University¹, University of Pennsylvania²

01 Aug 2000

TL;DR: This work presents a new technique for clustering large datasets, using a cheap, approximate distance measure to eciently divide the data into overlapping subsets the authors call canopies, and presents ex- perimental results on grouping bibliographic citations from the reference sections of research papers.

...read moreread less

Abstract: important problems involve clustering large datasets. Although naive implementations of clustering are computa- tionally expensive, there are established ecient techniques for clustering when the dataset has either (1) a limited num- ber of clusters, (2) a low feature dimensionality, or (3) a small number of data points. However, there has been much less work on methods of eciently clustering datasets that are large in all three ways at once|for example, having millions of data points that exist in many thousands of di- mensions representing many thousands of clusters. We present a new technique for clustering these large, high- dimensional datasets. The key idea involves using a cheap, approximate distance measure to eciently divide the data into overlapping subsets we call canopies .T hen cluster- ing is performed by measuring exact distances only between points that occur in a common canopy. Using canopies, large clustering problems that were formerly impossible become practical. Under reasonable assumptions about the cheap distance metric, this reduction in computational cost comes without any loss in clustering accuracy. Canopies can be applied to many domains and used with a variety of cluster- ing approaches, including Greedy Agglomerative Clustering, K-means and Expectation-Maximization. We present ex- perimental results on grouping bibliographic citations from the reference sections of research papers. Here the canopy approach reduces computation time over a traditional clus- tering approach by more than an order of magnitude and decreases error in comparison to a previously used algorithm by 25%.

...read moreread less

1,197 citations

Collapse

Authors

Showing all 36645 results

Name	H-index	Papers	Citations
Yi Chen	217	4342	293080
Rakesh K. Jain	200	1467	177727
Robert C. Nichol	187	851	162994
Michael I. Jordan	176	1016	216204
Jasvinder A. Singh	176	2382	223370
J. N. Butler	172	2525	175561
P. Chang	170	2154	151783
Krzysztof Matyjaszewski	169	1431	128585
Yang Yang	164	2704	144071
Geoffrey E. Hinton	157	414	409047
Herbert A. Simon	157	745	194597
Yongsun Kim	156	2588	145619
Terrence J. Sejnowski	155	845	117382
John B. Goodenough	151	1064	113741
Scott Shenker	150	454	118017

Network Information

Related Institutions (5)

Massachusetts Institute of Technology

268K papers, 18.2M citations

95% related

University of Maryland, College Park

155.9K papers, 7.2M citations

225.1K papers, 10.1M citations

93% related

IBM

253.9K papers, 7.4M citations

93% related

Princeton University

146.7K papers, 9.1M citations

92% related

Performance

Metrics

104,917

Papers

6,710,469

Citations

No. of papers from the Institution in previous years
Year	Papers
2023	120
2022	499
2021	4,981
2020	5,375
2019	5,420
2018	4,972