Institution

Carnegie Mellon University

Education•Pittsburgh, Pennsylvania, United States•

About: Carnegie Mellon University is a education organization based out in Pittsburgh, Pennsylvania, United States. It is known for research contribution in the topics: Computer science & Robot. The organization has 36317 authors who have published 104359 publications receiving 5975734 citations. The organization is also known as: CMU & Carnegie Mellon.

...read moreread less

Topics: Computer science, Robot, Context (language use), Population, Mobile robot ...read more

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Statistical properties of community structure in large social and information networks

[...]

Jure Leskovec¹, Kevin J. Lang², Anirban Dasgupta², Michael W. Mahoney²•Institutions (2)

Carnegie Mellon University¹, Yahoo!²

21 Apr 2008

TL;DR: It is found that a generative model, in which new edges are added via an iterative "forest fire" burning process, is able to produce graphs exhibiting a network community structure similar to that observed in nearly every network dataset examined.

...read moreread less

Abstract: A large body of work has been devoted to identifying community structure in networks. A community is often though of as a set of nodes that has more connections between its members than to the remainder of the network. In this paper, we characterize as a function of size the statistical and structural properties of such sets of nodes. We define the network community profile plot, which characterizes the "best" possible community - according to the conductance measure - over a wide range of size scales, and we study over 70 large sparse real-world networks taken from a wide range of application domains. Our results suggest a significantly more refined picture of community structure in large real-world networks than has been appreciated previously.Our most striking finding is that in nearly every network dataset we examined, we observe tight but almost trivial communities at very small scales, and at larger size scales, the best possible communities gradually "blend in" with the rest of the network and thus become less "community-like." This behavior is not explained, even at a qualitative level, by any of the commonly-used network generation models. Moreover, this behavior is exactly the opposite of what one would expect based on experience with and intuition from expander graphs, from graphs that are well-embeddable in a low-dimensional structure, and from small social networks that have served as testbeds of community detection algorithms. We have found, however, that a generative model, in which new edges are added via an iterative "forest fire" burning process, is able to produce graphs exhibiting a network community structure similar to our observations.

...read moreread less

999 citations

Proceedings Article•DOI•

Ensemble of exemplar-SVMs for object detection and beyond

[...]

Tomasz Malisiewicz¹, Abhinav Gupta¹, Alexei A. Efros¹•Institutions (1)

Carnegie Mellon University¹

06 Nov 2011

TL;DR: This paper proposes a conceptually simple but surprisingly powerful method which combines the effectiveness of a discriminative object detector with the explicit correspondence offered by a nearest-neighbor approach.

...read moreread less

Abstract: This paper proposes a conceptually simple but surprisingly powerful method which combines the effectiveness of a discriminative object detector with the explicit correspondence offered by a nearest-neighbor approach. The method is based on training a separate linear SVM classifier for every exemplar in the training set. Each of these Exemplar-SVMs is thus defined by a single positive instance and millions of negatives. While each detector is quite specific to its exemplar, we empirically observe that an ensemble of such Exemplar-SVMs offers surprisingly good generalization. Our performance on the PASCAL VOC detection task is on par with the much more complex latent part-based model of Felzenszwalb et al., at only a modest computational cost increase. But the central benefit of our approach is that it creates an explicit association between each detection and a single training exemplar. Because most detections show good alignment to their associated exemplar, it is possible to transfer any available exemplar meta-data (segmentation, geometric structure, 3D model, etc.) directly onto the detections, which can then be used as part of overall scene understanding.

...read moreread less

999 citations

Journal Article•DOI•

Inducing features of random fields

[...]

S. Della Pietra¹, V. Della Pietra, John Lafferty²•Institutions (2)

Stony Brook University¹, Carnegie Mellon University²

01 Apr 1997-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: In this article, the authors present a technique for constructing random fields from a set of training samples, where each feature has a weight that is trained by minimizing the Kullback-Leibler divergence between the model and the empirical distribution of the training data.

...read moreread less

Abstract: We present a technique for constructing random fields from a set of training samples. The learning paradigm builds increasingly complex fields by allowing potential functions, or features, that are supported by increasingly large subgraphs. Each feature has a weight that is trained by minimizing the Kullback-Leibler divergence between the model and the empirical distribution of the training data. A greedy algorithm determines how features are incrementally added to the field and an iterative scaling algorithm is used to estimate the optimal values of the weights. The random field models and techniques introduced in this paper differ from those common to much of the computer vision literature in that the underlying random fields are non-Markovian and have a large number of parameters that must be estimated. Relations to other learning approaches, including decision trees, are given. As a demonstration of the method, we describe its application to the problem of automatic word classification in natural language processing.

...read moreread less

998 citations

Journal Article•DOI•

Human pharmaceuticals in the aquatic environment: a challenge to Green Chemistry.

[...]

Sushil K. Khetan¹, Terrence J. Collins¹•Institutions (1)

Carnegie Mellon University¹

27 May 2007-Chemical Reviews

998 citations

Journal Article•DOI•

Graph based anomaly detection and description: a survey

[...]

Leman Akoglu¹, Hanghang Tong², Danai Koutra³•Institutions (3)

Stony Brook University¹, City University of New York², Carnegie Mellon University³

01 May 2015-Data Mining and Knowledge Discovery

TL;DR: This survey aims to provide a general, comprehensive, and structured overview of the state-of-the-art methods for anomaly detection in data represented as graphs, and gives a general framework for the algorithms categorized under various settings.

...read moreread less

Abstract: Detecting anomalies in data is a vital task, with numerous high-impact applications in areas such as security, finance, health care, and law enforcement. While numerous techniques have been developed in past years for spotting outliers and anomalies in unstructured collections of multi-dimensional points, with graph data becoming ubiquitous, techniques for structured graph data have been of focus recently. As objects in graphs have long-range correlations, a suite of novel technology has been developed for anomaly detection in graph data. This survey aims to provide a general, comprehensive, and structured overview of the state-of-the-art methods for anomaly detection in data represented as graphs. As a key contribution, we give a general framework for the algorithms categorized under various settings: unsupervised versus (semi-)supervised approaches, for static versus dynamic graphs, for attributed versus plain graphs. We highlight the effectiveness, scalability, generality, and robustness aspects of the methods. What is more, we stress the importance of anomaly attribution and highlight the major techniques that facilitate digging out the root cause, or the `why', of the detected anomalies for further analysis and sense-making. Finally, we present several real-world applications of graph-based anomaly detection in diverse domains, including financial, auction, computer traffic, and social networks. We conclude our survey with a discussion on open theoretical and practical challenges in the field.

...read moreread less

998 citations

Collapse

Authors

Showing all 36645 results

Name	H-index	Papers	Citations
Yi Chen	217	4342	293080
Rakesh K. Jain	200	1467	177727
Robert C. Nichol	187	851	162994
Michael I. Jordan	176	1016	216204
Jasvinder A. Singh	176	2382	223370
J. N. Butler	172	2525	175561
P. Chang	170	2154	151783
Krzysztof Matyjaszewski	169	1431	128585
Yang Yang	164	2704	144071
Geoffrey E. Hinton	157	414	409047
Herbert A. Simon	157	745	194597
Yongsun Kim	156	2588	145619
Terrence J. Sejnowski	155	845	117382
John B. Goodenough	151	1064	113741
Scott Shenker	150	454	118017

Network Information

Related Institutions (5)

Massachusetts Institute of Technology

268K papers, 18.2M citations

95% related

University of Maryland, College Park

155.9K papers, 7.2M citations

225.1K papers, 10.1M citations

93% related

IBM

253.9K papers, 7.4M citations

93% related

Princeton University

146.7K papers, 9.1M citations

92% related

Performance

Metrics

104,917

Papers

6,710,469

Citations

No. of papers from the Institution in previous years
Year	Papers
2023	120
2022	499
2021	4,981
2020	5,375
2019	5,420
2018	4,972