scispace - formally typeset
Search or ask a question
Author

Neal E. Young

Bio: Neal E. Young is an academic researcher from University of California, Riverside. The author has contributed to research in topics: Approximation algorithm & Time complexity. The author has an hindex of 40, co-authored 161 publications receiving 6211 citations. Previous affiliations of Neal E. Young include University of Cologne & University of Maryland, College Park.


Papers
More filters
Journal ArticleDOI
TL;DR: In this paper, a method of allocating fibers to desired targets given a set of tile centers that includes the effects of collisions is presented, which is nearly optimally efficient and uniform.
Abstract: Large surveys using multiobject spectrographs require automated methods for deciding how to efficiently point observations and how to assign targets to each pointing. The Sloan Digital Sky Survey (SDSS) will observe around 10 6 spectra from targets distributed over an area of about 10,000 deg 2 , using a multiobject fiber spectrograph that can simultaneously observe 640 objects in a circular field of view (referred to as a ‘‘ tile ’’) 1=49 in radius. No two fibers can be placed closer than 55 00 during the same observation; multiple targets closer than this distance are said to ‘‘ collide.’’ We present here a method of allocating fibers to desired targets given a set of tile centers that includes the effects of collisions and that is nearly optimally efficient and uniform. Because of large-scale structure in the galaxy distribution (which form the bulk of the SDSS targets), a naive covering of the sky with equally spaced tiles does not yield uniform sampling. Thus, we present a heuristic for perturbing the centers of the tiles from the equally spaced distribution that provides more uniform completeness. For the SDSS sample, we can attain a sampling rate of greater than 92% for all targets, and greater than 99% for the set of targets that do not collide with each other, with an efficiency greater than 90% (defined as the fraction of available fibers assigned to targets). The methods used here may prove useful to those planning other large surveys.

618 citations

Journal ArticleDOI
TL;DR: The marking algorithm is developed, a randomized on-line algorithm for the paging problem, which it is proved that its expected cost on any sequence of requests is within a factor of 2Hk of optimum.

489 citations

Proceedings ArticleDOI
21 Aug 2011
TL;DR: This work introduces a novel algorithm that finds shapelets in less time than current methods by an order of magnitude, and shows for the first time an augmented shapelet representation that distinguishes the data based on conjunctions or disjunctions of shapelets.
Abstract: Time series shapelets are small, local patterns in a time series that are highly predictive of a class and are thus very useful features for building classifiers and for certain visualization and summarization tasks. While shapelets were introduced only recently, they have already seen significant adoption and extension in the community. Despite their immense potential as a data mining primitive, there are two important limitations of shapelets. First, their expressiveness is limited to simple binary presence/absence questions. Second, even though shapelets are computed offline, the time taken to compute them is significant. In this work, we address the latter problem by introducing a novel algorithm that finds shapelets in less time than current methods by an order of magnitude. Our algorithm is based on intelligent caching and reuse of computations, and the admissible pruning of the search space. Because our algorithm is so fast, it creates an opportunity to consider more expressive shapelet queries. In particular, we show for the first time an augmented shapelet representation that distinguishes the data based on conjunctions or disjunctions of shapelets. We call our novel representation Logical-Shapelets. We demonstrate the efficiency of our approach on the classic benchmark datasets used for these problems, and show several case studies where logical shapelets significantly outperform the original shapelet representation and other time series classification techniques. We demonstrate the utility of our ideas in domains as diverse as gesture recognition, robotics, and biometrics.

298 citations

Journal ArticleDOI
TL;DR: Weighted caching is a generalization of paging in which the cost to evict an item depends on the item as discussed by the authors, and it is studied as a restriction of the well-known k-server problem.
Abstract: Weighted caching is a generalization ofpaging in which the cost to evict an item depends on the item. We study both of these problems as restrictions of the well-knownk-server problem, which involves moving servers in a graph in response to requests so as to minimize the distance traveled.

250 citations

Journal ArticleDOI
TL;DR: A simple algorithm to find a spanning tree that simultaneously approximates a shortest-path tree and a minimum spanning tree is given and obtains the best-possible tradeoff.
Abstract: We give a simple algorithm to find a spanning tree that simultaneously approximates a shortest-path tree and a minimum spanning tree. The algorithm provides a continuous tradeoff: given the two trees and aź>0, the algorithm returns a spanning tree in which the distance between any vertex and the root of the shortest-path tree is at most 1+ź2ź times the shortest-path distance, and yet the total weight of the tree is at most 1+ź2/ź times the weight of a minimum spanning tree. Our algorithm runs in linear time and obtains the best-possible tradeoff. It can be implemented on a CREW PRAM to run a logarithmic time using one processor per vertex.

210 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: A series of improvements to the spectroscopic reductions are described, including better flat fielding and improved wavelength calibration at the blue end, better processing of objects with extremely strong narrow emission lines, and an improved determination of stellar metallicities.
Abstract: This paper describes the Seventh Data Release of the Sloan Digital Sky Survey (SDSS), marking the completion of the original goals of the SDSS and the end of the phase known as SDSS-II. It includes 11,663 deg^2 of imaging data, with most of the ~2000 deg^2 increment over the previous data release lying in regions of low Galactic latitude. The catalog contains five-band photometry for 357 million distinct objects. The survey also includes repeat photometry on a 120° long, 2°.5 wide stripe along the celestial equator in the Southern Galactic Cap, with some regions covered by as many as 90 individual imaging runs. We include a co-addition of the best of these data, going roughly 2 mag fainter than the main survey over 250 deg^2. The survey has completed spectroscopy over 9380 deg^2; the spectroscopy is now complete over a large contiguous area of the Northern Galactic Cap, closing the gap that was present in previous data releases. There are over 1.6 million spectra in total, including 930,000 galaxies, 120,000 quasars, and 460,000 stars. The data release includes improved stellar photometry at low Galactic latitude. The astrometry has all been recalibrated with the second version of the USNO CCD Astrograph Catalog, reducing the rms statistical errors at the bright end to 45 milliarcseconds per coordinate. We further quantify a systematic error in bright galaxy photometry due to poor sky determination; this problem is less severe than previously reported for the majority of galaxies. Finally, we describe a series of improvements to the spectroscopic reductions, including better flat fielding and improved wavelength calibration at the blue end, better processing of objects with extremely strong narrow emission lines, and an improved determination of stellar metallicities.

5,665 citations

Journal ArticleDOI
TL;DR: In this paper, a large-scale correlation function measured from a spectroscopic sample of 46,748 luminous red galaxies from the Sloan Digital Sky Survey is presented, which demonstrates the linear growth of structure by gravitational instability between z ≈ 1000 and the present and confirms a firm prediction of the standard cosmological theory.
Abstract: We present the large-scale correlation function measured from a spectroscopic sample of 46,748 luminous red galaxies from the Sloan Digital Sky Survey. The survey region covers 0.72h −3 Gpc 3 over 3816 square degrees and 0.16 < z < 0.47, making it the best sample yet for the study of large-scale structure. We find a well-detected peak in the correlation function at 100h −1 Mpc separation that is an excellent match to the predicted shape and location of the imprint of the recombination-epoch acoustic oscillations on the low-redshift clustering of matter. This detection demonstrates the linear growth of structure by gravitational instability between z ≈ 1000 and the present and confirms a firm prediction of the standard cosmological theory. The acoustic peak provides a standard ruler by which we can measure the ratio of the distances to z = 0.35 and z = 1089 to 4% fractional accuracy and the absolute distance to z = 0.35 to 5% accuracy. From the overall shape of the correlation function, we measure the matter density mh 2 to 8% and find agreement with the value from cosmic microwave background (CMB) anisotropies. Independent of the constraints provided by the CMB acoustic scale, we find m = 0.273 ±0.025+0.123(1+ w0)+0.137K. Including the CMB acoustic scale, we find that the spatial curvature is K = −0.010 ± 0.009 if the dark energy is a cosmological constant. More generally, our results provide a measurement of cosmological distance, and hence an argument for dark energy, based on a geometric method with the same simple physics as the microwave background anisotropies. The standard cosmological model convincingly passes these new and robust tests of its fundamental properties. Subject headings: cosmology: observations — large-scale structure of the universe — distance scale — cosmological parameters — cosmic microwave background — galaxies: elliptical and lenticular, cD

4,428 citations

Journal ArticleDOI
TL;DR: In this paper, the authors examined the properties of the host galaxies of 22 623 narrow-line active galactic nuclei (AGN) with 0.02 < z < 0.3 selected from a complete sample of 122 808 galaxies from the Sloan Digital Sky Survey.
Abstract: We examine the properties of the host galaxies of 22 623 narrow-line active galactic nuclei (AGN) with 0.02 < z < 0.3 selected from a complete sample of 122 808 galaxies from the Sloan Digital Sky Survey. We focus on the luminosity of the [O III] λ5007 emission line as a tracer of the strength of activity in the nucleus. We study how AGN host properties compare with those of normal galaxies and how they depend on L[O III]. We find that AGN of all luminosities reside almost exclusively in massive galaxies and have distributions of sizes, stellar surface mass densities and concentrations that are similar to those of ordinary early-type galaxies in our sample. The host galaxies of low-luminosity AGN have stellar populations similar to normal early types. The hosts of high-luminosity AGN have much younger mean stellar ages. The young stars are not preferentially located near the nucleus of the galaxy, but are spread out over scales of at least several kiloparsecs. A significant fraction of high-luminosity AGN have strong Hδ absorption-line equivalent widths, indicating that they experienced a burst of star formation in the recent past. We have also examined the stellar populations of the host galaxies of a sample of broad-line AGN. We conclude that there is no significant difference in stellar content between type 2 Seyfert hosts and quasars (QSOs) with the same [O III] luminosity and redshift. This establishes that a young stellar population is a general property of AGN with high [O III] luminosities.

3,781 citations

Journal ArticleDOI
TL;DR: In this paper, the relation between stellar mass and gas-phase metallicity was studied using the Sloan Digital Sky Survey imaging and spectroscopy of ~53,000 star-forming galaxies at z = 0.1.
Abstract: We utilize Sloan Digital Sky Survey imaging and spectroscopy of ~53,000 star-forming galaxies at z ~ 0.1 to study the relation between stellar mass and gas-phase metallicity. We derive gas-phase oxygen abundances and stellar masses using new techniques that make use of the latest stellar evolutionary synthesis and photoionization models. We find a tight (?0.1 dex) correlation between stellar mass and metallicity spanning over 3 orders of magnitude in stellar mass and a factor of 10 in metallicity. The relation is relatively steep from 108.5 to 1010.5 M? h, in good accord with known trends between luminosity and metallicity, but flattens above 1010.5 M?. We use indirect estimates of the gas mass based on the H? luminosity to compare our data to predictions from simple closed box chemical evolution models. We show that metal loss is strongly anticorrelated with baryonic mass, with low-mass dwarf galaxies being 5 times more metal depleted than L* galaxies at z ~ 0.1. Evidence for metal depletion is not confined to dwarf galaxies but is found in galaxies with masses as high as 1010 M?. We interpret this as strong evidence of both the ubiquity of galactic winds and their effectiveness in removing metals from galaxy potential wells.

3,621 citations

Journal ArticleDOI
TL;DR: In this article, the authors present a comprehensive study of the physical properties of ∼ 10 5 galaxies with measurable star formation in the Sloan Digital Sky Survey (SDSS) by comparing physical information extracted from the emission lines with continuum properties, and build up a picture of the nature of star-forming galaxies at z < 0.2.
Abstract: We present a comprehensive study of the physical properties of ∼ 10 5 galaxies with measurable star formation in the Sloan Digital Sky Survey (SDSS). By comparing physical information extracted from the emission lines with continuum properties, we build up a picture of the nature of star-forming galaxies at z < 0.2. We develop a method for aperture correction using resolved imaging and show that our method takes out essentially all aperture bias in the star formation rate (SFR) estimates, allowing an accurate estimate of the total SFRs in galaxies. We determine the SFR density to be 1.915 +0.02 −0.01 (random) +0.14

3,262 citations