Institution

IBM

Company•Armonk, New York, United States•

About: IBM is a company organization based out in Armonk, New York, United States. It is known for research contribution in the topics: Layer (electronics) & Cache. The organization has 134567 authors who have published 253905 publications receiving 7458795 citations. The organization is also known as: International Business Machines Corporation & Big Blue.

...read moreread less

Topics: Layer (electronics), Cache, Set (abstract data type), Object (computer science), Server ...read more

Papers published on a yearly basis

1 / 2

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Privacy preserving mining of association rules

[...]

Alexandre V. Evfimievski¹, Ramakrishnan Srikant¹, Rakesh Agrawal¹, Johannes Gehrke¹•Institutions (1)

IBM¹

23 Jul 2002

TL;DR: A class of randomization operators are proposed that are much more effective than uniform randomization in limiting the breaches of privacy breaches and derived formulae for an unbiased support estimator and its variance are derived.

...read moreread less

Abstract: We present a framework for mining association rules from transactions consisting of categorical items where the data has been randomized to preserve privacy of individual transactions. While it is feasible to recover association rules and preserve privacy using a straightforward "uniform" randomization, the discovered rules can unfortunately be exploited to find privacy breaches. We analyze the nature of privacy breaches and propose a class of randomization operators that are much more effective than uniform randomization in limiting the breaches. We derive formulae for an unbiased support estimator and its variance, which allow us to recover itemset supports from randomized datasets, and show how to incorporate these formulae into mining algorithms. Finally, we present experimental results that validate the algorithm by applying it on real datasets.

...read moreread less

911 citations

Proceedings Article•DOI•

Dynamic Placement of Virtual Machines for Managing SLA Violations

[...]

Norman Bobroff¹, Andrzej Kochut¹, Kirk A. Beaty¹•Institutions (1)

IBM¹

25 Jun 2007

TL;DR: A dynamic server migration and consolidation algorithm is introduced and is shown to provide substantial improvement over static server consolidation in reducing the amount of required capacity and the rate of service level agreement violations.

...read moreread less

Abstract: A dynamic server migration and consolidation algorithm is introduced. The algorithm is shown to provide substantial improvement over static server consolidation in reducing the amount of required capacity and the rate of service level agreement violations. Benefits accrue for workloads that are variable and can be forecast over intervals shorter than the time scale of demand variability. The management algorithm reduces the amount of physical capacity required to support a specified rate of SLA violations for a given workload by as much as 50% as compared to static consolidation approach. Another result is that the rate of SLA violations at fixed capacity may be reduced by up to 20%. The results are based on hundreds of production workload traces across a variety of operating systems, applications, and industries.

...read moreread less

910 citations

Proceedings Article•DOI•

Optimal aggregation algorithms for middleware

[...]

Ronald Fagin¹, Amnon Lotem², Moni Naor³•Institutions (3)

IBM¹, University of Maryland, College Park², Weizmann Institute of Science³

01 May 2001

TL;DR: An elegant and remarkably simple algorithm is analyzed that is optimal in a much stronger sense than FA, and is essentially optimal, not just for some monotone aggregation functions, but for all of them, and not just in a high-probability sense, but over every database.

...read moreread less

Abstract: Assume that each object in a database has m grades, or scores, one for each of m attributes. For example, an object can have a color grade, that tells how red it is, and a shape grade, that tells how round it is. For each attribute, there is a sorted list, which lists each object and its grade under that attribute, sorted by grade (highest grade first). There is some monotone aggregation function, or combining rule, such as min or average, that combines the individual grades to obtain an overall grade.To determine objects that have the best overall grades, the naive algorithm must access every object in the database, to find its grade under each attribute. Fagin has given an algorithm (“Fagin's Algorithm”, or FA) that is much more efficient. For some distributions on grades, and for some monotone aggregation functions, FA is optimal in a high-probability sense.We analyze an elegant and remarkably simple algorithm (“the threshold algorithm”, or TA) that is optimal in a much stronger sense than FA. We show that TA is essentially optimal, not just for some monotone aggregation functions, but for all of them, and not just in a high-probability sense, but over every database. Unlike FA, which requires large buffers (whose size may grow unboundedly as the database size grows), TA requires only a small, constant-size buffer.We distinguish two types of access: sorted access (where the middleware system obtains the grade of an object in some sorted list by proceeding through the list sequentially from the top), and random access (where the middleware system requests the grade of object in a list, and obtains it in one step). We consider the scenarios where random access is either impossible, or expensive relative to sorted access, and provide algorithms that are essentially optimal for these cases as well.

...read moreread less

908 citations

Journal Article•DOI•

Revisiting the factor structure of the system usability scale

[...]

James R. Lewis¹, Jeff Sauro•Institutions (1)

IBM¹

01 Aug 2017-Journal of Usability Studies archive

TL;DR: A comparison of the fit of three confirmatory factor analyses showed that a model in which the SUS's positive tone (odd-numbered) and negative-tone (even-numbered), were aligned with two factors had a better fit than a unidimensional model (all items on one factor) or the Usability/Learnability model as discussed by the authors.

...read moreread less

Abstract: In 2009, we published a paper in which we showed how three independent sources of data indicated that, rather than being a unidimensional measure of perceived usability, the System Usability Scale apparently had two factors: Usability (all items except 4 and 10) and Learnability (Items 4 and 10). In that paper, we called for other researchers to report attempts to replicate that finding. The published research since 2009 has consistently failed to replicate that factor structure. In this paper, we report an analysis of over 9,000 completed SUS questionnaires that shows that the SUS is indeed bidimensional, but not in any interesting or useful way. A comparison of the fit of three confirmatory factor analyses showed that a model in which the SUS's positive-tone (odd-numbered) and negative-tone (even-numbered) were aligned with two factors had a better fit than a unidimensional model (all items on one factor) or the Usability/Learnability model we published in 2009. Because a distinction based on item tone is of little practical or theoretical interest, we recommend that user experience practitioners and researchers treat the SUS as a unidimensional measure of perceived usability, and no longer routinely compute Usability and Learnability subscales.

...read moreread less

906 citations

Journal Article•DOI•

Fisher information and stochastic complexity

[...]

Jorma Rissanen¹•Institutions (1)

IBM¹

01 Jan 1996-IEEE Transactions on Information Theory

TL;DR: A sharper code length is obtained as the stochastic complexity and the associated universal process are derived for a class of parametric processes by taking into account the Fisher information and removing an inherent redundancy in earlier two-part codes.

...read moreread less

Abstract: By taking into account the Fisher information and removing an inherent redundancy in earlier two-part codes, a sharper code length as the stochastic complexity and the associated universal process are derived for a class of parametric processes. The main condition required is that the maximum-likelihood estimates satisfy the central limit theorem. The same code length is also obtained from the so-called maximum-likelihood code.

...read moreread less

906 citations

Collapse

Authors

Showing all 134658 results

Name	H-index	Papers	Citations
Zhong Lin Wang	245	2529	259003
Anil K. Jain	183	1016	192151
Hyun-Chul Kim	176	4076	183227
Rodney S. Ruoff	164	666	194902
Tobin J. Marks	159	1621	111604
Jean M. J. Fréchet	154	726	90295
Albert-László Barabási	152	438	200119
György Buzsáki	150	446	96433
Stanislas Dehaene	149	456	86539
Philip S. Yu	148	1914	107374
James M. Tour	143	859	91364
Thomas P. Russell	141	1012	80055
Naomi J. Halas	140	435	82040
Steven G. Louie	137	777	88794
Daphne Koller	135	367	71073