Home
/
Authors
/
Srinivas Devadas

Author

Srinivas Devadas

Other affiliations: University of California, Berkeley, Cornell University, Bar-Ilan University ...read more

Bio: Srinivas Devadas is an academic researcher from Massachusetts Institute of Technology. The author has contributed to research in topics: Sequential logic & Combinational logic. The author has an hindex of 88, co-authored 480 publications receiving 31897 citations. Previous affiliations of Srinivas Devadas include University of California, Berkeley & Cornell University.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1990
1989
1988
1987
1986

Papers

PDF

Open Access

More filters

Dissertation•

Dpool: a distributed data structure for factored operating systems

[...]

Anant Agarwal¹, Srinivas Devadas¹, David Wentzlaff¹•Institutions (1)

Massachusetts Institute of Technology¹

01 Jan 2012

TL;DR: This thesis uses the dPool data structure within the parallel fos Physical Memory Allocation fleet and demonstrates that it is possible to use a dPool to manage shared state in a factored operating system's physical page allocator.

...read moreread less

Abstract: Future computer architectures will likely exhibit increased parallelism through the addition of more processor cores. Architectural trends such as exponentially increasing parallelism and the possible lack of scalable shared memory motivate the reevaluation of operating system design. This thesis work takes place in the context of Factored Operating Systems which leverage distributed system ideas to increase the scalability of multicore processor operating systems. fos, a Factored Operating System, explores a new design point for operating systems where traditional low-level operating system services are fine-grain parallelized while internally only using explicit message passing for communication. fos factors an operating system first by system service and then further parallelizes inside of the system service by splitting the service into a fleet of server processes which communicate via messaging. Constructing parallel low-level operating system services which only internally use messaging is challenging because shared resources must be partitioned across servers and the services must provide scalable performance when met with uneven demand. To ease the construction of parallel fos system services, this thesis develops the dPool distributed data structure. The dPool data structure provides concurrent access to an unordered collection of elements by server processes within a fos fleet. Internal to a single dPool instance, all communication between different portions of a dPool is done via messaging. This thesis uses the dPool data structure within the parallel fos Physical Memory Allocation fleet and demonstrates that it is possible to use a dPool to manage shared state in a factored operating system's physical page allocator. This thesis begins by presenting the design of the prototype fos operating system. In the context of fos system service fleets, this thesis describes the dPool data structure, its design, different implementations, and interfaces. The dPool data structure is shown to achieve scalability across even and uneven micro-benchmark workloads. This thesis shows that common parallel and distributed programming techniques apply to the creation of dPool and that background threads within a dPool can increase performance. Finally, this thesis evaluates different dPool implementations and demonstrates that intelligently pushing elements between dPool parts can increase scalability. (Copies available exclusively from MIT Libraries, Rm. 14-0551, Cambridge, MA 02139-4307. Ph. 617-253-5668; Fax 617-253-1690.)

...read moreread less

2 citations

Proceedings Article•DOI•

A pattern for efficient parallel computation on multicore processors with scalar operand networks

[...]

Henry Hoffmann¹, Srinivas Devadas¹, Anant Agarwal¹•Institutions (1)

Massachusetts Institute of Technology¹

30 Mar 2010

TL;DR: This paper presents a pattern for developing parallel programs using systolic designs to execute efficiently (without resorting to simulation) on modern multicore processors featuring scalar operand networks and illustrates the application of this pattern to produce parallel implementations of matrix multiplication and convolution.

...read moreread less

Abstract: Systolic arrays have long been used to develop custom hardware because they result in designs that are efficient and scalable. Many researchers have explored ways to exploit systolic designs in programmable processors; however, such efforts often result in the simulation of large systolic arrays on a general purpose platforms. While simulation can add flexibility and problem size independence, it comes at a cost of greatly reducing the efficiency of the original systolic approach. This paper presents a pattern for developing parallel programs using systolic designs to execute efficiently (without resorting to simulation) on modern multicore processors featuring scalar operand networks. This pattern provides a compromise solution that can achieve high efficiency and flexibility given appropriate hardware support. Several examples illustrate the application of this pattern to produce parallel implementations of matrix multiplication and convolution.

...read moreread less

2 citations

A Case for Fine-Grain Adaptive Cache Coherence

[...]

George Kurian, Omer Khan, Srinivas Devadas

22 May 2012

TL;DR: This paper develops a scalable, efficient shared memory architecture that enables seamless adaptation between private and logically shared caching at the fine granularity of cache lines and allows private caching for data blocks with high spatio-temporal locality.

...read moreread less

Abstract: As transistor density continues to grow geometrically, processor manufacturers are already able to place a hundred cores on a chip (e.g., Tilera TILE-Gx 100), with massive multicore chips on the horizon. Programmers now need to invest more effort in designing software capable of exploiting multicore parallelism. The shared memory paradigm provides a convenient layer of abstraction to the programmer, but will current memory architectures scale to hundreds of cores? This paper directly addresses the question of how to enable scalable memory systems for future multicores. We develop a scalable, efficient shared memory architecture that enables seamless adaptation between private and logically shared caching at the fine granularity of cache lines. Our datacentric approach relies on in-hardware runtime profiling of the locality of each cache line and only allows private caching for data blocks with high spatio-temporal locality. This allows us to better exploit on-chip cache capacity and enable low-latency memory access in large-scale multicores.

...read moreread less

2 citations

Dissertation•

Software-assisted cache mechanisms for embedded systems

[...]

Srinivas Devadas¹, Prabhat Jain¹•Institutions (1)

Massachusetts Institute of Technology¹

01 Jan 2008

TL;DR: The results show that the proposed cache mechanisms show promise in improving cache performance and predictability with a modest increase in silicon area.

...read moreread less

Abstract: Embedded systems are increasingly using on-chip caches as part of their on-chip memory system This thesis presents cache mechanisms to improve cache performance and provide opportunities to improve data availability that can lead to more predictable cache performance The first cache mechanism presented is an intelligent cache replacement policy that utilizes information about dead data and data that is very frequently used This mechanism is analyzed theoretically to show that the number of misses using intelligent cache replacement is guaranteed to be no more than the number of misses using traditional LRU replacement Hardware and software-assisted mechanisms to implement intelligent cache replacement are presented and evaluated The second cache mechanism presented is that of cache partitioning which exploits disjoint access sequences that do not overlap in the memory space A theoretical result is proven that shows that modifying an access sequence into a concatenation of disjoint access sequences is guaranteed to improve the cache hit rate Partitioning mechanisms inspired by the concept of disjoint sequences are designed and evaluated A profile-based analysis, annotation, and simulation framework has been implemented to evaluate the cache mechanisms This framework takes a compiled benchmark program and a set of program inputs and evaluates various cache mechanisms to provide a range of possible performance improvement scenarios The proposed cache mechanisms have been evaluated using this framework by measuring cache miss rates and Instructions Per Clock (IPC) information The results show that the proposed cache mechanisms show promise in improving cache performance and predictability with a modest increase in silicon area (Copies available exclusively from MIT Libraries, Rm 14-0551, Cambridge, MA 02139-4307 Ph 617-253-5668; Fax 617-253-1690)

...read moreread less

2 citations

Proceedings Article•DOI•

Multiple fault testable sequential circuits

[...]

Pranav Ashar¹, Srinivas Devadas¹, A.R. Newton¹•Institutions (1)

University of California, Berkeley¹

01 May 1990

TL;DR: Methods for the synthesis of sequential circuits for high multiple fault testability are proposed and it is shown that the effects of multiple stuck-at faults on the state graph of a sequential circuit can be much more dramatic than the effect of single stuck- at faults.

...read moreread less

Abstract: The effects of multiple stuck-at faults on sequential circuits are analyzed. It is shown that the effects of multiple stuck-at faults on the state graph of a sequential circuit can be much more dramatic than the effects of single stuck-at faults. Methods for the synthesis of sequential circuits for high multiple fault testability are proposed. >

...read moreread less

2 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
…
84
85
86
87
88
89
90
…
91
92
93
94
95
96
97
98
99
100

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

TaintDroid: An Information-Flow Tracking System for Realtime Privacy Monitoring on Smartphones

[...]

William Enck¹, Peter Gilbert², Seungyeop Han³, Vasant Tendulkar¹, Byung-Gon Chun⁴, Landon P. Cox², Jaeyeon Jung⁵, Patrick McDaniel⁶, Anmol Sheth - Show less +5 more•Institutions (6)

North Carolina State University¹, Duke University², University of Washington³, Seoul National University⁴, Microsoft⁵, Pennsylvania State University⁶

01 Jun 2014-ACM Transactions on Computer Systems

TL;DR: TaintDroid as mentioned in this paper is an efficient, system-wide dynamic taint tracking and analysis system capable of simultaneously tracking multiple sources of sensitive data by leveraging Android's virtualized execution environment.

...read moreread less

Abstract: Today’s smartphone operating systems frequently fail to provide users with visibility into how third-party applications collect and share their private data. We address these shortcomings with TaintDroid, an efficient, system-wide dynamic taint tracking and analysis system capable of simultaneously tracking multiple sources of sensitive data. TaintDroid enables realtime analysis by leveraging Android’s virtualized execution environment. TaintDroid incurs only 32p performance overhead on a CPU-bound microbenchmark and imposes negligible overhead on interactive third-party applications. Using TaintDroid to monitor the behavior of 30 popular third-party Android applications, in our 2010 study we found 20 applications potentially misused users’ private information; so did a similar fraction of the tested applications in our 2012 study. Monitoring the flow of privacy-sensitive data with TaintDroid provides valuable input for smartphone users and security service firms seeking to identify misbehaving applications.

...read moreread less

2,983 citations

Proceedings Article•DOI•

TaintDroid: an information-flow tracking system for realtime privacy monitoring on smartphones

[...]

William Enck¹, Peter Gilbert², Byung-Gon Chun³, Landon P. Cox², Jaeyeon Jung³, Patrick McDaniel¹, Anmol Sheth³ - Show less +3 more•Institutions (3)

Pennsylvania State University¹, Duke University², Intel³

04 Oct 2010

TL;DR: Using TaintDroid to monitor the behavior of 30 popular third-party Android applications, this work found 68 instances of misappropriation of users' location and device identification information across 20 applications.

...read moreread less

Abstract: Today's smartphone operating systems frequently fail to provide users with adequate control over and visibility into how third-party applications use their private data. We address these shortcomings with TaintDroid, an efficient, system-wide dynamic taint tracking and analysis system capable of simultaneously tracking multiple sources of sensitive data. TaintDroid provides realtime analysis by leveraging Android's virtualized execution environment. TaintDroid incurs only 14% performance overhead on a CPU-bound micro-benchmark and imposes negligible overhead on interactive third-party applications. Using TaintDroid to monitor the behavior of 30 popular third-party Android applications, we found 68 instances of potential misuse of users' private information across 20 applications. Monitoring sensitive data with TaintDroid provides informed use of third-party applications for phone users and valuable input for smartphone security service firms seeking to identify misbehaving applications.

...read moreread less

2,379 citations

Journal Article•DOI•

Symbolic Boolean manipulation with ordered binary-decision diagrams

[...]

Randal E. Bryant¹•Institutions (1)

Carnegie Mellon University¹

01 Sep 1992-ACM Computing Surveys

TL;DR: The OBDD data structure is described and a number of applications that have been solved by OBDd-based symbolic analysis are surveyed.

...read moreread less

Abstract: Ordered Binary-Decision Diagrams (OBDDs) represent Boolean functions as directed acyclic graphs. They form a canonical representation, making testing of functional properties such as satisfiability and equivalence straightforward. A number of operations on Boolean functions can be implemented as graph algorithms on OBDD data structures. Using OBDDs, a wide variety of problems can be solved through symbolic analysis. First, the possible variations in system parameters and operating conditions are encoded with Boolean variables. Then the system is evaluated for all variations by a sequence of OBDD operations. Researchers have thus solved a number of problems in digital-system design, finite-state system analysis, artificial intelligence, and mathematical logic. This paper describes the OBDD data structure and surveys a number of applications that have been solved by OBDD-based symbolic analysis.

...read moreread less

2,196 citations

Proceedings Article•DOI•

Physical unclonable functions for device authentication and secret key generation

[...]

G. Edward Suh¹, Srinivas Devadas²•Institutions (2)

Cornell University¹, Massachusetts Institute of Technology²

04 Jun 2007

TL;DR: This work presents PUF designs that exploit inherent delay characteristics of wires and transistors that differ from chip to chip, and describes how PUFs can enable low-cost authentication of individual ICs and generate volatile secret keys for cryptographic operations.

...read moreread less

Abstract: Physical Unclonable Functions (PUFs) are innovative circuit primitives that extract secrets from physical characteristics of integrated circuits (ICs). We present PUF designs that exploit inherent delay characteristics of wires and transistors that differ from chip to chip, and describe how PUFs can enable low-cost authentication of individual ICs and generate volatile secret keys for cryptographic operations.

...read moreread less

2,014 citations

Proceedings Article•

Physical Unclonable Functions for Device Authentication and Secret Key Generation

[...]

Suh, Devadas

01 Jan 2007

1,944 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse