Home
/
Authors
/
Carey Williamson

Author

Carey Williamson

Other affiliations: Stanford University, University of Saskatchewan

Bio: Carey Williamson is an academic researcher from University of Calgary. The author has contributed to research in topics: The Internet & Network packet. The author has an hindex of 39, co-authored 225 publications receiving 7277 citations. Previous affiliations of Carey Williamson include Stanford University & University of Saskatchewan.

Topics: The Internet, Network packet, Wireless network, TCP acceleration, Web server ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1993
1992
1991
1989
1987
1984

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Web server workload characterization: the search for invariants

[...]

Martin Arlitt¹, Carey Williamson¹•Institutions (1)

University of Saskatchewan¹

15 May 1996

TL;DR: This paper concludes with a discussion of caching and performance issues, using the invariants to suggest performance enhancements that seem most promising for Internet Web servers.

...read moreread less

Abstract: The phenomenal growth in popularity of the World Wide Web (WWW, or the Web) has made WWW traffic the largest contributor to packet and byte traffic on the NSFNET backbone. This growth has triggered recent research aimed at reducing the volume of network traffic produced by Web clients and servers, by using caching, and reducing the latency for WWW users, by using improved protocols for Web interaction.Fundamental to the goal of improving WWW performance is an understanding of WWW workloads. This paper presents a workload characterization study for Internet Web servers. Six different data sets are used in this study: three from academic (i.e., university) environments, two from scientific research organizations, and one from a commercial Internet provider. These data sets represent three different orders of magnitude in server activity, and two different orders of magnitude in time duration, ranging from one week of activity to one year of activity.Throughout the study, emphasis is placed on finding workload invariants: observations that apply across all the data sets studied. Ten invariants are identified. These invariants are deemed important since they (potentially) represent universal truths for all Internet Web servers. The paper concludes with a discussion of caching and performance issues, using the invariants to suggest performance enhancements that seem most promising for Internet Web servers.

...read moreread less

858 citations

Journal Article•DOI•

Internet Web servers: workload characterization and performance implications

[...]

Martin Arlitt¹, Carey Williamson¹•Institutions (1)

University of Saskatchewan¹

01 Oct 1997-IEEE ACM Transactions on Networking

TL;DR: The paper concludes with a discussion of caching and performance issues, using the observed workload characteristics to suggest performance enhancements that seem promising for Internet Web servers.

...read moreread less

Abstract: This paper presents a workload characterization study for Internet Web servers. Six different data sets are used in the study: three from academic environments, two from scientific research organizations, and one from a commercial Internet provider. These data sets represent three different orders of magnitude in server activity, and two different orders of magnitude in time duration, ranging from one week of activity to one year. The workload characterization focuses on the document type distribution, the document size distribution, the document referencing behavior, and the geographic distribution of server requests. Throughout the study, emphasis is placed on finding workload characteristics that are common to all the data sets studied. Ten such characteristics are identified. The paper concludes with a discussion of caching and performance issues, using the observed workload characteristics to suggest performance enhancements that seem promising for Internet Web servers.

...read moreread less

771 citations

Journal Article•DOI•

Offline/realtime traffic classification using semi-supervised learning

[...]

Jeffrey Erman¹, Anirban Mahanti², Martin Arlitt¹, Ira Cohen³, Carey Williamson¹ - Show less +1 more•Institutions (3)

University of Calgary¹, Indian Institute of Technology Delhi², Hewlett-Packard³

01 Oct 2007-Performance Evaluation

TL;DR: This is the first work to use semi-supervised learning techniques for the traffic classification problem and allows classifiers to be designed from training data that consists of only a few labeled and many unlabeled flows.

...read moreread less

288 citations

Proceedings Article•DOI•

A Longitudinal Study of P2P Traffic Classification

[...]

A. Madhukar¹, Carey Williamson¹•Institutions (1)

University of Calgary¹

11 Sep 2006

TL;DR: The results show that port-based analysis is ineffective, being unable to identify 30%-70% of today's Internet traffic, and the transport-layer method seems promising, providing a robust means to assess aggregate P2P traffic.

...read moreread less

Abstract: This paper focuses on network traffic measurement of Peer-to- Peer (P2P) applications on the Internet. P2P applications supposedly constitute a substantial proportion of today's Internet traffic. However, current P2P applications use several obfuscation techniques, including dynamic port numbers, port hopping, HTTP masquerading, chunked file transfers, and encrypted payloads. As P2P applications continue to evolve, robust and effective methods are needed for P2P traffic identification. The paper compares three methods to classify P2P applications: port-based classification, application-layer signatures, and transport-layer analysis. The study uses empirical network traces collected from the University of Calgary Internet connection for the past 2 years. The results show that port-based analysis is ineffective, being unable to identify 30%-70% of today's Internet traffic. Application signatures are accurate, but may not be possible for legal or technical reasons. The transport-layer method seems promising, providing a robust means to assess aggregate P2P traffic. The latter method suggests that 30%-70% of the campus Internet traffic for the past year was P2P.

...read moreread less

233 citations

Journal Article•DOI•

A Visual Backchannel for Large-Scale Events

[...]

Marian Dörk¹, Daniel M. Gruen², Carey Williamson¹, Sheelagh Carpendale¹•Institutions (2)

University of Calgary¹, IBM²

01 Nov 2010-IEEE Transactions on Visualization and Computer Graphics

TL;DR: The Visual Backchannel design provides an evolving, interactive, and multi-faceted visual overview of large-scale ongoing conversations on Twitter, and includes visual saliency for what is happening now and what has just happened in the context of the evolving conversation.

...read moreread less

Abstract: We introduce the concept of a Visual Backchannel as a novel way of following and exploring online conversations about large-scale events. Microblogging communities, such as Twitter, are increasingly used as digital backchannels for timely exchange of brief comments and impressions during political speeches, sport competitions, natural disasters, and other large events. Currently, shared updates are typically displayed in the form of a simple list, making it difficult to get an overview of the fast-paced discussions as it happens in the moment and how it evolves over time. In contrast, our Visual Backchannel design provides an evolving, interactive, and multi-faceted visual overview of large-scale ongoing conversations on Twitter. To visualize a continuously updating information stream, we include visual saliency for what is happening now and what has just happened, set in the context of the evolving conversation. As part of a fully web-based coordinated-view system we introduce Topic Streams, a temporally adjustable stacked graph visualizing topics over time, a People Spiral representing participants and their activity, and an Image Cloud encoding the popularity of event photos by size. Together with a post listing, these mutually linked views support cross-filtering along topics, participants, and time ranges. We discuss our design considerations, in particular with respect to evolving visualizations of dynamically changing data. Initial feedback indicates significant interest and suggests several unanticipated uses.

...read moreread less

229 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47

Collapse

Cited by

PDF

Open Access

More filters

Proceedings Article•DOI•

Random graphs

[...]

Alan Frieze¹•Institutions (1)

Carnegie Mellon University¹

22 Jan 2006

TL;DR: Some of the major results in random graphs and some of the more challenging open problems are reviewed, including those related to the WWW.

...read moreread less

Abstract: We will review some of the major results in random graphs and some of the more challenging open problems. We will cover algorithmic and structural questions. We will touch on newer models, including those related to the WWW.

...read moreread less

7,116 citations

Book Chapter•DOI•

What About Us

[...]

Matthew Robert Kerbel

30 May 2018

TL;DR: Tata Africa Services (Nigeria) Limited as mentioned in this paper is a nodal point for Tata businesses in West Africa and operates as the hub of TATA operations in Nigeria and the rest of West Africa.

...read moreread less

Abstract: Established in 2006, TATA Africa Services (Nigeria) Limited operates as the nodal point for Tata businesses in West Africa. TATA Africa Services (Nigeria) Limited has a strong presence in Nigeria with investments exceeding USD 10 million. The company was established in Lagos, Nigeria as a subsidiary of TATA Africa Holdings (SA) (Pty) Limited, South Africa and serves as the hub of Tata’s operations in Nigeria and the rest of West Africa.

...read moreread less

3,658 citations

Proceedings Article•

On the capacity of a cellular CDMA system

[...]

Klein S. Gilhousen¹, Irwin M. Jacobs¹, Roberto Padovani¹, Andrew J. Viterbi¹, L.A. Weaver¹, Charles E. Iii Del Mar Wheatley¹ - Show less +2 more•Institutions (1)

Qualcomm¹

01 Jan 1991

TL;DR: It is concluded that properly augmented and power-controlled multiple-cell CDMA (code division multiple access) promises a quantum increase in current cellular capacity.

...read moreread less

Abstract: It is shown that, particularly for terrestrial cellular telephony, the interference-suppression feature of CDMA (code division multiple access) can result in a many-fold increase in capacity over analog and even over competing digital techniques. A single-cell system, such as a hubbed satellite network, is addressed, and the basic expression for capacity is developed. The corresponding expressions for a multiple-cell system are derived. and the distribution on the number of users supportable per cell is determined. It is concluded that properly augmented and power-controlled multiple-cell CDMA promises a quantum increase in current cellular capacity. >

...read moreread less

2,951 citations

Journal Article•DOI•

[...]

Mark Crovella¹, Azer Bestavros¹•Institutions (1)

Boston University¹

15 May 1996

TL;DR: It is shown that the self-similarity in WWW traffic can be explained based on the underlying distributions of WWW document sizes, the effects of caching and user preference in file transfer, the effect of user "think time", and the superimposition of many such transfers in a local area network.

...read moreread less

Abstract: Recently the notion of self-similarity has been shown to apply to wide-area and local-area network traffic. In this paper we examine the mechanisms that give rise to the self-similarity of network traffic. We present a hypothesized explanation for the possible self-similarity of traffic by using a particular subset of wide area traffic: traffic due to the World Wide Web (WWW). Using an extensive set of traces of actual user executions of NCSA Mosaic, reflecting over half a million requests for WWW documents, we examine the dependence structure of WWW traffic. While our measurements are not conclusive, we show evidence that WWW traffic exhibits behavior that is consistent with self-similar traffic models. Then we show that the self-similarity in such traffic can be explained based on the underlying distributions of WWW document sizes, the effects of caching and user preference in file transfer, the effect of user "think time", and the superimposition of many such transfers in a local area network. To do this we rely on empirically measured distributions both from our traces and from data independently collected at over thirty WWW sites.

...read moreread less

2,332 citations

Journal Article•DOI•

Web usage mining: discovery and applications of usage patterns from Web data

[...]

Jaideep Srivastava¹, Robert Cooley¹, Mukund Deshpande¹, Pang-Ning Tan¹•Institutions (1)

University of Minnesota¹

01 Jan 2000-Sigkdd Explorations

TL;DR: Web usage mining is the application of data mining techniques to discover usage patterns from Web data, in order to understand and better serve the needs of Web-based applications as mentioned in this paper, where preprocessing, pattern discovery, and pattern analysis are described in detail.

...read moreread less

Abstract: Web usage mining is the application of data mining techniques to discover usage patterns from Web data, in order to understand and better serve the needs of Web-based applications. Web usage mining consists of three phases, namely preprocessing, pattern discovery, and pattern analysis. This paper describes each of these phases in detail. Given its application potential, Web usage mining has seen a rapid increase in interest, from both the research and practice communities. This paper provides a detailed taxonomy of the work in this area, including research efforts as well as commercial offerings. An up-to-date survey of the existing work is also provided. Finally, a brief overview of the WebSIFT system as an example of a prototypical Web usage mining system is given.

...read moreread less

2,227 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse