NStreamAware: real-time visual analytics for data streams to enhance situational awareness
read more
Citations
Temporal MDS Plots for Analysis of Multivariate Data
BubbleNet: A Cyber Security Dashboard for Visualizing Patterns
Unlocking user-centered design methods for building cyber security visualizations
Commercial Visual Analytics Systems–Advances in the Big Data Analytics Field
Human Factors in Streaming Data Analysis: Challenges and Opportunities for Information Visualization
References
Toward a Theory of Situation Awareness in Dynamic Systems
A framework for clustering evolving data streams
Discretized streams: an efficient and fault-tolerant model for stream processing on large clusters
A Survey of Visualization Systems for Network Security
Cyber situational awareness – A systematic review of the literature
Related Papers (5)
Frequently Asked Questions (14)
Q2. What future works have the authors mentioned in the paper "Nstreamaware: real-time visual analytics for data streams to enhance situational awareness" ?
However, the system still needs to be applied to a larger computer network, which is part of the future work. Automatically defining good sizes for the sliding windows is also planed for the future. The merging model based on the feature selection process, could be applied to the realtime stream in the future, to actually merge sliding slices in real-time, which is not fully implemented yet. Tracking individual events over time was not the focus of this work, however, more work seems to be promising to extend the approach in that respect as well.
Q3. What is the purpose of Apache Spark?
Apache Spark introduces a programming model, called Resilient Distributed Datasets (RDDs), which provide an interface to coarse-grained transformations (e.g., map, group-by, filter, join).
Q4. What is the ultimate goal of visual analytics systems for data streams?
The ultimate goal of visual analytics systems for data streams is to enhance situational awareness to facilitate decision making.
Q5. Why do the authors need to decouple the flow-rate of a data stream from screen?
Because of the unpredictable characteristics of data streams with respect to volume, velocity, variety, and veracity, the authors additionally need visualizations able to decouple the flow-rate of a data stream from screen updates and keep the latter constant and predictable to not overwhelm the user.
Q6. What is the architecture of the NVisAware service?
Their architecture consists of their REST Service, Spark Service and a web application with various visualizations, called NVisAware.
Q7. How long did it take to create a new sliding slice?
To provide a new sliding slice every 30 seconds, the authors initialized the system with a batch and slide interval of 30s and a window length of 60s.
Q8. How could the service be deployed to a cluster of hardware machines?
To provide further scalability the service could also be deployed to a cluster of hardware machines running Apache Spark or to a cloud-based deployment.
Q9. How many servers are connected to a central syslog server?
13 servers are connected to a central syslog server, producing 30 000 to 80 000 messages per day with individual peaks of up to 5 000 messages per minute.
Q10. How many threads were used to run the Spark Service?
The Spark Service was operated in local mode on a normal workstation Dell OptiPlex 980, Core i7-860, 8GB RAM 4x 2.80GHz with 10 separate working threads.
Q11. What is the main limitation with respect to performance and scalability?
When displaying hundreds of sliding slices at the same time the performance decreased, because of browser and memory restrictions of the workstation.
Q12. How many times did the participants have to do the analysis?
A first analysis had to be sent to the organizers within three hours after first connecting to the final data stream from 20:00 to 21:30, which could only be streamed once, to force the participants to do real-time processing and provide immediate situational assessment under time pressure.
Q13. Why is the POK suspected in the disappearance?
Because of an ongoing conflict between an organization known as the Protectors of Kronos (POK), they are suspected in the disappearance.
Q14. What is the purpose of the slice?
The slice will also be added to the Event Timeline & Insights view, where all starred objects are presented in a traditional interactive timeline to explore the events flagged and labeled by the analysts.