An architecture for interspatial communication

doi:10.1109/INFCOMW.2018.8406931

Proceedings Article•DOI•

An architecture for interspatial communication

Anil Madhavapeddy¹, KC Sivaramakrishnan¹, Gemma Gordon¹, Thomas Gazagnaire•Institutions (1)

15 Apr 2018-pp 716-723

TL;DR: A software architecture that inverts the current model by building an operating system designed to securely connect physical spaces with extremely low latency, high bandwidth local-area computation capabilities and service discovery is laid out.

read less

Abstract: Digital infrastructure in modern urban environments is currently very Internet-centric, and involves transmitting data to physically remote environments The cost for this is data insecurity, high response latency and unpredictable reliability of services In this paper, we lay out a software architecture that inverts the current model by building an operating system designed to securely connect physical spaces with extremely low latency, high bandwidth local-area computation capabilities and service discovery We describe our early prototype design OSMOSE, which is based on unikernels and a distributed store

...read moreread less

Summary (4 min read)

Jump to: [I. INTRODUCTION] – [II. THE PROBLEM WITH EXISTING MOBILE SOFTWARE] – [A. Reliability] – [B. Security] – [C. Latency] – [III. TOWARDS INTERSPATIAL APPLICATIONS] – [A. Incremental Networking] – [B. Spatial Interfaces] – [C. Native Hardware] – [D. A Cross-Layer Unikernel Software Architecture] – [IV. THE INTERSPATIAL OPERATING SYSTEM] – [Actuators] – [A. An Example Deployment] – [B. Security and Privacy] and [VI. CONCLUSION]

I. INTRODUCTION

It begins to record and encrypt the conversation with a group secret keyed to the participants.
This paper proposes an operating software architecture aimed at fully supporting these new applications on modern hardware platforms.

II. THE PROBLEM WITH EXISTING MOBILE SOFTWARE

The answer lies in the traditional operating systems and network architecture that power the current generation of smart devices.
Consider what happens when the authors speak into an Apple Watch in order to retrieve some information.
For this to work, the watch must be connected to a mobile phone, which in turn needs to establish a wireless connection to the Internet, where Apple voice recognition services will dispatch a query to the Google search engine.
If any one of the services in this chain breaks (for example, the common case of the phone signal being "trapped" by a wifi authentication page), then the user experience is broken.

A. Reliability

Services deployed in physical environments need to work all the time, and be locally debuggable when they do fail.
They also need to work independently of Internet connectivity, so that every building can be an island of digital services even when offline.

B. Security

The amount of sensitive data being captured in these environments is tremendous, and much of it should not leave the confines of the physical space without explicit permission from all parties involved.
This is extremely difficult to police given the amount of Internet-wide coordination used in existing devices, but can be fixed if the local environment provides a structured mechanism for handling such storage securely with respect to local environmental policies.

C. Latency

Interactive services require response times beneath the uncanny valley of human perception.
For the scenario above to really feel seamless, the authors need a new "latency first" application architecture that makes data and computation capacity available physically near the human users, with scheduling tolerances for responses in the milliseconds rather than seconds.
Solving these problems is difficult to do piecemeal across individual parts of the software stack, since they are currently general-purpose and loosely coupled.
A typical IoT device might run its own copy of the Linux kernel, with an embedded userspace, a VPN into a centralised management server operated by the vendor for updates, and a mobile application for the user to manage it.
There is little synergy or dependence on shared infrastructure to assist with the process.

III. TOWARDS INTERSPATIAL APPLICATIONS

Before proposing a solution, the authors consider some design principles to underpin how applications in physical spaces operate.
When dealing with physical devices, the latency of response to external events is paramount.
Safety critical systems often require "hard realtime" operation, or more often attempt to provide soft guarantees about when they respond.
This implies that there is simply no time for doing conventional operations such as network scanning or service discovery in serial -by the time the user has asked a question, the physical environment should already have the appropriate resources established.
The authors thus need to rearrange the programming models around building interspatial applications to make them suitable for such an environment.

A. Incremental Networking

A device that is starting "cold" will need to establish a network connection to the local node, most commonly via TCP and subsequently a secure transport via TLS.
Once the connection has been established, the authors still need to authenticate the user(s), negotiate security keys, perhaps perform version negotiation between devices, and usually interrupt the user at an inconvenient time for a software security update.
The reason that every device has to currently do all this work is partly due to the end-to-end principle that guides the design of Internet protocols.
Every IP node is a "dumb" router that knows how to forward protocol packets, but lacks higherlevel application information.
Incremental connectivity requires fundamental programming interface changes in order to combine information across the traditional operating system and application stacks, and for us to move away from the venerable sockets API [6] .

B. Spatial Interfaces

In a physical environment it is also necessary to mesh wired sensors (such as motion detectors or cameras) with mobile ones (humans carrying location beacons).
The wired systems need to be establishing connectivity with services required by applications, and the wireless ones need to be searching for nearby network nodes via Zigbee [7] or Bluetooth [8] .
Internet protocols such as TCP have not traditionally not handled connection handoff well -extensions such as multipath TCP are effective [9] but have struggled with extending existing interfaces such as the sockets API with the new routing semantics available to applications [10] .
For their interspatial applications, a building needs to be able to manage all the local resources (include bandwidth or storage) just as it does so with other utilities such as electricity.
In return, individual applications do not need to manage their own networking, storage and authentication needs as they are provided by the surrounding environment.

C. Native Hardware

Applications are traditionally built for a specific device profile in mind -for example, a mobile phone or a desktop application.
The proliferation of multiple display form factors has resulted in a new mode of "responsive" design [11] that can adapt an interface to multiple resolutions and sizes.
With interspatial applications, the authors wish to eliminate the need for importing wearable and mobile devices into a building as the sole mechanism of interaction with a user.
Hardware present in the environment such as parametric speakers for 3D positional audio [12] or wallpaper projections [13] should be able to be used by local users, rather than being constrained to the single form factor mobile devices (e.g. a watch or a mobile phone) that the authors carry around.
For this to work, interspatial applications needs to be designed to have responsive user interfaces, but also a suitable trust model to let users establish secure connections to environmental hardware that is not directly owned by them [14] .

D. A Cross-Layer Unikernel Software Architecture

The three design principles for interspatial applications programming interfaces (incremental networking, spatial interfaces and use of native hardware) require cross-cutting changes across traditional operating system and application programming interfaces.
The first step to making this a practical prospect is to adopt the discipline of deploying unikernels [15] on the hardware, and replacing the traditional layered OS stack.
The authors now find that the same approach is a perfect fit to driving the world of resourceconstrained embedded hardware found in physical environments.
In a library operating system, the functionality that is conventionally found in a monolithic operating system kernel is broken out into software libraries that are available for use by the application in exactly the same way as other higherlevel functionality currently is.
The same application source code could be compiled to a tiny embedded processor without an MMU, or also be recompiled into a conventional Unix process.

IV. THE INTERSPATIAL OPERATING SYSTEM

The authors now discuss the design of a prototype interspatial operating system based on a unikernel architecture, dubbed OSMOSE.
Details are out of scope for this position paper -there are a number of indoor localisation technologies available [19] .
Since OSMOSE needs to run across a variety of hardware with differing resource constraints, the traditional model of booting a single image no longer holds.
This is then combined with physical topology information and security policies to compile a set of unikernels that are booted on the building's hardware.
All communications in the system are dispatched through a distributed storage system that immutably stores interactions for audit and debugging purposes.

Actuators

Sensors CPUs Network Flows Scheduler Fig. 1 .
Instead of a processbased system, a deployment can be viewed as a graph of distributed hardware nodes, through which event stream data from the environment flows.
Every coordination operation between nodes is piped through the Irmin datastore, allowing the complete state of the system to be tracked with strong provenance.
Interspatial applications require a lot of immediate interaction with the physical environment.
Every time a new participant enters, their collection of identities are aggregated to generate temporary encryption keys for any shared communication.

A. An Example Deployment

Instead of the usual crowded array of laptops, mobile phones and headphones, the users benefit from use of the hardware present in the situated environment.
Meanwhile, the lone worker on the bottom right can listen to an encrypted podcast via parametric audio speakers that project the sound directly from the ceiling, without a need for the user to hook up headphones and a mobile device.
Behind the scenes, OSMOSE is providing important operating system services to this scenario: each of the users has been authenticated upon entry to the building and could install their interspatial applications using the local wireless network.
Each of the users need to be tracked by the building with low latency to ensure that they can immediately interact with the devices in front of them, while also enforcing isolation of their data across the shared hardware in the building such as the microphones and parametric audio speakers.
As they purchased their caffeinated beverages and took their seats, other services such as payments could also be potentially performed securely using this infrastructure.

B. Security and Privacy

At the lower levels, the unikernel approach is designed to eliminate unnecessary code and promote the use of safer programming languages throughout the software stack.
This is mainly a concern due to data leakage that arises from third parties (e.g. cloud providers) gaining access to this private data.
The data capture systems in OSMOSE can replace the role currently taken by mobile devices [33] , and also provide a corresponding increase in accuracy due to not requiring the data to be transmitted remotely before being processed.
OSMOSE is designed to ensure that deletion is an expensive but reliable operation -since every bit of data is tracked, a distributed garbage collector can traverse everything and rewrite history on all nodes to remove any individual data that should be excised.

VI. CONCLUSION

The authors have presented an early design for a distributed operating system designed for deploying a new generation of low-latency, interactive applications in urban environments.
The authors design inverts the existing model of funnelling data to the cloud, and instead provides the infrastructure for rapidly processing data locally.
The authors OSMOSE prototype design brings the traditional benefits of an operating system to the distributed array of hardware that comprises a physical building -resource scheduling, hardware isolation and a userlevel programming interface.
Any operating system is only worth the applications that run on it.
All of the source code of the constituent components discussed here such as MirageOS and Irmin are available under a BSD-style license from https://github.com/ mirage.

Did you find this useful? Give us your feedback

Figures (2)

Fig. 2. A shared co-working space where OSMOSE is deployed. There are two users collaborating using a shared display on the table, while another user is listening to a personalised audio stream. The barista is receiving music and audio notifications while working.

Fig. 1. Architecture of the interspatial operating system. The system is configured declaratively via a set of application source code, policies and network topology, and is then incrementally built and deployed onto hardware. All communications happens via the Merkle-tree based branch storage.

Content maybe subject to copyright Report

An Architecture for Interspatial Communication

Anil Madhavapeddy

Computer Laboratory

University of Cambridge

avsm2@cl.cam.ac.uk

KC Sivaramakrishnan

Computer Laboratory

University of Cambridge

sk826@cl.cam.ac.uk

Gemma Gordon

Computer Laboratory

University of Cambridge

gg417@cl.cam.ac.uk

Thomas Gazagnaire

Tarides

Paris, France

thomas@tarides.com

Abstract—Digital infrastructure in modern urban environ-

ments is currently very Internet-centric, and involves transmit-

ting data to physically remote environments. The cost for this is

data insecurity, high response latency and unpredictable reliabil-

ity of services. In this paper, we lay out a software architecture

that inverts the current model by building an operating system

designed to securely connect physical spaces with extremely

low latency, high bandwidth local-area computation capabilities

and service discovery. We describe our early prototype design

OSMOSE, which is based on unikernels and a distributed store.

I. INTRODUCTION

In his classic essay on “Computing for the 21st Cen-

tury” [1], Mark Weiser observed that:

The most profound technologies are those that

disappear. They weave themselves into the fabric of

everyday life until they are indistinguishable from it.

Since then, there have been tremendous advances in mobile

and sensing technologies, and there is an ongoing rapid

deployment of “smart” digital infrastructure that augments the

physical environment. Consider the following scenario for a

next-generation building:

“A group of people are meeting in an coworking space to

discuss a project they are working on together. They are the

ﬁrst to arrive at the building on a cold winter morning, and are

directed to a meeting room automatically by audio directions

individually projected to them. As they walk into the room

the lights are already on and the heating has been preset to

a comfortable level. They wave to a wall where a display

appears, and the building recognises the gesture and projects

a shared folder of the project. It begins to record and encrypt

the conversation with a group secret keyed to the participants.

One of the participants receives a tight-beam audio notiﬁcation

informing them that they are needed elsewhere brieﬂy. When

they leave the room the shared recording is immediately

paused within milliseconds to allow other participants to

casually chat among themselves. Subsequently, the encrypted

recording is transcribed, and made available in the datastores

of each participant under the terms of their shared disclosure

agreement for their project.”

This scenario illustrates the distance between Weiser’s 1999

vision of ubiquitous computing and where we currently are

almost two decades on. The above narrative features no

wearables or mobiles among the human participants – instead

of an array of individual smart phones, watches and laptops,

the situated environment of the building is providing secure,

shared, multi-tenant infrastructure for audio and visual com-

munications among the human participants. This is analogous

to the shift from individual computers to multi-tenant cloud

infrastructure in the past decade [2], but applied to the sharing

of physical places and devices.

Another aspect to consider in the scenario is the seamless,

low-latency nature of the interactions that were described

between humans and electronic devices. When a human takes

an action the environment is able to react within milliseconds

to take action immediately, rather than delaying for seconds

and causing the awkward delays that we have become used to

with modern Internet-connected mobile devices. By providing

immediacy, the technology can complement social interactions

rather than interrupting them [3].

Encouragingly, much of the hardware required to physically

assemble the described building is available off the shelf. E-ink

display wallpaper, parametric audio speakers for directional

broadcast, gesture recognition and smart lighting and heating

are all available or relatively easy to construct. The missing

link is the foundational software that manages, coordinates

and secures the distributed hardware – the operating system

for digitally connecting physical spaces together.

In this position paper, we begin to bridge this gap between

available hardware and missing software. We construct an

architectural model for interspatial networking – an operating

system for the dense, interconnected and shared urban spaces

that most humans live in. It aims to shift us away from the

wide-area, mobile-oriented devices that are permeating early

deployments of smart infrastructure, and towards a sustainable

digital model that is far more similar to a conventional utility

such as electricity or gas. Our system aims to make it far

simpler and more secure to introduce, manage and rely on

digital devices during day-to-day life in urban environments.

There is also an exciting new generation of upcoming

applications that demand vast amounts of bandwidth and low-

latency responses. Augmented and virtual reality, environmen-

tal e-ink displays, parametric directional sound, and robotic ap-

pliances simply do not work well if the latencies of interactions

with them rise above a certain point. This paper proposes an

operating software architecture aimed at fully supporting these

new applications on modern hardware platforms. We will next

describe some of the challenges in more detail (§II), discuss

the design principles behind interspatial applications (§III),

then present the design of our OSMOSE prototype OS (§IV),

and ﬁnally discuss examples and some implications (§V).

II. THE PROBLEM WITH EXISTING MOBILE SOFTWARE

Given that the hardware is all available, why is the above

scenario difﬁcult to implement? The answer lies in the

traditional operating systems and network architecture that

power the current generation of smart devices. Because of

the rise of cloud computing, they are typically built around

communication to centralised Internet services. For example,

consider what happens when we speak into an Apple Watch

in order to retrieve some information. For this to work, the

watch must be connected to a mobile phone, which in turn

needs to establish a wireless connection to the Internet, where

Apple voice recognition services will dispatch a query to the

Google search engine. If any one of the services in this chain

breaks (for example, the common case of the phone signal

being “trapped” by a wiﬁ authentication page), then the user

experience is broken. Even when it does work, it can take

seconds to respond to the voice query, and with very variable

latency. Once the response comes through, handoff to other

devices is also difﬁcult unless they are owned by the user.

A. Reliability

Services deployed in physical environments need to work

all the time, and be locally debuggable when they do fail.

How do we shift from a light switch that “almost works”

after being pressed several times to seamless voice or gesture-

driven services that are always tuned and available in a

given environment? They also need to work independently of

Internet connectivity, so that every building can be an island

of digital services even when ofﬂine.

B. Security

The amount of sensitive data being captured in these envi-

ronments is tremendous, and much of it should not leave the

conﬁnes of the physical space without explicit permission from

all parties involved. This is extremely difﬁcult to police given

the amount of Internet-wide coordination used in existing

devices, but can be ﬁxed if the local environment provides a

structured mechanism for handling such storage securely with

respect to local environmental policies.

C. Latency

Interactive services require response times beneath the un-

canny valley of human perception. For the scenario above to

really feel seamless, we need a new “latency ﬁrst” application

architecture that makes data and computation capacity avail-

able physically near the human users, with scheduling toler-

ances for responses in the milliseconds rather than seconds.

Solving these problems is difﬁcult to do piecemeal across

individual parts of the software stack, since they are currently

general-purpose and loosely coupled. A typical IoT device

might run its own copy of the Linux kernel, with an embedded

userspace, a VPN into a centralised management server oper-

ated by the vendor for updates, and a mobile application for

the user to manage it. There is little synergy or dependence

on shared infrastructure to assist with the process.

III. TOWARDS INTERSPATIAL APPLICATIONS

Before proposing a solution, we consider some design prin-

ciples to underpin how applications in physical spaces operate.

When dealing with physical devices, the latency of response

to external events is paramount. Safety critical systems often

require “hard realtime” operation, or more often attempt to

provide soft guarantees about when they respond. As we move

further up the stack to modern mobile applications, there are

no such guarantees. Pressing a button on an iOS or Android

device goes through many layers of scheduling and network

connectivity before a response is generated.

Our interspatial architecture needs to allow us to deploy

“latency-ﬁrst apps” into physical environments – ones that

are designed to run on local devices with response budgets

in the milliseconds, and with minimal external connectivity

needs. This implies that there is simply no time for doing

conventional operations such as network scanning or service

discovery in serial – by the time the user has asked a question,

the physical environment should already have the appropriate

resources established. We thus need to rearrange the program-

ming models around building interspatial applications to make

them suitable for such an environment.

A. Incremental Networking

While it is currently possible to spin up services on demand

within milliseconds [4] it is difﬁcult to transmit a response for

complex clustered services within a time budget of a few mil-

liseconds. A device that is starting “cold” will need to establish

a network connection to the local node, most commonly via

TCP and subsequently a secure transport via TLS. The number

of hops required to establish a secure connection have been

recognised as a problem on the wider Internet, and new low-

latency protocols such as QUIC [5] are being deployed in

browsers. However, even these newer protocols only solve part

of the problem, since they do not integrate closely with the

application layer. Once the connection has been established,

we still need to authenticate the user(s), negotiate security

keys, perhaps perform version negotiation between devices,

and usually interrupt the user at an inconvenient time for a

software security update.

The reason that every device has to currently do all this

work is partly due to the end-to-end principle that guides the

design of Internet protocols. Every IP node is a “dumb” router

that knows how to forward protocol packets, but lacks higher-

level application information. Existing system interfaces (such

as the BSD sockets API, or DNS resolution) implement this

network stack, which is in turn baked into existing application

logic. Each device needs to repetitively perform the same

actions to establish connectivity, with the surrounding network

environment being unable to assist.

An alternative interface in a physical environment is to

move away from just-in-time connectivity towards a model

of establishing connectivity incrementally as the opportunity

arises. When sufﬁcient information from the environment

arises to establish a partial connection (for example, upon

entering a building but before making any service requests)

the application needs to perform the operations that it can

do so then, such as negotiating security keys or ensuring

that software versions match. When a service request does

ﬁnally happen, the previously derived connection information

can be used to perform a single hop. Incremental connectivity

requires fundamental programming interface changes in order

to combine information across the traditional operating system

and application stacks, and for us to move away from the

venerable sockets API [6].

B. Spatial Interfaces

Incremental networking lets an individual network connec-

tion to a node be established quickly. In a physical environ-

ment it is also necessary to mesh wired sensors (such as motion

detectors or cameras) with mobile ones (humans carrying

location beacons). The wired systems need to be establishing

connectivity with services required by applications, and the

wireless ones need to be searching for nearby network nodes

via Zigbee [7] or Bluetooth [8]. This connectivity mesh is

necessary to ensure that all of the distributed components that

comprise a physical environment can all communicate with

predictable and very low latency.

Internet protocols such as TCP have not traditionally not

handled connection handoff well – extensions such as multi-

path TCP are effective [9] but have struggled with extending

existing interfaces such as the sockets API with the new

routing semantics available to applications [10].

For our interspatial applications, a building needs to be

able to manage all the local resources (include bandwidth or

storage) just as it does so with other utilities such as electricity.

In return, individual applications do not need to manage their

own networking, storage and authentication needs as they

are provided by the surrounding environment. This requires

a change in the traditional programming model to allow

applications to multiplex their connectivity needs around a

networking environment that is multipath and dynamic as

humans move around the physical space.

C. Native Hardware

Applications are traditionally built for a speciﬁc device

proﬁle in mind – for example, a mobile phone or a desktop

application. The proliferation of multiple display form factors

has resulted in a new mode of “responsive” design [11] that

can adapt an interface to multiple resolutions and sizes.

With interspatial applications, we wish to eliminate the need

for importing wearable and mobile devices into a building

as the sole mechanism of interaction with a user. Hardware

present in the environment such as parametric speakers for 3D

positional audio [12] or wallpaper projections [13] should be

able to be used by local users, rather than being constrained to

the single form factor mobile devices (e.g. a watch or a mobile

phone) that we carry around. For this to work, interspatial

applications needs to be designed to have responsive user

interfaces, but also a suitable trust model to let users establish

secure connections to environmental hardware that is not

directly owned by them [14].

D. A Cross-Layer Unikernel Software Architecture

The three design principles for interspatial applications

programming interfaces (incremental networking, spatial in-

terfaces and use of native hardware) require cross-cutting

changes across traditional operating system and application

programming interfaces. What would a software solution to

operating embedded physical infrastructure might look like, if

designed with a clean slate in mind? The ﬁrst step to making

this a practical prospect is to adopt the discipline of deploying

unikernels [15] on the hardware, and replacing the traditional

layered OS stack.

Unikernels are specialised operating systems that are com-

piled together with application source code and conﬁguration,

resulting in a specialised binary that can boot in millisec-

onds [16] and eliminates traditional runtime layers in favour

of optimised build-time assembly. Unikernels were originally

developed from the concept of library operating systems [17]

and applied to cloud computing. We now ﬁnd that the same

approach is a perfect ﬁt to driving the world of resource-

constrained embedded hardware found in physical environ-

ments.

In a library operating system, the functionality that is

conventionally found in a monolithic operating system kernel

is broken out into software libraries that are available for use

by the application in exactly the same way as other higher-

level functionality currently is. All of the software libraries

are linked together with a small boot layer – this includes

traditionally kernel-based interfaces such as hardware device

drivers, which can now run in the same privilege level as the

application driving them. This model is ideal for embedded

devices, since it can result in direct low-latency and energy-

efﬁcient access to the hardware.

Crucially, the approach also allows the application to be

tailored to the operational model of the embedded hardware.

For example, the same application source code could be

compiled to a tiny embedded processor without an MMU,

or also be recompiled into a conventional Unix process. One

downside of this specialisation approach is that traditional

multi-user (e.g. UNIX processes) operation is more difﬁcult

in a single device once the unikernel has been deployed (we

address this in §IV-B). An upside is that the baseline operating

system can be highly stripped down and easily substituted –

for example, a conventional Linux kernel can be replaced by

a small, formally veriﬁed microkernel such as seL4 without

having to rewrite the rest of the application code [18].

The unikernel approach is thus important to the long term

sustainability of designing software for physical environments.

It lets us close the gap between the requirements of application

programming interfaces and the diverse requirements of the

embedded hardware. Instead of forcing vendors to squeeze

an ever-growing operating system stack for trivial tasks (such

as driving a lightswitch), adopting unikernel-based interfaces

means that the same uniﬁed software codebase can be tailored

across the variety of hardware that we can expect to see in a

typical situated environment in the next few decades.

IV. THE INTERSPATIAL OPERATING SYSTEM

We now discuss the design of a prototype interspatial

operating system based on a unikernel architecture, dubbed

OSMOSE. Its scope is to drive all the hardware in a single

physical space – such as a building – including the thousands

of sensors and actuators that may be present. As background

requirements, we assume that:

• we have a reliable physical topology available of the

situated environment. Details are out of scope for this

position paper – there are a number of indoor localisation

technologies available [19].

• users and devices have an out-of-band method to authen-

ticate to the operating system – this might be biometric

or face recognition or a password or hardware token [20],

but this has to happen as they enter the physical space.

• there is ample hardware available locally for applications

– for example an array of embedded devices in each

room – and the hardware is all connected via room-local

reliable wired networks, with wireless at the last hop only.

Since OSMOSE needs to run across a variety of hardware

with differing resource constraints, the traditional model of

booting a single image no longer holds. Instead of application

binaries, the input to the system is a collection of declarative

application fragments that represent processing logic and how

it ties together. This is then combined with physical topology

information and security policies to compile a set of unikernels

that are booted on the building’s hardware. This cycle of

compilation happens continuously and incrementally – as

applications are introduced or policies change, the unikernels

are recompiled in real-time to all the devices.

Applications do not communicate freely between each other

over the network with conventional TCP/IP – instead, the OS

sets up explicit channels that provide a high-level streaming

interface suited to the application needs (for example, a

low-bandwidth control channel vs a high-bandwidth video

processing ring buffer). The resulting set of communication

circuits is tracked as a dynamic dataﬂow graph, with the

building hardware representing nodes and the application logic

as edges. All communications in the system are dispatched

through a distributed storage system that immutably stores

interactions for audit and debugging purposes.

Figure 1 illustrates the overall architecture of OSMOSE. It

implements these components using the MirageOS

unikernel

framework, written in the OCaml programming language.

A. Declarative Layer

Applications are partitioned up into declarative source code

fragments that represent units of computation, and then sched-

uled together by via a structured event stream using the Capnp

RPC framework. This is becoming a popular architecture on

container-native computing infrastructures via the Serverless

movement [21], since it allows applications to run on dis-

tributed virtual machines that can be rescheduled depending on

See https://mirage.io

See http://capnproto.org

load and hardware availability. In an interspatial deployment,

the application logic is compiled to run directly on the phys-

ical hardware as a unikernel [22], with appropriate network

channels established depending on the available circuits (e.g.

a local point-to-point wireless link).

The services are composed together via the domain-speciﬁc

language of combinators provided by MirageOS. For example,

a voice recognition application can request an audio stream,

which can then be mapped onto a feature extraction module.

The application does not know the exact source of the audio

stream at this stage. In the case of our prototype design, we

use the MirageOS device driver model to deﬁne many of these

sources and sinks. The resulting programming model is event-

driven and reactive, and encourages the programmer to provide

ongoing incremental updates so that user interfaces can be

updated rapidly.

B. Builder and Artefact Layer

Once we have all the source code (in the form of application

logic, topology and conﬁguration code), we need to compile

it into executable unikernels that can run on the available

hardware. Unlike a conventional operating system that runs

on a single hardware host, the code is compiled into a

heterogenous set of bootable kernels that are specialised to

the various devices available in the cluster. For example, a

portion of the application logic may be compiled to run on

a GPU or FPGA, and another piece to run on an embedded

ARM CPU without an MMU. This is accomplished via an

incremental build service that can rapidly link and specialise

the source code into unikernels.

The build service tracks all of the source code that via a

branching datastore that is similar to Git. Every single line

of application, operating system and conﬁguration logic is

stored in the same place, and so the builder can easily track

precisely what code is running on the array of hardware. We

use the Irmin [23] datastore in our prototype design. Irmin is

a unikernel implementation of the Git model for MirageOS,

and provides the facility to handle complex distributed datas-

tructures just like source code.

Irmin is structured as a series of OCaml libraries that

expose increasingly complex storage functionality. Applica-

tions request the minimal functionality they need for their

purposes, and the appropriate storage layer is crafted from the

combination of requirements. For example, one application (a

button) might just need a key-value store, whereas another

(a projector) might want a read-only ﬁlesystem. Importantly,

all of the storage is backed by the same Merkle-tree-based

storage, granting us the ability to precisely track the behaviour

of every hardware node.

C. Scheduler Layer

The scheduler is responsible for triggering the builder, de-

ploying the resulting unikernels to hardware, and establishing

network circuits between devices. It receives an event stream

from the hardware (such as network topology changes, or

node failures). This layer is the most radical departure from a

Applications

Declarative

Runtime

Topology

Policies

Incremental Builder Event Stream

Branch Storage

Unikernels

Secure Enclave

Audit Log

Actuators

Sensors

CPUs

Network Flows

Scheduler

Fig. 1. Architecture of the interspatial operating system. The system is conﬁgured declaratively via a set of application source code, policies and network

topology, and is then incrementally built and deployed onto hardware. All communications happens via the Merkle-tree based branch storage.

conventional operating system model. Instead of a process-

based system, a deployment can be viewed as a graph of

distributed hardware nodes, through which event stream data

from the environment ﬂows.

Every coordination operation between nodes is piped

through the Irmin datastore, allowing the complete state of the

system to be tracked with strong provenance. We assume that

there is a large amount of persistent local storage available in

the deployment, but this can run in a separate processing node

from the low-power embedded hardware. The basic model for

the Irmin-based communication has been validated over the

past decade via a series of shared-memory implementations in

the Xen hypervisor [24]. In order to keep latency down despite

the use of Merkle-tree-based coordination, Irmin can expose

a shared-memory endpoint to each node, and local operations

between two nodes are asynchronously reconciled to a remote

store via a set of short-lived branches.

When applications need access to the outside world, they

do so via RPC calls to the scheduler. We use the Capnp

RPC framework for this purpose, which effectively acts as

the system call layer. Capnp features a very efﬁcient low-

level serialisation mechanism (important when communicating

between embedded devices), but also a compiler for protocol

schemas that integrates secure capabilities [25]. These capa-

bilities can be passed around nodes as opaque references and

used to authenticate access to different parts of the system.

Since a capability can only be generated and communicated

through the Irmin store, this means that tracing distributed

function calls through a deployment are available “for free”

by inspecting and reconstructing the storage.

Interspatial applications require a lot of immediate interac-

tion with the physical environment. This could involve reading

sensor input (e.g. gesture recognition, audio inputs, pressure

sensors) and actuating outputs to sensors (e.g. heating, light-

ing, speakers, displays). The scheduler layer thus maintains

multipath network connections that track a physical network

topology that describes the containment relationships present

in a physical environment – a chair is on a ﬂoor beside a

wall within a room that is on the ﬁrst ﬂoor of a building in

Cambridge in the United Kingdom.

As users interact with their environment, network connec-

tions are set up as they move in parallel with their actions. In

our earlier scenario, when the user ﬁrst enters the building they

are authenticated with the building systems and given a session

key that is used to subsequently track them. As they walk

towards the meeting room, their preferences are reconciled

with other participants. Once they enter the room, they are

allocated a network slot inside that room’s network, and their

voiceprint details uploaded to local embedded processors. Ev-

ery time a new participant enters, their collection of identities

are aggregated to generate temporary encryption keys for any

shared communication. Video and voice are encrypted in real-

time using these keys and saved to secure storage enclaves

within the walls, available for immediate processing by locally

connected computation. Once a participant leaves a space, their

session keys are deleted and the hardware resources such as

displays are freed, ready for re-use by other humans.

This design inverts the conventional model of establishing

on-demand connections to remote services with a model that

incrementally establishes nearby circuits for applications, with

local data processing and resource allocation done as the

humans move around the situated environment in real time.

Since connections are established incrementally as humans

move around, when an action actually has to be taken it can be

done so with a single-hop packet, thus minimising round-trip

times and error-prone connection establishment protocols that

would normally add latency variance to an interaction.

D. Hardware Layer

The unikernels stored in the artefact layer are regularly

booted on the various pieces of embedded hardware through-

out the deployment. There is no need for dynamic network

discovery since static topology information is baked into every

booting kernel. If the network environment changes, then the

scheduler will trigger a recompile via the branch store and

HTML Viewer

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Spatial Name System

[...]

Ryan Thomas Gibb

10 Oct 2022-arXiv.org

TL;DR: The Spatial Name System is proposed, an alternative network architecture that relies on the innate physicality of this paradigm, which allows us to identify devices locally based on their physical presence.

...read moreread less

Abstract: The development of emerging classes of hardware such as Internet of Thing devices and Augmented Reality headsets has outpaced the development of Internet infrastructure. We identify problems with latency, security and privacy in the global hierarchical distributed Domain Name System. To remedy this, we propose the Spatial Name System, an alternative network architecture that relies on the innate physicality of this paradigm. Utilizing a device’s pre-existing unique identifier, its location, allows us to identify devices locally based on their physical presence. A naming system tailored to the physical world for ubiquitous computing can enable reliable, low latency, secure and private communication.

...read moreread less

An architecture for interspatial communication

Summary (4 min read)

I. INTRODUCTION

II. THE PROBLEM WITH EXISTING MOBILE SOFTWARE

A. Reliability

B. Security

C. Latency

III. TOWARDS INTERSPATIAL APPLICATIONS

A. Incremental Networking

B. Spatial Interfaces

C. Native Hardware

D. A Cross-Layer Unikernel Software Architecture

IV. THE INTERSPATIAL OPERATING SYSTEM

Actuators

A. An Example Deployment

B. Security and Privacy

VI. CONCLUSION

Figures (2)

Citations

References

"An architecture for interspatial co..." refers background in this paper

Additional excerpts

"An architecture for interspatial co..." refers background in this paper

Related Papers (5)