Dependable Network Data Plane for the Cloud
Dependable Network Data Plane for the Cloud
Disciplines
Electrical Engineering, Electronics, Information Engineering (80%); Computer Sciences (20%)
Keywords
-
Communication networks,
Network Virtualization,
Software-Defined Networks,
Dependability,
Reliability,
Algorithms
With the advent of new data-centric services related to entertainment, business, health, etc., the number of applications, the size of the user base, and the amount of applications deployed in the modern Cloud is growing explosively. While users of Cloud services benefit from high flexibility and low infrastructure cost, they inevitably accept the sharing of resources, facilities, and infrastructure with other users. The consequences of this, often undesired fate-sharing, are contradictory: on the one hand, it is a key to the apparent business success of the multi-tenant cloud, on the other hand it raises significant reliability and security concerns. What if a malevolent user steals, spoofs, or tampers with the data of another user with whom they share, e.g., the same server in the Cloud data center? What if, instead of directly attacking the other user, bad actors merely run an adverse workload on their resource slice that will interfere with the service provision of innocent Cloud users, e.g., increases the time to respond to web requests to a time frame that will be intolerably slow for an enjoyable customer experience? Unfortunately, this type of inadvertent or intended breaking of the data and performance isolation between users is clearly possible in the Cloud, as demonstrated by a set of recent high-profile incidents. What is worse, it is enough to have only a single malevolent user of a data center to cause massive collateral damage to an entire fleet of victims. The reason is that it is not just physical resources, like servers, CPUs, memory, storage, and network devices that are shared between users, but also a set of much less tangible assets as well, namely, algorithms and data-structures and the state embodied by these "virtual" artifacts. Unfortunately, in order to facilitate the unprecedented programmability and sharing provided by the Cloud, the algorithms and data-structures applied there are often fairly complex and not always designed with availability, reliability, security, and performability (i.e., dependability), in mind. The objective of this project is to chart a comprehensive algorithmic landscape of the "trustworthiness" (dependability) and the isolation properties of the typical algorithms and data structures applied in the Cloud, and to design new algorithms and data-structures with dependability as a critical feature rather than an afterthought.
While users of cloud services benefit from high flexibility and low infrastructure costs, they inevitably accept the sharing of resources, facilities, and infrastructure with other users. The consequences of this often undesirable "shared fate" are contradictory: on the one hand, it is a key to the obvious business success of the multi-tenant cloud, but on the other hand, it raises significant questions about reliability and security. The DELTA project focuses particularly on the cloud network, which has recently become reconfigurable and programmable in a completely new way. This enables the development of new algorithmic tools and brings great flexibility to network operations. However, the complex algorithmic infrastructure is often shared among multiple users. An unintentional error, a misconfigured network element, or a malicious attack can easily cause widespread damage. Indeed, network algorithms can become targets of novel attacks, such as algorithmic complexity attacks that exploit specific computationally intensive operations to impair network performance and availability. For example, it has been shown that the performance of network algorithms can suffer from attacks like the "Tuple Space Explosion," and that attacks on the packet parser of a virtual switch can even allow an attacker to take over an entire data center. DELTA designs a comprehensive picture of the algorithmic possibilities and reliability aspects of the cloud network data plane in general, as well as the underlying algorithms, data structures, and isolation properties in particular. It examines how and to what extent more adaptable and "self-adjusting" algorithms in the data plane can improve network performance and availability, as well as support parallelization and scaling. Furthermore, it demonstrates how performance isolation can be achieved, enabling more predictable and reliable operation despite shared resource usage. For example, DELTA contributes self-adjusting data structures for firewalls in the network data plane, improving their performance and availability. Another example is self-adjusting Bloom filters, which can help prevent denial-of-service attacks on specific applications. Overall, we are very pleased with the success of this project. We had an excellent team of postdocs, doctoral students, and students who were able to benefit from the project and publish outstanding research results. We sincerely thank the FWF for its support.
Research Output
- 63 Citations
- 21 Publications
- 1 Datasets & models
-
2021
Title The Programmable Data Plane: Abstractions, Architectures, Algorithms, and Applications DOI 10.48550/arxiv.2110.00631 Type Other Author Bifulco R Link Publication -
2023
Title Supercharge WebRTC: Accelerate TURN Services with eBPF/XDP DOI 10.1145/3609021.3609296 Type Conference Proceeding Abstract Author Kreith B Pages 70-76 -
2023
Title Dynamic Maintenance of Monotone Dynamic Programs and Applications DOI 10.48550/arxiv.2301.01744 Type Preprint Author Henzinger M Link Publication -
2022
Title Compiling packet programs to dRMT switches DOI 10.1145/3565475.3569080 Type Conference Proceeding Abstract Author Fraknói Á Pages 26-32 -
2022
Title Domain specific run time optimization for software data planes DOI 10.1145/3503222.3507769 Type Conference Proceeding Abstract Author Miano S Pages 1148-1164 -
2022
Title Data Plane Cooperative Caching With Dependencies DOI 10.1109/tnsm.2021.3132275 Type Journal Article Author Kulik A Journal IEEE Transactions on Network and Service Management -
2022
Title Programmable Packet Scheduling With SP-PIFO: Theory, Algorithms and Evaluation DOI 10.1109/infocomwkshps54753.2022.9798055 Type Conference Proceeding Abstract Author Sarkadi C Pages 1-6 -
2022
Title Embedding and batch-scheduling data flow graphs in software switches Type PhD Thesis Author Tamás Lévai -
2022
Title Modeling and enumerating geographically correlated failure events in communication networks Type PhD Thesis Author Vass Balázs -
2024
Title Learning Minimum Linear Arrangement of Cliques and Lines DOI 10.1109/icdcs60910.2024.00025 Type Conference Proceeding Abstract Author Dallot J Pages 175-185 -
2024
Title Dependency-Aware Online Caching DOI 10.1109/infocom52122.2024.10621422 Type Conference Proceeding Abstract Author Dallot J Pages 871-880 -
2024
Title Adaptive Protocols and Reconfigurable Topologies for High Performance Datacenter Networks Type PhD Thesis Author Vamsi Addanki -
2024
Title Morpheus: A Run Time Compiler and Optimizer for Software Data Planes DOI 10.1109/tnet.2023.3346286 Type Journal Article Author Miano S Journal IEEE/ACM Transactions on Networking -
2023
Title RIFO: Pushing the Efficiency of Programmable Packet Schedulers DOI 10.48550/arxiv.2308.07442 Type Other Author Mostafaei H Link Publication -
2023
Title Dynamic Maintenance of Monotone Dynamic Programs and Applications DOI 10.4230/lipics.stacs.2023.36 Type Conference Proceeding Abstract Author Henzinger M Conference LIPIcs, Volume 254, STACS 2023 Pages 36:1 - 36:16 Link Publication -
2020
Title Compiling Packet Programs to Reconfigurable Switches DOI 10.1145/3426744.3431332 Type Conference Proceeding Abstract Author Bérczi-Kovács E Pages 28-35 -
2024
Title Dependency-Aware Online Caching DOI 10.48550/arxiv.2401.17146 Type Preprint Author Dallot J Link Publication -
2021
Title A Survey of Fast-Recovery Mechanisms in Packet-Switched Networks DOI 10.1109/comst.2021.3063980 Type Journal Article Author Chiesa M Journal IEEE Communications Surveys & Tutorials Pages 1253-1301 Link Publication -
2023
Title Self-Adjusting Partially Ordered Lists DOI 10.1109/infocom53939.2023.10228937 Type Conference Proceeding Abstract Author Addanki V Pages 1-10 -
2023
Title Sinkless Orientation Made Simple; In: Symposium on Simplicity in Algorithms (SOSA) DOI 10.1137/1.9781611977585.ch17 Type Book Chapter Publisher Society for Industrial and Applied Mathematics -
2025
Title Everything matters in programmable packet scheduling Type Conference Proceeding Abstract Author Albert Gran Alcoz Conference USENIX NSDI Link Publication