High performance cloud computing

Placeholder Show Content

Abstract/Contents

Abstract
Running fast computations at a large scale increasingly stresses task scheduling performance. As CPU performance becomes the performance bottleneck, it is common to optimize CPU performance, further parallelize computations by using more nodes, and split computations into more finely-grained tasks. As a result, a cloud framework running an application on thousands of cores must schedule hundreds of thousands of tasks per second. A framework must be also able to dynamically reschedule tasks in order to achieve high CPU utilization in the presence of an unbalanced load, stragglers, and node failures. Today, systems must choose either high task throughput or dynamic, centralized scheduling: no existing system can provide both. This dissertation proposes a data-centric asynchronous control plane, which schedules hundreds of thousands of tasks per second while retaining a central interface to dynamically reschedule tasks. In this design, a central controller decides how to redistribute work, but workers locally decide between themselves when to redistribute. The controller avoids issuing per-task scheduling decisions by using data placement as the control signal for load distribution. So this design provides a declarative, central, and asynchronous load control mechanism without limiting task throughput. This control plane depends on a new program abstraction, called task recipes, that enables workers to locally schedule tasks and asynchronously migrate data and computation. This dissertation implements the design in a from-scratch in-memory cloud framework called Canary and evaluates its performance on data analytics and machine learning benchmarks. Furthermore, this dissertation examines how applications benefit from the design by using fluid simulation as a case study, and it proposes a new load-balancing algorithm in this context.

Description

Type of resource text
Form electronic resource; remote; computer; online resource
Extent 1 online resource.
Place California
Place [Stanford, California]
Publisher [Stanford University]
Copyright date 2018; ©2018
Publication date 2018; 2018
Issuance monographic
Language English

Creators/Contributors

Author Qu, Hang
Degree supervisor Levis, Philip
Thesis advisor Levis, Philip
Thesis advisor Aiken, Alexander
Thesis advisor Rosenblum, Mendel
Degree committee member Aiken, Alexander
Degree committee member Rosenblum, Mendel
Associated with Stanford University, Department of Electrical Engineering.

Subjects

Genre Theses
Genre Text

Bibliographic information

Statement of responsibility Hang Qu.
Note Submitted to the Department of Electrical Engineering.
Thesis Thesis Ph.D. Stanford University 2018.
Location electronic resource

Access conditions

Copyright
© 2018 by Hang Qu
License
This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Also listed in

Loading usage metrics...