Advanced Distributed Systems

Fall 2019

General Course Information:

Instructor: Lakshmish Ramaswamy (laks[AT]cs[dot]uga[dot]edu, 706-542-2737)
 
Time and Venue: Wednesdays - 11:15 AM to 12:05 PM (Barrow 205) ; Tuesdays & Thursdays - 11:00 AM to 12:15 PM (Boyd 326)
 
Office Hours: Tuesdays - 12:15 PM to 1:00 PM and Wednesdays - 10:30 AM to 11:15 AM (Tentative) or by appointment

Course Description:

Distributed systems have become widely pervasive and are having a tremendous impact on various domains of human activity. Today's distributed systems range from IoT devices to massive cloud infrastructures comprising of highly powerful servers. The research in the area of distributed systems has focused on achieving better performance, reliability, security, and privacy of various kinds of distributed data processing applications.

In this course we will discuss advanced topics on distributed system principles including Distributed Hash Tables, Paxos Protocol, and CAP Theorem. Furthermore, we will study the design, implementation and evaluation of a wide class of distributed systems and applications including distributed file systems, cloud and edge computing, Internet of Things, pervasive applications and blockchains
with the objective of gaining an in-depth understanding of the requirements and the design options.

Grading Policy (Tentative)

Course Materials (Tentative -- Will be modified during the course of the semester)

A. Advanced  Topics on Distributed System Principles (Lakshmish)
  1.   Material from the book "Distributed Systems: Principles and Paradigms" by Tanenbaum and Van Steen, 2nd Ed. Pearson Prentice Hall.**
  2. I. Stoica, R. Morris, D. Liben-Nowell, D. Karger, M. Kaashoek, F. Dabek and H. Balakrishnan, Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications, IEEE/ACM Transactions on Networking, 11(1), 2003. **
  3. A. Broder and M. Mitzenmacher, "Network Applications of Bloom Filters: A Survey", Internet Mathematics, 1(4), 2004.**
  4. L. Lamport, "Paxos Made Simple", 2001.**
  5. D. Mazieres "Paxos Made Practical" **
B. Distributed File Systems and Lock Services
  1. Material from the book "Distributed Systems: Principles and Paradigms" by Tanenbaum and Van Steen, 2nd Ed. Pearson Prentice Hall.**
  2. S. Ghemawat, H. Gobioff and S-T Leung, "The Google File System", OSDI 2003 ().
  3. M. Burrows, "The Chubby Lock Service for Loosely-Coupled Distributed Systems", OSDI 2006 ().
  4. S. A. Weil et al. "Ceph: A Scalable, High -Performance Distributed FIle System", OSDI 2006 ().
  5. K. Shvachko et al. "The Hadoop Distributed File System", MSST 2010 ().
  6. D. Ford et al., "Avaialability in Globally Distributed Storage Systems", OSDI 2010 () .
  7. B. Calder et al., "Windows Azure Storage: A Highly Available Cloud Storage Service with Strong Consistency", SOSP 2011 ().
  8. A. Thomson and D. J. Abadi "CalvinFS: Consistent WAN Replication and Scalable Metadata Management for Distributed File Systems", USENIX FAST 2015 ().
C. Cloud Computing, Cluster Computing, Containers
  1. Materials from the book "Distributed and Cloud Computing: From Parallel and Processing to the Internet of Things", by K. Hwang and G. C. Fox and J. J. Dongarra, Elsevier & Morgan-Kauffman (Lakshmish)
  2. M. Armburst et al, "Above the Clouds: A Berkeley View of Cloud Computing", Technical Report, University of California, Berkeley, 2009.
  3. B. Burns, et al., "Borg, Omega, and Kubernetes: Lessons Learned from Three Container-Management Systems Over a Decade",
  4. B. Hindman et al., Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center, NSDI 2011 ().
  5. M. Schwarzkopf et al., "Omega: Flexible, Scalable Schedulers for Large Compute Clusters", EuroSys 2013.
  6. E. Boutin et al. "Apollo: Scalable and Coordinated Scheduling for Cloud-Scale Computing", OSDI 2014.
  7. M. Li et al. "Scaling Distributed Machine Learning with the Parameter Server", OSDI 2014.
  8. A. Verma et al., "Large Scale Cluster Management at Google with Borg", EuroSys 2015
  9. E.Jonas et al., "Occupy the Cloud: Distributed Computing for the 99%", SoCC 2017.
  10. A. Mertin et al., "Docker Ecosystem -- Vulernability Analysis", Computer Communications June 2018 (Not for presentation).
D. Edge/Fog Computing
  1. M. Satyanarayanan et al., "The Emergence of Edge Computing", IEEE Computer Volume 50, Issue 1, Jan 2017 (Lakshmish)
  2. L. Ma et al., "Efficient Service Handoff Across Edge Servers via Docker Container Migration", ACM/IEEE SEC 2017.
  3. L. Chaufournier et al., "Fast Transparent Virtual Machine Migration in Distributed Edge Clouds", ACM/IEEE SEC 2017.
  4. C. Streiffer et al., "ePrivateEye: To the Edge and Beyond", ACM/IEEE SEC 2017.
  5. Z. Feng et al., "Edge-based Discovery of Training Data for Machine Learning", ACM/IEEE SEC 2018.
  6. J. Wang et al., "Bandwidth-efficient Live Video Analytics for Drones via Edge Computing", ACM/IEEE SEC 2018.
  7. J. Gedoen et al., "From Cell Towers to Smart Street Lamps: Placing Cloudlets on Existing Urban Infrastructures", ACM/IEEE SEC 2018.
  8. D. Zhang et al., "Cooperative-Competitve Task Allocation in Edge Computing for Delay-Sensitive Social Sensing", ACM/IEEE SEC 2018.
IoT, Smart Sensing and Applications
  1. J. Gubbi, et al., "Internet of Things (IoT): A Vision, Architectural Elements and Future Directions", Future Generation of Computer Systems, 2013. 
  2. J. A. Stankovic, "Research Directions for the Internet of Things", IEEE Internet of Things Journal, February 2014 (Not for presentation).
  3. S. Sicari, et al., "Security, Privacy and Trust in Internet of Things: The Road Ahead", Elsevier Computer Networks, 2015.
  4. S. M. R. Islam et al., "The Internet of Things for Healthcare: A Comprehensive Survey", IEEE Access, June 2015.
  5. C. Perera et al., "Sensing as a Service Model for Smart Cities Supported by Internet of Things", Transactions of Emerging Telecommunication Technologies, 2013.
  6. J. Adkins et al., "The Signpost Platform for City-Scale Sensing", IPSN 2018.
  7. J. Liu et al., "A Survey of Mobile Crowdsourcing Techniques: A Critical Component for the Internet of Things", ACM Transactions on Cyber-Physical Systems July 2018.
  8. S. Yao, "FastDeepIoT: Towards Understanding and Optimizing Neural Network Execution Time on Mobile and Embedded Devices", SenSys 2018.
  9. Z. Qin, "EXIMUS: A Measurement Framework for Explicit and Implicit Urban Traffic Sensing", SenSys 2018.
  10. A. Srivastava et al., "CamForensics: Understanding Visual Privacy Leaks in the Wild", SenSys 2017.
Blockchains and their Applications

Presentation Slides

Available on ELC.

Programming Projects



Miscellanious Materials