Hosted at

33^rd International Conference
on Massive Storage Systems
and Technology (MSST 2017)
May 15 — 19, 2017

Technically Co-
Sponsored by

Hosted at
Santa Clara University
Santa Clara, CA

StorageNewsletter:	2017 Recap
insideHPC:	Article, Interview, Article
HPCWire:	Article

2017 Conference

MSST (2017), as is our custom, dedicated five days to computer-storage technology, including a day of tutorials, two days of invited papers, two days of peer-reviewed research papers, and a vendor exposition. The conference was held, once again, on the beautiful campus of Santa Clara University, in the heart of Silicon Valley.

Many Thanks to Our Sponsors!

2017 Program

9:00am — 5:00pm
(Breakfast 7:30 — 9:00, Lunch 12:30 — 1:30)

Instructors: Sean Roberts (bio) and Stefano Maffulli (bio)

With over 2000 developers from 130 different companies worldwide, OpenStack is one of the largest collaborative software-development projects. Because of its size, it is characterized by a huge diversity in social norms and technical conventions. These can significantly slow down the speed at which changes by newcomers are integrated in the open source project.

We've designed a training program to accelerate the speed at which new developers are successful at integrating their own roadmap into that of the open source project. We have taken a slice of an existing successful two-day training program and broken out the session dealing with development interaction. This seven-hour live class teaches students to navigate the intricacies of a project's technical teams and social interactions using Legos. It is a lot of fun and very informative to the way upstream development teams, companies, and individual technical contributors behave and react to milestones. For more background, read:

https://sarob.com/2017/02/open-source-first-project-product

7:30 — 8:30 Registration / Breakfast

Emerging Open Source Storage System Design for Hyperscale Computing

An Update on MarFS in Production

(Slides, Video)

David Bonnie, Los Alamos National Laboratory (bio)

With MarFS in production at LANL since fall 2016, we have gained new insights, learned lessons, and expanded our future plans. We'll discuss the various hurdles required to deploy such an ambitious system with minimal manpower. Further, we'll delve into the challenges, triumphs, and defeats on the road to a new tier of inexpensive scalable storage.

Bridging Big - Small, Fast - Slow with Campaign Storage

(Slides, Video)

Peter Braam, Campaign Storage, LLC (bio)

Economic considerations and technology developments are necessitating widely usable tiered storage. Untroubled by the worries of transparency and performance, Campaign Storage—invented at Los Alamos National Laboratory—offers radical revisions of old workflows and adapts to new technologies. But it also leverages widely available technologies and interfaces to offer stability from the ground up and blend in with the past. We'll discuss how a simple combination of components can support scalability, data analytics and efficient integration with memory based storage.

Limits of Hard Drives for Long-Term Storage

Behind the Curtain of Backblaze Hard Drive Stats

(Slides, Video)

Andrew Klein, , Backblaze (bio)

For the last four years, Backblaze has collected and reported on the failure rates and SMART stats of the hard drives in use in our data centers. Currently we have over 80,000 drives ranging from 3 to 8TB. Let's take a look at what we learned over the years about hard drives, including failure rates, by model, and the ability to predict drive failure before it happens.

12:30 — 1:30 Lunch

(Linda Kempster video)

Leveraging Compression, Encryption, and Erasure Coding Chip
Hardware Support to Construct Extreme-Scale Storage Systems

SPARC Chip Support for Compression, Encryption, and SQL

(Slides, Video)

Stephen Phillips, Oracle SPARC (bio)

On August 12, 2014, Oracle announced the SPARC M7 processor, highlighting an architecture advancement the company refers to as "Software in Silicon". Putting software onto the computer chip (Software in Silicon) is a revolutionary new technology which fundamentally changes the way computer systems are built in the future. Oracle processor engineers worked closely with Oracle's software developers, in particular its database experts, in hard-wiring capabilities specific to application processes and performance onto the processor—hence the "Software in Silicon" moniker.

Storage acceleration with ISA-L

(Slides, Video)

Greg Tucker, Intel

As distributed storage adds advanced features such as erasure coding, dedup, compression and encryption, the computational requirements can limit performance. ISA-L is an optimized software library for storage algorithms intended to maximize efficiency by targeting the highest cycle-per-byte functions in modern storage systems.

Building High Speed Erasure Coding Libraries for ARM and x86 Processors

(Slides, Video)

Per Simonsen, Memoscale (bio)

Library optimizations as well as development of new erasure coding algorithms have been keys to unlocking higher levels of erasure coding performance. Learn about performance improvements achieved on ARM and x86 processors.

Panel — The Limits of Open Source in Extreme-Scale Storage Systems Design

(Video)

Peter Braam, Campaign Storage, LLC (bio)

Matthew O'Keefe, Oracle (bio)

Sean Roberts, OpenStack Consortium (bio)

Michael Declerck, Oracle

Building Extreme-Scale SQL and NoSQL Processing Environments

The Benefits of Hardware-Software Co-Design/Convergence for Large-Scale Enterprise Workloads

(Slides, Video)

Michael Palmeter, Oracle

Exadata: Design of an Extreme-Scale SQL Processing System

(Slides, Video)

Matthew O'Keefe, Oracle (bio)

SQL is the language for data in business and many science applications today. In this talk, we will outline Oracle’s Exadata design and architecture and how it achieves very high performance, capacity, and resilience in production.

Design Decisions and Trade-offs in Apache Accumulo

(Slides, Video)

Aaron Cordova, Koverse (bio)

NoSQL databases make some strong assertions about reasons for deviating from the conventional relational model. In this talk Aaron Cordova will present the design decisions and trade-offs made in building Apache Accumulo, a highly scalable open source implementation of Google's BigTable and will discuss how these decisions have enabled Accumulo to achieve extreme scalability in several dimensions.

Short Talks

Attendees and vendors can sign up in advance, or at the conference, to give 5-15 minute
works-in-progress or summary updates on work of interest to conference attendees.

Fragmentation at Low Utilization in the Lustre File System

(Slides, Video)

John Kaitschuck, Seagate

Designing and Managing Large, Long-Lived Archives Efficiently

(Slides, Video)

Donna Harland, Oracle Optimized Solutions
Donna Shawhan, Oracle StorageTek Archive Solutions

Discover how to easily design and manage your large archives for the long term. This short talk will highlight the key components and features that are essential for a secure, future-proof long-lived archive. Understand how to leverage and scale the optimal mix of storage, ranging from flash to disk to tape to cloud. Get tips on how to architect an environment that provides you easy access to and infinite retention for your data, no matter where it is stored or how old.

A Virtual Object Store and Yet Another HSM

(Slides, Video)

Igor Sfiligoi, General Atomics

7:30 — 8:30 Breakfast

(Day Two Intro Video)

Keynote

Memory Driven Computing

(Slides, Video)

Dr. Kimberly Keeton, Hewlett Packard Enterprise (bio)

Data growth and data analytics requirements are outpacing the compute and storage technologies that have provided the foundation of processor-driven architectures for the last five decades. This divergence requires a deep rethinking of how we build systems, and points towards a memory-driven architecture, where memory is the key resource and everything else, including processing, revolves around it.

Memory-driven computing (MDC) brings together byte-addressable persistent memory, a fast memory fabric, task-specific processing, and a new software stack to address these data growth and analysis challenges. At Hewlett Packard Labs, we are exploring MDC hardware and software design through The Machine. This talk will review the trends that motivate MDC, illustrate how MDC benefits applications, provide highlights from our Machine-related work in data management and programming models, and outline challenges that MDC presents for the storage community.

Storage Innovation in Extreme-Scale HPC Data Centers

Parallel I/O at NERSC: Today and Tomorrow

(Slides, Video)

Glenn K. Lockwood, NERSC

Evolving Storage and Cyber Infrastructure at the NASA Center for Climate Simulation

(Slides, Video)

Ellen Salmon, NASA (bio)

This talk will describe recent developments at the NASA Center for Climate Simulation, which is funded by NASA’s Science Mission Directorate, and supports the specialized data storage and computational needs of weather, ocean, and climate researchers, as well as astrophysicists, heliophysicists, and planetary scientists. To meet requirements for higher-resolution, higher-fidelity simulations, the NCCS augments its High Performance Computing (HPC) and storage/retrieval environment. As the petabytes of model and observational data grow, the NCCS is broadening data services offerings and deploying and expanding virtualization resources for high performance analytics.

Storage Development at CERN

(Slides, Video)

Dr. Michal Simon, CERN (bio)

The storage group in CERN’s IT department provides coherent storage services for the physics community at CERN, including the the experiments at the Large Hadron Collider. In this talk we give a status update on the storage technologies, workloads, and storage analytics at CERN. Moreover, we will discuss the strategic developments like a new archive backend for our EOS disk pools, a RAFT based implementation of EOS namespace and latest security enhancements to our data access protocol (xroot).

Panel — How Extreme-Scale HPC Data Centers Can Leverage Public Cloud for
Computing and Storage

(Video)

Ellen Salmon, NASA

Dr. Michal Simon, CERN

Supporting Extreme-Scale Name Spaces with NAS Technology

ZFS for Extreme-Scale NAS

(Slides, Video)

Jason Schaffer, Oracle

Improving Network Attached Storage

(Slides)

Jim Finlayson, Department of Defense

Scaling NAS filer technology to extreme-scale requires a mix of hardware and software as well as intelligent system design. In this talk, we outline one approach to scaling NAS name spaces.

Lightning-fast File Operations for Extreme Scale Name Spaces:
Techniques for Applying Structure to Unstructured Data

(Slides, Video)

Mitch Crane, Cloudtenna

Mitch will illustrate the power of separating content and metadata. When metadata is extracted into a database, the index powers web-scale file access, search, and audit in ways not before possible.

12:30 — 1:30 Lunch

Storage System Designs Leveraging Hardware Support

Tiered Erasure - When Flat Doesn't Fit

(Slides, Video)

David Bonnie, Los Alamos National Laboratory (bio)

Storage systems continue to demand the seemingly impossible triad: faster, cheaper, and more reliable. As systems scale up, all three become increasingly hard to balance, with reliability generally taking the back seat. While flat protection schemas work well for small systems, they all compromise too much of the triad at the tens to hundreds of petabyte scale. This discussion will focus on the genesis of the tiered erasure system used in MarFS and how it leverages hardware accelerated erasure to implement a fast, safe, and efficient storage paradigm.

Accelerating Ceph data services with Intel QuickAssist Technology and ISA-L

(Slides, Video)

Greg Tucker, Intel

Ceph is an open-source, unified, distributed storage system designed with scale in mind. Ceph's performance doesn't suffer as your data storage grows, which makes it a good fit for Big Data projects. Ceph community has recently been focused on adding key enterprise features such as erasure coding, compression and encryption. In this presentation, we’ll talk about how Intel QuickAssist and ISA-L based offloads can help accelerate these functions in Ceph.

How Can Extreme-Scale Storage Systems Support Containerization?

Unsolved Storage Issues in Linux Container Interfaces

(Video)

Dr. James Bottomley, IBM (bio)

With the addition of the superblock namespace (essentially a user namespace for the kernel to filesystem boundary) much of the stage is now set for fixing one of the biggest underlying container problems: that of translating unprivileged container writes into real filesystem uid/gids. This talk will examine how this system works, why it is necessary and what solutions have been proposed so far, how the upstream discussions are going and what still need to be added for orchestration systems to make use of it.

Learning from ZFS to Scale Storage on and under Containers

(Slides, Video)

Evan Powell, Entrepreneur and Hacker (bio)

What is so new about the container environment that a new class of storage software is emerging to address these use cases? And can container orchestration systems themselves be part of the solution? As is often the case in storage, metadata matters here. We are implementing in the open source OpenEBS.io some approaches that are in some regards inspired by ZFS to enable much more efficient scale out block storage for containers that itself is containerized. The goal is to enable storage to be treated in many regards as just another application while, of course, also providing storage services to stateful applications in the environment.

Big Software-RAID Storage in Zoned Virtual Environments

(Slides, Video)

Scott Sinno, NASA (bio)

This presentation will describe the evolution and growth of the NCCS's flagship virtualization project known as "ADAPT". The ADAPT system is a KVM/QEMU-based virtualized environment hosting 11 PB of highly cost-effective disk-storage in the form of JBODS, heavily leveraging Linux's "mdadm" software-based RAID for data-integrity and reliability. The ADAPT environment enforces a true zoned architecture, such that nodes within a zone have no visibility whatsoever to network or storage resources in other zones. This is accomplished by providing each zone it's own distinct set of virtualized fileservers which access their media as logical block devices presented by their hypervisors.

Trends in Non-Volatile Media

Basic Principles and Challenges of STT-MRAM for Embedded Memory Applications

(Slides, Video)

Luc Thomas, Headway (bio)

Spin-Transfer-Torque Magnetic Random Access Memory (STT-MRAM) is emerging as a leading candidate for a variety of embedded memory applications ranging from embedded NVM to working memory and last level cache. In this talk, we will discuss the basic principles of STT-MRAM, as well as recent advances that bring perpendicular STT-MRAM closer to mass production. We will also address the specific challenges facing STT-MRAM for standalone and embedded applications, and its place in the emerging NVM landscape.

Persistent Memory Programming: The Current State of the Ecosystem

(Slides, Video)

Andy Rudoff, Intel (bio)

In this presentation, Andy will report on the latest developments around persistent memory programming. He’ll describing current discussions in the SNIA NVM Programming Technical Work Group, the current state of operating system support, recent tool and library development, and finally he’ll describe some of the upcoming challenges for high performance persistent memory use.

NOVA: A High-Performance, Hardened File System for Non-Volatile Main Memories

(Slides)

Dr. Steven Swanson, University of California, San Diego (bio)

Fast non-volatile memories (NVMs) will soon usher in a new era of high-performance storage. Managing, protecting, accessing, and maintaining consistency for data stored in NVM file systems raises a host of challenges that neither conventional block-oriented file systems nor existing NVM file systems address. I will describe NOVA, an open-source file system that provides the higher performance, better reliability, and stronger consistency than any available NVMM file system. NOVA adapts conventional log-structured file system techniques to exploit the fast random access that NVM provides. It uses per-file logs to provide fast, atomic file operations, uses checksums and ECC to protect data and metadata from media errors and software bugs, and leverages DRAM to to maximize performance. Experimental results show that in write-intensive workloads, NOVA provides 22% to 216x throughput improvement compared to state-of-the-art file systems, and 3.1x to 13.5x improvement compared to file systems that provide equally strong data consistency guarantees.

Short Talks

Attendees and vendors can sign up in advance, or at the conference, to give 5-15 minute
works-in-progress or summary updates on work of interest to conference attendees.

A Sideband Database for HPC and Archival Storage Systems Supporting Billions of Files

(Slides, Video)

Jacob Farmer, Starfish

6:00 — 8:00 Reception

(* Indicates Presenter)

7:30 — 8:30 Registration / Breakfast

8:30 — 10:30 Deduplication

(session chair: Yu Hua, Yu Hua, Huazhong University of Science & Technology)

Experiences with a Distributed Deduplication API

2017 Organizers
Conference Chair	Dr. Sam Coleman
Tutorial Chair	Sean Roberts
Program Chair	Dr. Matthew O'Keefe
Research General Chair	Dr. Ahmed Amer
Research Program Chairs	Dr. Thomas Schwarz Dr. Aleatha Parker-Wood
Research Track Program Committee
SCU Arrangements	Dr. Ahmed Amer
Industry Chair	Dr. James Reaney
Communications Chair	Meghan Wingate McClelland
Registration Chairs	Dr. JoAnne Holliday, Yi Fang

Hosted at

Sponsored by Santa Clara University, School of Engineering

Technically Co- Sponsored by

2017 Conference

Many Thanks to Our Sponsors!

2017 Program

Sponsored by Santa Clara University,
School of Engineering

Technically Co-
Sponsored by