2016 Conference
    
  
  MSST (2016), as is our custom, dedicated five days to computer-storage technology, including a day of tutorials, two days of invited papers, two days of peer-reviewed research papers, and a vendor exposition. The conference was held, once again, on the beautiful campus of Santa Clara University, in the heart of Silicon Valley.
  
  
  
  
  
  
  
  
  
   
    Many Thanks to Our Sponsors!
    
    
   
  
  
  
    
2016 Program
  
  
  7:30 — 8:30 Registration / Breakfast
  8:30 — 5:00 Swiftstack Swift Tutorial / Lunch
  Albert Chen, Systems Engineer, Swiftstack
  Chris Nelson, Director of Systems Engineering, Swiftstack
  Abstract: In this full-day workshop, you'll learn why the largest public clouds and internet properties, including Wikipedia, IBM Softlayer, Rackspace, and HP are using OpenStack Swift to deliver massively scalable, multi-geographically distributed storage for their customers. You'll also learn how to get up and running with a Swift Cluster. 
    
    In this hands-on lab, we'll cover:
    
      - An overview on Swift Fundamentals
- How easy it is to consume storage (for mega-scale!) while improving on throughput Architecture Overview
- Use Cases and Integration
- Selecting and Optimizing Hardware
- Hands-on deploying and interacting with a Swift cluster
    More specifically, you'll leave understanding exactly how to deploy and manage a Swiftcluster, what use cases are the best fit for OpenStack Swift's object storage and how easy it is to build apps using Swift and consume their assets (videos, images, docs, pdfs, etc...). 
    
    Prerequisites:
    Attendees will be given access to their own remote linux instances for the hands-on lab. Please make sure you have the following for the tutorial session:
    
      - Laptop with SSH capabilities
- Chrome, Safari, or Firefox installed
 
  
  
  
  7:30 — 8:30 Registration / Breakfast
  8:30 — Keynote
  Scalable High Performance FLASH Systems (
Slides, 
Video)
Jeff Bonwick, EMC DSSD
  
    
    MSST 2016 features keynote speaker Jeff Bonwick, co-founder and CTO of DSSD, where he co-invented both the system hardware architecture and the Flood software stack. His talk will focus on extracting maximum performance from flash at scale. Jeff has a long history of developing at-scale storage starting with leading the team that developed the ZFS filesystem, which powers Oracle’s ZFS storage line as well as numerous startups including Nexenta, Delphix, Joyent, and Datto.  Jeff is also the inventor of the slab allocator, which is used in every major operating system and has become part of the computer science curriculum at universities worldwide. His previous roles include Sun Fellow, Sun storage CTO, and Oracle VP.  Jeff holds 62 issued patents in storage and operating system technology.
 
  Application- and Workload-Specific Workflows
  Workflow Specification for Large-Scale Computational Systems (
Slides, 
Video)
David Montoya, Los Alamos National Laboratory
  Panel: Workflows for Specifying System Acquisitions 
  Moderator: Gary Grider, 
Los Alamos National Laboratory (
Slides, 
Video)
 
  David Montoya, Los Alamos National Laboratory
  
  
  Lance Evans, 
Cray, Inc. (
Slides)
12:30 — 1:30 Lunch
  1:30 — Data-Intensive Workflows
  A Holistic Framework for Data-Intensive Workflows
  
  Panel: Data-Intensive Workflows
  
  Jake Carroll, 
University of Queensland Brain Institute (
Slides)
Kirk Jordan, Hartree Centre - Science and Technology Facilities Council
  Massimo Noro, 
Unilever (
Slides)
Don Preuss, Starfish Storage
  Leveraging Disk for Large-Scale, Long-Term Storage Applications
  A Perspective on Power Management for Hard Disks
  
  
    
    
    The speaker will talk about the impact of power management on reliability of hard disk devices based on a field experience with SGI’s MAID product (formerly COPAN). The presentation will explore statistical analysis of support data covering a decade of MAID deployment (2005-2015) as well as briefly cover the history and basic principles of the technology and provide an update on future direction of the product line.
    
    
    
  
 
  Hard Disks for Large Archives
  Dave Anderson, Seagate
  Leveraging Large-Scale Disk Systems in a Web-based Photo Archive: Design, Operations, and Future Plans
  Mike Kugler, 
Shutterfly (
Slides)
Panel: Leveraging Disks for Large-Scale, Long-Term Storage
  
  
  Roark Hilomen, 
Sandisk (
Video)
Mike Kugler, 
Shutterfly (
Video)
Kirill Malkin, 
SGI (
Video)
Lightning Talks
  Superfacility: How new workflows in the DOE Office of Science are changing storage system requirements
  Katie Antypas, 
Lawrence Berkeley Laboratory (
Slides, 
Video)
Everspan—an optical archive solution with LTO-7 performance and cost, but without the tape issues
  
  
  7:30 — 8:30 Breakfast
  8:30 — Keynote
  Learnings from Operating 200 PB of Disk-Based Storage
  
  Trends in FLASH Technology Relevant to Large-Scale Systems
  Leveraging Flash in Scalable Environments:
    A Systems Perspective on How FLASH Storage is Displacing Disk Storage
  
  Storage Media Overview: Historic Perspectives
  
  Innovations in Non-Volatile Memory: 3D NAND and its Implications
  
  Panel: Media Trends
  Moderator: Matthew O’Keefe, Oracle
  
  Dave Anderson, 
Seagate (
Slides)
Bob Fontana, IBM Almaden Research
  Roark Hilomen, Sandisk
  Rob Peglar, Micron
  12:30 — 1:30 Lunch
  1:30 — An Adventure in Semantics: Compromising POSIX Semantics to Adapt
    Applications to Scalable Object Technology
  MarsFS: A Near-POSIX Namespace Leveraging Scalable Object Storage (
Slides, 
Video)
David Bonnie, Los Alamos National Laboratory
  
    
    It has been said that objects are for applications and POSIX is for people. The HPC community as well as many other large scale IT organizations have legacy applications and users that know, use, and depend on a near POSIX environment with real folders, ease of renaming and reshaping trees, and other powerful concepts in POSIX. 
    
    There are several POSIX name spaces that sit on top of cloud style erasure based objects but few if any really provide an extremely scalable solution. MarFS is designed to address this problem by providing a scalable near-POSIX name space over standard object systems, with target scaling out to trillions of POSIX files, hundreds of gigabytes/sec of data bandwidth, and millions of POSIX metadata operations/sec.
 
  Panel: Evolving Semantics for Object Storage
  
  David Bonnie, 
Los Alamos National Laboratory (
Slides)
Ian Corner, CSIRO
  
  
  Justin Stottlemyer, Intuit
  Potpourri
  Eternal 5D Data Storage in Glass
  Peter Kazansky, 
University of Southampton (UK) (
Slides, 
Video)
Recording of polarization multiplexed digital data was demonstrated by femtosecond laser nanostructuring of fused quartz. The storage allows unprecedented parameters including hundreds of terabytes per disc data capacity, thermal stability up to 1000°C and virtually unlimited lifetime at room temperature, which is a vital step towards an eternal archive.
  Everspan: 8-channel optical drive to achieve high transfer rates
  
  Building Scalable, High Performance Block Storage via an RDMA-based Hyperconverged Platform
  Josh Goldenhar, Excelero
  Excelero NVMesh allows the seemingly impossible by enabling compute hosts to have shared access to 100s of TBs of flash storage at LOCAL speeds and latencies. By incorporating classic RDMA benefits with NVMe technology and a few new “tricks”, it’s become possible to build a hugely scalable distributed block platform that performs faster than any all-flash-array. This capability allows data scientists to slice and dice data for analysis in ways that before were too costly, too slow or too proprietary.
  5:30 — 8:00 Reception
  
  
  
   (* Indicates Presenter)
  8:00 — 9:00 Registration / Breakfast
  9:00 — It’s Never too Fast: Storage Performance Enhancements
  Session chair: Dimitris Skourtis, VMware
  Pfimbi: Accelerating Big Data Jobs Through Flow-Controlled Data Replication (
Paper, 
Slides)
Simbarashe Dzinamarira* and T. S. Eugene Ng, Rice University
  Florin Dinu, École Polytechnique Fédérale de Lausanne
  Manylogs: Improved CMR/SMR Disk Bandwidth and Faster Durability with Scattered Logs (
Paper, 
Slides)
Tiratat Patana-anake*, Nora Sandler, Cheng Wu and Haryadi S. Gunawi, University of Chicago
  Vincentius Martin, Surya University
  Tombolo: Performance Enhancements for Cloud Storage Gateways (
Paper, 
Slides)
Suli Yang*, Remzi H. Arpaci-Dusseau, Andrea C. Arpaci-Dusseau, University of Wisconsin
  Kiran Srinivasan,  Kishore Udayashankar, Shweta Krishnan, Jingxin Feng, NetApp
  Yupu Zhang, HP
  File Systems for Non-Volatile Memory
  Session chair: Jay Lofstead, Sandia National Laboratories
  Fine-grained Metadata Journaling on NVM (
Paper, 
Slides)
Cheng Chen, Jun Yang*, Qingsong Wei, Chundong Wang, and Mingdi Xue, A-STAR
  Fast and Failure-Consistent Updates of Application Data in Non-Volatile Main Memory File System (
Paper, 
Slides)
Jiaxin Ou and Jiwu Shu, Tsinghua University, presented by Linpeng Huang*
  HMVFS: A Hybrid Memory Versioning File System (
Paper, 
Slides)
Shengan Zheng*, Linpeng Huang, Hao Liu, Linzhu Wu, and Jin Zha, Shanghai Jiao Tong University 
  12:10 — 1:10 Lunch
  1:10 — Store More, Longer, and for Less: Deduplication and Archival Systems
  Session chair: John May, Lawrence Livermore National Laboratory
  A Long Term User-Centric Analysis of Deduplication Patterns (
Paper, 
Slides)
Zhen Sun* and Nong Xiao, National University of Defense Technology
  Geoff Kuenning, Harvey Mudd College
  Sonam Mandal and Erez Zadok, Stony Brook University
  Philip Shilane, EMC
   Vasily Tarasov, IBM Research
  
  Jingwei Ma, Rebecca J. Stones*, Yuxiang Ma, Jingui Wang, Junjie Ren, Gang Wang, Xiaoguang Liu, Nankai University
  Sorted Deduplication: How to Process Thousands of Backup Streams (
Paper, 
Slides)
Jürgen Kaiser*, Tim Süß, Lars Nagel, André Brinkmann, Johannes Gutenberg University
  Effects of Prolonged Media Usage and Long-term Planning on Archival Systems (
Paper, 
Slides)
Preeti Gupta, Darrell D.E. Long, and Ethan L. Miller, University of California, Santa Cruz
  Avani Wildani*, Emory University
  David S.H. Rosenthal, Stanford University
  Spotlight on Flash memory and Solid-State Drives
  Session chair: Irfan Ahmad, Cloud Physics
  Adaptive policies for balancing performance and lifetime of mixed SSD arrays through workload sampling
    (
Paper, 
Slides)
Sangwhan Moon* and A. L. Narasimha Reddy, Texas A & M University
  REAL: A Retention Error Aware LDPC Decoding Scheme to Improve NAND Flash Read Performance (
Paper, 
Slides)
Meng Zhang*, Fei Wu, Shunzhuo Wang, and Changsheng Xie, Huazhong University of Science and Technology
  Xubin He and Ping Huang, Virginia Commonwealth University
  Analytic models for flash-based SSD performance when subject to trimming (
Paper, 
Slides)
Robin Verschoren* and Benny Van Houdt, University of Antwerp
  Reducing Write Amplification of Flash Storage through Cooperative Data Management with NVM (
Paper, 
Slides)
Eunji Lee*, Chungbuk National University
  Julie Kim, Line Corporation
  Hyokyung Bahn, Ewha Womans University
  Sam H. Noh, Ulsan National Institute of Science and Technology
  Exploiting Latency Variation for Access Conflict Reduction of NAND Flash Memory (
Paper, 
Slides)
Jinhua Cui, Weiguo Wu, Xingjun Zhang, and Jianhang Huang*, Xi'an Jiaotong University
  Yinfeng Wang, ShenZhen Institute of Information Technology
  
   (* Indicates Presenter)
  8:00 — 9:00 Breakfast
  9:00 — Understanding Storage Systems through Measurements and Analysis
  Session chair: Xing Lin, NetApp
  Understanding I/O Performance Behaviors of Cloud Storage from a Client’s Perspective (
Paper, 
Slides)
Binbing Hou* and Feng Chen, Louisiana State University
  Zhonghong Ou, Beijing University of Posts and Telecommunications
  Ren Wang and Michael Mesnier, Intel Labs
  File System Trace Replay Methods Through the Lens of Metrology (
Paper, 
Slides)
Thiago Emmanuel Pereira*, Francisco Brasileiro, and Livia Sampaio , Universidade Federal de Campina Grande
  The Impact of Data Placement on Resilience in Large-Scale Object Storage Systems (
Paper, 
Slides)
Philip Carns*, Kevin Harms, John Jenkins, Misbah Mubarak, Robert Ross, Argonne National Laboratory
  Christopher Carothers, Rensselaer Polytechnic Institute
  On-the-Go Storage
  Session chair: Jishen Zhao, University of California, Santa Cruz
  Understanding Storage I/O Behaviors of Mobile Applications (
Paper, 
Slides)
Jace Courville* and Feng Chen, Louisiana State University
  An Overlay File System for Cloud-Assisted Mobile Applications (
Paper, 
Slides)
Jianchen Shan*, Nafize R. Paiker, Xiaoning Ding, Narain Gehani, Reza Curtmola and Cristian Borcea, New Jersey Institute of Technology
  Fast Transaction Logging for Smartphones (
Paper, 
Slides)
Hao Luo*, Yaodong Yang, University of Nebraska
  Hong Jiang, Zhichao Yan, University of Texas
  12:00 — 1:00 Lunch
  
  
  
    
      |  2016 Organizers Advisory Board | 
    
      | Conference Chair | Dr. Sam Coleman | 
    
      | Tutorial Chair | Sean Roberts | 
    
      | Program Chair | Dr. Matthew O'Keefe | 
    
      | Research General Chair | Dr. Ahmed Amer | 
    
      | Research Program Chairs | Dr. Carlos Maltzahn, Dr. Vasily Tarasov | 
    
      | Research Track Program Committee | 
    
      | SCU Arrangements | Dr. Ahmed Amer | 
    
      | Vendor Chair | Dr. James Reaney | 
    
      | Communications Chair | Meghan Wingate McClelland | 
    
      | Registration Chairs | Dr. JoAnne Holliday, Yi Fang |