MSST 2024, 38th International Conference on Massive Storage Systems and Technology, will be returning on its 50th anniversary to its new home at Santa Clara University, where it will be hosted by the School of Engineering, and will be held from June 3rd through June 7th, 2024

Registration (the lowest early-bird registration rates will close soon), is available here.

Directions to the venue and hotel information, is now available here.

The Research CFP (now closed for paper submissions) is archived here, but the poster submissions are still being accepted here.

Since the conference was founded, in 1974, by the leading national laboratories, MSST has been a venue for massive-scale storage system designers and implementers, storage architects, researchers, and vendors to share best practices and discuss building and securing the world’s largest storage systems for high-performance computing, web-scale systems, and enterprises.

MSST 2024 is possible thanks to sponsorship and support from SCU’s School of Engineering, and Hammerspace.

 

CONFERENCE SCHEDULE

 

The 38th International Conference on Massive Storage Systems and Technology will feature a return to the MSST full-week program.

More details will be provided as talks and presentations are finalized.

June 3rd

Tutorials

8:30 – 9:30

Breakfast & Badge Pickup

A Continental breakfast will be provided from 8:30 to 9:30am.

Badge pick-up will continue from 8:30 to 4:30pm.

9:15 – 9:30

Welcome to Analytics for the Storage Practitioner

Gary Grider (LANL)

9:30 – noon

Columnar Analytics

 

Overview (what, how, ecosystem, implications on storage) Fernanda Foertter (Voltron Data)
Object Computational Storage analytics offload Qing Zheng (LANL)
Erasured Object Computational Storage analytics offload Donpaul Stephens (AirMettle)
Hardware Accelerated Computational Storage analytics offload Krishna Maheshwari (Neuroblade)

noon – 1:00

Lunch

1:00 – 3:00

Row Analytics

 

Overview (what, how, ecosystem, implications on storage) Rob Johnson (Broadcom)
Hardware Accelerated Row analytics offload Edward Bortnikov (Pliops)
Hardware Accelerated Row analytics offload Phil Chan (Eideticom)

3:30 – 4:30

GraphDB / Analytics

 

Overview (what/how/market/ecosystem/storage implications) Amine Mhedhbi (Polytechnique Montréal)

June 4th

Invited Track

8:00 – 9:00

Breakfast & Badge Pickup

A Continental breakfast will be provided from 8:00 to 9:00am.

Badge pick-up will continue from 8:30 to 5:00pm.

8:30 – 8:45

Welcome & Opening Remarks

8:45 – 10:15

Computational Storage 1

 

Open Object Computational Storage Pushdown Jongryool.Kim (Skhynix)
Open PNFS Pushdown Dominic Manno (LANL)
Hardware Accelerated Computational Storage analytics offload Krishna Maheshwari (Neuroblade)

10:30 – 11:30

Computational Storage 2

 

Erasured Object Computational Storage analytics Donpaul Stephens (AirMettle)
Ceph Computational Storage Stephen Bates (Huawei)

11:30 – 12:30

Keynote: Michael Cornwell (Pure) – “Storage Landscapes”

12:30 – 1:30

Lunch

Badge pick-up will continue from 8:30 to 5:00pm.

1:30 – 3:00

Media Trends

 

Tape (Spectra) Nathan Thompson (Spectra)
Disk (WDC/Seagate) Paul Peck (Western Digital)
Flash (Samsung) Young Paik (Samsung)

3:15 – 4:45

Data Lakes

 

Apache Data Lake Ecosystem Fernanda Foertter (Voltron Data)
Iceberg (Apple/LinkedIn/Stripe) Alex Merced (Dremio)
Tackling I/O Challenges in Modern Data Lakes Hope Wang (Alluxio)

4:45 – 5:00

Invited Track Day 1 Wrap-Up

Adam Manzanares (Samsung)

June 5th

Invited Track

8:00 – 9:00

Breakfast & Badge Pickup

A Continental breakfast will be provided from 8:00 to 9:00am.

Badge pick-up will continue from 8:00 to 5:00pm.

8:30 – 8:45

Welcome & Introduction for Day 2

8:45 – 9:45

AI Storage Case Studies 1

 

AI storage case studies Hari Kennan (Pure)
AI storage case studies David Flynn (Hammerspace)

10:00 – 11:30

AI Storage Case Studies 2

 

All Flash Storage Systems Devasena Inupakutika (Samsung)
AI storage case studies Caden Bradbury
AI storage case studies Andy Pernsteiner (VAST)

11:30 – 12:30

Keynote: Garth Gibson – “Storage for A.I.”

12:30 – 1:30

Lunch

1:30 – 3:00

Heterogenous Workloads

 

Efficient Erasured NVMEoF Targets Sergey Platonov (Xinnor)
PNFS/NFS4.2 Diverse Workloads Trond Myklebust (Hammerspace)
Scalable Composable NVME Storage Matthew Williams (Cerio)

3:15 – 4:45

Distributed Storage Management

 

Open Community Distributed Storage Managment Dominivc Manno (LANL)
Beyond BeeOND: A Proposal for Composable Storage Joe McCormick (ThinkparQ)

4:45 – 5:00

Invited Track Day 2 Wrap-Up

Rohan Puri (AirMettle)

June 6th & 7th

Research Track

8:00 – 9:00 both days

Breakfast & Badge Pickup

A Continental breakfast will be provided from 8:00 to 9:00am.

Badge pick-up will continue from 8:00 to 5:00pm.

TBA

Session 1: Archival Data

Revisiting HDD Rules of Thumb: 1/3 Is Not (Quite) the Average Seek Distance

BURST: A Chunk-Based Data Deduplication System with Burst-Encoded Fingerprint Matching

A Generic and Efficient Framework for Estimating Lossy Compressibility of Scientific Data

TBA

Session 2: Long Live the Data

Repair I/O Optimization for Clay Codes via Gray-Code Based Sub-Chunk Reorganization in Distributed Storage Systems

Cauchy-Merge: An Efficient Cauchy Matrix based Stripe Merging Method for Reed-Solomon Codes

Minimizing Performance Degradation of RAID Recovery Through Pre-Failure Prediction

TBA

Session 3: Log Structured Session

Prophet: Optimizing LSM-Based Key-Value Store on ZNS SSDs with File Lifetime Prediction and Compaction Compensation

SAS-Cache: A Semantic-Aware Secondary Cache for LSM-based Key-Value Stores

A GPU-accelerated Compaction Strategy for LSM-based Key-Value Store System

TBA

Session 4: Getting Real

A GPU-accelerated Compaction Strategy for LSM-based Key-Value Store System

Answering the Call to ARMs with PACER: Power-Efficiency in Storage Servers

Mitigating Write-ahead Log Contention on Shared Storage Devices

TBA

Session 5: Cloudy With A Chance of Serverless

FastStore: Optimization of Distributed Block Storage Services for Cloud Computing

FuncStore: Resource Efficient Ephemeral Storage for Serverless Data Sharing

Balancing Costs and Durability for Serverless Data

TBA

Session 6: Heterogeneity

Learning to Coordinate Read-Write Cache Policies in SSD

TieredMMS: A Portable Tiered Memory Management System

Minding the Semantic Gap for Effective Storage-Based Ransomware Defense

TBA

Session 7: Flashy Session

Adaptive Selection of Parity Chunk Update Methods in RAID-enabled SSDs

AUGEFS: A Scalable Userspace Log-Structured File System for Modern SSDs

Fully Harnessing the Performance Potential of DRAM-less Mobile Flash Storage

TBA

Session 8: All Alone with the Memory

LodgeTree: A Last-Level Distributed and Surrogate Buffer Tree for Non-Volatile Memories

Dolphin: A Resource-efficient Hybrid Index on Disaggregated Memory

H2KV: A Hotspot Awareness based Hybrid Fault-tolerant In-memory Key-Value Store

TBA

Session 9: Even More Flashy

PhasedRR: Read Reclaim Scheduling without Page-level Access Counting

Ensuring Compaction and Zone Cleaning Efficiency through Same-Zone Compaction in ZNS Key-Value Store

PhatKV: Towards an Efficient Metadata Engine for KV-based File Systems on Modern SSD