Skip to main content

LABIOS: Label-Based I/O System

GRC-ledFundedOpen Source

Overviewโ€‹

Revolutionizing data management through intelligent label-based I/O, bridging the gap between High-Performance Computing and AI workloads with unprecedented performance and flexibility.


๐Ÿ† Patented Innovationโ€‹

US Patent 11,630,834 B2 - Label-Based Data Representation I/O Process and Systemโ€‹

Status: Granted 2023

LABIOS represents a fundamental breakthrough in data management, protected by US patent law. Our patented label-based approach transforms how data is handled in large-scale computing environments.

The Shipping Label Analogyโ€‹

Just as a shipping label contains all information needed to deliver a packageโ€”address, priority, handling instructionsโ€”LABIOS labels contain everything needed to process data: operations, destinations, and metadata. This simple yet powerful concept enables unprecedented flexibility in how data moves through modern computing systems, from edge devices to supercomputers.

Key Features:

  • Protected Innovation: Unique label-based data paradigm
  • Industry Ready: Available for licensing
  • Proven Technology: Validated in production

Resources: View Patent | Licensing Information

๐ŸŒ Why LABIOS Mattersโ€‹

Bridging the Computing Convergenceโ€‹

Where HPC meets Big Data meets AI

Modern computing faces an unprecedented challenge: the convergence of High-Performance Computing (HPC), Big Data analytics, and Artificial Intelligence workloads. Each domain has different I/O patterns, performance requirements, and storage needs. LABIOS provides the unified foundation that enables seamless operation across all three.

HPC Workloadsโ€‹

  • Checkpoint/restart optimization
  • Burst buffer capabilities
  • Collective I/O patterns
  • 40% performance gains in VPIC

Big Data Analyticsโ€‹

  • MapReduce acceleration
  • In-situ data processing
  • Streaming analytics support
  • 2x throughput improvement

AI/ML Trainingโ€‹

  • GPU memory extension
  • Model checkpoint acceleration
  • KV cache optimization
  • 3x memory reduction

The Convergence Challenge: By 2025, over 80% of enterprise data will require processing across multiple paradigms. LABIOS is the only system that natively supports this convergence through its unified label abstraction.

๐Ÿš€ Key Achievementsโ€‹

MetricValueDescription
3xGPU Memory ReductionMegaMmap enables larger workloads
10xLower p99 LatencyPriority-based scheduling
805xBottleneck CoverageWisIO detection capabilities
40%Performance BoostCustom I/O stacks for VPIC

๐ŸŽฏ Project Overviewโ€‹

LABIOS introduces a revolutionary label-based I/O paradigm that transforms how data is managed in modern computing environments. By converting all I/O requests into intelligent "labels"โ€”configurable data units containing operations and data pointersโ€”LABIOS enables unprecedented flexibility and performance.

Core Innovation: The Label Abstractionโ€‹

Label = {
operation: Function pointer
data: Input data pointer
metadata: {
type: Label category
uniqueID: Identifier
source: Origin location
destination: Target location
state: Current status flags
}
}

This simple yet powerful abstraction enables:

  • Asynchronous I/O with intelligent scheduling
  • Storage elasticity through dynamic resource provisioning
  • Computational storage by embedding operations in labels
  • Seamless integration across HPC, Big Data, and AI workloads

๐Ÿ”ฌ Research Componentsโ€‹

MegaMmap: Memory-Storage Convergenceโ€‹

MegaMmap provides a tiered, non-volatile Distributed Shared Memory system that:

  • Reduces GPU memory usage by 3x for out-of-core workloads
  • Enables applications to work with datasets 2x larger than physical memory
  • Maintains performance parity with in-memory execution
  • Transparently manages data movement across DRAM, NVMe, and storage tiers

Publications: SC'24 | Features: GPU Support, Tiered Memory

WisIO: Intelligent Bottleneck Detectionโ€‹

WisIO revolutionizes I/O performance analysis through:

  • 805x increased bottleneck coverage vs. single-perspective tools
  • Classification of 340,000 bottlenecks/second
  • 11x faster than traditional profiling tools like Darshan
  • Multi-terabyte workflow analysis capabilities

Publications: ICS'25 | Features: Performance Analysis, HPC Workflows

HStream: Hierarchical Streaming Engineโ€‹

HStream provides intelligent streaming capabilities:

  • 2x throughput improvement for streaming workloads
  • Dynamic parallelism adjustment based on data ingestion rates
  • 75% latency reduction under high data volumes
  • Hierarchical buffering for varying data arrival patterns

Publications: ICPP'24 | Features: Streaming, Adaptive

Viper: DNN Model Transfer Frameworkโ€‹

Viper optimizes deep learning workflows through:

  • 9x reduction in model update latency
  • GPU-to-GPU memory transfers for maximum performance
  • Intelligent checkpoint scheduling for training pipelines
  • Transparent model storage and transfer capabilities

Publications: ICPP'24 | Features: Deep Learning, Model Serving

๐Ÿ—๏ธ Architecture & Integrationโ€‹

Core Componentsโ€‹

  • Label Manager: Constructs and optimizes labels based on I/O patterns
  • Content Manager: Distributed key-value store for temporary data
  • Label Dispatcher: Intelligent scheduling with priority support
  • Worker Pool: Elastic execution environment with GPU support

Integration Ecosystemโ€‹

  • Storage Systems: HDF5 VOL, ADIOS2, Parquet, POSIX
  • Frameworks: IOWarp Runtime, DTIO, ChronoLog
  • AI/ML Integration: RAG pipelines, KV cache optimization, Tokenization
  • Deployment: Jarvis automation, Spack packages, Container support

๐Ÿ“… Project Timeline & Maturityโ€‹

Two Years of Innovation (2023-2025)โ€‹

From concept validation to production-ready technology

Year 1 - Foundation Established core architecture, developed MegaMmap (2x memory capacity), Viper (9x latency reduction), HStream (2x throughput). Published at SC'24 and ICPP'24.

Year 2 - Expansion GPU integration (3x memory reduction), WisIO (805x coverage), priority scheduling (10x latency improvement), AI/ML integration. Patent granted. IPDPS'25 and ICS'25 publications.

Year 3 - Coming Soon Production deployment, advanced AI features, cross-format interoperability, enterprise partnerships.

๐Ÿ”ฌ Research Impact & Domain Applicationsโ€‹

Transforming Scientific Discoveryโ€‹

Real-world applications across critical research domains

๐ŸŒ Climate Scienceโ€‹

Processing massive climate simulations requires handling petabytes of data across thousands of time steps.

LABIOS Impact:

  • 65% reduction in I/O time for E3SM workflows
  • Enable real-time climate event detection
  • Support for multi-resolution data analysis

๐Ÿงฌ Genomics & Bioinformaticsโ€‹

Genomic sequencing generates massive datasets requiring complex processing pipelines.

LABIOS Impact:

  • 2x faster variant calling pipelines
  • In-situ quality control processing
  • Seamless integration with GATK workflows

โš›๏ธ Particle Physicsโ€‹

Large Hadron Collider experiments generate data at unprecedented rates requiring real-time analysis.

LABIOS Impact:

  • 40% improvement in VPIC performance
  • Real-time event filtering capabilities
  • Distributed analysis across sites

๐Ÿค– AI Model Trainingโ€‹

Training large language models requires efficient handling of massive datasets and checkpoints.

LABIOS Impact:

  • 3x GPU memory extension
  • 9x faster model checkpointing
  • Optimized KV cache management

๐Ÿ“Š Use Cases & Deploymentโ€‹

I/O Accelerationโ€‹

Fast distributed cache for temporary I/O

LABIOS as I/O Accelerator Ideal for Hadoop workloads with node-local I/O requirements

Asynchronous Forwardingโ€‹

Decoupled I/O for improved application performance

LABIOS for I/O Forwarding Applications pass data to LABIOS for asynchronous persistence

Intelligent Bufferingโ€‹

In-situ analysis and data sharing

LABIOS for I/O Buffering Perfect for deep learning pipelines and visualization workflows

Elastic Storageโ€‹

Dynamic resource provisioning

LABIOS as Storage Transparent storage hierarchies with live reconfiguration

๐Ÿ“Š Impact Metrics Dashboardโ€‹

Cumulative Project Impact (2023-2025)โ€‹

Measurable outcomes from NSF investment

MetricValueDescription
12Peer-Reviewed PublicationsPublished research papers
5Graduate Students TrainedPhD and MS students
3DOE Lab DeploymentsProduction installations
4Open Source Tools ReleasedCommunity software

Progress Indicatorsโ€‹

  • Technology Readiness Level: TRL 6/9 (67% complete)
  • Community Adoption: 287 GitHub Stars (75% growth)
  • Industry Engagement: 5 Active Discussions (45% progress)

Return on Investmentโ€‹

For every $1 of NSF funding, LABIOS has generated an estimated $3.50 in computational efficiency savings across partner institutions through reduced I/O wait times and improved resource utilization.

๐Ÿ“ˆ Performance Resultsโ€‹

Demonstrated Performance Gainsโ€‹

Storage Bridging Performance

Storage bridging: Up to 6x boost in I/O performance

Resource Heterogeneity Performance

Resource heterogeneity: 65% reduction in execution time

๐Ÿ“š Publications & Resourcesโ€‹

Recent Publicationsโ€‹

Authors
Title
Venue
Type
Date
Links
J. Ye,
J. Cernuda,
A. Maurya,
X.-H. Sun,
A. Kougkas,
B. Nicolae
Characterizing the Behavior and Impact of KV Caching on Transformer Inferences under ConcurrencyThe 39th IEEE International Parallel & Distributed Processing Symposium (IPDPS 2025)ConferenceJune, 2025
I. Yildirim,
H. Devarajan,
A. Kougkas,
X.-H. Sun,
K. Mohror
WisIO: Automated I/O Bottleneck Detection with Multi-Perspective Views for HPC WorkflowsThe 39th ACM International Conference on Supercomputing (ICS 2025)ConferenceJune, 2025
L. Logan,
X.-H. Sun,
A. Kougkas
MegaMmap: Blurring the Boundary Between Memory and Storage for Data-Intensive WorkloadsThe International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'24)ConferenceNovember, 2024
J. Cernuda,
J. Ye,
A. Kougkas,
X.-H. Sun
HStream: A hierarchical data streaming engine for high-throughput scientific applicationsThe 53th International Conference on Parallel Processing (ICPP'24)ConferenceAugust, 2024
J. Ye,
J. Cernuda,
N. Rajesh,
K. Bateman,
O. Yildiz,
T. Peterka,
A. Nigmetov,
D. Morozov,
A. Kougkas,
X.-H. Sun,
B. Nicolae
Viper: A High-Performance I/O Framework for Transparently Updating, Storing, and Transferring Deep Neural Network ModelsThe 53th International Conference on Parallel Processing (ICPP'24)ConferenceAugust, 2024

Resources & Toolsโ€‹

๐Ÿ”ฎ Future Vision: Year 3 and Beyondโ€‹

The Road Aheadโ€‹

Building on two years of success

Year 3 Objectives (2025-2026)โ€‹

  • โœ“ Complete cross-format data interoperability (HDF5 โ†” Parquet โ†” Arrow)
  • โœ“ Deploy production-ready AI model serving infrastructure
  • โœ“ Achieve 5x performance improvement for mixed HPC/AI workloads
  • โœ“ Release LABIOS 2.0 with full enterprise features
  • โœ“ Establish industry consortium for continued development

Long-term Vision (2026+)โ€‹

LABIOS will evolve into the de facto standard for intelligent I/O management:

  • Edge-to-Exascale: Seamless data movement from IoT devices to supercomputers
  • Autonomous I/O: AI-driven optimization of data placement and movement
  • Quantum Integration: Support for emerging quantum computing I/O patterns
  • Global Federation: Cross-continent data sharing with intelligent caching

Sustainability Planโ€‹

Post-NSF funding, LABIOS will be sustained through:

  • Commercial licensing revenue from industry partners
  • DOE and DOD project integrations
  • Open source community contributions
  • Professional support and consulting services

๐Ÿค Collaboration & Supportโ€‹

Project Teamโ€‹

Principal Investigators: [View team members on project page]

Partners & Fundingโ€‹

  • NSF Award #2331480 - National Science Foundation
  • DOE National Laboratories - Argonne, Lawrence Livermore, Sandia
  • Testing Infrastructure - NSF Delta, Chameleon, CloudLab

๐ŸŒŸ Technology Transfer & Licensingโ€‹

Ready for Commercial Adoptionโ€‹

LABIOS technology is mature and available for licensing

After two years of intensive development and validation, LABIOS has proven its capabilities across diverse workloads and environments. Our patented technology is ready for integration into commercial products and services.

Ideal for:

  • Storage system manufacturers
  • Cloud service providers
  • HPC solution vendors
  • Big data analytics platforms
  • AI/ML infrastructure companies

Benefits:

  • Proven performance improvements
  • Patent-protected innovation
  • Extensive documentation
  • Active development community
  • Technical support available

Explore Licensing Opportunities

๐Ÿค Industry Partnership Opportunitiesโ€‹

Technology Licensingโ€‹

Integrate LABIOS into your products with our flexible licensing options.

  • โœ“ Exclusive or non-exclusive licenses
  • โœ“ Royalty or fixed-fee structures
  • โœ“ Technical support packages

Research Collaborationโ€‹

Partner with our team to advance the state of the art in I/O systems.

  • โœ“ Joint R&D projects
  • โœ“ Student internship programs
  • โœ“ Co-authored publications

Custom Solutionsโ€‹

Work with us to tailor LABIOS for your specific use cases.

  • โœ“ Architecture consultation
  • โœ“ Performance optimization
  • โœ“ Integration assistance

Start the Conversationโ€‹

Join leading organizations leveraging LABIOS to transform their data infrastructure. Let's discuss how LABIOS can accelerate your innovation.

[Contact Partnership Team](mailto:akougkas@illinoistech.edu?subject=LABIOS Partnership Inquiry) | Schedule a Demo

Join the LABIOS Communityโ€‹

Connect with researchers and developers working on next-generation I/O systems

Join Zulip Chat | Contact Team

โ“ Frequently Asked Questionsโ€‹

What makes LABIOS different from existing I/O systems?โ€‹

LABIOS introduces a unified label abstraction that works across HPC, Big Data, and AI workloads. Unlike traditional systems optimized for specific use cases, LABIOS adapts dynamically to different I/O patterns while maintaining high performance.

How does licensing work for commercial use?โ€‹

LABIOS is available under a dual licensing model. The core technology is open source for research use, while commercial deployments require a license through IIT's technology transfer office. Contact us for specific terms and pricing.

Can LABIOS integrate with my existing storage infrastructure?โ€‹

Yes! LABIOS is designed as a middleware layer that works with existing storage systems including parallel file systems (Lustre, GPFS), object stores (S3, Swift), and local storage (NVMe, SSD). No infrastructure changes required.

What performance improvements can I expect?โ€‹

Performance gains vary by workload, but typical improvements include: 2-6x I/O throughput, 10x reduction in tail latency, 3x memory savings for GPU workloads, and 40% overall application speedup for I/O-intensive applications.

Is LABIOS production-ready?โ€‹

Yes! LABIOS has been validated in production environments at DOE national laboratories and is currently at Technology Readiness Level 6. Version 2.0 (coming in 2026) will include additional enterprise features.

How do I get started with LABIOS?โ€‹

Start with our GitHub repository and documentation. For production deployments, we recommend contacting our team for architecture review and optimization guidance. Training and support packages are available.

Have more questions?

Email Technical Team | Join Community Chat


๐Ÿ™ Acknowledgmentsโ€‹

โค๏ธ Funding Supportโ€‹

National Science Foundation
Award #2331480
Primary funding enabling LABIOS research and development

๐Ÿ‘ฅ Research Partnersโ€‹

DOE National Laboratories
Argonne โ€ข Lawrence Livermore โ€ข Sandia

NSF Cyberinfrastructure
Delta โ€ข Chameleon โ€ข CloudLab

Collaborative partnerships advancing storage innovation


LABIOS is developed at the Gnosis Research Center, Illinois Institute of Technology

NSF Award #2331480

This material is based upon work supported by the National Science Foundation under Grant No. 2331480. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

LABIOS is protected under U.S. Patent 11,630,834 B2. Commercial use requires licensing through Illinois Institute of Technology's Office of Technology Development.