Skip to main content

Publications

Authors
Title
Venue
Type
Date
Tags
Links
J. Cernuda,
L. Logan,
A. Gainaru,
J. Lofstead,
A. Kougkas,
X.-H. Sun
Hades: A Context-Aware Active Storage Framework for Accelerating Large-Scale Data AnalysisThe 24th IEEE/ACM international Symposium on Cluster, Cloud and Internet Computing (CCGRID 2024)ConferenceMay, 2024
Active StorageHierarchical StorageContext AwarenessMetadata ManagementData OperatorIn-Transit ComputingCoeus
TBA
N. Rajesh,
K. Bateman,
S. Byna,
J. L. Bez,
A. Kougkas,
X.-H. Sun
TunIO: An AI-powered Framework for Optimizing HPC I/O38th IEEE International Parallel & Distributed Processing Symposium (IPDPS 2024)ConferenceMay, 2024
AI for I/OI/O Stack Tuning
TBA
X. Lu,
B. Long,
X. Chen,
Y. Han,
X.-H. Sun
ACES: Accelerating Sparse Matrix Multiplication with Adaptive Execution Flow and Concurrency-Aware Cache OptimizationsThe 2024 Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2024)ConferenceApril, 2024
SpMMAcceleratorParallelismConcurrencySynchronizationScalability
TBA
X. Lu,
H. Najafi,
J. Liu,
X.-H. Sun
CHROME: Concurrency-Aware Holistic Cache Management Framework with Online Reinforcement LearningThe 30th IEEE International Symposium on High-Performance Computer Architecture (HPCA 2024)ConferenceMarch, 2024
Memory ArchitectureCache Management
TBA
Z. Dang,
S. He,
X. Zhang,
P. Hong,
Z. Li,
X. Chen,
H. Song,
X.-H. Sun,
G. Chen
PMAlloc: A Holistic Approach to Improving Persistent Memory AllocationACM Transactions on Computer Systems (TOCS'24)JournalJanuary, 2024
Persistent MemoryMemory Management
TBA
I. Yildirim,
H. Devarajan,
A. Kougkas,
X.-H. Sun,
K. Mohror
IOMax: Maximizing Out-of-Core I/O Analysis Performance on HPC SystemsThe 8th International Parallel Data Systems Workshop (PDSW'23), November 12, 2023WorkshopNovember, 2023
I/O AnalysisOut-of-Core AnalysisData DrillingWisIO
I. Yildirim,
H. Devarajan,
A. Kougkas,
X.-H. Sun,
K. Mohror
Exploring the Impacts of Multiple I/O Metrics in Identifying I/O BottlenecksThe International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'23), November 12-17, 2023PosterNovember, 2023
I/O AnalysisI/O MetricsI/O Bottleneck DetectionWisIO
M. Zou,
M. Zhang,
R. Wang,
X.-H. Sun,
X. Ye,
D. Fan,
Z. Tang
Skyway: Accelerate Graph Applications with a Dual-Path Architecture and Fine-Grained Data ManagementJournal of Computer Science and Technology (JCST'23)JournalTBA
Graph ApplicationsComputer ArchitectureMemory Hierarchy
TBA
H. Lee,
L. Guo,
M. Tang,
J. Firoz,
N. Tallent,
A. Kougkas,
X.-H. Sun
Data Lifecycles: Optimizing Workflow Task & Data CoordinationThe International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'23), November 12-17, 2023ConferenceNovember, 2023
Data AnalyticsPerformance MeasurementModelingTools
TBA
L. Logan,
J. Lofstead,
A. Kougkas,
X.-H. Sun
An Evaluation of DAOS for Simulation and Deep Learning HPC WorkloadsThe 3rd Workshop on Challenges and Opportunities of Efficient and Performant Storage Systems (CHEOPS'23)WorkshopMay, 2023
Distributed ComputingDistributed StorageFlash MemoryMachine LearningParallel ComputingPhase Change Memory
X.-H. Sun,
X. Lu
The Memory-Bounded Speedup Model and Its Impacts in ComputingJournal of Computer Science and Technology (JCST'23), vol. 38, no. 1, February 2023JournalFebruary, 2023
Memory-Bounded SpeedupScalable ComputingMemory-WallData-Centric Design
W. Chen,
S. He,
Y. Xu,
X. Zhang,
S. Yang,
S. Hu,
X.-H. Sun,
G. Chen
iCACHE: An Importance-Sampling-Informed Cache for Accelerating I/O-Bound DNN Model TrainingThe 29th IEEE International Symposium on High-Performance Computer Architecture (HPCA-29), Montreal, QC, Canada, February 25 - March 01, 2023ConferenceFebruary, 2023
X. Lu,
R. Wang,
X.-H. Sun
CARE: A Concurrency-Aware Enhanced Lightweight Cache Management FrameworkThe 29th IEEE International Symposium on High-Performance Computer Architecture (HPCA-29), Montreal, QC, Canada, February 25 - March 01, 2023ConferenceFebruary, 2023
K. Bateman,
N. Rajesh,
J. Cernuda,
L. Logan,
J. Ye,
S. Herbein,
A. Kougkas,
X.-H. Sun
LuxIO: Intelligent Resource Provisioning and Auto-Configuration for Storage ServicesThe 29th edition of the IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC'22)ConferenceDecember, 2022
Resource ProvisioningI/O BehaviorStorage Auto-TuningChronoLog
P. Chen,
S. He,
X. Zhang,
S. Chen,
P. Hong,
Y. Yin,
X.-H. Sun
Accelerating Tensor Swapping in GPUs with Self-Tuning CompressionTransactions on Parallel and Distributed Systems (TPDS'22)JournalDecember, 2022
DNNGPUTensorSwappingCompression
H. Najafi,
X. Lu,
J. Liu,
X.-H. Sun
A Generalized Model For Modern Hierarchical Memory SystemThe 2022 Winter Simulation Conference (WSC), Singapore, December 11-14, 2022ConferenceDecember, 2022
Hierarchical Memory SystemC-AMAT
L. Logan,
J. Cernuda,
J. Lofstead,
X.-H. Sun,
A. Kougkas
LabStor: A Modular and Extensible Platform for Developing High-Performance, Customized I/O Stacks in UserspaceThe International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'22)ConferenceNovember, 2022
Clouds and Distributed ComputingProgramming FrameworksSystem SoftwareChronoLog
I. Yildirim,
H. Devarajan,
A. Kougkas,
X.-H. Sun,
K. Mohror
A Multifaceted Approach to Automated I/O Bottleneck Detection for HPC WorkloadsThe International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'22)PosterNovember, 2022
I/O CharacterizationI/O Bottleneck DetectionWisIO
M. Zou,
M. Zhang,
R. Wang,
X.-H. Sun,
X. Ye,
D. Fan,
Z. Tang
Accelerating Graph Processing with Lightweight Learning-Based Data ReorderingThe IEEE Computer Architecture Letters (CAL'2022)JournalMay, 2022
H. Devarajan,
A. Kougkas,
H. Zheng,
V. Vishwanath,
X.-H. Sun
Stimulus: Accelerate Data Management for Scientific AI applications in HPCThe 22nd IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID'22), May 16-19, 2022ConferenceMay, 2022
HDF5TensorFlowDecoupled I/OI/O AccelerationHermes
Z. Dang,
S. He,
P. Hong,
Z. Li,
X. Zhang,
X.-H. Sun,
G. Chen
NVAlloc: Rethinking Heap Metadata Management in Persistent Memory AllocatorsThe 2022 Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS'22), Feb 28 - Mar 4, 2022ConferenceFebruary, 2022
N. Rajesh,
Q. Koziol,
S. Byna,
H. Tang,
J. L. Bez,
A. Kougkas,
X.-H. Sun
Feature Reduction of Darshan Counters Using Evolutionary AlgorithmsThe 2021 International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'21), November 14–19, 2021PosterNovember, 2021
Feature ReductionEvolutionary AlgorithmsDarshan
L. Logan,
J. Lofstead,
S. Levy,
P. Widener,
X.-H. Sun,
A. Kougkas
Utilizing Persistent Memory in Parallel I/O LibrariesThe 2021 International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'21), November 14–19, 2021PosterNovember, 2021
Persistent MemoryLibrariesMemory ManagementMemory Mapped I/O
I. Yildirim,
M. Tang,
A. Kougkas,
X.-H. Sun
Performance Analysis of Containerized OrangeFS in HPC EnvironmentThe 2021 International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'21), November 14–19, 2021PosterNovember, 2021
HPCSingularityContainersBenchmarking
J. Ye,
A. Kougkas,
X.-H. Sun
HDF5 VOL Connector to Apache ArrowThe 2021 International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'21), November 14–19, 2021PosterNovember, 2021
HDF5Apache ArrowColumn store
X. Lu,
R. Wang,
X.-H. Sun
Premier: A Concurrency-Aware Pseudo-Partitioning Framework for Shared Last-Level CacheThe 2021 IEEE 39th International Conference on Computer Design (ICCD'21), October 24 - 27, 2021ConferenceOctober, 2021
L. Logan,
J. Lofstead,
S. Levy,
P. Widener,
X.-H. Sun,
A. Kougkas
pMEMCPY: a simple, lightweight, and portable I/O library for storing data in persistent memoryThe 1st Workshop on Re-envisioning Extreme-Scale I/O for Emerging Hybrid HPC Workloads (REX-IO'21), in conjunction with the 2021 IEEE International Conference on Cluster Computing (CLUSTER'21), September 7-10, 2021WorkshopSeptember, 2021
Persistent MemoryLibrariesMemory ManagementMemory Mapped I/O
P. Chen,
S. He,
X. Zhang,
S. Chen,
P. Hong,
Y. Yin,
X.-H. Sun,
G. Chen
CSWAP: A Self-Tuning Compression Framework for Accelerating Tensor Swapping in GPUsThe 2021 IEEE International Conference on Cluster Computing (CLUSTER'21), September 7-10, 2021ConferenceSeptember, 2021
J. Cernuda,
H. Devarajan,
L. Logan,
K. Bateman,
N. Rajesh,
J. Ye,
A. Kougkas,
X.-H. Sun
HFlow: A Dynamic and Elastic Multi-Layered Data ForwarderThe 2021 IEEE International Conference on Cluster Computing (CLUSTER'21), September 7-10, 2021ConferenceSeptember, 2021
Hermes
L. Yan,
M. Zhang,
R. Wang,
X. Chen,
X. Zou,
X. Lu,
Y. Han,
X.-H. Sun
CoPIM: A Concurrency-aware PIM Workload Offloading Architecture for Graph ApplicationsThe 2021 ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED'21), July 26, 2021ConferenceJuly, 2021
S. Yang,
W. Chen,
X. Zhang,
S. He,
Y. Yin,
X.-H. Sun
AUTO-PRUNE: Automated DNN Pruning and Mapping for ReRAM-Based AcceleratorThe ACM International Conference on Supercomputing (ICS'21), June 14-17, 2021ConferenceJune, 2021
ReRAM-based AcceleratorPruningReinforcement Learning
N. Rajesh,
H. Devarajan,
J. Cernuda,
K. Bateman,
L. Logan,
J. Ye,
A. Kougkas,
X.-H. Sun
Apollo: An ML-assisted Real-Time Storage Resource ObserverThe 30th ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC'21), June 21-25, 2021ConferenceJune, 2021
HPCMachine LearningResource MonitoringHermes
H. Devarajan,
H. Zheng,
A. Kougkas,
X.-H. Sun,
V. Vishwanath
DLIO: A Data-Centric Benchmark for Scientific Deep Learning ApplicationsThe 2021 IEEE/ACM International Symposium in Cluster, Cloud, and Internet Computing (CCGrid'21), May 17 - 20, 2021 Best paper awardConferenceMay, 2021
Deep LearningScientific ApplicationsRepresentativeBenchmarkData-IntensiveI/OCharacterizationTensorFlowData PipelineHermes
Y. Liu,
X.-H. Sun,
Y. Wang,
Y. Bao
HCDA: From Computational Thinking to a Generalized Thinking ParadigmCommunications of the ACM (CACM'21), Vol. 64, No. 5, pp. 66-75, May 2021JournalMay, 2021
HardwareEmerging TechnologiesEmerging ArchitecturesComputing EducationHistory of Computing
J. Liu,
P. Espina,
X.-H. Sun
A Study on Modeling and Optimization of Memory SystemsJournal of Computer Science and Technology (JCST'21), vol. 35, no. 1, January 2021JournalJanuary, 2021
Performance ModelingPerformance OptimizationMemory ArchitectureMemory HierarchyConcurrent Average Memory Access Time
Z. Ye,
Y. Wang,
S. He,
C.-Z. Xu,
X.-H. Sun
Sova: A Software-Defined Autonomic Framework for Virtual Network AllocationsIEEE Transactions on Parallel and Distributed Systems (TPDS'21) Vol: 32, Issue: 1, pp: 116-130, January 1, 2021JournalJanuary, 2021
Resource ManagementBandwidthData CentersServersVirtualizationTime FactorsQuality of Service
H. Devarajan,
A. Kougkas,
X.-H. Sun
HReplica: A Dynamic Data Replication Engine with Adaptive Compression for Multi-Tiered StorageThe 2020 IEEE International Conference on Big Data (Big Data'20), December 10-13, 2020ConferenceDecember, 2020
Data ReplicationDynamicSelection AlgorithmMulti-TieredData CompressionIntelligent SelectionDynamic ProgrammingCloud ApplicationScientific ApplicationsBig DataHermes
L. Logan,
A. Kougkas,
X.-H. Sun
Quantifying the Overheads of the Modern Linux I/O StackThe International Conference for High Performance Computing, Networking, Storage and Analysis (SC'20)PosterNovember, 2020
I/O BottleneckFilesystemsLinux
A. Kougkas,
H. Devarajan,
X.-H. Sun
Bridging Storage Semantics using Data Labels and Asynchronous I/OACM Transactions on Storage (TOS'20), Vol 16, No 4, Article 22, Nov. 2020JournalNovember, 2020
Label-Based I/OStorage BridgingHeterogeneous I/ODatalabelsTask-Based I/OExascale I/OEnergy-Aware I/OElastic Storage
N. Rajesh,
G. Heber,
A. Kougkas,
X.-H. Sun
Characterizing and Approximating I/O Behavior of HDF5 ApplicationsThe International Conference for High Performance Computing, Networking, Storage and Analysis (SC'20)PosterNovember, 2020
H. Devarajan,
H. Zheng,
X.-H. Sun,
V. Vishwanath
Understanding I/O behavior of Scientific Deep Learning Applications in HPC systemsThe International Conference for High Performance Computing, Networking, Storage and Analysis (SC'20)PosterNovember, 2020
H. Devarajan,
A. Kougkas,
X.-H. Sun
A Dynamic Multi-Tiered Storage System for Extreme Scale ComputingThe International Conference for High Performance Computing, Networking, Storage and Analysis (SC'20)PosterNovember, 2020
Hermes
X. Lu,
R. Wang,
X.-H. Sun
APAC: An Accurate and Adaptive Prefetch Framework with Concurrent Memory Access AnalysisThe 38th IEEE International Conference on Computer Design (ICCD'20), October 18 - 21, 2020ConferenceOctober, 2020
PrefetchMemory Performance ModelConcurrent Memory Access
N. Zhang,
B. Toonen,
X.-H. Sun,
B. Allcock
Performance Modeling and Evaluation of a Production Disaggregated Memory SystemInternational Symposium on Memory Systems (MEMSYS'20), Sept. 28 - Oct. 2, 2020ConferenceOctober, 2020
Performance ModelingDisaggregated MemoryC-AMATPerformance EvaluationUtilizationRAN
A. Kougkas,
H. Devarajan,
K. Bateman,
J. Cernuda,
N. Rajesh,
X.-H. Sun
ChronoLog: A Distributed Shared Tiered Log Store with Time-based Data OrderingThe 36th International Conference on Massive Storage Systems and Technology (MSST'20), Oct. 29-30, 2020ConferenceOctober, 2020
Distributed LogShared LogTiered StorageChronoLog
H. Devarajan,
A. Kougkas,
K. Bateman,
X.-H. Sun
HCL: Distributing Parallel Data Structures in Extreme ScalesIEEE International Conference on Cluster Computing (CLUSTER'20), Sept. 14-17, 2020ConferenceSeptember, 2020
Distributed Data StructureRPC over RDMAHybrid Data Access ModelHPC Data ContainersHermes
H. Devarajan,
A. Kougkas,
X.-H. Sun
HFetch: Hierarchical Data Prefetching for Scientific Workflows in Multi-Tiered Storage EnvironmentsIEEE International Parallel and Distributed Processing Symposium (IPDPS'20), May 18-22, 2020ConferenceMay, 2020
HierarchicalMulti-TieredData PrefetchingData-FetchingDynamic ChoiceData-CentricLibraryMiddlewareEngineData-AwareWorkflow-AwareServer-PushHermes
H. Devarajan,
A. Kougkas,
L. Logan,
X.-H. Sun
HCompress: Hierarchical Data Compression for Multi-Tiered Storage EnvironmentsIEEE International Parallel and Distributed Processing Symposium (IPDPS'20), May 18-22, 2020ConferenceMay, 2020
HierarchicalMulti-TieredData CompressionData-ReductionDynamic ChoiceWorkflow PrioritiesLibraryMiddlewareEngineHermes
S. He,
Z. Li,
J. Zhou,
Y. Yin,
X. Xu,
Y. Chen,
X.-H. Sun
A Holistic Heterogeneity-Aware Data Placement Scheme in Hybrid Parallel I/O SystemsIEEE Transactions on Parallel and Distributed Systems (TPDS'20), vol 31. no 4. pp 830-842JournalApril, 2020
Parallel I/OParallel File System (PFS)Hybrid Parallel File SystemData PlacementSolid State Drive
S. He,
Y. Yin,
X.-H. Sun,
X. Zhang,
Z. Li
Optimizing Parallel I/O Accesses through Pattern-Directed and Layout-Aware ReplicationIEEE Transactions on Computers (TC'20), vol 69. no 2. pp 212-225JournalFebruary, 2020
Parallel I/OI/O OptimizationData ReplicationData ReorganizationData Access Pattern
A. Kougkas,
H. Devarajan,
X.-H. Sun
I/O Acceleration via Multi-Tiered Data Buffering and PrefetchingJournal of Computer Science and Technology (JCST'20), vol 35. no 1. pp 92-120JournalJanuary, 2020
I/O BufferingHeterogeneous BufferingLayered BufferingDeep Memory HierarchyBurst BuffersHierarchical Data PrefetchingData-Centric ArchitectureHermes
C. Li,
M. Zhang,
Z. Xu,
X.-H. Sun
Self-adaptive Address Mapping Mechanism for Access Pattern Awareness on DRAM17th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA), 2019. pp. 61-70ConferenceDecember, 2019
DRAMLocalityMLPAccess PatternData LayoutMatrix Multiplication
K. Feng,
H. Devarajan,
A. Kougkas,
X.-H. Sun
NIOBE: An Intelligent I/O Bridging Engine for Complex and Distributed WorkflowsThe 7th IEEE International Conference on Big Data, 2019. pp. 493-502ConferenceDecember, 2019
Data IntegrationIntegrated WorkflowData AggregationKVSParallel File System (PFS)
H. Devarajan,
A. Kougkas,
X.-H. Sun
HFetch: Hierarchical Data Prefetching in Multi-Tiered Storage EnvironmentsThe International Conference for High Performance Computing, Networking, Storage and Analysis (SC'19) Best Poster Nominee, Ph.D ForumPosterNovember, 2019
HierarchicalMulti-TieredData PrefetchingServer-CentricData-CentricFile ScoringData ScoringWorkflow-AwareLightweightServer-SideHermes
J. Cernuda,
H. Trivino,
H. Devarajan,
A. Kougkas,
X.-H. Sun
Efficient Data Eviction across Multiple Tiers of StorageThe International Conference for High Performance Computing, Networking, Storage and Analysis (SC'19)PosterNovember, 2019
Y. Liu,
X.-H. Sun
LPM: A Systematic Methodology for Concurrent Data Access Pattern Optimization from a Matching PerspectiveIEEE Transactions on Parallel and Distributed Systems (TPDS), 2019. vol 30. no 11. pp.2478-2493.JournalNovember, 2019
Memory-WallMemory Stall TimeEfficiencyPerformance OptimizationLayered Performance Matching (LPM)Memory Concurrency
A. Kougkas,
H. Devarajan,
J. Lofstead,
X.-H. Sun
LABIOS: A Distributed Label-Based I/O SystemThe 28th International Symposium on High-Performance Parallel and Distributed Computing (HPDC'19), Phoenix, USA 2019. pp. 13-24. Karsten Schwan Best Paper AwardConferenceJune, 2019
Label-Based I/OStorage BridgingHeterogeneous I/ODatalabelsTask-Based I/OExascale I/OEnergy-Aware I/OElastic StorageHermes
H. Devarajan,
A. Kougkas,
X.-H. Sun
An Intelligent, Adaptive, and Flexible Data Compression FrameworkIEEE/ACM International Symposium in Cluster, Cloud, and Grid Computing (CCGrid'19), Larnaca, Cyprus2019. pp. 82-91.ConferenceMay, 2019
Hermes
Y. Wang,
S. He,
X. Fan,
C. Xu,
X.-H. Sun
On Cost-Driven Collaborative Data Caching: A New Model ApproachIEEE Transactions on Parallel and Distributed Systems (TPDS), vol. 30, no. 3, pp. 662 - 676JournalMarch, 2019
H. Devarajan,
A. Kougkas,
P. Challa,
X.-H. Sun
Vidya: Performing Code-Block I/O Characterization for Data Access OptimizationThe IEEE International Conference on High Performance Computing, Data, and Analytics 2018 (HiPC'18), Bengaluru, India2018. pp. 255-264.ConferenceDecember, 2018
Hermes
S. He,
X.-H. Sun
A Cost-Effective Distribution-Aware Data Replication Scheme for Parallel I/O SystemsIEEE Transactions on Computers (TC), vol. 67, no. 10, pp. 1374-1387JournalOctober, 2018
A. Kougkas,
H. Devarajan,
X.-H. Sun,
J. Lofstead
Harmonia: An Interference-Aware Dynamic I/O Scheduler for Shared Non-Volatile Burst BuffersThe IEEE International Conference on Cluster Computing 2018 (Cluster'18), Belfast, UK2018. pp. 290-301.ConferenceSeptember, 2018
Hermes
K. Feng,
X.-H. Sun,
X. Yang,
S. Zhou
SciDP: Support HPC and Big Data Applications via Integrated Scientific Data ProcessingThe IEEE International Conference on Cluster Computing 2018 (Cluster'18), Belfast, UK2018. pp. 114-123.ConferenceSeptember, 2018
Y. Liu,
X.-H. Sun
CaL: Extending Data Locality to Consider Concurrency for Performance OptimizationIEEE Transactions on Big Data, vol. 5, no. 2, pp. 273-288JournalJune, 2018
A. Kougkas,
H. Devarajan,
X.-H. Sun
IRIS: I/O Redirection via Integrated StorageThe 32nd ACM International Conference on Supercomputing (ICS), Bejing, China2018. pp. 33-42.ConferenceJune, 2018
A. Kougkas,
H. Devarajan,
X.-H. Sun
Hermes: A Heterogeneous-Aware Multi-Tiered Distributed I/O Buffering SystemThe 27th ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC), Tempe, AZ, USA, 2018. pp. 219-230ConferenceJune, 2018
Hermes
S. He,
X.-H. Sun,
Y. Wang,
C. Xu
A Migratory Heterogeneity-Aware Data Layout Scheme for Parallel File SystemsThe 32nd IEEE International Parallel and Distributed Processing Symposium (IPDPS'18), Vancouver, Canada2018. pp. 1133-1142.ConferenceMay, 2018
X. Wang,
X. Yang,
M. Mubarak,
R. Ross,
Z. Lan
Trade-off Study of Localizing Communication and Balancing Network Traffic on Dragonfly SystemThe 32nd IEEE International Parallel and Distributed Processing Symposium (IPDPS'18), Vancouver, Canada2018. pp. 1113-1122.ConferenceMay, 2018
A. Haider,
F. Checconi,
X. Que,
L. Schneidenbach,
D. Buono,
X.-H. Sun
Horizon: A Multi-abstraction Framework for Graph AnalyticsThe ACM International Conference on Computing Frontiers 2018 (CF'18), Italy, 2018. pp. 252-255WorkshopMay, 2018
A. Kougkas,
H. Devarajan,
X.-H. Sun
Enosis: Bridging the Semantic Gap between File-based and Object-based Data ModelsThe ACM SIGHPC 8th International Workshop on Data-Intensive Computing in the Clouds (DataCloud 2017), in conjunction with SC'17, Denver, CO, USAWorkshopNovember, 2017
H. Devarajan,
A. Kougkas,
X.-H. Sun,
H. Che
Open Ethernet Drive: Evolution of Energy-Efficient Storage TechnologyThe ACM SIGHPC 8th International Workshop on Data-Intensive Computing in the Clouds (DataCloud 2017), in conjunction with SC'17, Denver, CO, USAvol. 17WorkshopNovember, 2017
N. Zhang,
C. Jiang,
X.-H. Sun,
S. Song
Evaluating GPGPU Memory Performance Through the C-AMAT ModelThe ACM SIGHPC 1st International Workshop on Memory Centric Programming for HPC (MCHPC 2017), in conjunction with SC'17, Denver, CO. USA2017. pp. 35-39WorkshopNovember, 2017
Y. Yan,
R. Brightwell,
X.-H. Sun
Principles of Memory-Centric Programming for High Performance ComputingThe ACM SIGHPC 1st International Workshop on Memory Centric Programming for HPC (MCHPC 2017), in conjunction with SC'17, Denver, CO. USA2017. pp. 2-6WorkshopNovember, 2017
A. Kougkas,
H. Devarajan,
X.-H. Sun
Syndesis: Mapping Objects to Files for a Unified Data Access SystemThe ACM SIGHPC 8th International Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS 2017), in conjunction with SC'17, Denver, CO, USAWorkshopNovember, 2017
Y. Fan,
P. Rich,
W. Allcock,
M. Papka,
Z. Lan
Trade-off Between Prediction Accuracy and Underestimation Rate in Job Runtime EstimatesThe IEEE International Conference on Cluster Computing 2017 (Cluster'17), Hawaii, USA 2017, pp. 530-540.ConferenceSeptember, 2017
N. Liu,
A. Haider,
D. Jin,
X.-H. Sun
Modeling and Simulation of Extreme-Scale Fat-Tree Networks for HPC Systems and Data CentersACM Transactions on Modeling and Computer Simulation (TOMACS) - Special Issue on PADS 2015, vol. 27, no. 2, pp. 13:1-13:23JournalJuly, 2017
A. Kougkas,
H. Eslami,
R. Thakur,
W. D. Gropp,
X.-H. Sun
Rethinking Key Value Store for Parallel I/O OptimizationInternational Journal of High Performance Applications,2017. vol. 31, no. 4, pp. 335-356JournalJuly, 2017
S. He,
Y. Wang,
Z. Li,
X.-H. Sun,
C. Xu
Cost-Aware Region-Level Data Placement in Multi-Tiered Parallel I/O SystemsIEEE Transactions on Parallel and Distributed Systems (TPDS), vol. 28, no. 7, pp. 1853-1865JournalJune, 2017
S. He,
Y. Wang,
X.-H. Sun,
C. Huang,
C. Xu
Heterogeneity-Aware Collective I/O for Parallel I/O Systems with Hybrid HDD/SSD ServersIEEE Transactions on Computers (TC), vol. 66, no. 6, pp. 1091-1098JournalJune, 2017
S. He,
Y. Wang,
X.-H. Sun,
C. Xu
HARL: Optimizing Parallel File Systems with Heterogeneity-Aware Region-Level Data LayoutIEEE Transactions on Computers (TC), vol. 66, no. 6, pp. 1048-1060JournalJune, 2017
W. Allcock,
P. Rich,
Y. Fan,
Z. Lan
Experience and Practice of Batch Scheduling on Leadership Supercomputers at ArgonneThe 21st workshop on Job Scheduling Strategies for Parallel Processing (JSSPP), Vancouver, Canada2017, pp. 1-24WorkshopMay, 2017
Y.-H. Liu,
X.-H. Sun
Evaluating the Combined Effect of Memory Capacity and Concurrency for Many-core Chip DesignACM Transactions on Modeling and Performance Evaluation of Computing Systems (TOMPECS), 2017. vol. 2, no. 2, pp. 9:1-9:25JournalApril, 2017
S. He,
Y. Wang,
X.-H. Sun,
C. Xu
Using MinMax-Memory Claims to Improve In-Memory Workflow Computations in the CloudIEEE Transactions on Parallel and Distributed Systems (TPDS), 2017. vol. 28, no. 4, pp. 1202-1204JournalApril, 2017
S. Liu,
E.-S. Jun,
R. Kettimuthu,
X.-H. Sun,
M. Papka
Towards Optimizing Large-Scale Data Transfers with End-to-End Integrity Verification4th International Workshop on Distributed Storage Systems and Coding for Big Data, in conjunction with IEEE BigData 2016. Washington, D.C., USAWorkshopDecember, 2016
Z. Zhou,
X. Yang,
Z. Lan,
P. Rich,
W. Tang,
V. Morozov,
N. Desai
Improving Batch Scheduling on Blue Gene/Q by Relaxing 5D Torus Network Allocation ConstraintsIEEE Transactions on Parallel and Distributed Systems (TPDS), vol. 27, no. 11, pp. 3269-3282JournalNovember, 2016
S. He,
Y. Wang,
X.-H. Sun
Improving Performance of Parallel I/O Systems through Selective and Layout-Aware SSD CacheIEEE Transactions on Parallel and Distributed Systems (TPDS), vol. 27, no. 10, pp. 2940-2952JournalNovember, 2016
S. He,
Y. Liu,
Y. Wang,
X.-H. Sun
Enhancing Hybrid Parallel File System through Performance and Space-Aware Data LayoutInternational Journal of High Performance Computing Applications (IJHPCA), vol. 30, no 4, pp. 396-410JournalNovember, 2016
X. Yang,
J. Jenkins,
M. Mubarak,
R. Ross,
Z. Lan
Watch Out for the Bully! Job Interference Study on Dragonfly NetworkACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis 2016 (SC'16), Salt Lake City, Utah, USA2016, pp. 750-760ConferenceNovember, 2016
S. Wallace,
X. Yang,
V. Vishwanath,
W. Allcock,
S. Coghlan,
M. Papka,
Z. Lan
A Data Driven Scheduling Approach for Power Management on HPC SystemsACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis 2016 (SC'16), Salt Lake City, Utah, USA2016, pp. 656-666ConferenceNovember, 2016
D. Li,
S. Wang,
S. Yao,
Y.-H. Liu,
Y. Cheng,
X.-H. Sun
Efficient Design Space Exploration by Knowledge TransferEleventh IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS'16), Pittsburgh, PA, USApp. 1-10. 2016.ConferenceOctober, 2016
X. Yang,
S. Liu,
K. Feng,
S. Zhou,
X.-H. Sun
Visualization and Adaptive Subsetting of Earth Science Data in HDFS - A Novel Data Analysis Strategy with Hadoop and Spark6th IEEE International Conference on Big Data and Cloud Computing (BDCloud 2016), Atlanta, GA2016, pp. 89-96ConferenceOctober, 2016
Z. Zhou,
X. Yang,
D. Zhao,
P. Rich,
W. Tang,
J. Wang,
Z. Lan
I/O Aware Job Scheduling and Bandwidth Allocation for Petascale Computing SystemsJournal of Parallel Computing (ParCo), vol. 58, no. C, pp. 107-116JournalOctober, 2016TBA
A. Kougkas,
M. Dorier,
R. Latham,
R. Ross,
X.-H. Sun
Leveraging Burst Buffer Coordination to Prevent I/O InterferenceThe eScience'16, Baltimore, Maryland, USA2016, pp. 371-380ConferenceOctober, 2016
S. He,
Y. Wang,
X.-H. Sun
Boosting Parallel File System Performance with Heterogeneity-Aware Selective Data LayoutIEEE Transactions on Parallel and Distributed Systems (TPDS), 2016. vol. 27, no. 9, pp. 2492-2505JournalSeptember, 2016
B. Xu,
W. Zhang,
X.-H. Sun,
Y. Wang
A memory-driven scheduling scheme and optimization for concurrent execution in GPUJournal of Cluster Computing, 2016. vol. 19, no. 4, pp. 2241-2250JournalSeptember, 2016
X.-H. Sun,
Y.-H. Liu
Utilizing Concurrency: A New Theory for Memory Wall29th International Workshop on Languages and Compilers for Parallel Computing (LCPC2016) (a position paper), Sept, 2016, New York, USApp. 18-23 Springer, Cham.WorkshopSeptember, 2016
E. Berrocal,
L. Bautista-Gomez,
S. Di,
Z. Lan,
F. Cappello
Exploring Partial Replication to Improve Lightweight Silent Data Corruption Detection for HPC Applications22nd International European Conference on Parallel and Distributed Computing (Euro-Par 2016), Grenoble, FranceConferenceAugust, 2016
Y. Chen,
C. Chen,
Y. Yin,
X.-H. Sun,
R. Thakur,
W. D. Gropp
Rethinking High Performance Computing System Architecture for Scientific Big Data Applications14th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA 2016), Tianjin, ChinaConferenceAugust, 2016
W. Yang,
C. Xu,
S. He,
X.-H. Sun
On MinMax-Memory Claims for Scientific Workflows in the In-Memory Cloud Computing36th International Conference on Distributed Computing Systems (ICDCS), Nara Hotel, Nara, JapanPosterJune, 2016
D. Li,
S. Yao,
Y.-H. Liu,
S. Wang,
X.-H. Sun
Efficient Design Space Exploration via Statistical Sampling and AdaBoost Learning53rd Design Automation Conference (DAC'16), Texas, Austin, USAConferenceJune, 2016
D. Zhao,
N. Liu,
D. Kimpe,
R. Ross,
X.-H. Sun,
I. Raicu
Towards Exploring Data-Intensive Scientific Applications at Extreme Scales through Systems and SimulationsIEEE Transactions on Parallel and Distributed Systems (TPDS), 2016. vol. 27, no. 6, pp. 1824-1837JournalJune, 2016
A. Kougkas,
A. Fleck,
X.-H. Sun
Towards Energy Efficient Data Management in HPC: The Open Ethernet Drive ApproachPDSW-DISCS'16, in conjunction with SC'16WorkshopJanuary, 2016
A. Haider,
X. Yang,
N. Liu,
S. He,
X.-H. Sun
IC-Data: Improving Compressed Data Processing in Hadoop22nd annual IEEE International Conference on High Performance Computing (HiPC 2015), Bengaluru, IndiaConferenceDecember, 2015
X. Yang,
C. Feng,
Z. Xu,
X.-H. Sun
Dominoes: Speculative Repair in Erasure Coded Hadoop System22nd annual IEEE International Conference on High Performance Computing (HiPC 2015), Bengaluru, IndiaConferenceDecember, 2015
Y.-H. Liu,
X.-H. Sun
C^2-bound: A Capacity and Concurrency driven Analytical Model for Manycore DesignACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis 2015 (SC'15). Texas, Austin, USAConferenceNovember, 2015
H. Eslami,
A. Kougkas,
M. Kotsifakou,
T. Kasampalis,
K. Feng,
Y. Lu,
W. D. Gropp,
X.-H. Sun,
Y. Chen,
R. Thakur
Efficient Disk-to-Disk Sorting: A Case Study in Decoupled Execution ParadigmData Intensive Scalable Computing Systems Workshop (DISCS), in conjunction with ACM/IEEE SuperComputing 2015, Austin, TX, USAWorkshopNovember, 2015
A. Haider,
S. Mickelson,
J. Dennis,
X.-H. Sun
Lessons from Post-processing Climate Data on Modern Flash-based HPC SystemsACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis 2015 (SC'15). Texas, Austin, USAPosterNovember, 2015TBA
X. Yang,
N. Liu,
B. Feng,
X.-H. Sun,
S. Zhou
PortHadoop: Support Direct HPC Data Processing in HadoopIEEE International Conference on Big Data (IEEE BigData 2015). Santa Clara, CA, USAConferenceOctober, 2015
S. Zhou,
X. Yang,
X. Li,
T. Matsui,
S. Liu,
X.-H. Sun,
W. Tao
A Hadoop-Based Visualization and Diagnosis Framework for Earth Science DataBig Data in the Geosciences Workshop, in conjunction with IEEE International Conference on Big Data (IEEE BigData 2015). Santa Clara, CA, USAWorkshopOctober, 2015
K. Wang,
N. Liu,
I. Sadooghi,
X. Yang,
X. Zhou,
M. Lang,
X.-H. Sun,
I. Raicu
Overcoming Hadoop Scaling Limitations through Distributed Task ExecutionIEEE International Conference on Cluster Computing 2015 (Cluster'15), Chicago, IL, USAConferenceSeptember, 2015
B. Feng,
X. Yang,
K. Feng,
Y. Yin,
X.-H. Sun
IOSIG+: on the Role of I/O Tracing and Analysis for Hadoop SystemsIEEE International Conference on Cluster Computing 2015 (Cluster'15), Chicago, IL, USAWorkshopSeptember, 2015
K. Feng,
M. G. Venkata,
D. Li,
X.-H. Sun
Fast Fault Injection and Sensitivity Analysis for Collective CommunicationsIEEE International Conference on Cluster Computing 2015 (Cluster'15), Chicago, IL, USAConferenceSeptember, 2015
Z. Zhou,
X. Yang,
D. Zhao,
P. Rich,
W. Tang,
J. Wang,
Z. Lan
I/O-Aware Batch Scheduling for Petascale Computing SystemsIEEE International Conference on Cluster Computing 2015 (Cluster'15), Chicago, IL, USAConferenceSeptember, 2015
Y.-H. Liu,
X.-H. Sun
LPM: Concurrency-driven Layered Performance Matching44th International Conference on Parallel Processing (ICPP'15), Beijing, ChinaConferenceSeptember, 2015
S. He,
X.-H. Sun,
Y. Wang,
A. Kougkas,
A. Haider
A Heterogeneity-Aware Region-Level Data Layout Scheme for Hybrid Parallel File Systems44th International Conference on Parallel Processing (ICPP'15), Beijing, ChinaConferenceSeptember, 2015
C. Feng,
X. Yang,
F. Liang,
X.-H. Sun,
Z. Xu
LCIndex, A Local and Clustering Index on Distributed Ordered Tables for Multi-Dimensional Range Queries44th International Conference on Parallel Processing (ICPP'15), Beijing, ChinaConferenceSeptember, 2015
N. Liu,
X.-H. Sun,
D. Jin
On Massively Parallel Simulation of Large-Scale Fat-Tree Networks for HPC Systems and Data Centers29th ACM SIGSIM Conference on Principles of Advanced Discrete Simulation (ACM SIGSIM PADS), London, UKPosterJune, 2015TBA
B. Wang,
W. Yu,
X.-H. Sun,
X. Wang
DaCache: Memory Divergence-Aware GPU Cache Management29th International Conference on Supercomputing (ICS'15), Newport Beach, CA. USAConferenceJune, 2015
R. Ranjan,
L. Wang,
A. Y. Zomaya,
D. Georgakopoulos,
X.-H. Sun,
G. Wang
Recent Advances in Autonomic Provisioning of Big Data Applications on CloudsIEEE Transaction on Cloud Computing, vol. 3, no. 2, pp. 101-104JournalJune, 2015
N. Liu,
A. Haider,
X.-H. Sun,
D. Jin
FatTreeSim: Modeling a Large-scale Fat-Tree Network for HPC Systems and Data Centers Using Parallel and Discrete Event Simulation29th ACM SIGSIM Conference on Principles of Advanced Discrete Simulation (ACM SIGSIM PADS), London, UKConferenceJune, 2015
S. He,
X.-H. Sun,
A. Haider
HAS: Heterogeneity-Aware Selective Data Layout Scheme for Parallel File Systems on Hybrid Servers29th IEEE International Parallel and Distributed Processing Symposium (IPDPS'15), Hyderabad, IndiaConferenceMay, 2015
Z. Zhou,
X. Yang,
Z. Lan,
P. Rich,
W. Tang,
V. Morozov,
N. Desai
Improving Batch Scheduling on Blue Gene/Q by Relaxing 5D Torus Network Allocation Constraints29th IEEE International Parallel and Distributed Processing Symposium (IPDPS'15), Hyderabad, IndiaConferenceMay, 2015
N. Liu,
X. Yang,
X.-H. Sun,
J. Jenkins,
R. Ross
YARNsim: Hadoop YARN Simulation System15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid 2015), Shenzhen, Guangdong, ChinaConferenceMay, 2015
J. Wu,
X. Xiong,
Z. Lan
Hierarchical Task Mapping for Parallel Applications on SupercomputersThe Journal of Supercomputing, vol. 71, no. 5, pp. 1776-1802JournalMay, 2015
Z. Zheng,
L. Yu,
Z. Lan
Reliability-Aware Speedup Models for Parallel Applications with Coordinated Checkpointing/RestartIEEE Transactions on Computers, vol. 64, no. 5, pp. 1402-1415JournalMay, 2015
Y. Liu,
X.-H. Sun
Reevaluating Data Stall Time with the Consideration of Data Access ConcurrencyJournal Of Computer Science And Technology, vol. 30, no. 2, pp. 227-245JournalMarch, 2015
Y. Yin,
H. Jin,
X.-H. Sun
I/O and File Systems for Data Intensive ApplicationsThe Handbook on Data Centers, Springer, 2015, pp 561-582, Print ISBN: 978-1-4939-2091-4, Online ISBN: 978-1-4939-2092-1JournalJanuary, 2015
S. He,
Y. Liu,
X.-H. Sun
PSA: A Performance and Space-Aware Data Layout Scheme for Hybrid Parallel File SystemsData Intensive Scalable Computing Systems Workshop (DISCS), in conjunction with ACM/IEEE SuperComputing 2014, New Orleans, LA, USAWorkshopNovember, 2014
B. Feng,
N. Liu,
S. He,
X.-H. Sun
HPIS3: Towards a High-Performance Simulator for Hybrid Parallel I/O and Storage Systems9th Parallel Data Storage Workshop (PDSW'14), in conjunction with ACM/IEEE SuperComputing 2014, New Orleans, LA, USAWorkshopNovember, 2014
Y. Yin,
A. Kougkas,
K. Feng,
H. Eslami,
Y. Lu,
X.-H. Sun,
R. Thakur,
W. D. Gropp
Rethinking Key-Value Store for Parallel I/O OptimizationData Intensive Scalable Computing Systems Workshop (DISCS), in conjunction with ACM/IEEE SuperComputing 2014, New Orleans, LA, USAWorkshopNovember, 2014
X. Yang,
Y. Yin,
H. Jin,
X.-H. Sun
SCALER: Scalable Parallel File Write in HDFSInternational Conference on Cluster Computing 2014 (Cluster'14), Madrid, SpainConferenceSeptember, 2014
E. Berrocal,
L. Yu,
S. Wallace,
M. Papka,
Z. Lan
Exploring Void Search for Fault Detection on Extreme Scale SystemsIEEE International Conference on Cluster Computing 2014 (Cluster'14), Madrid, SpainConferenceSeptember, 2014
X. Yang,
X. Zheng,
Z. Zhou,
W. Tang,
J. Wang,
Z. Lan
Balancing Job Performance with System Performance via Locality-Aware Scheduling on Torus-Connected SystemsIEEE International Conference on Cluster Computing 2014 (Cluster'14), Madrid, SpainConferenceSeptember, 2014
C. Chen,
Y. Chen,
K. Feng,
Y. Yin,
H. Eslami,
R. Thakur,
X.-H. Sun,
W. D. Gropp
Decoupled I/O for Data-Intensive High Performance ComputingSeventh International Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2), in conjunction with the International Conference on Parallel Processing (ICPP-2014), Minneapolis, MN, USAWorkshopSeptember, 2014
S. He,
X.-H. Sun,
B. Feng,
K. Feng
Performance-Aware Data Placement in Hybrid Parallel File Systems14th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP), Dalian, ChinaConferenceAugust, 2014
D. Wang,
X.-H. Sun
APC: A Novel Memory Metric and Measurement Methodology for Modern Memory SystemIEEE Transactions on Computers, vol. 63, no. 7, pp. 1626-1639JournalJuly, 2014
X.-H. SunC-AMAT: a data access model for the Big Data eraCommunication of CCF, vol. 10, no. 6, pp. 19-22JournalJune, 2014
S. He,
X.-H. Sun,
B. Feng
S4D-Cache: Smart Selective SSD Cache for Parallel I/O SystemsInternational Conference on Distributed Computing Systems (ICDCS), Madrid, SpainConferenceJune, 2014
X.-H. Sun,
D. Wang
Concurrent Average Memory Access TimeIEEE Computer, vol. 47, no. 5, pp. 74-80JournalMay, 2014
X.-H. SunConcurrent-AMAT: a mathematical model for Big Data accessHPC TodayJournalMay, 2014
X. Yang,
Z. Zhou,
S. Wallace,
Z. Lan,
W. Tang,
S. Coghlan,
M. E. Papka
Integrating dynamic pricing of electricity into energy aware scheduling for HPC systemsInternational Conference for High Performance Computing, Networking, Storage and Analysis, Denver, CO, USAConferenceNovember, 2013
J. He,
J. Kowalkowski,
M. Paterno,
D. J. Holmgren,
J. N. Simone,
X.-H. Sun
Layout-Aware Scientific Computing-A Case Study using the MILC CodeJournal of Computational Science, vol. 4, no. 6, pp. 496-506JournalNovember, 2013
K. Feng,
Y. Yin,
C. Chen,
H. Eslami,
X.-H. Sun,
Y. Chen,
R. Thakur,
W. D. Gropp
Runtime System Design of Decoupled Execution Paradigm for Data-Intensive High-End Computing (Poster Presentation)IEEE International Conference on Cluster Computing 2013 (Cluster'13), Indianapolis, IN, USAConferenceSeptember, 2013
J. He,
J. Bent,
A. Torres,
G. Grider,
G. Gibson,
C. Maltzahn,
X.-H. Sun
I/O Acceleration with Pattern Detection22th International ACM Symposium on High Performance Distributed Computing (HPDC'13), New York City, NY, USAConferenceSeptember, 2013
H. Jin,
X.-H. Sun
Performance Comparison under Failures of MPI and MapReduce: An Analytical ApproachFuture Generation Computer Systems (FGCS), vol. 29, no. 7, pp. 1808-1815JournalSeptember, 2013
S. He,
X.-H. Sun,
B. Feng,
X. Huang,
K. Feng
A Cost-Aware Region-Level Data Placement Scheme for Hybrid Parallel I/O SystemsIEEE International Conference on Cluster Computing 2013 (Cluster'13), Indianapolis, IN, USAConferenceSeptember, 2013
Y. Yin,
J. Li,
J. He,
X.-H. Sun,
R. Thakur
Pattern-Direct and Layout-Aware Replication Scheme for Parallel I/O SystemsIEEE International Parallel and Distributed Processing Symposium (IPDPS' 13), Phoenix, AZ, USAConferenceMay, 2013
S. He,
X.-H. Sun,
Y. Yin
BPS: A Performance Metric of I/O System2013 International Workshop on High Performance Data Intensive Computing (HPDIC 2013), in Conjunction With IEEE IPDPS 2013, Boston, Massachusetts, USAWorkshopMay, 2013
J. He,
J. Bent,
A. Torres,
G. Grider,
G. Gibson,
C. Maltzahn,
X.-H. Sun
Discovering Structure in Unstructured I/OThe 7th Parallel Data Storage Workshop (PDSW'12), in conjunction with ACM/IEEE SuperComputing 2012, Salt Lake City, UT, USAWorkshopNovember, 2012
J. Wu,
Z. Lan,
X. Xiong,
N. Gnedin,
A. Kravtsov
Hierarchical Task Mapping of Cell-based AMR Cosmology SimulationsACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis 2012 (SC'12). Salt Lake City, UT, USAConferenceNovember, 2012
Y. Chen,
H. Zhu,
H. Jin,
X.-H. Sun
Algorithm-level Feedback-controlled Adaptive Data Prefetcher: Accelerating Data Access for High-Performance ProcessorsParallel Computing (ParCo), vol. 38, no. 10-11, pp. 533-551JournalOctober, 2012
Z. Zheng,
L. Yu,
Z. Lan,
T. Jones
3-Dimensional Root Cause Diagnosis via Co-AnalysisInternational Conference on Autonomic Computing 2012 (ICAC'12), San Jose, CA, USAConferenceSeptember, 2012
W. Tang,
D. Ren,
Z. Lan,
N. Desai
Adaptive Metric-Aware Job Scheduling for Production Supercomputers5th International Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2), 2012, in conjunction with ICPP 2012, Pittsburgh, PA, USAWorkshopSeptember, 2012
H. Jin,
J. Ji,
X.-H. Sun,
Y. Chen,
R. Thakur
CHAIO: Enabling HPC Applications on Data-Intensive File Systems41th International Conference on Parallel Processing (ICPP), Pittsburgh, PAConferenceSeptember, 2012
J. He,
X.-H. Sun,
R. Thakur
KNOWAC: I/O Prefetch via Accumulated KnowledgeIEEE International Conference on Cluster Computing (Cluster'12), Beijing, ChinaConferenceSeptember, 2012
Y. Chen,
C. Chen,
X.-H. Sun,
W. D. Gropp,
R. Thakur
A Decoupled Execution Paradigm for Data-Intensive High-End ComputingIEEE International Conference on Cluster Computing 2012 (Cluster'12), Beijing, China,ConferenceSeptember, 2012
X.-H. Sun,
D. Wang
APC: A Performance Metric of Memory SystemsACM SIGMETRICS Performance Evaluation Review, vol. 40, no. 2, pp. 125-130JournalSeptember, 2012
L. Yu,
Z. Zheng,
Z. Lan,
T. Jones,
J. Brandt,
A. Gentile
Filtering Log Data: Finding the needles in the HaystackInternational Conference on Dependable Systems and Networks 2012 (DSN'12), Boston, MA, USAConferenceJune, 2012
H. Jin,
X. Yang,
X.-H. Sun,
I. Raicu
ADAPT: Availability-aware MapReduce Data Placement in Non-Dedicated Distributed Computing Environment32nd International Conference on Distributed Computing Systems (ICDCS), Macau, ChinaConferenceJune, 2012
Y. Yin,
S. Byna,
H. Song,
X.-H. Sun,
R. Thakur
Boosting Application-Specific Parallel I/O Optimization Using IOSIGIEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Ottawa, CanadaConferenceMay, 2012
H. Jin,
X.-H. Sun
Performance Comparison under Failures of MPI and MapReduce: An Analytical Approach2nd International Workshop on Cloud Computing and Scientific Applications (CCSA), in conjunction with CCGrid 2012, Ottawa, CanadaWorkshopMay, 2012TBA
R. Ge,
X. Feng,
X.-H. Sun
SERA-IO: Integrating Energy Consciousness into Parallel I/O MiddlewareIEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Ottawa, CanadaConferenceMay, 2012
H. Zou,
X.-H. Sun,
S. Ma,
X. Duan
A Source-Aware Interrupt Scheduling for Modern Parallel I/O SystemsIEEE International Parallel and Distributed Processing Symposium (IPDPS' 12), Shanghai, ChinaConferenceMay, 2012
H. Jin,
T. Ke,
Y. Chen,
X.-H. Sun
Checkpointing Orchestration: Toward a Scalable HPC Fault-Tolerant EnvironmentIEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), Ottawa, CanadaConferenceMay, 2012
H. Song,
H. Jin,
J. He,
X.-H. Sun,
R. Thakur
A Server-Level Adaptive Data Layout Strategy for Parallel File Systems2012 International Workshop on High Performance Data Intensive Computing(HPDIC 2012), in Conjunction With IEEE IPDPS 2012, Shanghai, ChinaWorkshopMay, 2012
Y. Yu,
D. Rudd,
Z. Lan,
N. Gnedin,
A. Kravtsov,
J. Wu
Improving Parallel IO Performance of Cell-based AMR Cosmology ApplicationsIEEE International Parallel & Distributed Processing Symposium 2012 (IPDPS'12), Shanghai, ChinaConferenceMay, 2012
H. Song,
Y. Yin,
Y. Chen,
X.-H. Sun
Cost-intelligent Application-specific Data layout Optimization for Parallel File SystemsCluster Computing, pp. 1-14JournalFebruary, 2012
Y. Chen,
H. Zhu,
P. C. Roth,
H. Jin,
X.-H. Sun
Global-aware and Multi-order Context-based Prefetching for High-Performance ProcessorsSpecial issue on Programming Models, Software and Tools for High-End Computing of the International Journal of High Performance Computing Applications (IJHPCA), vol. 25, no. 4, pp. 355-370JournalNovember, 2011
H. Song,
Y. Yin,
X.-H. Sun,
R. Thakur,
S. Lang
Server-Side I/O Coordination for Parallel File SystemsThe ACM/IEEE SuperComputing Conference (SC'11), Seattle, WA, USAConferenceNovember, 2011
X.-H. Sun,
D. Wang
Memory Access Cycle and the Measurement of Memory SystemsThe 2nd International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computing Systems (PMBS'11), in conjunction with IEEE/ACM SuperComputing 2011, Seattle, WA, USAWorkshopNovember, 2011
J. He,
H. Song,
X.-H. Sun,
Y. Yin,
R. Thakur
Pattern-aware File Reorganization in MPI-IOThe 6th Parallel Data Storage Workshop (PDSW'11), in conjunction with ACM/IEEE SuperComputing 2011, Seattle, WA, USAWorkshopNovember, 2011
J. He,
J. Kowalkowski,
M. Paterno,
D. J. Holmgren,
J. N. Simone,
X.-H. Sun
Layout-aware Scientific Computing - A Case Study Using MILCWorkshop on Latest Advances in Scalable Algorithms for Large-Scale Systems (ScalA'11), in conjunction with ACM/IEEE SuperComputing 2011, Seattle, WA, USAWorkshopNovember, 2011
J. Wu,
R. Gonzalez,
Z. Lan,
N. Gnedin,
A. Kravtsov,
D. Rudd,
Y. Yu
Performance Emulation of Cell-based AMR Cosmology SimulationsThe IEEE International Conference on Cluster Computing (CLUSTER), Austin, TexasConferenceSeptember, 2011
W. Tang,
N. Desai,
V. Vishwanath,
D. Buettner,
Z. Lan
Job Coscheduling on Coupled High-End Computing SystemsThe International Conference on Parallel Processing Workshops (ICPPW'11), Taipei, TaiwanConferenceSeptember, 2011
D. Wang,
X.-H. Sun,
N. Hu,
N. Sun
EthSpeeder: A High-performance Scalable Fault-Tolerant Ethernet Network Architecture for Data CenterThe 6th IEEE International Conference on Networking, Architecture, and Storage (NAS2011), Dalian, ChinaConferenceJuly, 2011
L. Yu,
Z. Zheng,
Z. Lan,
S. Coghlan
Practical Online Failure Prediction for Blue Gene/P: Period-based vs Event-drivenThe Proactive Failure Avoidance, Recovery, and Maintenance workshop(in conjunction with DSN'11), Hong Kong, ChinaWorkshopJune, 2011
H. Song,
Y. Yin,
Y. Chen,
X.-H. Sun
A Cost-intelligent Application-specific Data layout Scheme for Parallel File SystemsThe 20th International ACM Symposium on High Performance Distributed Computing (HPDC'11), San Jose, CAConferenceJune, 2011
Y. Chen,
X.-H. Sun,
R. Thakur,
P. C. Roth,
W. D. Gropp
LACIO: A New Collective I/O Strategy for Parallel I/O SystemsThe IEEE International Parallel and Distributed Processing Symposium (IPDPS' 11), Anchorage, AK, USAConferenceMay, 2011
Z. Zheng,
L. Yu,
W. Tang,
Z. Lan,
R. Gupta,
N. Desai,
S. Coghlan,
D. Buettner
Co-Analysis of RAS Log and Job Log on Blue Gene/PThe IEEE International Parallel and Distributed Processing Symposium (IPDPS' 11), Anchorage, AK, USAConferenceMay, 2011
W. Tang,
Z. Lan,
N. Desai,
D. Buettner,
Y. Yu
Reducing Fragmentation on Torus-Connected SupercomputersThe IEEE International Parallel and Distributed Processing Symposium (IPDPS' 11), Anchorage, AK, USAConferenceMay, 2011
H. Song,
Y. Chen,
X.-H. Sun
A Hybrid Shared-nothing/Shared-data Storage Architecture for Large Scale Databases(Poster Presentation)The 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'11), Newport Beach, CA, USAConferenceMay, 2011
H. Song,
Y. Yin,
X.-H. Sun,
R. Thakur,
S. Lang
A Segment-Level Adaptive Data Layout Scheme for Improved Load Balance in Parallel File SystemsThe 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'11), Newport Beach, CA, USAConferenceMay, 2011
K. Zhang,
Z. Wang,
Y. Chen,
H. Zhu,
X.-H. Sun
PAC-PLRU: A Cache Replacement Policy to Salvage Discarded Predictions from Hardware PrefetchersThe 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'11), Newport Beach, CA, USAConferenceMay, 2011
H. Song,
X.-H. Sun,
Y. Che
A Hybrid Shared-nothing/Shared-data Storage Scheme for Large-scale Data ProcessingThe 9th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA'11), Busan, KoreaConferenceMay, 2011
H. Jin,
K. Qiao,
X.-H. Sun,
Y. Li
Performance under Failures of MapReduce Applications (Poster Presentation)The 11th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'11), Newport Beach, CA, USAConferenceMay, 2011
H. Jin,
X.-H. Sun,
Y. Chen,
T. Ke
REMEM: REmote MEMory as Checkpointing StorageThe 2nd International Conference on Cloud Computing, Indianapolis, IN, USAConferenceNovember, 2010
H. Song,
X.-H. Sun,
H. Jin,
Y. Chen
Trace-based Adaptive Data Layout Optimization for Parallel File systems (Poster Presentation)The 5th Petascale Data Storage Workshop, in conjunction with SuperComputing 2010, New Orleans, LA, USAWorkshopNovember, 2010
R. Ge,
X. Feng,
J. Hu,
X.-H. Sun
Assessing Energy Efficiency of Parallel I/O SystemsThe ACM/IEEE SuperComputing Conference (SC'10), New Orleans, LA, USAConferenceNovember, 2010TBA
Y. Chen,
X.-H. Sun,
R. Thakur,
H. Song,
H. Jin
Improving Parallel I/O Performance with Data Layout AwarenessThe IEEE International Conference on Cluster Computing 2010 (Cluster10), Heraklion, GreeceConferenceSeptember, 2010
H. Jin,
Y. Chen,
H. Zhu,
X.-H. Sun
Optimizing HPC Fault-Tolerant Environment: An Analytical ApproachThe 39th International Conference on Parallel Processing (ICPP'2010), San Diego, CA, USAConferenceSeptember, 2010
Y. Chen,
H. Zhu,
H. Jin,
X.-H. Sun
Improving the Effectiveness of Context-based Prefetching with Multi-order AnalysisThe 3rd International Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2), San Diego, CA, USAWorkshopSeptember, 2010
H. Zhu,
Y. Chen,
X.-H. Sun
Timing Local Streams: Improving Timeliness in Data PrefetchingThe 24th International Conference on Supercomputing (ICS'10), Tsukuba, JapanConferenceJune, 2010
Z. Lan,
J. Gu,
Z. Zheng,
R. Thakur,
S. Coghlan
A Study of Dynamic Meta-Learning for Failure Prediction in Large-Scale SystemsJournal of Parallel and Distributed Computing, vol. 70, pp. 630-643JournalJune, 2010
Z. Zheng,
Z. Lan,
R. Gupta,
S. Coghlan,
P. Beckman
A Practical Failure Prediction with Location and Lead Time for Blue Gene/PThe Fault-Tolerance at Extreme Scale workshop (in conjunction with DSN'10), Chicago, IL, USAWorkshopJune, 2010
Y. Chen,
H. Song,
R. Thakur,
X.-H. Sun
A Layout-aware Optimization Strategy for Collective I/OThe High Performance Distributed Computing (HPDC-2010), Chicago, IL, USAWorkshopJune, 2010
Y. Chen,
H. Zhu,
X.-H. Sun
An Adaptive Data Prefetcher for High-Performance ProcessorsThe 10th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid'10), Melbourne, AustraliaConferenceMay, 2010
R. Ge,
X. Feng,
S. Subramanya,
X.-H. Sun
Characterizing the Energy Efficiency of I/O Intensive Parallel Applications on Power-Aware ClustersThe 6th workshop on high performance power-aware computing in conjunction with the 24th IEEE International Parallel and Distributed Processing Symposium, Atlanta, GA, USAWorkshopApril, 2010
W. Tang,
N. Desai,
D. Buettner,
Z. Lan
Analyzing and Adjusting User Runtime Estimates to Improve Job Scheduling on Blue Gene/PThe IPDPS'10, Atlanta, GA, USAConferenceApril, 2010
X.-H. Sun,
Y. Chen
Reevaluating Amdahl's Law in the Multicore EraJournal of Parallel and Distributed Computing, vol. 70, no. 2, pp. 183-188JournalFebruary, 2010
Z. Lan,
Z. Zheng,
Y. Li
Toward Automated Anomaly Identification in Large-Scale SystemsIEEE Transactions on Parallel and Distributed Systems, vol. 21, no. 2, pp. 174 - 187JournalFebruary, 2010
X.-H. Sun,
S. Byna,
D. J. Holmgren
Modeling Data Access Contention in Multicore ArchitecturesThe Fifteenth International Conference on Parallel and Distributed Systems (ICPADS'09), Shenzhen, ChinaConferenceDecember, 2009TBA
B. Xie,
Y. Chen,
X.-H. Sun,
H. Jin
Performance under Failure of Multi-tier Web ServicesWorkshop on Internet-based Virtual Computing Environment (in conjunction with ICPADS'09), Shenzhen, ChinaWorkshopDecember, 2009TBA
X.-H. Sun,
Y. Chen,
Y. Yin
Data Layout Optimization for Petascale File SystemsThe 4th Petascale Data Storage Workshop (in conjunction with ACM/IEEE SC'09), Portland, OR, USAWorkshopNovember, 2009
H. Jin,
X.-H. Sun,
B. Xie,
Y. Chen
An Implementation and Evaluation of Memory-based Checkpointing (Poster Presentation)The ACM/IEEE SuperComputing Conference(SC'09), Portland, OR, USAConferenceNovember, 2009
X.-H. Sun,
C. Du,
H. Zou,
Y. Chen,
P. Shukla
V-MCS: A Configuration System for Virtual MachinesThe Workshop on Web 2.0 on e-Research Infrastructure, Services and Applications (in conjunction with Cluster'09), New Orleans, LA, USAWorkshopAugust, 2009TBA
Z. Zheng,
Z. Lan
Reliability-Aware Scalability Models for High Performance ComputingThe IEEE Cluster'09, New Orleans, LA, USAConferenceAugust, 2009TBA
W. Tang,
Z. Lan,
N. Desai,
D. Buettner
Fault-Aware, Utility-Based Job Scheduling on Blue Gene/P SystemsThe IEEE Cluster'09, New Orleans, LA, USAConferenceAugust, 2009TBA
Z. Zheng,
Z. Lan,
B.-H. Park,
A. Geist
System Log Pre-processing to Improve Failure PredictionThe IEEE/IFIP International Conference on Dependable Systems and Networks (DSN'09), Estoril, Lisbon, PortugalConferenceJune, 2009TBA
H. Jin,
X.-H. Sun,
Z. Zheng,
Z. Lan,
B. Xie
Performance under Failures of DAG-based Parallel ComputingThe IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid'09), Shanghai, ChinaConferenceMay, 2009
Z. Fang,
X.-H. Sun,
Y. Chen,
S. Byna
Core-Aware Memory Access Scheduling SchemesThe IEEE International Parallel & Distributed Processing Symposium (IPDPS'09), Rome, ItalyConferenceMay, 2009
S. Byna,
Y. Chen,
X.-H. Sun
Taxonomy of data prefetching for multicore processorsJournal of Computer Science and Technology, vol. 24, no. 3, pp. 405-417JournalMay, 2009
Y. Li,
Z. Lan,
P. Gujrati,
X.-H. Sun
Fault-Aware Runtime Strategies for High Performance ComputingIEEE Transactions on Parallel and Distributed Systems, vol. 20, no. 4, pp. 460-473JournalApril, 2009
M. Wu,
X.-H. Sun
QoS of Grid ComputingGrid Technologies and Utility Computing: Concepts for Managing Large-Scale Applications (Encyclopedia of Grid Computing Technologies and Applications), Igi Global, 2009, pp 59-74, ISBN-10: 1605661848, ISBN-13: 978-1605661841JournalJanuary, 2009TBA
C. Du,
P. Shukla,
X.-H. Sun
Virtual Machines in Grid Environments: Dynamic Virtual MachinesGrid Computing: Infrastructure, Service, and Application (Hardcover), CRC, 2009, pp 405-431, ISBN-10: 1420067664, ISBN-13: 978-1420067668JournalJanuary, 2009TBA
L. Piccoli,
J. B. Kowalkowski,
J. N. Simone,
X.-H. Sun,
H. Jin,
D. J. Holmgren,
N. Seenu,
A. G. Singh,
S. Byna,
D. J. Holmgren
Lattice QCD Workflows: A Case Study3rd International Workshop on Scientific Workflows and Business Workflow Standards in e-Science (SWBES)WorkshopDecember, 2008
Y. Chen,
S. Byna,
X.-H. Sun,
R. Thakur,
W. D. Gropp
Hiding I/O Latency with Pre-execution Prefetching for Parallel ApplicationsThe ACM/IEEE SuperComputing Conference (SC'08) Best paper award finalistConferenceNovember, 2008
S. Byna,
Y. Chen,
X.-H. Sun,
R. Thakur,
W. D. Gropp
Parallel I/O Prefetching Using MPI File Caching and I/O SignaturesThe ACM/IEEE SuperComputing Conference (SC'08)ConferenceNovember, 2008
X.-H. Sun,
Y. Chen,
S. Byna
Scalable Computing in Multicore EraThe International Symposium on Parallel Algorithms, Architectures and Programming (PAAP'08)ConferenceSeptember, 2008TBA
Y. Chen,
S. Byna,
X.-H. Sun,
R. Thakur,
W. D. Gropp
Exploring Parallel I/O Concurrency with Speculative PrefetchingThe 37th International Conference on Parallel Processing (ICPP'08)ConferenceSeptember, 2008TBA
J. Gu,
Z. Zheng,
Z. Lan,
J. White,
E. Hocks,
B.-H. Park
Dynamic Meta-Learning for Failure Prediction in Large-scale Systems: A Case StudyThe 37th International Conference on Parallel Processing (ICPP'08)ConferenceSeptember, 2008TBA
L. Piccoli,
J. N. Simone,
J. Kowalkowski
Tracking LQCD WorkflowsLattice 2008PosterJuly, 2008
Y. Li,
Z. Lan
A Fast Recovery Mechanism for Checkpointing in Networked EnvironmentsThe DSN'08ConferenceJune, 2008
S. Byna,
Y. Chen,
X.-H. Sun
A Taxonomy of Data Prefetching MechanismsThe International Symposium on Parallel Architectures, Algorithms, and Networks (I-SPAN)ConferenceMay, 2008
Z. Lan,
Y. Li,
Z. Zheng,
P. Gujrati
Enhancing Application Robustness through Adaptive Fault ToleranceThe NSFNGS Workshop (in conjunction with IPDPS'08)WorkshopApril, 2008TBA
X.-H. Sun,
Z. Lan,
Y. Li,
H. Jin,
Z. Zheng
Towards a Fault-aware Computing EnvironmentThe High Availability and Performance Computing Workshop (HAPCW)WorkshopMarch, 2008
Y. Chen,
X.-H. Sun,
M. Wu
Algorithm-System Scalability of Heterogeneous ComputingJournal of Parallel and Distributed Computing, vol. 68, pp. 1403-1412JournalJanuary, 2008
Z. Lan,
Y. Li
Adaptive Fault Management of Parallel Applications for High Performance ComputingIEEE Transactions on Computers, vol. 57, no. 12, pp. 1647-1660JournalJanuary, 2008
L. Piccoli,
X.-H. Sun,
J. N. Simone
The LQCD Workflow Experience: What We Have Learned (Poster Presentation)The ACM/IEEE SuperComputing Conf. 2007 (SC'07)ConferenceNovember, 2007
M. Wu,
X.-H. Sun,
H. Jin
Performance under Failure of High-End ComputingThe ACM/IEEE SuperComputing Conf. 2007 (SC'07)ConferenceNovember, 2007
Y. Chen,
S. Byna,
X.-H. Sun
Data Access History Cache and Associated Data Prefetching MechanismsThe ACM/IEEE SuperComputing Conf. 2007 (SC'07)ConferenceNovember, 2007
Z. Zheng,
Y. Li,
Z. Lan
Anomaly Localization in Large-scale ClustersThe IEEE Cluster'07ConferenceSeptember, 2007TBA
P. Gujrati,
Y. Li,
Z. Lan,
R. Thakur,
J. White
Exploring Meta-learning to Improve Failure Prediction in Supercomputing ClustersThe 2007 International Conference on Parallel Processing (ICPP'07)ConferenceSeptember, 2007TBA
Y. Li,
P. Gujrati,
Z. Lan,
X.-H. Sun
Fault-Driven Re-Scheduling For Improving System-level Fault ResilienceThe 2007 International Conference on Parallel Processing (ICPP'07)ConferenceSeptember, 2007TBA
X.-H. Sun,
M. Wu
Quality of Service of Grid Computing: Resource SharingThe 6th International Conference on Grid and Cooperative Computing(GCC'07)ConferenceAugust, 2007
Y. Li,
Z. Lan
Using Adaptive Fault Tolerance to Improve Application Robustness on the TeraGridThe TeraGrid'07WorkshopJune, 2007TBA
Z. Lan,
Y. Li,
P. Gujrati,
Z. Zheng,
R. Thakur,
J. White
A Fault Diagnosis and Prognosis Service for TeraGrid ClustersThe TeraGrid'07WorkshopJune, 2007TBA
K. Xiao,
N. Chen,
S. Ren,
L. Shen,
X.-H. Sun,
K. Kwiat,
M. Macalik
A Workflow-based Non-intrusive Approach for Enhancing the Survivability of Critical Infrastructures in Cyber EnvironmentThe 3rd International Workshop on Software Engineering for Secure Systems (SESS'07)WorkshopMay, 2007TBA
C. Du,
X.-H. Sun,
M. Wu
Dynamic Scheduling with Process MigrationThe IEEE International Symposium on Cluster Computing and the Grid 2007, Rio de Janeiro, BrazilConferenceMay, 2007
X.-H. Sun,
S. Byna,
Y. Chen
Improving Data Access Performance with Server Push ArchitectureThe NSF Next Generation Software Program Workshop (in conjunction with IPDPS '07)WorkshopMarch, 2007
X.-H. Sun,
S. Byna,
Y. Chen
Server-based Data Push Architecture for Multi-processor EnvironmentsJournal of Computer Science and Technology (JCST), vol. 22, no. 5, pp. 641-652JournalJanuary, 2007
K. Cameron,
G. Ge,
X.-H. Sun
lognP and log3P: Accurate analytical models of point-to-point communication in distributed systemsIEEE Trans. on Computer, vol. 56, no. 3, pp. 314-327JournalJanuary, 2007
S. Byna,
X.-H. Sun,
R. Nakhoul
Memory Servers: A Scope of SOA for High-End ComputingThe IEEE Service Computing Conference (SCC) 2006, ChicagoConferenceSeptember, 2006
S. Byna,
X.-H. Sun,
R. Thakur,
W. D. Gropp
Automatic Memory Optimizations for Improving MPI Derived Datatype Performance13th The European PVM/MPI Conference, Bonn, Germany, Lecture Notes in Computer Science, SpringerConferenceSeptember, 2006
M. Wu,
X.-H. Sun,
Y. Chen
QoS Oriented Resource Reservation in Shared EnvironmentsThe 6th IEEE International Symposium on Cluster Computing and the Grid, SingaporeConferenceMay, 2006
C. Du,
X.-H. Sun
MPI-Mitten: Enabling Migration Technology in MPIThe 6th IEEE International Symposium on Cluster Computing and the Grid, SingaporeConferenceMay, 2006
A. Eswaradass,
X.-H. Sun,
M. Wu
Network Bandwidth Predictor (NBP): A System for Online Network Performance ForecastingThe 6th IEEE International Symposium on Cluster Computing and the Grid, SingaporeConferenceMay, 2006
M. Wu,
X.-H. Sun
Grid Harvest Service: A Performance System of Grid ComputingJournal of Parallel and Distributed Computing, vol. 66, no. 10, pp. 1322-1337, 2006. (ACM Computing Review)JournalJanuary, 2006
Y. Chen,
X.-H. Sun
STAS: A Scalability Testing and Analysis SystemThe IEEE International conference on Cluster Computing 2006(Cluster2006)ConferenceJanuary, 2006
Y. Li,
Z. Lan
Exploit Failure Prediction for Adaptive Fault-Tolerance in Cluster ComputingThe IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGrid06) , SingaporeConferenceJanuary, 2006TBA
Z. Lan,
Y. Li
Failure-Aware Resource Selection for Grid ComputingThe IEEE Conference on Dependable Systems and networks (Fast Abstract)ConferenceJanuary, 2006TBA
M. Wu,
X.-H. Sun
The GHS Grid Scheduling System: Implementation and Performance ComparisonThe NSF Next Generation Software Program Workshop (in conjunction with IPDPS06), IEEE CS pressWorkshopJanuary, 2006
X.-H. Sun,
Y. Chen,
M. Wu
Scalability of Heterogeneous ComputingThe 34rd International Conference on Parallel Processing, Oslo, NorwayConferenceJune, 2005
V. K. Gurbani,
A. Brusilovsky,
X.-H. Sun
Ubiquitous Services in the Next Generation Network: Constraining and Facilitating ForcesThe Eurescom Summit 2005: Ubiquitous Services and Applications Exploiting the Potential, Heidelberg, GermanyConferenceApril, 2005
A. Eswaradass,
X.-H. Sun,
M. Wu
A Neural Network Based Predictive Mechanism for Available BandwidthThe 19th International Parallel and Distributed Processing Symposium (IPDPS05), Denver, ColoradoConferenceApril, 2005
Y. Zhuang,
X.-H. Sun
Highly Parallel Algorithms for the Numerical Simulation of Unsteady Diffusion ProcessesThe 19th International Parallel and Distributed Processing Symposium (IPDPS05), Denver, ColoradoConferenceApril, 2005TBA
X.-H. Sun,
M. Wu
GHS, A Performance System of Grid ComputingThe NSF Next Generation Software Program Workshop ( in conjunction with IPDPS05), Denver, ColoradoWorkshopApril, 2005
V. K. Gurbani,
X.-H. Sun
A Systematic Approach for Closer Integration of Cellular and Internet ServicesIEEE Network, pp: 26-32WorkshopFebruary, 2005
S. Byna,
K. Cameron,
X.-H. Sun
Isolating costs in shared memory communication bufferingParallel Processing Letters, vol. 15, no. 4, pp. 357-365JournalJanuary, 2005
A.-A. Chien,
X.-H. Sun,
Z.-W. Xu
Viewpoints on Grid StandardsJ. Comput. Sci. Technol. vol. 20, no. 1, pp. 141-143JournalJanuary, 2005
V. K. Gurbani,
X.-H. Sun,
A. Brusilovsky
Inhibitors for the Ubiquitous Deployment of Services in the Next Generation NetworkIEEE Communications, vol. 43, no. 9JournalJanuary, 2005
X. He,
X.-H. Sun
Incorporating Data Movement into Grid Task SchedulingLecture Notes in Computer Science, vol. 3795, pp. 394 - 405ConferenceJanuary, 2005
V. K. Gurbani,
X.-H. Sun
Extensions to an Internet signaling protocol to support telecommunication servicesThe IEEE Global Telecommunications Conference, Dallars, TX.ConferenceNovember, 2004
M. Wu,
X.-H. Sun
Memory Conscious Task Partition and Scheduling in Grid EnvironmentsThe 5th IEEE/ACM International Workshop on Grid Computing (in conjunction with SC 2004), pp. 138-145, PittsburghWorkshopNovember, 2004
S. Byna,
X.-H. Sun,
W. D. Gropp,
R. Thakur
Predicting the Memory-Access Cost Based on Data Access PatternsThe IEEE International Conference on Cluster Computing, San DiegoConferenceSeptember, 2004
C. Du,
S. Ghosh,
S. Shankar,
X.-H. Sun
A Runtime System for Automatic Rescheduling of MPI ProgramThe 33rd International Conference on Parallel Processing, Montreal, Quebec, Canada, Aug. 15-18ConferenceAugust, 2004
K. Chanchio,
X.-H. Sun
Communication State Transfer for the Mobility of Concurrent Heterogeneous ComputingIEEE Trans. on Computers, vol. 53, no. 10, pp. 1260-1273JournalJanuary, 2004
V. K. Gurbani,
X.-H. Sun
Terminating Telephony Services on the InternetACM/IEEE Trans. on Networking, vol. 12, no. 4, pp. 571-581JournalJanuary, 2004
X.-H. Sun,
A. R. Blatecky
Middleware: the key to next generation computingJ. Parallel Distrib. Comput. 64(6): 689-691JournalJanuary, 2004
X.-H. Sun,
W. Zhang
A Parallel Two-Level Hybrid Method for Tridiagonal Systems, and its Application to Fast Poisson SolversIEEE Trans. on Parallel and Distributed Systems, vol. 15, no. 2, pp. 97-106JournalJanuary, 2004
M. Wu,
X.-H. Sun
Self-adaptive Task Allocation and Scheduling of Meta-tasks in Non-dedicated Heterogeneous ComputingInternational Journal of High Performance Computing and Networking , vol. 2, no. 2/3/4, pp. 186-197JournalJanuary, 2004
S. Byna,
W. D. Gropp,
X.-H. Sun,
R. Thakur
Improving the Performance of MPI Derived Datatypes by Optimizing Memory-Access CostThe IEEE International Conference on Cluster Computing, 2003, Hong KongConferenceDecember, 2003
C. Du,
X.-H. Sun,
K. Chanchio
HPCM: A Pre-compiler Aided Middleware for the Mobility of Legacy CodeThe IEEE International Conference on Cluster Computing, 2003, Hong KongConferenceDecember, 2003
M. Wu,
X.-H. Sun
A General Self-adaptive Task Scheduling System for Non-dedicated Heterogeneous ComputingThe IEEE International Conference on Cluster Computing, 2003, Hong KongConferenceDecember, 2003
V. K. Gurbani,
X.-H. Sun
Accessing telephony services from the InternetThe IEEE International Conference on Computer Communications and Networks} (ICCCN03), Dallars, TX.ConferenceOctober, 2003
A. Daftari,
N. Mehta,
S. Bakre,
X.-H. Sun
On Design Framework of Context Aware Embedded SystemsThe 2003 DoD Monterey Workshop, Software Engineering for Embedded Systems: From Requirements to Implementation, ChicagoWorkshopSeptember, 2003TBA
S. Anand,
S. Yoginath,
G. Laszewski,
B. Alunkal,
X.-H. Sun
Flow-based Multistage Co-allocation ServiceThe 2003 International Conference on Communications in Computing, Las Vegas, NevadaConferenceJune, 2003
G. Laszewski,
B. Alunkal,
J. Gawor,
R. Madhuri,
P. Plaszezak,
X.-H. Sun
A File Transfer Component for GridsThe 2003 International Conference on Parallel and Distributed Processing Techniques and Applications, Las Vegas, NevadaConferenceJune, 2003
V. K. Gurbani,
X.-H. Sun
Services spanning heterogeneous networksThe 2003 IEEE International Conference on Communications (ICC 2003), Anchorage, AlaskaConferenceMay, 2003
K. Cameron,
X.-H. Sun
Quantifying Locality Effect in Data Access Delay: Memory logPThe 2003 IEEE International Parallel and Distributed Processing Symposium (IPDPS 2003), Nice, FranceConferenceApril, 2003
V. K. Gurbani,
X.-H. Sun
Internet Service Execution for Telephony EventsThe IEEE International Conference for Intelligence in Next Generation Networks (ICIN) 2003, Bordeaux, FranceConferenceApril, 2003
X.-H. Sun,
M. Wu
Grid Harvest Service: A System for Long-Term, Application-Level Task SchedulingThe 2003 IEEE International Parallel and Distributed Processing Symposium (IPDPS 2003), Nice, FranceConferenceApril, 2003
X. He,
X.-H. Sun,
G. Laszewski
QoS Guided Min-Min Heuristic for Grid Task SchedulingJournal of Computer Science and Technology, Special Issue on Grid Computing, 18(4)JournalJanuary, 2003
X. He,
X.-H. Sun,
G. Laszewski
A QoS Guided Scheduling Algorithm for the Computational GridThe International Workshop on Grid and Cooperative Computing (GCC02), Hainan, ChianWorkshopDecember, 2002
L. Gong,
X.-H. Sun,
E. Waston
Performance Modeling and Prediction of Non-Dedicated Network ComputingIEEE Trans. on Computers, Vol 51, No 9, pp. 1041-1055JournalSeptember, 2002
K. Chanchio,
X.-H. Sun
SNOW : Software Systems for Process Migration in High-Performance, Heterogeneous Distributed EnvironmentsThe 2002 workshops of International Conference on Parallel Processing Workshop on Compile and Runtime Techniques for Parallel Computing, IEEE CS Press, Vancouver, CanadaWorkshopAugust, 2002TBA
Y. Zhuang,
X.-H. Sun
Stabilized Explicit-implicit Domain Decomposition Methods for the Numerical Solution of Parabolic EquationsSIAM Journal on Scientific Computing , Vol. 24, No. 1, 335-358JournalJuly, 2002
K. Chanchio,
X.-H. Sun
Data collection and restoration for heterogeneous process migrationSOFTWARE--PRACTICE AND EXPERIENCE, 32:1-27JournalApril, 2002
X.-H. Sun,
W. Zhang
A Parallel Two-level Hybrid Method for Diagonal Dominant Tridiagonal SystemsThe 2002 International Parallel and Distributed Processing Symposium (IPDPS 2002), Fort Lauderdale, FLConferenceApril, 2002
X.-H. SunScalability Versus Execution Time in Scalable SystemsJournal of Parallel and Distributed Computing, Vol. 62, No. 2, pp. 173-192JournalFebruary, 2002
X.-H. Sun,
T. Fahringer,
M. Pantano
SCALA: A Performance System for Scalable ComputingInternational Journal of High Performance Computing Applications (IJHPCA), Vol. 16, No. 4, Autumn 2002JournalJanuary, 2002
X. Wu,
Q. Chen,
X.-H. Sun
Design and Development of a Scalable Distributed Debugger for Cluster ComputingCluster Computing, 5, 365-375, 2002ConferenceJanuary, 2002
Y. Zhuang,
X.-H. Sun
Stable, Globally Non-iterative, Non-overlapping Domain Decomposition Parallel Solvers for Parabolic ProblemsThe SuperComputing 2001 (SC2001), DenverConferenceNovember, 2001
K. Chanchio,
X.-H. Sun
Communication State Transfer for the Mobility of Concurrent Heterogenous ComputingThe 2001 the International Conference on Parallel Processing (ICPP 2001) Best Paper AwardConferenceSeptember, 2001
X.-H. Sun,
D. He,
K. Cameron,
Y. Luo
Adaptive Multivariate Regression for Advanced Memory System Evaluation: Application and ExperienceJournal of Performance Evaluation, Volume 45, Issue 1, May 2001, Pages 1-18JournalMay, 2001
K. Chanchio,
X.-H. Sun
A Protocol Design for Communication State Transfer for Distributed ComputingThe 21st International Conference on Distributed Computing Systems (ICDCS 2001)ConferenceApril, 2001
X.-H. SunA Scalable Parallel Algorithm for Periodic Symmetric Toeplitz Tridiagonal SystemsInternational Journal of Computer Research, Vol. 10, No. 1, 2001, pp. 89-98.JournalJanuary, 2001
Y. Zhuang,
X.-H. Sun
A High Order Fast Direct Solver for Singular Poisson EquationsJournal of Computational Physics, Vol. 171, pp. 79-94 (2001).JournalJanuary, 2001
K. Chanchio,
X.-H. Sun
Data Collection and Restoration for Heterogeneous Process MigrationThe 2001 International Parallel and Distributed Processing Symposium (IPDPS 2001).ConferenceJanuary, 2001
D. Khettry,
X.-H. Sun
A Windows-NT Virtual Collaboratory For Technical ComputingInternational Journal on Advances in Engineering Software, Vol. 31, pp. 717-722JournalSeptember, 2000
T. Fahringer,
B. Scholz,
X.-H. Sun
Execution-driven performance analysis for distributed and parallel systemsThe Second ACM International Workshop on Software and Performance (WOSP'2000)WorkshopSeptember, 2000TBA
X.-H. Sun,
K. Cameron
A Statistical-Empirical Hybrid Approach to Hierarchical Memory AnalysisThe Euro-Par 2000, Lecture Notes in Computer Science 1900, SpringerConferenceSeptember, 2000
Y. Zhuang,
X.-H. Sun
A High Order ADI Method For Separable Generalized Helmholtz EquationsInternational Journal on Advances in Engineering Software, Vol. 31, pp. 585-592JournalAugust, 2000
W. Cai,
X.-H. Sun
Adaptive Wavelet ADI Method: Application and parallelizationThe Workshop on High Performance Scientific and Engineering Computing with Applications (HPSECA-00), August, 2000, IEEE Computer PressWorkshopAugust, 2000TBA
X. Wu,
X.-H. Sun
Performance Modeling of Interconnection NetworkThe IEEE Fourth International Conference/Exhibition on High Performance Computing in Asia-Pacific Region (HPC-ASIA 2000)ConferenceMay, 2000TBA
X.-H. Sun,
X. Wu
PDRS: A Performance Data Representation SystemLecture Notes in Computer Science, SpringerConferenceApril, 2000
K. Chanchio,
X.-H. Sun
User-level Process Migration for Heterogeneous Distributed Parallel ComputingThe Newsletter of the IEEE Technical Committee on Distributed ProcessingJournalJanuary, 2000TBA
K. Li,
X.-H. Sun
Average-case Analysis of Isospeed Scalability of Parallel Computations on MultiprocessorsInternational Journal of High Speed Computing, Vol. 11, No. 1, pp. 15-36JournalJanuary, 2000TBA
X. Wu,
Q. Chen,
X.-H. Sun
Design and Implementation of a Java-based Distributed Debugger Supporting PVM and MPIThe The 11th ISASTED Interantional Conference on Parallel and Distributed Computing and Systems, Nov. 1999, Cambridge, MassachusettsConferenceNovember, 1999
D. Khettry,
X.-H. Sun
Virtual Collaboratory in Windows-NT EnvironmentThe 5th NASA National Symposium on Large-Scale Analysis, Design and Intelligent Synthesis EnvironmentsConferenceOctober, 1999TBA
Y. Zhuang,
X.-H. Sun
A High-Order Multilevel ADI Solver for Generalized Helmholtz EquationsThe 5th NASA National Symposium on Large-Scale Analysis, Design and Intelligent Synthesis EnvironmentsConferenceOctober, 1999TBA
X. Liao,
X.-H. Sun
Computer Simulation of PEC NetworkJournal of Simulation Practice and Theory, Vol.7, May, 1999, pp 251-278JournalMay, 1999
X.-H. Sun,
M. Pantano,
T. Fahringer
Integrated Range Comparison for Data-Parallel Compilation SystemsIEEE Trans. on Parallel and Distributed ProcessingJournalMay, 1999
K. Li,
X.-H. Sun
Average-Case Analysis of Isospeed Scalability of Parallel Computations on MultiprocessorsThe IEEE Int'l Symposium on Parallel and Distributed ProcessingConferenceApril, 1999TBA
X.-H. Sun,
M. Pantano,
T. Fahringer,
Z. Zhan
SCALA: A Framework for Performance Evaluation of Scalable ComputingThe 4-th Workshop on High-Level Parallel Programming Models & Supportive Environments in Lecture Notes in Computer Science , No. 1586, SpringerWorkshopApril, 1999
X.-H. Sun,
D. He,
K. Cameron,
Y. Luo
A Factorial Performance Evaluation for Hierarchical Memory SystemsThe IEEE Int'l Parallel Processing Symposium'99ConferenceApril, 1999TBA
Y. Zhuang,
X.-H. Sun
A Domain Decomposition Based Parallel Solver for Time Dependent Differential EquationsThe SIAM Conf. on Parallel Processing for Scientific ComputingWorkshopMarch, 1999TBA
X.-H. Sun,
V. K. Naik,
K. Chanchio
A Coordinated Approach for Process Migration in Heterogeneous EnvironmentsThe 1999 SIAM Parallel Processing ConferenceConferenceMarch, 1999
X. Wu,
Q. Chen,
X.-H. Sun
A Java-based distributed debugger supporting MPI and PVMJournal of Parallel and Distributed Computing and Practice, Vol. 2, No. 4JournalJanuary, 1999TBA
X.-H. Sun,
M. Pantano,
T. Fahringer
Performance Range Comparison for Restructuring CompilationThe International Conference on Parallel ProcessingConferenceAugust, 1998
M. Noelle,
M. Pantano,
X.-H. Sun
Communication Overhead: Prediction and Its Influence on ScalabilityThe International Conference on Parallel and Distributed Processing Techniques and ApplicationsConferenceJuly, 1998TBA
Q. Hou,
X.-H. Sun
A Three-Level Parallelization of a Spatial Direct Numerical SimulationInternational Journal on Advances in Engineering Software, pp. 325-330, Vol. 29, No. 3-6JournalJuly, 1998
X.-H. SunPerformance Range Comparison Via Crossing Point AnalysisLecture Notes in Computer Science, No. 1388, Springer-VerlagConferenceMarch, 1998
K. Chanchio,
X.-H. Sun
Memory Space Representation for Heterogeneous Network Process MigrationThe 12th International Parallel Processing SymposiumConferenceMarch, 1998
X. Liao,
X.-H. Sun
A Simulation Study of Packed Exponential Connection NetworkThe Int'l Conf. on Parallel and Distributed Computing SystemsConferenceOctober, 1997TBA
S. T. Leutenegger,
X.-H. Sun
Limitations of Cycle Stealing of Parallel Processing on a Network of Homogeneous WorkstationsJournal of Parallel and Distributed Computing, Vol.43, No. 3, pp.169-178JournalJanuary, 1997
X.-H. Sun,
S. Moitra
Performance Comparison of a Set of Periodic and Non-Periodic Tridiagonal Solvers on SP2 and Paragon Parallel ComputersConcurrency: Practice and Experience, pp.1-21, Vol. 8(10)JournalJanuary, 1997TBA
K. Chanchio,
X.-H. Sun
MpPVM: A Software System for Non-Dedicated Heterogeneous ComputingThe International Conference on Parallel ProcessingConferenceAugust, 1996TBA
X.-H. SunThe Relation of Scalability and Execution TimeThe IEEE International Parallel Processing Symposium'96ConferenceApril, 1996TBA
X.-H. Sun,
J. Zhu
Performance Considerations : A Case Study Using a Scalable Shared-Virtual-Memory MachinesIEEE Parallel and Distributed Technology, Vol 4, pp. 36-49, WinterConferenceJanuary, 1996
X.-H. Sun,
D. Joslin
A Simple Parallel Prefix Algorithm for Almost Toeplitz Tridiagonal SystemsInternational Journal of High Speed Computing, Vol.7, No.4, pp. 547-576JournalDecember, 1995TBA
X.-H. Sun,
J. Zhu
Performance Considerations of Shared Virtual Memory MachinesIEEE Trans. on Parallel and Distributed SystemsJournalNovember, 1995
X.-H. SunApplication and Accuracy of the Parallel Diagonal Dominant AlgorithmParallel ComputingJournalAugust, 1995
X.-H. Sun,
J. Zhu
Performance Prediction of Scalable Computing: A case studyThe 28th Hawaii International Conference on System SciencesConferenceJanuary, 1995TBA
X.-H. Sun,
D. Joslin
A Massively Parallel Algorithm for Compact Finite Difference SchemesThe 23rd International Conf. on Parallel Processing (ICPP'94)ConferenceAugust, 1994TBA
X.-H. Sun,
J. Rosendale
A Green's Function Approach to Distributed Solution of Tridiagonal SystemsThe 14th IMACS World Congress on Computational and Applied Mathematics, AtlantaConferenceJuly, 1994TBA
X.-H. Sun,
D. Rover
Scalability of Parallel Algorithm-Machine CombinationsIEEE Trans. on Parallel and Distributed SystemsJournalMay, 1994
X.-H. Sun,
J. Zhu
Shared Virtual Memory and Generalized SpeedupThe IEEE International Parallel Processing Symposium'94, pp. 637-643ConferenceApril, 1994TBA
X.-H. SunA Scalable Parallel Algorithm for Periodic Symmetric Toeplitz Tridiagonal SystemsThe Mardi Gras Conference'94: Toward Teraflop Computing and New Grand Challenge ApplicationsConferenceFebruary, 1994TBA
J. Wu,
X.-H. Sun
Optimal Cube-Connected Cube MulticomputersJournal of Network and Computer Applications, Vol. 17, pp. 135-146JournalJanuary, 1994TBA
X.-H. Sun,
L. Ni
Scalable Problems and Memory-Bounded SpeedupJournal of Parallel and Distributed Computing, Vol. 19, pp.27-37JournalSeptember, 1993
X.-H. Sun,
N. Kamel
Preprocessing Predicates and QueriesInformation Systems, Vol. 17, No.6, pp.465-475JournalNovember, 1992TBA
X.-H. Sun,
N. Kamel
Augmenting Multikey Searching Structures for General Database QueriesInternational Journal of Computer Systems Science and Engineering, Vol. 7, No. 4, pp.229-235JournalOctober, 1992TBA
X.-H. Sun,
H. Zhang,
L. Ni
Efficient Tridiagonal Solvers on MulticomputersIEEE Trans. on Computers, Vol. 41, No. 3, pp.286-296JournalMarch, 1992TBA
X.-H. Sun,
J. Gustafson
Toward A Better Parallel Performance MetricParallel Computing, Vol. 17, pp.1093-1109JournalDecember, 1991
T. Y. Li,
H. Zhang,
X.-H. Sun
Parallel Homotopy Algorithm for Symmetric Tridiagonal Eigenvalue ProblemSIAM Journal of Scientific and Statistical Computing, Vol. 5JournalMay, 1991TBA
Showing 345 of 345 publications