Systems for AI LLM Training Stragglers Research on identifying and mitigating straggler effects in large-scale language model training BootSeer - LLM Training Startup Optimization System-level optimization framework for reducing startup overhead in large-scale LLM training ML for Systems Self-Healing in Large-Scale Datacenters AIHS - Automated intelligent healing system for cloud-scale data centers using machine learning Network Traffic Classification with Deep Learning EBSNN and BSNN - Novel neural network architectures for automated network traffic classification Systems HotSwap - Serverless Cold Start Optimization Novel provider-side optimization for reducing cold-start latency in serverless computing