Abstract:The growing IoT landscape requires effective server deployment strategies to meet demands including real-time processing and energy efficiency. This is complicated by heterogeneous, dynamic applications and servers. To address these challenges, we propose ReinFog, a modular distributed software empowered with Deep Reinforcement Learning (DRL) for adaptive resource management across edge/fog and cloud environments. ReinFog enables the practical development/deployment of various centralized and distributed DRL techniques for resource management in edge/fog and cloud computing environments. It also supports integrating native and library-based DRL techniques for diverse IoT application scheduling objectives. Additionally, ReinFog allows for customizing deployment configurations for different DRL techniques, including the number and placement of DRL Learners and DRL Workers in large-scale distributed systems. Besides, we propose a novel Memetic Algorithm for DRL Component (e.g., DRL Learners and DRL Workers) Placement in ReinFog named MADCP, which combines the strengths of Genetic Algorithm, Firefly Algorithm, and Particle Swarm Optimization. Experiments reveal that the DRL mechanisms developed within ReinFog have significantly enhanced both centralized and distributed DRL techniques implementation. These advancements have resulted in notable improvements in IoT application performance, reducing response time by 45%, energy consumption by 39%, and weighted cost by 37%, while maintaining minimal scheduling overhead. Additionally, ReinFog exhibits remarkable scalability, with a rise in DRL Workers from 1 to 30 causing only a 0.3-second increase in startup time and around 2 MB more RAM per Worker. The proposed MADCP for DRL component placement further accelerates the convergence rate of DRL techniques by up to 38%.

Decision Transformer under Random Frame Dropping

D3D: Conditional Diffusion Model for Decision-Making under Random Frame Dropping

DROP: Conservative Model-based Optimization for Offline Reinforcement Learning

Robust Decision Transformer: Tackling Data Corruption in Offline RL via Sequence Modeling

An Offline-Transfer-Online Framework for Cloud-Edge Collaborative Distributed Reinforcement Learning

Solving time-delay issues in reinforcement learning via transformers

ReinFog: A DRL Empowered Framework for Resource Management in Edge and Cloud Computing Environments

Solving Continual Offline Reinforcement Learning with Decision Transformer

Towards Robust Decision-Making for Autonomous Driving on Highway

DEER: A Delay-Resilient Framework for Reinforcement Learning with Variable Delays

Retentive Decision Transformer with Adaptive Masking for Reinforcement Learning based Recommendation Systems

FLIRRAS: Fast Learning With Integrated Reward and Reduced Action Space for Online Multitask Offloading

DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents

Solving Offline Reinforcement Learning with Decision Tree Regression

Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning

Adaptability Preserving Domain Decomposition for Stabilizing Sim2Real Reinforcement Learning

Uncertainty-Aware Decision Transformer for Stochastic Driving Environments

Towards Robust Decision-Making for Autonomous Highway Driving Based on Safe Reinforcement Learning

Decision Transformers for Wireless Communications: A New Paradigm of Resource Management

Rethinking Decision Transformer via Hierarchical Reinforcement Learning