Proximal Policy Optimization Algorithm

Google's TurboQuant compression tech cuts LLM memory use by 6x with no accuracy loss

The biggest memory burden for LLMs is the key-value cache, which stores conversational context as users interact with AI ...

IEEE

Solving Human–Robot Collaborative Circular Disassembly Line Balancing Problem via Graph Neural Network-Enhanced Proximal Policy Optimization Algorithm

Abstract: Industry 5.0 promotes the transformation of manufacturing toward flexibility, personalization, and sustainability. As a critical component of closed-loop manufacturing systems, disassembly ...

IEEE

SIM-assisted Secure Mobile Communications via Enhanced Proximal Policy Optimization Algorithm

Abstract: With the development of sixth-generation (6G) wire-less communication networks, the security challenges are becoming increasingly prominent, especially for mobile users (MUs). As a promising ...

Interesting Engineering

AI-trained quadruped robot walks rough, low-friction terrain without human input

A quadruped robot has learned to walk across slippery, uneven terrain entirely through simulation, without any human-designed gaits or manual tuning. The system relies on deep reinforcement learning ...

InfoQ

AlphaEvolve Enters Google Cloud as an Agentic System for Algorithm Optimization

A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...

GitHub

AliceeWonderland/Improving-Proximal-Policy-Optimization-for-Goal-reaching-Simulation-in-Unity-with-ML-Agents

This project presents a comprehensive overview of building a simulation environment in Unity and applying the Proximal Policy Optimization (PPO) algorithm from Unity’s built-in ML-Agents toolkit. We ...

Scientific Research Publishing

Tran, T.T., Browne, T., Veitch, B., Musharraf, M. and Peters, D. (2023) Route Optimization for Vessels in Ice: Investigating Operational Implications of the Carbon Intensity ...

ABSTRACT: Maritime transportation is increasingly being subjected to pressure to balance economic efficiency with environmental sustainability under regulatory frameworks such as global trade demands ...

GitHub

AliceeUL/Improving-Proximal-Policy-Optimization-for-Goal-reaching-Simulation-in-Unity-with-ML-Agents

Goal-reaching simulation in Unity by combining to use ML-Agents toolkit and Anaconda involves training an agent to navigate and interact with environments to reach predefined goal target. This task ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results