Abstract: This article presents a prediction-correction proximal method (PCPM) for the general nonsmooth convex optimization problem with linear equality and inequality constraints. The proposed ...
Abstract: In this paper, we consider a class of constrained convex optimization problems, where the global cost function is defined as the sum of agents' individual cost functions. Both local and ...
CVXRO is a package for decision making under uncertainty. It is based on Python, and built on top of CVXPY. It allows model optimization problems affected by uncertainty using data. The following code ...
Efficient Transformers in Reinforcement Learning using Actor-Learner Distillation[2021] E. Parisotto and R. Salakhutdinov[PDF] Deep Transformer Q-Networks for Partially Observable Reinforcement ...