深度强化学习实验室
来源:ICLR2021
编辑:DeepRL
[1]. What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study
平均得分: 8
得分: ['7', '9', '9', '7']
论文链接: openreview.net/forum?id=nI…
[2]. Invariant Representations for Reinforcement Learning without Reconstruction
平均得分: 7.67
得分: ['9', '7', '7']
论文链接: openreview.net/forum?id=-2…
[3]. Winning the L2RPN Challenge: Power Grid Management via Semi-Markov Afterstate Actor-Critic
平均得分: 7.5
得分: ['7', '9', '7', '7']
论文链接: openreview.net/forum?id=Lm…
[4]. Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients
平均得分: 7.5
得分: ['9', '5', '8', '8']
论文链接: openreview.net/forum?id=m5…
[5]. Parrot: Data-Driven Behavioral Priors for Reinforcement Learning
平均得分: 7.5
得分: ['8', '7', '6', '9']
论文链接: openreview.net/forum?id=Ys…
[6]. Evolving Reinforcement Learning Algorithms
平均得分: 7.33
得分: ['9', '6', '7']
论文链接: openreview.net/forum?id=0X…
[7]. Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime
平均得分: 7
得分: ['7', '7', '7', '7']
论文链接: openreview.net/forum?id=bB…
[8]. Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy
平均得分: 7
得分: ['8', '8', '7', '5']
论文链接: openreview.net/forum?id=pq…
[9]. UPDeT: Universal Multi-agent RL via Policy Decoupling with Transformers
平均得分: 7
得分: ['7', '9', '5']
论文链接: openreview.net/forum?id=v9…
[10]. Regularized Inverse Reinforcement Learning
平均得分: 6.8
得分: ['6', '6', '7', '8', '7']
论文链接: openreview.net/forum?id=Hg…
[11]. Randomized Ensembled Double Q-Learning: Learning Fast Without a Model
平均得分: 6.75
得分: ['6', '7', '7', '7']
论文链接: openreview.net/forum?id=AY…
[12]. Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization
平均得分: 6.75
得分: ['8', '7', '5', '7']
论文链接: openreview.net/forum?id=3h…
[13]. Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels
平均得分: 6.75
得分: ['7', '6', '7', '7']
论文链接: openreview.net/forum?id=GY…
[14]. Support-set bottlenecks for video-text representation learning
平均得分: 6.75
得分: ['6', '9', '7', '5']
论文链接: openreview.net/forum?id=Eq…
[15]. A Sharp Analysis of Model-based Reinforcement Learning with Self-Play
平均得分: 6.75
得分: ['4', '7', '8', '8']
论文链接: openreview.net/forum?id=9Y…
[16]. RODE: Learning Roles to Decompose Multi-Agent Tasks
平均得分: 6.67
得分: ['8', '6', '6']
论文链接: openreview.net/forum?id=TT…
[17]. Text Generation by Learning from Off-Policy Demonstrations
平均得分: 6.6
得分: ['7', '7', '7', '5', '7']
论文链接: openreview.net/forum?id=Ro…
[18]. Robust Reinforcement Learning on State Observations with Learned Optimal Adversary
平均得分: 6.5
得分: ['5', '7', '7', '7']
论文链接: openreview.net/forum?id=sC…
[19]. Self-supervised Visual Reinforcement Learning with Object-centric Representations
平均得分: 6.5
得分: ['7', '6', '4', '9']
论文链接: openreview.net/forum?id=xp…
[20]. On Effective Parallelization of Monte Carlo Tree Search
平均得分: 6.5
得分: ['6', '6', '7', '7']
论文链接: openreview.net/forum?id=_F…
[21]. Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual Bounds
平均得分: 6.5
得分: ['6', '5', '8', '7']
论文链接: openreview.net/forum?id=dK…
[22]. Efficient Transformers in Reinforcement Learning using Actor-Learner Distillation
平均得分: 6.5
得分: ['5', '6', '7', '8']
论文链接: openreview.net/forum?id=uR…
[23]. Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning
平均得分: 6.5
得分: ['8', '7', '5', '6']
论文链接: openreview.net/forum?id=Y8…
[24]. SMiRL: Surprise Minimizing Reinforcement Learning in Unstable Environments
平均得分: 6.5
得分: ['5', '6', '8', '7']
论文链接: openreview.net/forum?id=cP…
[25]. Model-Based Visual Planning with Self-Supervised Functional Distances
平均得分: 6.5
得分: ['7', '6', '7', '6']
论文链接: openreview.net/forum?id=Uc…
[26]. Learning-based Support Estimation in Sublinear Time
平均得分: 6.5
得分: ['7', '4', '8', '7']
论文链接: openreview.net/forum?id=ti…
[27]. DOP: Off-Policy Multi-Agent Decomposed Policy Gradients
平均得分: 6.5
得分: ['7', '3', '9', '7']
论文链接: openreview.net/forum?id=6F…
[28]. Correcting experience replay for multi-agent communication
平均得分: 6.5
得分: ['4', '6', '8', '8']
论文链接: openreview.net/forum?id=xv…
[29]. Risk-Averse Offline Reinforcement Learning
平均得分: 6.4
得分: ['6', '8', '5', '6', '7']
论文链接: openreview.net/forum?id=TB…
[30]. Learning Value Functions in Deep Policy Gradients using Residual Variance
平均得分: 6.33
得分: ['8', '7', '4']
论文链接: openreview.net/forum?id=NX…
[31]. Contrastive Explanations for Reinforcement Learning via Embedded Self Predictions
平均得分: 6.33
得分: ['4', '8', '7']
论文链接: openreview.net/forum?id=Ud…
[32]. PODS: Policy Optimization via Differentiable Simulation
平均得分: 6.33
得分: ['9', '4', '6']
论文链接: openreview.net/forum?id=4f…
[33]. Transient Non-stationarity and Generalisation in Deep Reinforcement Learning
平均得分: 6.25
得分: ['7', '5', '5', '8']
论文链接: openreview.net/forum?id=Qu…
[34]. Improving Learning to Branch via Reinforcement Learning
平均得分: 6.25
得分: ['7', '7', '8', '3']
论文链接: openreview.net/forum?id=M_…
[35]. Mastering Atari with Discrete World Models
平均得分: 6.25
得分: ['4', '7', '10', '4']
论文链接: openreview.net/forum?id=0o…
[36]. Data-Efficient Reinforcement Learning with Self-Predictive Representations
平均得分: 6.25
得分: ['6', '5', '7', '7']
论文链接: openreview.net/forum?id=uC…
[37]. Local Information Opponent Modelling Using Variational Autoencoders
平均得分: 6.25
得分: ['8', '7', '4', '6']
论文链接: openreview.net/forum?id=xF…
[38]. Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning
平均得分: 6.25
得分: ['6', '6', '6', '7']
论文链接: openreview.net/forum?id=qd…
[39]. Efficient Reinforcement Learning in Factored MDPs with Application to Constrained RL
平均得分: 6.25
得分: ['7', '5', '7', '6']
论文链接: openreview.net/forum?id=fm…
[40]. Batch Reinforcement Learning Through Continuation Method
平均得分: 6.25
得分: ['6', '9', '6', '4']
论文链接: openreview.net/forum?id=po…
[41]. Optimistic Exploration with Backward Bootstrapped Bonus for Deep Reinforcement Learning
平均得分: 6.2
得分: ['7', '6', '7', '6', '5']
论文链接: openreview.net/forum?id=Qx…
[42]. Optimism in Reinforcement Learning with Generalized Linear Function Approximation
平均得分: 6
得分: ['6', '7', '6', '5']
论文链接: openreview.net/forum?id=CB…
[43]. Adversarially Guided Actor-Critic
平均得分: 6
得分: ['5', '6', '7']
论文链接: openreview.net/forum?id=_m…
[44]. QTRAN++: Improved Value Transformation for Cooperative Multi-Agent Reinforcement Learning
平均得分: 6
得分: ['7', '6', '6', '5']
论文链接: openreview.net/forum?id=Tl…
[45]. Policy Optimization in Zero-Sum Markov Games: Fictitious Self-Play Provably Attains Nash Equilibria
平均得分: 6
得分: ['6', '5', '8', '5']
论文链接: openreview.net/forum?id=c3…
[46]. Optimistic Policy Optimization with General Function Approximations
平均得分: 6
得分: ['7', '7', '4']
论文链接: openreview.net/forum?id=Jy…
[47]. Multi-Agent Collaboration via Reward Attribution Decomposition
平均得分: 6
得分: ['5', '6', '7', '6']
论文链接: openreview.net/forum?id=GV…
[48]. Efficient Wasserstein Natural Gradients for Reinforcement Learning
平均得分: 6
得分: ['5', '8', '5']
论文链接: openreview.net/forum?id=OH…
[49]. Density Constrained Reinforcement Learning
平均得分: 6
得分: ['7', '6', '5', '6']
论文链接: openreview.net/forum?id=jM…
[50]. Representation Balancing Offline Model-based Reinforcement Learning
平均得分: 6
得分: ['5', '6', '7', '6']
论文链接: openreview.net/forum?id=Qp…
[51]. Decoupling Representation Learning from Reinforcement Learning
平均得分: 6
得分: ['7', '5', '4', '8']
论文链接: openreview.net/forum?id=_S…
[52]. Model-based micro-data reinforcement learning: what are the crucial model properties and which model to choose?
平均得分: 5.8
得分: ['7', '7', '6', '5', '4']
论文链接: openreview.net/forum?id=p5…
[53]. Model-based Asynchronous Hyperparameter and Neural Architecture Search
平均得分: 5.8
得分: ['7', '5', '6', '6', '5']
论文链接: openreview.net/forum?id=a2…
[54]. DeepAveragers: Offline Reinforcement Learning By Solving Derived Non-Parametric MDPs
平均得分: 5.8
得分: ['5', '7', '5', '7', '5']
论文链接: openreview.net/forum?id=eM…
[55]. Uncertainty Weighted Offline Reinforcement Learning
平均得分: 5.8
得分: ['8', '6', '5', '6', '4']
论文链接: openreview.net/forum?id=7h…
[56]. Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning
平均得分: 5.75
得分: ['5', '7', '5', '6']
论文链接: openreview.net/forum?id=-6…
[57]. Parameter-based Value Functions
平均得分: 5.75
得分: ['3', '7', '7', '6']
论文链接: openreview.net/forum?id=tV…
[58]. Sample-Efficient Automated Deep Reinforcement Learning
平均得分: 5.75
得分: ['7', '5', '5', '6']
论文链接: openreview.net/forum?id=hS…
[59]. Causal Inference Q-Network: Toward Resilient Reinforcement Learning
平均得分: 5.75
得分: ['4', '6', '6', '7']
论文链接: openreview.net/forum?id=Pv…
[60]. SACoD: Sensor Algorithm Co-Design Towards Efficient CNN-powered Intelligent PhlatCam
平均得分: 5.75
得分: ['6', '6', '5', '6']
论文链接: openreview.net/forum?id=jQ…
[61]. Learn Goal-Conditioned Policy with Intrinsic Motivation for Deep Reinforcement Learning
平均得分: 5.75
得分: ['6', '7', '5', '5']
论文链接: openreview.net/forum?id=Mm…
[62]. Benchmarks for Deep Off-Policy Evaluation
平均得分: 5.75
得分: ['7', '6', '4', '6']
论文链接: openreview.net/forum?id=kW…
[63]. Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks
平均得分: 5.75
得分: ['6', '5', '6', '6']
论文链接: openreview.net/forum?id=Y-…
[64]. Exploring Zero-Shot Emergent Communication in Embodied Multi-Agent Populations
平均得分: 5.75
得分: ['6', '4', '6', '7']
论文链接: openreview.net/forum?id=Fb…
[65]. Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning
平均得分: 5.75
得分: ['5', '5', '7', '6']
论文链接: openreview.net/forum?id=sz…
[66]. Learning Robust State Abstractions for Hidden-Parameter Block MDPs
平均得分: 5.75
得分: ['5', '6', '5', '7']
论文链接: openreview.net/forum?id=fm…
[67]. Adapting to Reward Progressivity via Spectral Reinforcement Learning
平均得分: 5.75
得分: ['5', '7', '5', '6']
论文链接: openreview.net/forum?id=dy…
[68]. Accelerating Safe Reinforcement Learning with Constraint-mismatched Policies
平均得分: 5.75
得分: ['5', '6', '5', '7']
论文链接: openreview.net/forum?id=M3…
[69]. Off-Dynamics Reinforcement Learning: Training for Transfer with Domain Classifiers
平均得分: 5.75
得分: ['5', '6', '5', '7']
论文链接: openreview.net/forum?id=eq…
[70]. Meta-Reinforcement Learning With Informed Policy Regularization
平均得分: 5.75
得分: ['6', '5', '6', '6']
论文链接: openreview.net/forum?id=pT…
[71]. Hierarchical Reinforcement Learning by Discovering Intrinsic Options
平均得分: 5.75
得分: ['4', '4', '7', '8']
论文链接: openreview.net/forum?id=r-…
[72]. Multi-Agent Trust Region Learning
平均得分: 5.75
得分: ['4', '8', '5', '6']
论文链接: openreview.net/forum?id=eH…
[73]. Unity of Opposites: SelfNorm and CrossNorm for Model Robustness
平均得分: 5.75
得分: ['5', '7', '6', '5']
论文链接: openreview.net/forum?id=Oj…
[74]. The Advantage Regret-Matching Actor-Critic
平均得分: 5.67
得分: ['5', '6', '6']
论文链接: openreview.net/forum?id=YM…
[75]. Differentiable Trust Region Layers for Deep Reinforcement Learning
平均得分: 5.67
得分: ['7', '4', '6']
论文链接: openreview.net/forum?id=qY…
[76]. Linear Representation Meta-Reinforcement Learning for Instant Adaptation
平均得分: 5.67
得分: ['5', '5', '7']
论文链接: openreview.net/forum?id=lN…
[77]. Symmetry-Aware Actor-Critic for 3D Molecular Design
平均得分: 5.67
得分: ['6', '4', '7']
论文链接: openreview.net/forum?id=jE…
[78]. The Importance of Pessimism in Fixed-Dataset Policy Optimization
平均得分: 5.67
得分: ['5', '5', '7']
论文链接: openreview.net/forum?id=E3…
[79]. Understanding and Leveraging Causal Relations in Deep Reinforcement Learning
平均得分: 5.67
得分: ['5', '6', '6']
论文链接: openreview.net/forum?id=30…
[80]. Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior Regularization
平均得分: 5.67
得分: ['7', '5', '5']
论文链接: openreview.net/forum?id=8c…
[81]. Grounding Language to Entities for Generalization in Reinforcement Learning
平均得分: 5.6
得分: ['6', '7', '6', '5', '4']
论文链接: openreview.net/forum?id=ud…
[82]. Large Batch Simulation for Deep Reinforcement Learning
平均得分: 5.6
得分: ['7', '6', '6', '5', '4']
论文链接: openreview.net/forum?id=cP…
[83]. Deep Reinforcement Learning For Wireless Scheduling with Multiclass Services
平均得分: 5.5
得分: ['3', '7', '7', '5']
论文链接: openreview.net/forum?id=Ui…
[84]. Monotonic Robust Policy Optimization with Model Discrepancy
平均得分: 5.5
得分: ['7', '6', '5', '4']
论文链接: openreview.net/forum?id=kd…
[85]. Truly Deterministic Policy Optimization
平均得分: 5.5
得分: ['5', '6', '6', '5']
论文链接: openreview.net/forum?id=Bn…
[86]. Distributional Reinforcement Learning for Risk-Sensitive Policies
平均得分: 5.5
得分: ['5', '7', '5', '5']
论文链接: openreview.net/forum?id=19…
[87]. Bounded Myopic Adversaries for Deep Reinforcement Learning Agents
平均得分: 5.5
得分: ['5', '6', '5', '6']
论文链接: openreview.net/forum?id=Ew…
[88]. Decoupling Exploration and Exploitation for Meta-Reinforcement Learning without Sacrifices
平均得分: 5.5
得分: ['7', '6', '4', '5']
论文链接: openreview.net/forum?id=rS…
[89]. Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization
平均得分: 5.5
得分: ['5', '7', '5', '5']
论文链接: openreview.net/forum?id=lv…
[90]. Blending MPC & Value Function Approximation for Efficient Reinforcement Learning
平均得分: 5.5
得分: ['5', '5', '5', '7']
论文链接: openreview.net/forum?id=Rq…
[91]. A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
平均得分: 5.5
得分: ['6', '5', '5', '6']
论文链接: openreview.net/forum?id=zd…
[92]. The act of remembering: A study in partially observable reinforcement learning
平均得分: 5.5
得分: ['6', '7', '6', '3']
论文链接: openreview.net/forum?id=uF…
[93]. Random Coordinate Langevin Monte Carlo
平均得分: 5.5
得分: ['7', '7', '4', '4']
论文链接: openreview.net/forum?id=lb…
[94]. Provable Rich Observation Reinforcement Learning with Combinatorial Latent States
平均得分: 5.5
得分: ['4', '6', '5', '7']
论文链接: openreview.net/forum?id=hx…
[95]. Automatic Data Augmentation for Generalization in Reinforcement Learning
平均得分: 5.5
得分: ['6', '7', '3', '6']
论文链接: openreview.net/forum?id=9l…
[96]. Reinforcement Learning with Random Delays
平均得分: 5.5
得分: ['3', '6', '5', '8']
论文链接: openreview.net/forum?id=QF…
[97]. On Proximal Policy Optimization's Heavy-Tailed Gradients
平均得分: 5.5
得分: ['6', '5', '6', '5']
论文链接: openreview.net/forum?id=cY…
[98]. A Primal Approach to Constrained Policy Optimization: Global Optimality and Finite-Time Analysis
平均得分: 5.5
得分: ['7', '5', '5', '5']
论文链接: openreview.net/forum?id=rI…
[99]. Regularization Matters in Policy Optimization - An Empirical Study on Continuous Control
平均得分: 5.5
得分: ['4', '6', '5', '7']
论文链接: openreview.net/forum?id=yr…
[100]. Divide-and-Conquer Monte Carlo Tree Search
平均得分: 5.5
得分: ['8', '5', '4', '5']
论文链接: openreview.net/forum?id=Nj…
[101]. Status-Quo Policy Gradient in Multi-agent Reinforcement Learning
平均得分: 5.5
得分: ['4', '5', '6', '7']
论文链接: openreview.net/forum?id=76…
[102]. QPLEX: Duplex Dueling Multi-Agent Q-Learning
平均得分: 5.5
得分: ['4', '5', '6', '7']
论文链接: openreview.net/forum?id=Rc…
[103]. A Reduction Approach to Constrained Reinforcement Learning
平均得分: 5.5
得分: ['6', '7', '5', '4']
论文链接: openreview.net/forum?id=fV…
[104]. Compute- and Memory-Efficient Reinforcement Learning with Latent Experience Replay
平均得分: 5.5
得分: ['7', '4', '5', '6']
论文链接: openreview.net/forum?id=J7…
[105]. On Trade-offs of Image Prediction in Visual Model-Based Reinforcement Learning
平均得分: 5.5
得分: ['5', '3', '7', '7']
论文链接: openreview.net/forum?id=me…
[106]. Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning
平均得分: 5.5
得分: ['5', '7', '5', '5']
论文链接: openreview.net/forum?id=VM…
[107]. Average Reward Reinforcement Learning with Monotonic Policy Improvement
平均得分: 5.5
得分: ['6', '4', '6', '6']
论文链接: openreview.net/forum?id=lo…
[108]. FactoredRL: Leveraging Factored Graphs for Deep Reinforcement Learning
平均得分: 5.5
得分: ['5', '6', '6', '5']
论文链接: openreview.net/forum?id=wE…
[109]. Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning
平均得分: 5.5
得分: ['4', '7', '6', '5']
论文链接: openreview.net/forum?id=O9…
[110]. Scalable Bayesian Inverse Reinforcement Learning by Auto-Encoding Reward
平均得分: 5.5
得分: ['4', '5', '7', '6']
论文链接: openreview.net/forum?id=4q…
[111]. Model-Based Offline Planning
平均得分: 5.5
得分: ['6', '4', '8', '4']
论文链接: openreview.net/forum?id=OM…
[112]. BRAC+: Going Deeper with Behavior Regularized Offline Reinforcement Learning
平均得分: 5.5
得分: ['4', '6', '7', '5']
论文链接: openreview.net/forum?id=bM…
[113]. Learning to Share in Multi-Agent Reinforcement Learning
平均得分: 5.4
得分: ['4', '4', '8', '8', '3']
论文链接: openreview.net/forum?id=aw…
[114]. Explicit Pareto Front Optimization for Constrained Reinforcement Learning
平均得分: 5.33
得分: ['6', '6', '4']
论文链接: openreview.net/forum?id=pO…
[115]. Guided Exploration with Proximal Policy Optimization using a Single Demonstration
平均得分: 5.33
得分: ['6', '4', '6']
论文链接: openreview.net/forum?id=88…
[116]. Unsupervised Active Pre-Training for Reinforcement Learning
平均得分: 5.33
得分: ['5', '6', '5']
论文链接: openreview.net/forum?id=cv…
[117]. RECONNAISSANCE FOR REINFORCEMENT LEARNING WITH SAFETY CONSTRAINTS
平均得分: 5.33
得分: ['4', '5', '7']
论文链接: openreview.net/forum?id=Gc…
[118]. Daylight: Assessing Generalization Skills of Deep Reinforcement Learning Agents
平均得分: 5.33
得分: ['6', '5', '5']
论文链接: openreview.net/forum?id=Z3…
[119]. Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient Exploration
平均得分: 5.33
得分: ['4', '5', '7']
论文链接: openreview.net/forum?id=7q…
[120]. OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning
平均得分: 5.33
得分: ['7', '5', '4']
论文链接: openreview.net/forum?id=V6…
[121]. A REINFORCEMENT LEARNING FRAMEWORK FOR TIME DEPENDENT CAUSAL EFFECTS EVALUATION IN A/B TESTING
平均得分: 5.33
得分: ['6', '5', '5']
论文链接: openreview.net/forum?id=Dt…
[122]. PettingZoo: Gym for Multi-Agent Reinforcement Learning
平均得分: 5.25
得分: ['7', '5', '6', '3']
论文链接: openreview.net/forum?id=Wo…
[123]. Hippocampal representations emerge when training recurrent neural networks on a memory dependent maze navigation task
平均得分: 5.25
得分: ['4', '6', '4', '7']
论文链接: openreview.net/forum?id=Jr…
[124]. Data-efficient Hindsight Off-policy Option Learning
平均得分: 5.25
得分: ['5', '6', '5', '5']
论文链接: openreview.net/forum?id=QK…
[125]. Attacking Few-Shot Classifiers with Adversarial Support Sets
平均得分: 5.25
得分: ['6', '4', '6', '5']
论文链接: openreview.net/forum?id=0x…
[126]. Coverage as a Principle for Discovering Transferable Behavior in Reinforcement Learning
平均得分: 5.25
得分: ['8', '5', '4', '4']
论文链接: openreview.net/forum?id=IN…
[127]. Reinforcement Learning for Control with Probabilistic Stability Guarantee
平均得分: 5.25
得分: ['6', '5', '5', '5']
论文链接: openreview.net/forum?id=Qf…
[128]. Efficient Reinforcement Learning in Resource Allocation Problems Through Permutation Invariant Multi-task Learning
平均得分: 5.25
得分: ['7', '5', '5', '4']
论文链接: openreview.net/forum?id=Ti…
[129]. Meta-Reinforcement Learning Robust to Distributional Shift via Model Identification and Experience Relabeling
平均得分: 5.25
得分: ['6', '5', '5', '5']
论文链接: openreview.net/forum?id=AT…
[130]. Solving Compositional Reinforcement Learning Problems via Task Reduction
平均得分: 5.25
得分: ['3', '5', '6', '7']
论文链接: openreview.net/forum?id=9S…
[131]. Emergent Road Rules In Multi-Agent Driving Environments
平均得分: 5.25
得分: ['7', '4', '5', '5']
论文链接: openreview.net/forum?id=d8…
[132]. EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
平均得分: 5.25
得分: ['4', '6', '6', '5']
论文链接: openreview.net/forum?id=B8…
[133]. Double Q-learning: New Analysis and Sharper Finite-time Bound
平均得分: 5.25
得分: ['6', '4', '6', '5']
论文链接: openreview.net/forum?id=Mw…
[134]. Safety Verification of Model Based Reinforcement Learning Controllers
平均得分: 5.25
得分: ['3', '7', '6', '5']
论文链接: openreview.net/forum?id=mf…
[135]. D3C: Reducing the Price of Anarchy in Multi-Agent Learning
平均得分: 5.25
得分: ['3', '4', '7', '7']
论文链接: openreview.net/forum?id=8w…
[136]. Near-Optimal Regret Bounds for Model-Free RL in Non-Stationary Episodic MDPs
平均得分: 5.25
得分: ['6', '4', '4', '7']
论文链接: openreview.net/forum?id=TJ…
[137]. Communication in Multi-Agent Reinforcement Learning: Intention Sharing
平均得分: 5.25
得分: ['6', '4', '6', '5']
论文链接: openreview.net/forum?id=qp…
[138]. On the role of planning in model-based deep reinforcement learning
平均得分: 5.25
得分: ['7', '3', '6', '5']
论文链接: openreview.net/forum?id=Ir…
[139]. Reinforcement Learning with Latent Flow
平均得分: 5.25
得分: ['7', '3', '6', '5']
论文链接: openreview.net/forum?id=lS…
[140]. Iterative Amortized Policy Optimization
平均得分: 5.25
得分: ['6', '5', '5', '5']
论文链接: openreview.net/forum?id=49…
[141]. Unsupervised Task Clustering for Multi-Task Reinforcement Learning
平均得分: 5.25
得分: ['6', '5', '5', '5']
论文链接: openreview.net/forum?id=4K…
[142]. Adaptive Multi-model Fusion Learning for Sparse-Reward Reinforcement Learning
平均得分: 5.25
得分: ['6', '5', '6', '4']
论文链接: openreview.net/forum?id=4e…
[143]. ERMAS: Learning Policies Robust to Reality Gaps in Multi-Agent Simulations
平均得分: 5.25
得分: ['6', '5', '6', '4']
论文链接: openreview.net/forum?id=uI…
[144]. A Distributional Perspective on Actor-Critic Framework
平均得分: 5.25
得分: ['5', '7', '3', '6']
论文链接: openreview.net/forum?id=jW…
[145]. Robust Reinforcement Learning using Adversarial Populations
平均得分: 5.25
得分: ['5', '7', '4', '5']
论文链接: openreview.net/forum?id=I6…
[146]. The Compact Support Neural Network
平均得分: 5.25
得分: ['5', '5', '6', '5']
论文链接: openreview.net/forum?id=xC…
[147]. RMIX: Risk-Sensitive Multi-Agent Reinforcement Learning
平均得分: 5.25
得分: ['6', '4', '7', '4']
论文链接: openreview.net/forum?id=1E…
[148]. Meta-Model-Based Meta-Policy Optimization
平均得分: 5.25
得分: ['5', '5', '5', '6']
论文链接: openreview.net/forum?id=KO…
[149]. Decentralized Deterministic Multi-Agent Reinforcement Learning
平均得分: 5.2
得分: ['5', '4', '7', '5', '5']
论文链接: openreview.net/forum?id=QM…
[150]. Transfer among Agents: An Efficient Multiagent Transfer Learning Framework
平均得分: 5.2
得分: ['5', '6', '4', '6', '5']
论文链接: openreview.net/forum?id=9w…
[151]. Gradient-based tuning of Hamiltonian Monte Carlo hyperparameters
平均得分: 5
得分: ['5', '4', '6', '5']
论文链接: openreview.net/forum?id=Lv…
[152]. Combining Imitation and Reinforcement Learning with Free Energy Principle
平均得分: 5
得分: ['4', '6', '5', '5']
论文链接: openreview.net/forum?id=JI…
[153]. Ordering-Based Causal Discovery with Reinforcement Learning
平均得分: 5
得分: ['5', '5', '5', '5']
论文链接: openreview.net/forum?id=bM…
[154]. Universal Value Density Estimation for Imitation Learning and Goal-Conditioned Reinforcement Learning
平均得分: 5
得分: ['5', '5', '4', '6']
论文链接: openreview.net/forum?id=S2…
[155]. The Emergence of Individuality in Multi-Agent Reinforcement Learning
平均得分: 5
得分: ['5', '5', '4', '6']
论文链接: openreview.net/forum?id=Eo…
[156]. Explore with Dynamic Map: Graph Structured Reinforcement Learning
平均得分: 5
得分: ['4', '5', '6', '5']
论文链接: openreview.net/forum?id=-u…
[157]. Offline Meta-Reinforcement Learning with Advantage Weighting
平均得分: 5
得分: ['5', '6', '5', '4']
论文链接: openreview.net/forum?id=S5…
[158]. Deep Q-Learning with Low Switching Cost
平均得分: 5
得分: ['6', '5', '5', '4']
论文链接: openreview.net/forum?id=7O…
[159]. AWAC: Accelerating Online Reinforcement Learning with Offline Datasets
平均得分: 5
得分: ['6', '6', '3', '6', '4']
论文链接: openreview.net/forum?id=OJ…
[160]. A Strong On-Policy Competitor To PPO
平均得分: 5
得分: ['5', '5', '5']
论文链接: openreview.net/forum?id=0m…
[161]. Control-Aware Representations for Model-based Reinforcement Learning
平均得分: 5
得分: ['6', '5', '4']
论文链接: openreview.net/forum?id=dg…
[162]. Formal Language Constrained Markov Decision Processes
平均得分: 5
得分: ['5', '6', '4', '5']
论文链接: openreview.net/forum?id=NT…
[163]. Multi-Agent Imitation Learning with Copulas
平均得分: 5
得分: ['4', '4', '7']
论文链接: openreview.net/forum?id=gR…
[164]. Projected Latent Markov Chain Monte Carlo: Conditional Sampling of Normalizing Flows
平均得分: 5
得分: ['6', '5', '4']
论文链接: openreview.net/forum?id=MB…
[165]. Efficient Competitive Self-Play Policy Optimization
平均得分: 5
得分: ['7', '5', '3', '5']
论文链接: openreview.net/forum?id=99…
[166]. Offline Model-Based Optimization via Normalized Maximum Likelihood Estimation
平均得分: 5
得分: ['5', '5', '5']
论文链接: openreview.net/forum?id=Fm…
[167]. Beyond Prioritized Replay: Sampling States in Model-Based RL via Simulated Priorities
平均得分: 5
得分: ['4', '6', '5']
论文链接: openreview.net/forum?id=B5…
[168]. Action Guidance: Getting the Best of Sparse Rewards and Shaped Rewards for Real-time Strategy Games
平均得分: 5
得分: ['6', '4', '6', '4']
论文链接: openreview.net/forum?id=1O…
[169]. What About Taking Policy as Input of Value Function: Policy-extended Value Function Approximator
平均得分: 5
得分: ['7', '5', '5', '3']
论文链接: openreview.net/forum?id=V4…
[170]. Optimizing Information Bottleneck in Reinforcement Learning: A Stein Variational Approach
平均得分: 5
得分: ['6', '4', '5', '5']
论文链接: openreview.net/forum?id=IK…
[171]. On the Estimation Bias in Double Q-Learning
平均得分: 5
得分: ['6', '5', '3', '6']
论文链接: openreview.net/forum?id=FK…
[172]. Entropic Risk-Sensitive Reinforcement Learning: A Meta Regret Framework with Function Approximation
平均得分: 5
得分: ['6', '5', '4', '5']
论文链接: openreview.net/forum?id=q_…
[173]. Goal-Auxiliary Actor-Critic for 6D Robotic Grasping with Point Clouds
平均得分: 5
得分: ['5', '7', '3']
论文链接: openreview.net/forum?id=H5…
[174]. Policy Gradient with Expected Quadratic Utility Maximization: A New Mean-Variance Approach in Reinforcement Learning
平均得分: 5
得分: ['4', '5', '6']
论文链接: openreview.net/forum?id=BE…
[175]. D2RL: Deep Dense Architectures in Reinforcement Learning
平均得分: 5
得分: ['4', '8', '4', '4']
论文链接: openreview.net/forum?id=mY…
[176]. Intention Propagation for Multi-agent Reinforcement Learning
平均得分: 5
得分: ['3', '6', '6', '5']
论文链接: openreview.net/forum?id=7a…
[177]. SIM-GAN: Adversarial Calibration of Multi-Agent Market Simulators.
平均得分: 5
得分: ['3', '7', '5']
论文链接: openreview.net/forum?id=1z…
[178]. Preventing Value Function Collapse in Ensemble Q-Learning by Maximizing Representation Diversity
平均得分: 5
得分: ['4', '5', '5', '6']
论文链接: openreview.net/forum?id=dN…
[179]. REPAINT: Knowledge Transfer in Deep Actor-Critic Reinforcement Learning
平均得分: 5
得分: ['4', '6', '4', '6']
论文链接: openreview.net/forum?id=P8…
[180]. Mixture of Step Returns in Bootstrapped DQN
平均得分: 5
得分: ['5', '4', '4', '7', '5']
论文链接: openreview.net/forum?id=X6…
[181]. PAC-Bayesian Randomized Value Function with Informative Prior
平均得分: 4.8
得分: ['7', '3', '5', '4', '5']
论文链接: openreview.net/forum?id=d2…
[182]. Learning Safe Multi-agent Control with Decentralized Neural Barrier Certificates
平均得分: 4.8
得分: ['4', '4', '6', '5', '5']
论文链接: openreview.net/forum?id=P6…
[183]. Maximum Reward Formulation In Reinforcement Learning
平均得分: 4.8
得分: ['5', '6', '3', '4', '6']
论文链接: openreview.net/forum?id=Bn…
[184]. Model-Free Counterfactual Credit Assignment
平均得分: 4.75
得分: ['5', '5', '6', '3']
论文链接: openreview.net/forum?id=F8…
[185]. Plan-Based Asymptotically Equivalent Reward Shaping
平均得分: 4.75
得分: ['3', '5', '7', '4']
论文链接: openreview.net/forum?id=w2…
[186]. Design-Bench: Benchmarks for Data-Driven Offline Model-Based Optimization
平均得分: 4.75
得分: ['4', '3', '7', '5']
论文链接: openreview.net/forum?id=cQ…
[187]. Regioned Episodic Reinforcement Learning
平均得分: 4.75
得分: ['6', '4', '5', '4']
论文链接: openreview.net/forum?id=am…
[188]. Reinforcement Learning with Bayesian Classifiers: Efficient Skill Learning from Outcome Examples
平均得分: 4.75
得分: ['5', '4', '5', '5']
论文链接: openreview.net/forum?id=OZ…
[189]. Provably More Efficient Q-Learning in the One-Sided-Feedback/Full-Feedback Settings
平均得分: 4.75
得分: ['4', '4', '6', '5']
论文链接: openreview.net/forum?id=vY…
[190]. Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning
平均得分: 4.75
得分: ['4', '6', '4', '5']
论文链接: openreview.net/forum?id=gp…
[191]. Safe Reinforcement Learning with Natural Language Constraints
平均得分: 4.75
得分: ['5', '3', '5', '6']
论文链接: openreview.net/forum?id=Ua…
[192]. ReaPER: Improving Sample Efficiency in Model-Based Latent Imagination
平均得分: 4.75
得分: ['4', '5', '4', '6']
论文链接: openreview.net/forum?id=nl…
[193]. Coordinated Multi-Agent Exploration Using Shared Goals
平均得分: 4.75
得分: ['4', '5', '5', '5']
论文链接: openreview.net/forum?id=MP…
[194]. Measuring and mitigating interference in reinforcement learning
平均得分: 4.75
得分: ['5', '6', '4', '4']
论文链接: openreview.net/forum?id=26…
[195]. Hamiltonian Q-Learning: Leveraging Importance-sampling for Data Efficient RL
平均得分: 4.75
得分: ['5', '5', '5', '4']
论文链接: openreview.net/forum?id=10…
[196]. A Maximum Mutual Information Framework for Multi-Agent Reinforcement Learning
平均得分: 4.75
得分: ['3', '5', '6', '5']
论文链接: openreview.net/forum?id=_z…
[197]. Non-decreasing Quantile Function Network with Efficient Exploration for Distributional Reinforcement Learning
平均得分: 4.75
得分: ['4', '5', '4', '6']
论文链接: openreview.net/forum?id=f_…
[198]. Constrained Reinforcement Learning With Learned Constraints
平均得分: 4.75
得分: ['3', '3', '5', '8']
论文链接: openreview.net/forum?id=ak…
[199]. Efficient Exploration for Model-based Reinforcement Learning with Continuous States and Actions
平均得分: 4.75
得分: ['5', '5', '4', '5']
论文链接: openreview.net/forum?id=as…
[200]. Error Controlled Actor-Critic Method to Reinforcement Learning
平均得分: 4.75
得分: ['7', '3', '3', '6']
论文链接: openreview.net/forum?id=n5…
[201]. Cross-State Self-Constraint for Feature Generalization in Deep Reinforcement Learning
平均得分: 4.75
得分: ['5', '5', '4', '5']
论文链接: openreview.net/forum?id=Ji…
[202]. Safety Aware Reinforcement Learning (SARL)
平均得分: 4.75
得分: ['4', '6', '6', '3']
论文链接: openreview.net/forum?id=RD…
[203]. UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning
平均得分: 4.75
得分: ['4', '4', '6', '5']
论文链接: openreview.net/forum?id=0z…
[204]. Interpretable Reinforcement Learning With Neural Symbolic Logic
平均得分: 4.67
得分: ['5', '4', '5']
论文链接: openreview.net/forum?id=M_…
[205]. Network Reusability Analysis for Multi-Joint Robot Reinforcement Learning
平均得分: 4.67
得分: ['5', '4', '5']
论文链接: openreview.net/forum?id=hy…
[206]. Factored Action Spaces in Deep Reinforcement Learning
平均得分: 4.67
得分: ['6', '3', '5']
论文链接: openreview.net/forum?id=na…
[207]. Genetic Soft Updates for Policy Evolution in Deep Reinforcement Learning
平均得分: 4.67
得分: ['4', '6', '4']
论文链接: openreview.net/forum?id=TG…
[208]. The Skill-Action Architecture: Learning Abstract Action Embeddings for Reinforcement Learning
平均得分: 4.67
得分: ['5', '4', '5']
论文链接: openreview.net/forum?id=PU…
[209]. Learning Intrinsic Symbolic Rewards in Reinforcement Learning
平均得分: 4.67
得分: ['5', '4', '5']
论文链接: openreview.net/forum?id=4C…
[210]. Robust Offline Reinforcement Learning from Low-Quality Data
平均得分: 4.6
得分: ['5', '4', '6', '6', '2']
论文链接: openreview.net/forum?id=uO…
[211]. Adaptive Learning Rates for Multi-Agent Reinforcement Learning
平均得分: 4.6
得分: ['5', '4', '4', '5', '5']
论文链接: openreview.net/forum?id=yN…
[212]. Revisiting Parameter Sharing in Multi-Agent Deep Reinforcement Learning
平均得分: 4.5
得分: ['3', '3', '5', '7']
论文链接: openreview.net/forum?id=MW…
[213]. Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets
平均得分: 4.5
得分: ['6', '5', '4', '3']
论文链接: openreview.net/forum?id=9h…
[214]. TOMA: Topological Map Abstraction for Reinforcement Learning
平均得分: 4.5
得分: ['4', '3', '5', '6']
论文链接: openreview.net/forum?id=yo…
[215]. Multi-agent Policy Optimization with Approximatively Synchronous Advantage Estimation
平均得分: 4.5
得分: ['5', '3', '6', '4']
论文链接: openreview.net/forum?id=Rw…
[216]. Why Convolutional Networks Learn Oriented Bandpass Filters: Theory and Empirical Support
平均得分: 4.5
得分: ['6', '4', '5', '3']
论文链接: openreview.net/forum?id=UJ…
[217]. Self-Activating Neural Ensembles for Continual Reinforcement Learning
平均得分: 4.5
得分: ['4', '4', '4', '6']
论文链接: openreview.net/forum?id=Jf…
[218]. Approximating Pareto Frontier through Bayesian-optimization-directed Robust Multi-objective Reinforcement Learning
平均得分: 4.5
得分: ['5', '5', '5', '3']
论文链接: openreview.net/forum?id=S9…
[219]. Model-Based Reinforcement Learning via Latent-Space Collocation
平均得分: 4.5
得分: ['3', '5', '6', '4']
论文链接: openreview.net/forum?id=ku…
[220]. CDT: Cascading Decision Trees for Explainable Reinforcement Learning
平均得分: 4.5
得分: ['4', '4', '5', '5']
论文链接: openreview.net/forum?id=Wd…
[221]. PGPS : Coupling Policy Gradient with Population-based Search
平均得分: 4.5
得分: ['5', '5', '3', '5']
论文链接: openreview.net/forum?id=Pe…
[222]. CAT-SAC: Soft Actor-Critic with Curiosity-Aware Entropy Temperature
平均得分: 4.5
得分: ['6', '4', '4', '4']
论文链接: openreview.net/forum?id=pa…
[223]. Learning to Observe with Reinforcement Learning
平均得分: 4.5
得分: ['3', '6', '5', '4']
论文链接: openreview.net/forum?id=65…
[224]. Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning
平均得分: 4.5
得分: ['3', '6', '3', '6']
论文链接: openreview.net/forum?id=Lt…
[225]. Visual Imitation with Reinforcement Learning using Recurrent Siamese Networks
平均得分: 4.5
得分: ['4', '4', '4', '6']
论文链接: openreview.net/forum?id=MB…
[226]. Lyapunov Barrier Policy Optimization
平均得分: 4.5
得分: ['4', '6', '4', '4']
论文链接: openreview.net/forum?id=qU…
[227]. A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
平均得分: 4.5
得分: ['6', '4', '3', '5']
论文链接: openreview.net/forum?id=yp…
[228]. Cross-Modal Domain Adaptation for Reinforcement Learning
平均得分: 4.5
得分: ['5', '4', '5', '4']
论文链接: openreview.net/forum?id=0o…
[229]. L2E: Learning to Exploit Your Opponent
平均得分: 4.5
得分: ['6', '4', '3', '5']
论文链接: openreview.net/forum?id=m4…
[230]. MQES: Max-Q Entropy Search for Efficient Exploration in Continuous Reinforcement Learning
平均得分: 4.4
得分: ['4', '3', '5', '6', '4']
论文链接: openreview.net/forum?id=98…
[231]. Robust Multi-Agent Reinforcement Learning Driven by Correlated Equilibrium
平均得分: 4.4
得分: ['5', '4', '3', '6', '4']
论文链接: openreview.net/forum?id=Jv…
[232]. R-LAtte: Attention Module for Visual Control via Reinforcement Learning
平均得分: 4.33
得分: ['4', '4', '5']
论文链接: openreview.net/forum?id=D4…
[233]. Multi-agent Deep FBSDE Representation For Large Scale Stochastic Differential Games
平均得分: 4.33
得分: ['5', '3', '5']
论文链接: openreview.net/forum?id=Uo…
[234]. Aspect-based Sentiment Classification via Reinforcement Learning
平均得分: 4.33
得分: ['5', '5', '3']
论文链接: openreview.net/forum?id=bf…
[235]. Refine and Imitate: Reducing Repetition and Inconsistency in Dialogue Generation via Reinforcement Learning and Human Demonstration
平均得分: 4.33
得分: ['3', '6', '4']
论文链接: openreview.net/forum?id=Jt…
[236]. An Examination of Preference-based Reinforcement Learning for Treatment Recommendation
平均得分: 4.33
得分: ['4', '4', '5']
论文链接: openreview.net/forum?id=ux…
[237]. Adaptive Dataset Sampling by Deep Policy Gradient
平均得分: 4.33
得分: ['5', '3', '5']
论文链接: openreview.net/forum?id=t2…
[238]. Convergence Proof for Actor-Critic Methods Applied to PPO and RUDDER
平均得分: 4.25
得分: ['5', '4', '4', '4']
论文链接: openreview.net/forum?id=0h…
[239]. Q-Value Weighted Regression: Reinforcement Learning with Limited Data
平均得分: 4.25
得分: ['4', '6', '3', '4']
论文链接: openreview.net/forum?id=rd…
[240]. ScheduleNet: Learn to Solve MinMax mTSP Using Reinforcement Learning with Delayed Reward
平均得分: 4.25
得分: ['5', '4', '3', '5']
论文链接: openreview.net/forum?id=P6…
[241]. Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms
平均得分: 4.25
得分: ['4', '4', '3', '6']
论文链接: openreview.net/forum?id=t5…
[242]. Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in First-person Simulated 3D Environments
平均得分: 4.25
得分: ['3', '4', '4', '6']
论文链接: openreview.net/forum?id=7A…
[243]. Model-Free Energy Distance for Pruning DNNs
平均得分: 4.25
得分: ['5', '2', '5', '5']
论文链接: openreview.net/forum?id=k2…
[244]. D4RL: Datasets for Deep Data-Driven Reinforcement Learning
平均得分: 4.25
得分: ['2', '3', '6', '6']
论文链接: openreview.net/forum?id=px…
[245]. Exploring Transferability of Perturbations in Deep Reinforcement Learning
平均得分: 4.25
得分: ['3', '4', '6', '4']
论文链接: openreview.net/forum?id=in…
[246]. Alpha-DAG: a reinforcement learning based algorithm to learn Directed Acyclic Graphs
平均得分: 4.25
得分: ['4', '5', '4', '4']
论文链接: openreview.net/forum?id=0j…
[247]. Visual Explanation using Attention Mechanism in Actor-Critic-based Deep Reinforcement Learning
平均得分: 4.25
得分: ['5', '5', '4', '3']
论文链接: openreview.net/forum?id=Y0…
[248]. Knapsack Pruning with Inner Distillation
平均得分: 4.25
得分: ['4', '4', '5', '4']
论文链接: openreview.net/forum?id=O9…
[249]. Reinforcement Learning for Flexibility Design Problems
平均得分: 4.25
得分: ['5', '4', '4', '4']
论文链接: openreview.net/forum?id=oA…
[250]. Model-based Navigation in Environments with Novel Layouts Using Abstract $2$-D Maps
平均得分: 4.25
得分: ['6', '4', '4', '3']
论文链接: openreview.net/forum?id=_l…
[251]. Model-Based Robust Deep Learning: Generalizing to Natural, Out-of-Distribution Data
平均得分: 4.25
得分: ['5', '5', '4', '3']
论文链接: openreview.net/forum?id=Rg…
[252]. Structure and randomness in planning and reinforcement learning
平均得分: 4.2
得分: ['5', '3', '6', '3', '4']
论文链接: openreview.net/forum?id=UO…
[253]. Trust, but verify: model-based exploration in sparse reward environments
平均得分: 4
得分: ['4', '2', '6', '4']
论文链接: openreview.net/forum?id=DE…
[254]. Play to Grade: Grading Interactive Coding Games as Classifying Markov Decision Process
平均得分: 4
得分: ['4', '3', '5']
论文链接: openreview.net/forum?id=GJ…
[255]. Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning
平均得分: 4
得分: ['5', '3', '4', '4']
论文链接: openreview.net/forum?id=gD…
[256]. Regret Bounds and Reinforcement Learning Exploration of EXP-based Algorithms
平均得分: 4
得分: ['4', '4', '4']
论文链接: openreview.net/forum?id=-5…
[257]. MDP Playground: Controlling Dimensions of Hardness in Reinforcement Learning
平均得分: 4
得分: ['4', '3', '4', '5']
论文链接: openreview.net/forum?id=ax…
[258]. Intrinsically Guided Exploration in Meta Reinforcement Learning
平均得分: 4
得分: ['4', '4', '4', '4']
论文链接: openreview.net/forum?id=Rw…
[259]. Adaptive N-step Bootstrapping with Off-policy Data
平均得分: 4
得分: ['4', '4', '3', '5']
论文链接: openreview.net/forum?id=bh…
[260]. FORK: A FORward-looKing Actor for Model-Free Reinforcement Learning
平均得分: 4
得分: ['5', '3', '5', '3']
论文链接: openreview.net/forum?id=lX…
[261]. Measuring Progress in Deep Reinforcement Learning Sample Efficiency
平均得分: 4
得分: ['4', '5', '5', '2']
论文链接: openreview.net/forum?id=_Q…
[262]. Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning
平均得分: 4
得分: ['6', '3', '4', '3']
论文链接: openreview.net/forum?id=To…
[263]. Joint State-Action Embedding for Efficient Reinforcement Learning
平均得分: 3.8
得分: ['5', '1', '4', '3', '6']
论文链接: openreview.net/forum?id=5U…
[264]. Deep Reinforcement Learning for Optimal Stopping with Application in Financial Engineering
平均得分: 3.75
得分: ['2', '4', '4', '5']
论文链接: openreview.net/forum?id=RE…
[265]. Playing Atari with Capsule Networks: A systematic comparison of CNN and CapsNets-based agents.
平均得分: 3.75
得分: ['2', '4', '5', '4']
论文链接: openreview.net/forum?id=Ge…
[266]. Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification
平均得分: 3.75
得分: ['4', '3', '3', '5']
论文链接: openreview.net/forum?id=e-…
[267]. Decorrelated Double Q-learning
平均得分: 3.75
得分: ['4', '3', '5', '3']
论文链接: openreview.net/forum?id=jc…
[268]. Learning to Dynamically Select Between Reward Shaping Signals
平均得分: 3.75
得分: ['5', '2', '4', '4']
论文链接: openreview.net/forum?id=Nr…
[269]. Empirically Verifying Hypotheses Using Reinforcement Learning
平均得分: 3.75
得分: ['3', '3', '5', '4']
论文链接: openreview.net/forum?id=Xb…
[270]. Self-Supervised Continuous Control without Policy Gradient
平均得分: 3.75
得分: ['3', '4', '4', '4']
论文链接: openreview.net/forum?id=pN…
[271]. Dynamic Relational Inference in Multi-Agent Trajectories
平均得分: 3.75
得分: ['2', '4', '5', '4']
论文链接: openreview.net/forum?id=UV…
[272]. Greedy Multi-Step Off-Policy Reinforcement Learning
平均得分: 3.75
得分: ['2', '4', '4', '5']
论文链接: openreview.net/forum?id=rA…
[273]. Addressing Extrapolation Error in Deep Offline Reinforcement Learning
平均得分: 3.67
得分: ['3', '4', '4']
论文链接: openreview.net/forum?id=OC…
[274]. Offline Policy Optimization with Variance Regularization
平均得分: 3.67
得分: ['3', '4', '4']
论文链接: openreview.net/forum?id=P3…
[275]. Fine-Tuning Offline Reinforcement Learning with Model-Based Policy Optimization
平均得分: 3.6
得分: ['3', '4', '4', '5', '2']
论文链接: openreview.net/forum?id=wi…
[276]. Learning to communicate through imagination with model-based deep multi-agent reinforcement learning
平均得分: 3.5
得分: ['3', '4', '4', '3']
论文链接: openreview.net/forum?id=bo…
[277]. A Robust Fuel Optimization Strategy For Hybrid Electric Vehicles: A Deep Reinforcement Learning Based Continuous Time Design Approach
平均得分: 3.5
得分: ['3', '5', '4', '2']
论文链接: openreview.net/forum?id=LF…
[278]. Deep Reinforcement Learning With Adaptive Combined Critics
平均得分: 3.5
得分: ['3', '3', '5', '3']
论文链接: openreview.net/forum?id=gt…
[279]. FSV: Learning to Factorize Soft Value Function for Cooperative Multi-Agent Reinforcement Learning
平均得分: 3.4
得分: ['2', '6', '2', '3', '4']
论文链接: openreview.net/forum?id=ij…
[280]. Success-Rate Targeted Reinforcement Learning by Disorientation Penalty
平均得分: 3.25
得分: ['2', '3', '4', '4']
论文链接: openreview.net/forum?id=rQ…
[281]. Explainable Reinforcement Learning Through Goal-Based Explanations
平均得分: 3.25
得分: ['3', '3', '4', '3']
论文链接: openreview.net/forum?id=Il…
[282]. Hierarchical Meta Reinforcement Learning for Multi-Task Environments
平均得分: 3.25
得分: ['3', '3', '4', '3']
论文链接: openreview.net/forum?id=u9…
[283]. Interpretable Meta-Reinforcement Learning with Actor-Critic Method
平均得分: 3.2
得分: ['4', '3', '4', '2', '3']
论文链接: openreview.net/forum?id=-R…
[284]. Reinforcement Learning Based Asymmetrical DNN Modularization for Optimal Loading
平均得分: 3
得分: ['3', '2', '3', '4']
论文链接: openreview.net/forum?id=_q…
[285]. Stochastic Inverse Reinforcement Learning
平均得分: 2.8
得分: ['2', '2', '4', '3', '3']
论文链接: openreview.net/forum?id=l3…
[286]. Using Deep Reinforcement Learning to Train and Evaluate Instructional Sequencing Policies for an Intelligent Tutoring System
平均得分: 2.67
得分: ['2', '4', '2']
论文链接: openreview.net/forum?id=eI…
[287]. Guiding Representation Learning in Deep Generative Models with Policy Gradients
平均得分: 2.5
得分: ['2', '4', '3', '1']
论文链接: openreview.net/forum?id=sg…
完
往期精彩回顾
适合初学者入门人工智能的路线及资料下载机器学习及深度学习笔记等资料打印机器学习在线手册深度学习笔记专辑《统计学习方法》的代码复现专辑
AI基础下载机器学习的数学基础专辑
获取本站知识星球优惠券,复制链接直接打开:
https://t.zsxq.com/y7uvZF6
本站qq群704220115。
加入微信群请扫码: