LIAO YONG Technology Space
Drive AI
HOME
Mathematics
Optimization
Numerical Computation
Probabilistic Graphical Models
MACHINE LEARNING
Supervised Learning
Classification
Regression
Unsupervised Learning
Clustering
Dimensionality Reduction
Meta Learning
DEEP LEARNING
Models
Techniques
Toolkits/Libs
Topics
Applications
Computer Vision
Natural Language Processing
REINFORCEMENT LEARNING
Model Free
Value Based
Policy Based
Hybrid Methods
Model Based
Learned Model
Given Model
Multi-agent RL
Extended Topics
Imitation Learning
Inverse RL
Meta-RL
Offline RL
Transfer/Multitask RL
Toolkits/Libs
Dynamic Programming
COMPUTING ADVERTISING
CTR Prediction
Online Learning
Multi-Task Learning
COMPUTER SCIENCE
Linux
Big Data
Programming
Data Structure
Toolkits/Libs
ABOUT
×
Search for:
Notification
2019-10-03
Welcome to this new website!
COMPUTER SCIENCE
About Me
Focus, Life-time Devotion
Recent Posts
Loss Weighting in Multi-task Learning
Overview of Industrial CTR Prediction
LOSS: Wasserstein Distance
Natural Policy Gradient
Trust Region Policy Optimization (TRPO)
Prioritized Double DQN
Deep Q-Learning Series (DQN)
Temporal Difference Learning from Scratch
Dynamic Programming in Reinforcement Learning
Asynchronous Advantage Actor Critic (A3C)
Archives
Archives
Select Month
January 2022 (2)
May 2021 (1)
February 2020 (2)
January 2020 (2)
December 2019 (2)
November 2019 (3)
October 2019 (3)
Categories
Categories
Select Category
Comprehension (1)
CTR Prediction (1)
Dynamic Programming (1)
Featured Posts (12)
Hilighted Posts (1)
Model Free (9)
Hybrid Methods (3)
Policy Based (3)
Value Based (3)
Multi-Task Learning (1)
Notification (1)
Topics (1)
Tags
actor-critic
(2)
CTR
(1)
dqn
(1)
MTL
(1)
Contact
Email
: mail@liaoyong.net
京ICP备17032968号-1