Dirichlet policies for reinforced factor portfolios.

Eric André,Guillaume Coqueret

Dirichlet policies for reinforced factor portfolios.

2020

Eric André
Guillaume Coqueret

This article aims to combine factor investing and reinforcement learning (RL). The agent learns through sequential random allocations which rely on firms' characteristics. Using Dirichlet distributions as the driving policy, we derive closed forms for the policy gradients and analytical properties of the performance measure. This enables the implementation of REINFORCE methods, which we perform on a large dataset of US equities. Across a large range of implementation choices, our result indicates that RL-based portfolios are very close to the equally-weighted (1/N) allocation. This implies that the agent learns to be agnostic with regard to factors. This is partly consistent with cross-sectional regressions showing a strong time variation in the relationship between returns and firm characteristics.

Keywords:

Econometrics
Dirichlet distribution
Reinforcement learning
large range
Computer science

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations