### Abstract

We develop a model in which investors must learn the distribution of asset returns over time. The process of learning is made more difficult by the fact that the distributions are not constant through time. We consider risk-neutral investors who have quadratic utility and are selecting between two risky assets. We determine the time at which it is optimal to update the distribution estimate and hence, alter portfolio weights. Our results deliver an optimal policy for asset allocation, that is, the sequence of time intervals at which it is optimal to switch between assets, based on stochastic optimal control theory. In addition, we determine the time intervals in which asset switching leads to a loss with high probability. We provide estimates of the effectiveness of the optimal policy.

Original language | English |
---|---|

Pages (from-to) | 269-312 |

Number of pages | 44 |

Journal | Journal of Computational Analysis and Applications |

Volume | 4 |

Issue number | 4 |

DOIs | |

State | Published - Dec 1 2002 |

### Fingerprint

### Keywords

- Asset allocation
- Optimal policy
- Stochastic control
- Two-armed Bandit problem

### Cite this

*Journal of Computational Analysis and Applications*,

*4*(4), 269-312. https://doi.org/10.1023/A:1019954626395