Abstract
In this paper, we consider the problem of the optimal timing to initiate a medical treatment. In the absence of treatment, we model the disease evolution as a semi-Markov process. The optimal time to initiate the treatment is a stopping time, which maximizes the total expected reward for the patient. We propose a stochastic dynamic programming formulation to find this stopping time. Under some plausible conditions, we show that the maximum total expected reward at the start of a health state will be smaller when the patient is in a more severe state. We then prove that the optimal policy for initializing the treatment is determined by a time threshold for each given health state. That is, in each health state, the treatment should be planned to start, when the patient’s duration time in the health state reaches (or exceeds, in the case of a late observation of the patient’s health status) a certain threshold level. We also present numerical examples to illustrate our model and to provide managerial insights.







Similar content being viewed by others
References
Kousignian, I., Abgrall, S., Duval, X., Descamps, D., Matheron, S., Costagliola, D.: Modeling the time course of CD4 T-lymphocyte counts according to the level of virologic rebound in HIV-1-infected patients on highly active antiretroviral therapy. J. Acquir. Immune Defic. Syndr. 34(1), 50–57 (2003). URL http://www.ncbi.nlm.nih.gov/pubmed/14501793
Mathieu, E., Loup, P., Dellamonica, P., Daures, J.P.: Markov modelling of immunological and virological states in HIV-1 infected patients. Biom. J. 47(6), 834–846 (2005)
Pérez-Ocón, R., Ruiz-Castro, J.E., Gámiz-Pérez, M.L.: A multivariate model to measure the effect of treatments in survival to breast cancer. Biom. J. 40(6), 703–715 (1998)
Sweeting, M.J., Farewell, V.T., Angelis, D.D.: Multi-state Markov models for disease progression in the presence of informative examination times: an application to hepatitis C. Stat. Med. 29(11), 1161–1174 (2010)
Xiong, C., Miller, J.P., Gao, F., Yan, Y., Morris, J.C.: Testing increasing hazard rate for the progression time of dementia. Discrete Contin. Dyn. Syst. Ser. B 4(3), 813–821 (2004)
Foucher, Y., Mathieu, E., Philippe, S.P., Durands, J.F., Daures, J.P.: A semi-Markov model based on generalized Weibull distribution with an illustration for HIV disease. Biom. J. 47(6), 825–833 (2005)
Foucher, Y., Giral, M., Soulillou, J.P., Daures, J.P.: A semi-Markov model for multistate and interval-censored data with multiple terminal events: application in renal transplantation. Stat. Med. 26(30), 5381–5393 (2007)
Friedman, A.: Optimal stopping problems in stochastic control. SIAM Rev. 21(1), 71–80 (1979)
Chow, Y.S., Robbins, H., Siegmund, D.: Great Expectations: The Theory of Optimal Stopping. Houghton Mifflin Company, Boston (1971)
Ferguson, T.S.: Optimal stopping and applications. Lecture Notes. Department of Mathematics, University of California, Los Angeles
Boshuizen, F.A., Gouweleeuw, J.M.: General optimal stopping theorems for semi-Markov processes. Adv. Appl. Probab. 25(4), 825–846 (1993)
Oh, S.: Optimal Stopping Problems in Operations Management. Ph.D. Thesis, Stanford University (2010)
Kurt, M., Denton, B.T., Schaefer, A.J., Shah, N.D., Smith, S.A.: The structure of optimal statin initiation policies for patients with type 2 diabetes. IIE Trans. Healthc. Syst. Eng. 1, 49–65 (2011)
Saithna, A., Pynsent, P.B., Grimer, R.J.: Retrospective analysis of the impact of symptom duration on prognosis in soft tissue sarcoma. Int. Orthop. 32(3), 381–384 (2008)
Maillart, L.M., Ivy, J.S., Ransom, S., Diehl, K.: Assessing dynamic breast cancer screening policies. Oper. Res. 56(6), 1411–1427 (2008)
Zhang, J., Denton, B.T., Balasubramanian, H., Shah, N.D., Inman, B.A.: Optimization of PSA screening policies: a comparison of the patient and societal perspectives. Med. Decis. Mak. 32(2), 337–349 (2012)
Denton, B.T., Kurt, M., Shah, N.D., Bryant, S.C., Smith, S.A.: Optimizing the start time of statin therapy for patients with diabetes. Med. Decis. Mak. 29, 351–367 (2009)
Mason, J.E., England, D.A., Denton, B.T., Smith, S.A., Kurt, M., Shah, N.D.: Optimizing statin treatment decisions for diabetes patients in the presence of uncertain future adherence. Med. Decis. Mak. 32(1), 154–166 (2012)
Shechter, S.M., Bailey, M.D., Schaefer, A.J.: A modeling framework for replacing medical therapies. IIE Trans. 40(9), 861–869 (2008)
Shechter, S.M., Bailey, M.D., Schaefer, A.J., Roberts, M.S.: Are threshold policies sufficient for the optimal timing of HIV treatment. Working Paper, Department of Industrial Engineering, University of Pittsburgh (2008)
Shechter, S.M., Alagoz, O., Roberts, M.S.: Irreversible treatment decisions under consideration of the research and development pipeline for new therapies. IIE Trans. 42(9), 632–642 (2010)
Lavieri, M.S., Puterman, M.L., Tyldesley, S., Morris, W.J.: When to treat prostate cancer patients based on their PSA dynamics. IIE Trans. Healthc. Syst. Eng. 2(1), 62–77 (2012)
Kirkizlar, H.E., Faissol, D.M., Griffin, P.M., Swann, J.L.: Timing of testing and treatment for asymptomatic diseases. Math. Biosci. 226(1), 28–37 (2010)
Denton, B., Alagoz, O., Holder, A., Lee, E.K.: Medical decision making: open research challenges. IIE Trans. Healthc. Syst. Eng. 1(3), 161–167 (2011)
Zhang, J., Mason, J.E., Denton, B.T., Pierskalla, W.P.: Applications of operations research to the prevention, detection, and treatment of disease (2013). To appear in Wiley Encyclopedia of Operations Research and Management Science, 3rd Edition
Pascucci, A.: Free boundary and optimal stopping problems for American Asian options. Finance Stat. 12(1), 21–41 (2008)
Wu, R., Fu, M.C.: Optimal exercise policies and simulation-based valuation for American-Asian options. Oper. Res. 51(1), 52–66 (2003)
Liu, X., Li, J., Al-Khalifa, K.N., Hamouda, A.S., Coit, D.W., Elsayed, E.A.: Condition-based maintenance for continuously monitored degrading systems with multiple failure modes. IIE Trans. 45, 422–435 (2013)
Martagan, T., Krishnamurthy, A., Maravelias, C.T.: Optimal condition-based harvesting policies for biomanufacturing operations with failure risks. IIE Trans. 48, 440–461 (2016)
Shechter, S.M., Bailey, M.D., Schaefer, A.J., Roberts, M.S.: The optimal time to initiate HIV therapy under ordered health states. Oper. Res. 56(1), 20–33 (2008)
Alagoz, O., Maillart, L.M., Schaefer, A.J., Roberts, M.S.: The optimal timing of living-donor liver transplantation. Manag. Sci. 50(10), 1420–1430 (2004)
Crowder, M.: A distributional model for repeated failure time measurements. J. R. Stat. Soc. Ser. B (Methodol.) 47(3), 447–452 (1985)
Kelle, P., Silver, E.A.: Safety stock reduction by order splitting. Nav. Res. Logist. 37(5), 725–743 (1990)
Kao, E.P.C.: Modeling the movement of coronary patients within a hospital by semi-Markov processes. Oper. Res. 22, 683–699 (1974)
Lawless, J.F.: Statistical Models and Methods for Lifetime Data, 2nd edn. Wiley Interscience, Hoboken, NJ (2003)
Acknowledgements
Dr. Mabel C. Chou is supported by grant N-311-000-464-022 NPRO10/NH060. Dr. Mahmut Parlar, is supported by the Natural Sciences and Engineering Research Council of Canada. The authors are grateful to the Editor-in-Chief, associate editor, and an anonymous referee’s comments, which lead to significant improvement of the paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Alberto d’Onofrio.
Appendices
Appendix A: Proof of Proposition 2.1
Proof
According to Lemma 2.1 in Boshuizen and Gouweleeuw [11], for any \(k\ge 1\) there exists a random variable \(\tau _{k}\ge 0\) such that \( {\mathcal {T}}\wedge T_{k}=\left( T_{k-1}+\tau _{k}\right) \wedge T_{k}\), and \( \tau _{k}\) is \({\mathcal {F}}(T_{k})\)-measurable with \({\mathcal {F}}(s)\) standing for the \(\sigma \)-algebra generated by \(\left\{ Z(t),t\le s\right\} \). (That is, \(\tau _{k}\) reduces to a deterministic number when \(T_{k}\) is known.) It is equivalent to use the objective function \(P({\mathcal {T}})-P(T_{n}+t)\) in place of \(P({\mathcal {T}})\).
Let \(\tilde{V}(i,t)=\max _{{\mathcal {T}}\ge T_{n}+t}\left\{ P({\mathcal {T}} )-P(T_{n}+t)\mid X_{n}=i,S_{n}>t\right\} \text {.}\)
Then, \(\tilde{V}(i,t)=V(i,t)-E\left\{ P(T_{n}+t)\mid X_{n}=i,S_{n}>t\right\} =V(i,t)-R(i)\). We have
In order to maximize \(E\left[ P({\mathcal {T}})\mid X_{n}=i,S_{n}>t\right] \) with respect to \({\mathcal {T}}\), we can maximize with respect to \(\tau _{n+1}\) and \({\mathcal {T}}\vee T_{n+1}\) separately. Note that an arbitrary stopping time \(\tilde{\mathcal {T}}\ge T_{n+1}\) can be written into the form \({\mathcal {T}}\vee T_{n+1}\).
The term \(P\left( \left( T_{n}+\tau _{n+1}\right) \wedge T_{n+1}\right) -P(T_{n}+t)\) is the reward difference between initializing treatment at difference times. If \(S_{n}>\tau _{n}\), then the health state equals i when treatment starts for both starting time epochs. Otherwise, the health state is \(X_{n}\) at \(\left( T_{n-1}+\tau _{n}\right) \wedge T_{n}=T_{n}\) and is i at \(T_{n-1}+t\). Therefore, we have
Because \(\tilde{V}(j,t)=V(j,t)-R(j)\) for all \(j\in {\mathcal {H}}\), we can write (6) in terms of V, as in Eqs. (4)–(5). \(\square \)
Appendix B: Proof of Lemma 3.1
Proof
To prove the lemma, it is equivalent to show that \(\Pr \left( X_{n+1}(i,^{\le }s)\ge j\right) \) increases in s for all \(j\in {\mathcal {H}}\).
It is easy to verify that
By Assumption 3.3, \(\Pr \left( X_{n+1}(i,z)\ge j\right) \) increases in z. Then,
It follows that
Therefore, \(\Pr \left( X_{n+1}(i,^{\le }s)\ge j\right) \) is increasing in s, implying that \(X_{n+1}(i,^{\le }s)\) is stochastically increasing in s. \(\square \)
Appendix C: Proof of Proposition 3.1
Proof
By Lemma 3.1 and Assumption 3.4, we have \( X_{n+1}(i+1,^{\le }s)\ge _{\text {st}}X_{n+1}(i+1,1)\ge _{\text {st} }X_{n+1}(i)\ge _{\text {st}}X_{n+1}(i,^{\le }z)\) for all s and z. The optimality Eqs. (4)–(5) can be solved by a value iteration process. Let \(V_{0}(i,t)\equiv 0\) be the initial value function, \(V_{m}(i,t)\) the value function generated by the mth value iteration for \(m\ge 1\), and \(J_{m}(i,t,\tau )\) the value of function \( J(i,t,\tau )\) when V is replaced by \(V_{m}\) in (5).
If we can prove \(V_{m}(i,0)\) decreases in i for all \(m\ge 0\), then \( V(i,0)=\lim _{m\rightarrow \infty }V_{m}(i,0)\) also decreases in i. In the followings, we prove \(V_{m}(i,0)\) decreases in i for all \(m\ge 0\) by induction.
Suppose that \(V_{m}(i,0)\) is decreasing in i. Note that
By the inductive hypothesis that \(V_{m}(i,0)\) decreases in i, and the fact that \(X_{n+1}(i+1,^{\le }s)\ge _{st}X_{n+1}(i,^{\le }z)\) for all s and z, for each \(i\in {\mathcal {H}}\) there exists a(i) such that \(E\left[ V_{m}\left( X_{n+1}(i+1,^{\le }\tau ),0\right) \right] \le a(i)\le E\left[ V_{m}\left( X_{n+1}(i,^{\le }\tilde{\tau }),0\right) \right] \) for all \(\tilde{\tau }\ge 0\) and \(\tau \ge 0\).
For an arbitrary \(\tau \), let us select \(\tilde{\tau }\) such that \( F_{S_{n}(i)}(\tilde{\tau })\le F_{S_{n}(i+1)}(\tau )\le F_{S_{n}(i)}(\tilde{ \tau }+1)\).
If \(R(i)\ge a(i)\), by (7) and the condition \(R(i)\ge R(i+1)\) , then
Otherwise, \(R(i+1)\le R(i)<a(i)\). We can replace \(\tilde{\tau }\) with \( \tilde{\tau }+1\) in (7) and obtain
Finally, \(V_{m+1}(i,0)=\max _{\tau \ge 0}J_{m}(i,0,\tau )\ge \max \left\{ J_{m}(i,0, \tilde{\tau }),J_{m}(i,0,\tilde{\tau }+1)\right\} \ge J_{m}(i+1,0,\tau )\). Taking maximization with respect to \(\tau \) causes \(V_{m+1}(i,0)\ge V_{m+1}(i+1,0)\), proving the monotonicity of \(V_{m+1}(i,0)\) in i. \(\square \)
Appendix D: Proof of Lemma 3.2
Proof
It is easy to see that the following three equalities hold
By (5) and the definition of conditional probability, we have
where the function \(\gamma _{ij}\) is defined as \(\gamma _{ij}(t):=\Pr \left( X_{n+1}=j\mid X_{n}=i,S_{n}=t\right) \) and the last equality holds because
Moreover, \(\sum _{j\in {\mathcal {H}}}\gamma _{ij}(\tau +1)V(j,0)=E\left[ V\left( X_{n+1}(i,\tau +1),0\right) \right] \) is decreasing in \(\tau \), according to Assumption 3.3 and the fact that V(i, 0) decreases in i. Let \(u(i,\tau )=\frac{r(i)}{H_{i}(\tau )}-R(i)+\sum _{j\in {\mathcal {H}}}\gamma _{ij}(\tau +1)V(j,0)\). Then, \(u(i,\tau )\) is decreasing in \(\tau \). Let \(\tau ^{*}(i)=\min \left\{ \tau \ge 0\mid u(i,\tau )\le 0\right\} \). Based on the above analysis, we can conclude that \(J(i,t,\tau +1)-J(i,t,\tau )>0\) for \(\tau <\tau ^{*}(i)\), and \(J(i,t,\tau +1)-J(i,t,\tau )\le 0\) for \(\tau \ge \tau ^{*}(i)\). This proves the quasi-concavity of \(J(i,t,\tau )\) with respect to \(\tau \). \(\square \)
Rights and permissions
About this article
Cite this article
Chou, M.C., Parlar, M. & Zhou, Y. Optimal Timing to Initiate Medical Treatment for a Disease Evolving as a Semi-Markov Process. J Optim Theory Appl 175, 194–217 (2017). https://doi.org/10.1007/s10957-017-1139-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10957-017-1139-7