Pushpa Publishing House

Journal Menu

Content

Volume 118 (2025)

Volume 117 (2024)

Volume 116 (2023)

	Volume 116, Issue 4 Pg 311 - 419 (December 2023)
	Volume 116, Issue 3 Pg 215 - 310 (September 2023)
	Volume 116, Issue 2 Pg 73 - 213 (June 2023)
	Volume 116, Issue 1 Pg 1 - 71 (March 2023)

Volume 111 (2021)

Volume 110 (2021)

Volume 109 (2021)

Volume 108 (2020)

Volume 107 (2020)

Volume 104 (2019)

Volume 103 (2019)

Volume 102 (2019)

Volume 101 (2019)

Volume 100 (2018)

Volume 99 (2018)

Volume 98 (2018)

Volume 97 (2017)

Volume 96 (2017)

Volume 95 (2016)

Volume 94 (2016)

Volume 93 (2015)

Volume 92 (2015)

Volume 91 (2015)

Volume 90 (2015)

Volume 89 (2014)

Volume 88 (2014)

Volume 87 (2014)

Volume 86 (2014)

Volume 85 (2013)

Volume 84 (2013)

Volume 83 (2013)

Volume 82 (2013)

Volume 81 (2013)

Volume 80 (2013)

Volume 79 (2013)

Volume 78 (2013)

Volume 77 (2013)

Volume 76 (2013)

Volume 75 (2013)

Volume 74 (2013)

Volume 73 (2012)

Volume 72 (2012)

Volume 71 (2012)

Volume 70 (2012)

Volume 69 (2012)

Volume 68 (2012)

Volume 67 (2012)

Volume 66 (2012)

Volume 65 (2012)

Volume 64 (2012)

Volume 63 (2012)

Volume 62 (2012)

Volume 61 (2011)

Volume 60 (2011)

Volume 59 (2011)

Volume 58 (2011)

Volume 57 (2011)

Volume 56 (2011)

Volume 55 (2011)

Volume 54 (2011)

Volume 53 (2011)

Volume 52 (2011)

Volume 51 (2011)

Volume 50 (2011)

Volume 49 (2010)

Volume 48 (2010)

Volume 47 (2010)

Volume 46 (2010)

Volume 45 (2010)

Volume 44 (2010)

Volume 43 (2010)

Volume 42 (2010)

Volume 41 (2010)

Volume 40 (2010)

Volume 39 (2010)

Volume 38 (2010)

Volume 37 (2009)

Volume 36 (2009)

Volume 35 (2009)

Volume 34 (2009)

Volume 33 (2008)

Volume 32 (2008)

Volume 31 (2008)

Volume 30 (2008)

Volume 29 (2007)

Volume 28 (2007)

Volume 27 (2007)

Volume 26 (2007)

Volume 25 (2006)

Volume 24 (2006)

Volume 23 (2006)

Volume 22 (2006)

Volume 21 (2005)

Volume 20 (2005)

Volume 19 (2005)

Volume 18 (2005)

Volume 17 (2004)

Volume 16 (2004)

Volume 15 (2004)

Volume 14 (2004)

Volume 13 (2003)

Volume 12 (2003)

Volume 11 (2003)

Volume 10 (2003)

Volume 9 (2002)

Volume 8 (2002)

Volume 7 (2002)

Volume 6 (2002)

Volume 5 (2001)

Vol. 4 (2000)

Notice: All new articles and volumes will be published only on our new website — www.pphmjopenaccess.com

Far East Journal of Applied Mathematics

Far East Journal of Applied Mathematics
Volume 116, Issue 2, Pages 149 - 171 (June 2023)
http://dx.doi.org/10.17654/0972096023009

WEIGHT SPEEDY Q-LEARNING FOR FEEDBACK STABILIZATION OF PROBABILISTIC BOOLEAN CONTROL NETWORKS

Yangyang Chen

Abstract:

In this paper, a reinforcement learning (RL)-based scalable technique is presented to control the probabilistic Boolean control networks (PBCNs). In particular, we propose an improved Q-learning (QL) algorithm: weight speedy Q-learning (WSQL). Based on WSQL, the feedback stability problem of PBCN is solved, and the state feedback controller is designed to make the PBCN stable at the given equilibrium point. According to the design of the controller, the PBCN can have finite-time stability and asymptotic stability. The presented method is model-free and offers scalability. We also verify the convergence of the proposed algorithm. Finally, simulation results illustrate that compared with the QL, our proposed algorithm converges to the fixed point faster.

Keywords and phrases:

probabilistic Boolean control networks, feedback stabilization, weight speedy Q-learning, model-free technique.

Received: February 3, 2023; Accepted: March 17, 2023; Published: April 19, 2023

How to cite this article: Yangyang Chen, Weight speedy Q-learning for feedback stabilization of probabilistic Boolean control networks, Far East Journal of Applied Mathematics 116(2) (2023), 149-171. http://dx.doi.org/10.17654/0972096023009

This Open Access Article is Licensed under Creative Commons Attribution 4.0 International License

References:

[1] S. A. Kauffman, Metabolic stability and epigenesis in randomly constructed genetic nets, J. Theoret. Biol. 22(3) (1969), 437-467.
[2] H. Kitano, Computational systems biology, Nature 420(6912) (2002), 206-210.
[3] B. Faryabi, E. R. Dougherty and A. Datta, On approximate stochastic control in genetic regulatory networks, IET Syst. Biol. 1(6) (2007), 361-368.
[4] T. Akutsu, S. Kosub, A. A. Melkman and T. Tamura, Finding a periodic attractor of a Boolean network, IEEE/ACM Trans. Comput. Biol. Bioinf. 9(5) (2012), 1410-1421.
[5] D. Cheng, H. Qi and Z. Li, Analysis and Control of Boolean Networks: A Semi-tensor Product Approach, Springer, London, U.K., 2011.
[6] F. Li, H. Yan and H. R. Karimi, Single-input pinning controller design for reachability of Boolean networks, IEEE Trans. Neural Netw. Learn. Syst. 29(7) (2018), 3264-3269.
[7] H. Li, Y. Wang and Z. Liu, Stability analysis for switched Boolean networks under arbitrary switching signals, IEEE Trans. Automat. Control 59(7) (2014), 1978-1982.
[8] R. Liu, J. Lu, Y. Liu, J. Cao and Z.-G. Wu, Delayed feedback control for stabilization of Boolean control networks with state delay, IEEE Trans. Neural Netw. Learn. Syst. 29(7) (2018), 3283-3288.
[9] H. Chen, J. Liang, T. Huang and J. Cao, Synchronization of arbitrarily switched Boolean networks, IEEE Trans. Neural Netw. Learn. Syst. 28(3) (2017), 612-619.
[10] I. Shmulevich, E. R. Dougherty, S. Kim and W. Zhang, Probabilistic Boolean networks: a rule-based uncertainty model for gene regulatory networks, Bioinformatics 18(2) (2002), 261-274.
[11] E. Fornasini and M. E. Valcher, Observability and reconstructibility of probabilistic Boolean networks, IEEE Control Syst. Lett. 4(2) (2020), 319-324.
[12] Y. Guo, R. Zhou, Y. Wu, W. Gui and C. Yang, Stability and set stability in distribution of probabilistic Boolean networks, IEEE Trans. Automat. Control 64(2) (2019), 736-742.
[13] R. Zhou, Y. Guo, Y. Wu and W. Gui, Asymptotical feedback set stabilization of probabilistic Boolean control networks, IEEE Trans. Neural Netw. Learn. Syst. 31 (2020), 4524-4537. doi: 10.1109/TNNLS.2019.2955974.
[14] L. Wang, Y. Liu, Z.-G. Wu, J. Lu and L. Yu, Stabilization and finite-time stabilization of probabilistic Boolean control networks, IEEE Trans. Syst., Man, Cybern.: Syst. 51 (2021), 1559-1566. doi: 10.1109/TSMC.2019.2898880.
[15] H. Li, Y. Wang and P. Guo, State feedback based output tracking control of probabilistic Boolean networks, Inform. Sci. 349 (2016), 1-11.
[16] A. Yerudkar, C. Del Vecchio and L. Glielmo, Output tracking control of probabilistic Boolean control networks, Proc. IEEE Int. Conf. Syst. Man, Cybern. (SMC), Bari, Italy, 2019, pp. 2109-2114.
[17] W. Stach, L. Kurgan, W. Pedrycz and M. Reformat, Genetic learning of fuzzy cognitive maps, Fuzzy Sets Syst. 153(3) (2005), 371-401.
[18] M. U. Ahmed, S. Brickman, A. Dengg, N. Fasth, M. Mihajlovic and J. Norman, A machine learning approach to classify pedestrians events based on IMU and GPS, Int. J. Artif. Intell. 17(2) (2019), 154-167.
[19] F. L. Lewis, D. Vrabie and K. G. Vamvoudakis, Reinforcement learning and feedback control: using natural decision methods to design optimal adaptive controllers, IEEE Control Syst. Mag. 32(6) (2012), 76-105.
[20] B. Faryabi, A. Datta and E. Dougherty, On approximate stochastic control in genetic regulatory networks, IET Syst. Biol. 1(6) (2007), 361-368.
[21] A. Sootla, N. Strelkowa, D. Ernst, M. Barahona and G.-B. Stan, On periodic reference tracking using batch-mode reinforcement learning with application to gene regulatory network control, Proc. 52nd IEEE Conf. Decis. Control, Florence, Italy, 2013, pp. 4086-4091.
[22] Aivar Sootla, Natalja Strelkowa, Damien Ernst, Mauricio Barahona and Guy-Bart Stan, Toggling a genetic switch using reinforcement learning, 2013.
arXiv preprint arXiv:1303.3183.
[23] Georgios Papagiannis and Sotiris K. Moschoyiannis, Deep reinforcement learning for control of probabilistic Boolean networks, International Workshop on Complex Networks and their Applications, 2019.
[24] A. Acernese, A. Yerudkar, L. Glielmo and C. Del Vecchio, Reinforcement learning approach to feedback stabilization problem of probabilistic Boolean control networks, IEEE Control Syst. Lett. 5(1) (2021), 337-342.
doi: 10.1109/LCSYS. 2020.3001993
[25] Tommi Jaakkola, Michael I. Jordan and Satinder P. Singh, On the convergence of stochastic iterative dynamic programming algorithms, Neural Comput. 6(6) (1994), 1185-1201.
[26] M. G. Azar, R. Munos, M. Ghavamzadeh and H. Kappen, Speedy Q-learning, Advances in Neural Information Processing Systems, 2011.