AUTONOMOUS VISUAL SERVOING OF A ROBOT MANIPULATOR USING REINFORCEMENT LEARNING

Mehdi Sadeghzadeh, David Calvert, and Hussein A. Abdullah

References

  1. [1] C.J.C.H. Watkins and P. Dayan, “Q-learning , Machine Learning, 8, 1992, 279–292.
  2. [2] J.J. Craig, Introduction to robotics mechanics and control, 3rd ed. (Upper Saddle River, USA, NJ: Pearson Education Inc., 2005).
  3. [3] S. Hutchinson, G.D. Hager, and P.I. Corke, A tutorial on visual servo control, IEEE Transactions on Robotics and Automation, 12(5), 1996, 651–670.
  4. [4] A.C. Sanderson, L.E. Weiss, and C.P. Neuman, Dynamicsensor-based control of robots with visual feedback, IEEETransactions on Robotics and Automation, R.A-3, October1987, 404–417.
  5. [5] Y. Wang, H. Lang, and C.W. de Silva, A robust mobile robot manipulation system using a switch-based visual-servo controller, Int. Conf. on Industrial Mechatronics and Automation, Chengdu, 15–16 May 2009, 364–367.
  6. [6] L. Kaelbling, Learning in embedded systems (Cambridge, Massachusetts: MIT Press, 1993).
  7. [7] T.M. Mitchell, Machine learning (New York, USA: McGraw-Hill, 1997).
  8. [8] R. Sutton, Integrated architectures for learning, planning and reacting based on approximating dynamic programming, Proc. Seventh Int. Conf. of Machine Learning, (Austin, Texas, USA: Morgan Kaufmann, 1990), 216–224.
  9. [9] G.A. Rummery and M. Niranjan, On-line Q-learning us-ing connectionist systems (Cambridge University, September1994).
  10. [10] Jyh-Shing Roger Jang, ANFIS: Adaptive-network-based fuzzy inference system, IEEE Transaction on Systems, Man and Cybernetics, 23, May/June 1993, 665–685.
  11. [11] B. Widrow and M.A. Lehr, 30 years of adaptive neural networks perception, Madaline, and back propagation, Proceedings of IEEE, 9(78), September 1990, 1415–1442.
  12. [12] Y.-X. Yang, D. Liu, and H. Liu, Robot end-effector 2Dvisual positioning using neural networks, Proceeding of theSecond International Conference on Machine Learning andCybernetics, 5, November 2003, 2940–2943.
  13. [13] B.J.A. Krose, M.J. Van der Krose, and F.C.A. Groen, Learning strategies for a vision based neural controller for a robot arm, Proceedings of International IEEE Workshop on Intelligent Motion Control, 1, 1990, 199–203.
  14. [14] Q. Zhao, Z. Sun, F. Sun, and J. Zhu, Appearance-basedrobot visual servo via a wavelet neural network, International Journal of Control, Automation and Systems, 6(4), August 2008, 607–612.
  15. [15] S.K. Nayar, H. Murase, and S.A. Nene, Learning, positioning, and tracking visual appearance, Proceedings of the IEEE International Conference on Robotics and Automation, 4, 1994, 3237–3244.
  16. [16] B.H. Dinh, M.W. Dunnigan, and D.S. Reay, Position con-trol of a robotic manipulator using a radial basis func-tion network and simple vision system, IEEE InternationalSymposium on Industrial Electronics, 2008, 1371–1376.DOI: 10.1109/ISIE.2008.4677070.
  17. [17] J. Zhang, R. Schmidt, and A. Knoll, Appearance-based visual learning in neuro-fuzzy model for fine-positioning of manipulator, Proceeding of the IEEE International Conference on Robotics and Automation, 2, May 1999, 1164–1169.
  18. [18] M. Hao, Z. Sun, and M. Fujii, Image based visual servoing using Takagi-Sugeno fuzzy neural network controller, 22nd IEEE International Symposium on Intelligent Control, October 2007, 53–58.
  19. [19] H. Liu, D. Liu, and Y.-X. Yang, Research of real time robot visual servoing based on Genetic algorithm, Proceeding of the First International Conference on Machine Learning and Cybernetics, Beijing, 1, 4–5 November 2002, 87–90.
  20. [20] S.A. Mazhari and S. Kumar, Hybrid GA tuned RBF basedneuro-fuzzy controller for robotic manipulator, International Journal of Electrical, Computer, and Systems Engineering, 2(4), Fall 2008, 268–276.
  21. [21] Y. Wang, H. Lang, and C.W. de Silva, A robust mobile robot manipulation system using a switch-based visual-servo controller, International Conference on Industrial Mechatronics and Automation, Chengdu, 15–16 May 2009, 364–367.
  22. [22] Y. Wang, H. Lang, and C.W. de Silva, A hybrid visualservo controller for robust grasping by wheeled mobile robots, IEEE/ASME Transactions on Mechatronics, 15(5), October 2010, 757–769.
  23. [23] A. Anglani, F. Taurisano, R.D. Giuseppe, and C. Dis-tante, Learning to grasp by using visual information, IEEEInternational Symposium on Computational Intelligencein Robotics and Automation, Monterey, CA, 1999, 7–14.DOI: 10.1109/CIRA.1999.809933.
  24. [24] Z. Miljkovic, M. Mitic, M. Lazarevic, and B. Babic, Neural network reinforcement learning for visual control of robot manipulators, Expert Systems with Applications, 40 (Elsevier, 2013), 1721–1736.
  25. [25] V. Ganapathy, S.C. Yun, and H.K. Joe, Neural Q-learning controller for mobile robot, IEEE/ASME International Conference on Advanced Intelligent Mechatronics, 14–17 July 2009, 863–868.
  26. [26] J. Qiao, Z. Hou, and X. Ruan, Q-learning based on neural network in learning action selection of mobile robot, Proceedings of the IEEE, International Conference on Automation and Logistics, 18–21 August 2007, 263–267.
  27. [27] S.-C. Wang, Z.-X. Song, H. Ding, and H.-B. Shi, An Improved reinforcement Q-learning method with BP neural networks in robot soccer, Fourth International Symposium on Computational Intelligence and Design, Hangzhou, China, 2011, 177–180.
  28. [28] B. Huang, G. Cao, and M. Guo, Reinforcement learningneural network to the problem of autonomous mobile robotobstacle avoidance, Proc. 4th Int. Conf. Machine Learning and Cybernetics, Guangzhou, 2005.
  29. [29] E. Cervera and A.P. del. Pobil, Sensor-based learning for practical planning of fine motions in robotics, Information Sciences, 145, 2002, 147–168.
  30. [30] C. Harris and M. Stephen, A combined corner and edgedetector, In 4th Alvey Cision Conference, 1988, 147–151.

Important Links:

Go Back