Submitted:
30 March 2026
Posted:
31 March 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Materials and Methods
2.1. Search Strategy
2.2. Search Execution and Screening Process
2.2.1. Inclusion Criteria
- Published 2025 or later
- Written in English
- Proposed or applied AI-based techniques explicitly for control of RMs
2.2.2. Exclusion Criteria
- Soft robots, hydraulic robot, continuum systems, or quadrotor
- Simulation platforms evaluations using robot learning methods
- General/high-level control discussions without AI approaches
- Book chapters, reviews, editorials, patents, letters, abstracts, or other scientific journal formats
2.3. Data Extraction
- Authors and year of publication
-
The functional role of implemented strategy within the robot manipulator control system, categorized as one of the following:
- o
- Perception and Estimation: outputs state estimates such as pose, velocity, maps, or object identity for downstream use
- o
- Planning: outputs a plan/trajectory/waypoints/goals or select actions at a symbolic/task level that are then tracked by a controller.
- o
- Learning Control: outputs low-level commands or a control policy that directly drives the robot’s motion (joint velocities/torques/actions each timestep).
- o
- Interaction and Safety: outputs safety constraints or regulates contact/force/compliance, including human-aware limits and safe-set filtering
- o
- Learning and Adaptation: updates models or policies online/continually across conditions, improving with new deployment data
- Robot type
- Application area
- Control techniques
- AI/ML methods
- Learning paradigm (supervised, unsupervised, reinforcement learning, hybrid)
- Evaluation method (simulation, experiments, both)
2.4. Research Question
3. Review of Publications
3.1. Publications
3.2. Research Interest
3.3. Robot Types
3.4. Applications
3.5. Control Techniques
3.4. AI-based Methods
3.5. Learning Paradigm
3.6. Trends in Evaluation Method
4. Discussion
4.1. Limitations in AI-Based Robot Manipulator Control
4.2. General Challenges in AI-Based Robot Manipulator Control
4.3. Future Directions
5. Conclusions
- Uneven functional development: The literature was concentrated predominantly in learning control, with substantially lower representation of planning, perception and estimation, learning and adaptation, and especially interaction and safety.
- Methodological concentration: Learning-based control overwhelmingly dominates the field, particularly RL and DRL. Although many distinct AI methods were identified overall, recurrent use remained centered on a relatively small core of RL oriented techniques.
- Narrow robot and application coverage: The reviewed studies focused primarily on serial manipulators, while many specialized robotic configurations remained sparsely represented. A large proportion of studies also lacked a clearly specified application setting. Among the explicitly reported domains, manufacturing was the most represented.
- Strong reliance on simulation-based evaluation: Most studies relied simulation only or combined simulation and experiment for validation, whereas fully experimental evaluation remained comparatively limited.
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Hockstein, N.G.; Gourin, C.G.; Faust, R.A.; Terris, D.J. A History of Robots: From Science Fiction to Surgical Robotics. J. Robot. Surg. 2007, 1, 113–118. [Google Scholar] [CrossRef]
- Lin, P.; Bekey, G.; Abney, K. Autonomous Military Robotics: Risk, Ethics, and Design; Defense Technical Information Center: Fort Belvoir, VA, 2008. [Google Scholar]
- Nguyen, C.C.; Ngo, H.T.T.; Duong, T.T.C.; Nabili, A. Mechanical Design and Kinematic Analysis of an Autonomous Wrist with DC Motor Actuators for Space Assembly. Actuators 2025, 14, 542. [Google Scholar] [CrossRef]
- Halder, S.; Afsari, K. Robots in Inspection and Monitoring of Buildings and Infrastructure: A Systematic Review. Appl. Sci. 2023, 13, 2304. [Google Scholar] [CrossRef]
- Jin, T.; Han, X. Robotic Arms in Precision Agriculture: A Comprehensive Review of the Technologies, Applications, Challenges, and Future Prospects. Comput. Electron. Agric. 2024, 221, 108938. [Google Scholar] [CrossRef]
- Aggarwal, S.; Gupta, D.; Saini, S. A Literature Survey on Robotics in Healthcare. In Proceedings of the 2019 4th International Conference on Information Systems and Computer Networks (ISCON), November 2019; IEEE: Mathura, India; pp. 55–58. [Google Scholar]
- Ngo, H.T.T.; Nguyen, C.C.; Duong, T.T.C.; Nguyen, T.T. Trends in Control Strategies of Parallel Robot Manipulators for Robot-Assisted Rehabilitation. Eng 2026, 7, 44. [Google Scholar] [CrossRef]
- Slotine, J.-J.E.; Li, W. Applied Nonlinear Control; Prentice Hall: Englewood Cliffs, NJ, 1991; ISBN 978-0-13-040890-7. [Google Scholar]
- Kroemer, O.; Niekum, S.; Konidaris, G. A Review of Robot Learning for Manipulation: Challenges, Representations, and Algorithms. [CrossRef]
- Zomaya, A.Y.; Suddaby, M.E.; Morris, A.S. Direct Neuro-Adaptive Control of Robot Manipulators. In Proceedings of the Proceedings 1992 IEEE International Conference on Robotics and Automation; IEEE Comput. Soc. Press: Nice, France, 1992; pp. 1902–1907. [Google Scholar]
- Harib, M.; Chaoui, H.; Miah, S. Evolution of Adaptive Learning for Nonlinear Dynamic Systems: A Systematic Survey. Intell. Robot. 2022. [Google Scholar] [CrossRef]
- Yagi, S.; Morimoto, J. Learning-Based Joint Control With Hierarchical Reinforcement Learning and On-Device Execution. IEEE Robot. Autom. Lett. 2025, 10, 12493–12500. [Google Scholar] [CrossRef]
- Elsisi, M.; Mahmoud, K.; Lehtonen, M.; Darwish, M.M.F. An Improved Neural Network Algorithm to Efficiently Track Various Trajectories of Robot Manipulator Arms. IEEE Access 2021, 9, 11911–11920. [Google Scholar] [CrossRef]
- Lu, Y.; Wu, C.; Yao, W.; Sun, G.; Liu, J.; Wu, L. Deep Reinforcement Learning Control of Fully-Constrained Cable-Driven Parallel Robots. IEEE Trans. Ind. Electron. 2023, 70, 7194–7204. [Google Scholar] [CrossRef]
- Reuther, A.; Michaleas, P.; Jones, M.; Gadepally, V.; Samsi, S.; Kepner, J. AI and ML Accelerator Survey and Trends. In Proceedings of the 2022 IEEE High Performance Extreme Computing Conference (HPEC), Waltham, MA, USA, September 19 2022; IEEE; pp. 1–10. [Google Scholar]
- Zhu, S.; Yu, T.; Xu, T.; Chen, H.; Dustdar, S.; Gigan, S.; Gunduz, D.; Hossain, E.; Jin, Y.; Lin, F.; et al. Intelligent Computing: The Latest Advances, Challenges, and Future. Intell. Comput. 2023, 2, 0006. [Google Scholar] [CrossRef]
- Waseem, S.; Adnan, M.; Iqbal, M.S.; Amin, A.A.; Shah, A.; Tariq, M. From Classical to Intelligent Control: Evolving Trends in Robotic Manipulator Technology. Comput. Electr. Eng. 2025, 127, 110559. [Google Scholar] [CrossRef]
- Swetha Danthala Et Al., S.D.E.Al.; TJPRC Robotic Manipulator Control by Using Machine Learning Algorithms, A Review. Int. J. Mech. Prod. Eng. Res. Dev. 2018, 8, 305–310. [CrossRef]
- Benotsmane, R.; Dudás, L.; Kovács, G. Survey on Artificial Intelligence Algorithms Used in Industrial Robotics. Multidiszcip. Tudományok 2020, 10, 194–205. [Google Scholar] [CrossRef]
- Cho, J.; Jung, S. Reinforcement Learning-Based Motion Planning for Robotic Manipulators in Smart Industry. In Proceedings of the 2024 15th International Conference on Information and Communication Technology Convergence (ICTC), October 16 2024; IEEE: Jeju Island, Korea, Republic of; pp. 1166–1168. [Google Scholar]
- Fernández Mareco, E.R.; Pinto-Roa, D. Application of Artificial Intelligence in Control Systems: Trends, Challenges, and Opportunities. AI 2025, 6, 326. [Google Scholar] [CrossRef]
- Nahavandi, S.; Alizadehsani, R.; Nahavandi, D.; Lim, C.P.; Kelly, K.; Bello, F. Machine Learning Meets Advanced Robotic Manipulation. Inf. Fusion 2024, 105, 102221. [Google Scholar] [CrossRef]
- Page, M.J.; Moher, D.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. PRISMA 2020 Explanation and Elaboration: Updated Guidance and Exemplars for Reporting Systematic Reviews. BMJ 2021, n160. [Google Scholar] [CrossRef] [PubMed]
- Bertino, A.; Bagheri, M.; Krstić, M.; Naseradinmousavi, P. Experimental Autonomous Deep Learning-Based 3D Path Planning for a 7-DOF Robot Manipulator. In Proceedings of the Volume 2: Modeling and Control of Engine and Aftertreatment Systems; Modeling and Control of IC Engines and Aftertreatment Systems; Modeling and Validation; Motion Planning and Tracking Control; Multi-Agent and Networked Systems; Renewable and Smart Energy Systems; Thermal Energy Systems; Uncertain Systems and Robustness; Unmanned Ground and Aerial Vehicles; Vehicle Dynamics and Stability; Vibrations: Modeling, Analysis, and Control; American Society of Mechanical Engineers: Park City, Utah, USA, 8 October 2019; Volume 2, p. V002T14A002. [Google Scholar]
- Catalán, J.M.; Trigili, E.; Nann, M.; Blanco-Ivorra, A.; Lauretti, C.; Cordella, F.; Ivorra, E.; Armstrong, E.; Crea, S.; Alcañiz, M.; et al. Hybrid Brain/Neural Interface and Autonomous Vision-Guided Whole-Arm Exoskeleton Control to Perform Activities of Daily Living (ADLs). J. NeuroEngineering Rehabil. 2023, 20, 61. [Google Scholar] [CrossRef]
- Chen, C.-S.; Hu, N.-T. Eye-in-Hand Robotic Arm Gripping System Based on Machine Learning and State Delay Optimization. Sensors 2023, 23, 1076. [Google Scholar] [CrossRef]
- Chen, X.; Guhl, J. Industrial Robot Control with Object Recognition Based on Deep Learning. Procedia CIRP 2018, 76, 149–154. [Google Scholar] [CrossRef]
- Ghiasvand, S.; Xie, W.-F.; Mohebbi, A. Deep Neural Network-Based Robotic Visual Servoing for Satellite Target Tracking. Front. Robot. AI 2024, 11, 1469315. [Google Scholar] [CrossRef] [PubMed]
- Gul, A.N.A.; Sahin, M.; Kandis, T.; Kavrak, K.; Oner, Z. Robotic Arm Control Using Machine Learning-Based EOG Signal Classifier. In Proceedings of the 2023 Medical Technologies Congress (TIPTEKNO), November 10 2023; IEEE: Famagusta, Cyprus; pp. 1–4. [Google Scholar]
- Kazemzadeh Heris, P.; Khamesee, M. Design and Fabrication of a Magnetic Actuator for Torque and Force Control Estimated by the ANN/SA Algorithm. Micromachines 2022, 13, 327. [Google Scholar] [CrossRef]
- Kirda, A.W.; Majewski, P.; Bursy, G.; Bartoszuk, M.; Yassin, H.; Królczyk, G.; Akbar, N.A.; Caesarendra, W. Integrating YOLOv5, Jetson Nano Microprocessor, and Mitsubishi Robot Manipulator for Real-Time Machine Vision Application in Manufacturing: A Lab Experimental Study. Adv. Sci. Technol. Res. J. 2025, 19, 248–270. [Google Scholar] [CrossRef]
- Kondratenko, Y.; Atamanyuk, I.; Sidenko, I.; Kondratenko, G.; Sichevskyi, S. Machine Learning Techniques for Increasing Efficiency of the Robot’s Sensor and Control Information Processing. Sensors 2022, 22, 1062. [Google Scholar] [CrossRef]
- Kruzic, S.; Music, J.; Kamnik, R.; Papic, V. Estimating Robot Manipulator End-Effector Forces Using Deep Learning. In Proceedings of the 2020 43rd International Convention on Information, Communication and Electronic Technology (MIPRO), September 28 2020; IEEE: Opatija, Croatia; pp. 1163–1168. [Google Scholar]
- Liu, J.; Balatti, P.; Ellis, K.; Hadjivelichkov, D.; Stoyanov, D.; Ajoudani, A.; Kanoulas, D. Garbage Collection and Sorting with a Mobile Manipulator Using Deep Learning and Whole-Body Control. In Proceedings of the 2020 IEEE-RAS 20th International Conference on Humanoid Robots (Humanoids), July 19 2021; IEEE: Munich, Germany; pp. 408–414. [Google Scholar]
- Liu, Y.; Xu, H.; Liu, D.; Wang, L. A Digital Twin-Based Sim-to-Real Transfer for Deep Reinforcement Learning-Enabled Industrial Robot Grasping. Robot. Comput.-Integr. Manuf. 2022, 78, 102365. [Google Scholar] [CrossRef]
- Luo, Y.; Li, S.; Li, D. Intelligent Perception System of Robot Visual Servo for Complex Industrial Environment. Sensors 2020, 20, 7121. [Google Scholar] [CrossRef]
- Marchionna, L.; Pugliese, G.; Martini, M.; Angarano, S.; Salvetti, F.; Chiaberge, M. Deep Instance Segmentation and Visual Servoing to Play Jenga with a Cost-Effective Robotic System. Sensors 2023, 23, 752. [Google Scholar] [CrossRef]
- Martin, J.B.; Moutarde, F. Real-Time Gestural Control of Robot Manipulator Through Deep Learning Human-Pose Inference. In Computer Vision Systems; Lecture Notes in Computer Science; Tzovaras, D., Giakoumis, D., Vincze, M., Argyros, A., Eds.; Springer International Publishing: Cham, 2019; Vol. 11754, pp. 565–572. ISBN 978-3-030-34994-3. [Google Scholar]
- Panasiuk, J. Controlling an Industrial Robot Using Stereo 3D Vision Systems with AI Elements. Sensors 2025, 25, 6402. [Google Scholar] [CrossRef]
- Piltan, F.; Prosvirin, A.E.; Sohaib, M.; Saldivar, B.; Kim, J.-M. An SVM-Based Neural Adaptive Variable Structure Observer for Fault Diagnosis and Fault-Tolerant Control of a Robot Manipulator. Appl. Sci. 2020, 10, 1344. [Google Scholar] [CrossRef]
- Piltan, F.; Prosvirin, A.E.; Kim, J.-M. Robot Manipulator Active Fault-Tolerant Control Using a Machine Learning-Based Automated Robust Hybrid Observer. J. Intell. Fuzzy Syst. 2020, 39, 6443–6463. [Google Scholar] [CrossRef]
- Sacchi, N.; Incremona, G.P.; Ferrara, A. Sliding Mode Based Fault Diagnosis with Deep Reinforcement Learning Add-ons for Intrinsically Redundant Manipulators. Int. J. Robust Nonlinear Control 2023, 33, 9109–9127. [Google Scholar] [CrossRef]
- Shukla, P.; Kumar, H.; Nandi, G.C. Robotic Grasp Manipulation Using Evolutionary Computing and Deep Reinforcement Learning. Intell. Serv. Robot. 2021, 14, 61–77. [Google Scholar] [CrossRef]
- Wang, Z.; Huang, W.; Qi, Z.; Yin, S. MS-CLSTM: Myoelectric Manipulator Gesture Recognition Based on Multi-Scale Feature Fusion CNN-LSTM Network. Biomimetics 2024, 9, 784. [Google Scholar] [CrossRef] [PubMed]
- Ahn, W.-J.; Choi, K.; Kang, S.-W.; Rho, C.-K.; Pae, D.-S.; Lim, M.-T. Practical Mixed Palletizing Manipulator System: Incorporating Practical Reinforcement Learning and Configuration-Space Motion Planning. IEEE Trans. Autom. Sci. Eng. 2026, 23, 455–469. [Google Scholar] [CrossRef]
- Ak, A.; Topuz, V.; Midi, I. Motor Imagery EEG Signal Classification Using Image Processing Technique over GoogLeNet Deep Learning Algorithm for Controlling the Robot Manipulator. Biomed. Signal Process. Control 2022, 72, 103295. [Google Scholar] [CrossRef]
- Andersen, T.T.; Amor, H.B.; Andersen, N.A.; Ravn, O. Measuring and Modelling Delays in Robot Manipulators for Temporally Precise Control Using Machine Learning. In Proceedings of the 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA), Miami, FL, USA, December 2015; IEEE; pp. 168–175. [Google Scholar]
- Andriyanov, N. Development of Apple Detection System and Reinforcement Learning for Apple Manipulator. Electronics 2023, 12, 727. [Google Scholar] [CrossRef]
- Azizi, A. Applications of Artificial Intelligence Techniques to Enhance Sustainability of Industry 4.0: Design of an Artificial Neural Network Model as Dynamic Behavior Optimizer of Robotic Arms. Complexity 2020, 2020, 1–10. [Google Scholar] [CrossRef]
- Batzianoulis, I.; Iwane, F.; Wei, S.; Correia, C.G.P.R.; Chavarriaga, R.; Millán, J.D.R.; Billard, A. Customizing Skills for Assistive Robotic Manipulators, an Inverse Reinforcement Learning Approach with Error-Related Potentials. Commun. Biol. 2021, 4, 1406. [Google Scholar] [CrossRef]
- Bucinskas, V.; Dzedzickis, A.; Sumanas, M.; Sutinys, E.; Petkevicius, S.; Butkiene, J.; Virzonis, D.; Morkvenaite-Vilkonciene, I. Improving Industrial Robot Positioning Accuracy to the Microscale Using Machine Learning Method. Machines 2022, 10, 940. [Google Scholar] [CrossRef]
- Cheng, S.; Jin, Y.; Wang, H. Deep Learning-Based Control Framework for Dynamic Contact Processes in Humanoid Grasping. Front. Neurorobotics 2024, 18, 1349752. [Google Scholar] [CrossRef]
- Chen, Y.-H.; Yang, W.-T.; Chen, B.-H.; Lin, P.-C. Manipulator Trajectory Optimization Using Reinforcement Learning on a Reduced-Order Dynamic Model with Deep Neural Network Compensation. Machines 2023, 11, 350. [Google Scholar] [CrossRef]
- Chi, W.; Liu, J.; Abdelaziz, M.E.M.K.; Dagnino, G.; Riga, C.; Bicknell, C.; Yang, G.-Z. Trajectory Optimization of Robot-Assisted Endovascular Catheterization with Reinforcement Learning. In Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), October 2018; IEEE: Madrid; pp. 3875–3881. [Google Scholar]
- Ebert, F.; Finn, C.; Dasari, S.; Xie, A.; Lee, A.; Levine, S. Visual Foresight: Model-Based Deep Reinforcement Learning for Vision-Based Robotic Control 2018.
- Emamzadeh, M.M.; Sadati, N. A Fuzzy Based Model Coordination for Two-Level Optimal Control of Robot Manipulators. In Proceedings of the 2015 IEEE 27th International Conference on Tools with Artificial Intelligence (ICTAI), November 2015; IEEE: Vietri sul Mare, Italy; pp. 1122–1128. [Google Scholar]
- Jaquier, N.; Rozo, L.; Caldwell, D.G.; Calinon, S. Geometry-Aware Manipulability Learning, Tracking, and Transfer. Int. J. Robot. Res. 2021, 40, 624–650. [Google Scholar] [CrossRef] [PubMed]
- Karimi, M.; Ahmadi, M. A Reinforcement Learning Approach in Assignment of Task Priorities in Kinematic Control of Redundant Robots. IEEE Robot. Autom. Lett. 2022, 7, 850–857. [Google Scholar] [CrossRef]
- Kim, M.; Han, D.-K.; Park, J.-H.; Kim, J.-S. Motion Planning of Robot Manipulators for a Smoother Path Using a Twin Delayed Deep Deterministic Policy Gradient with Hindsight Experience Replay. Appl. Sci. 2020, 10, 575. [Google Scholar] [CrossRef]
- Kwon, Y.-T.; Kim, H.; Mahmood, M.; Kim, Y.-S.; Demolder, C.; Yeo, W.-H. Printed, Wireless, Soft Bioelectronics and Deep Learning Algorithm for Smart Human–Machine Interfaces. ACS Appl. Mater. Interfaces 2020, 12, 49398–49406. [Google Scholar] [CrossRef] [PubMed]
- Lee, J.Y.; Lee, S.; Mishra, A.; Yan, X.; McMahan, B.; Gaisford, B.; Kobashigawa, C.; Qu, M.; Xie, C.; Kao, J.C. Brain–Computer Interface Control with Artificial Intelligence Copilots. Nat. Mach. Intell. 2025, 7, 1510–1523. [Google Scholar] [CrossRef] [PubMed]
- Liu, H.; Li, X.; Dong, M.; Gu, Y.; Shen, B. Robotic Motion Planning Based on Deep Reinforcement Learning and Artificial Neural Networks. IEEE Trans. Autom. Sci. Eng. 2025, 22, 8465–8479. [Google Scholar] [CrossRef]
- Liu, W.; Niu, H.; Pan, W.; Herrmann, G.; Carrasco, J. Sim-and-Real Reinforcement Learning for Manipulation: A Consensus-Based Approach. In Proceedings of the 2023 IEEE International Conference on Robotics and Automation (ICRA), May 29 2023; pp. 3911–3917. [Google Scholar]
- Liu, W.; Niu, H.; Skilton, R.; Carrasco, J. Sim-to-Real Deep Reinforcement Learning with Manipulators for Pick-and-Place; 2023; Vol. 14136, pp. 240–252. [Google Scholar]
- Li, X.; Liu, H.; Dong, M. A General Framework of Motion Planning for Redundant Robot Manipulator Based on Deep Reinforcement Learning. IEEE Trans. Ind. Inform. 2022, 18, 5253–5263. [Google Scholar] [CrossRef]
- Li, Q.; Nie, J.; Wang, H.; Lu, X.; Song, S. Manipulator Motion Planning Based on Actor-Critic Reinforcement Learning. In Proceedings of the 2021 40th Chinese Control Conference (CCC), July 26 2021; IEEE: Shanghai, China; pp. 4248–4254. [Google Scholar]
- Li, H.; Gong, D.; Yu, J. An Obstacles Avoidance Method for Serial Manipulator Based on Reinforcement Learning and Artificial Potential Field. Int. J. Intell. Robot. Appl. 2021, 5, 186–202. [Google Scholar] [CrossRef]
- Parag, A.; Mansard, N.; Misimi, E. Optimizing Complex Control Systems with Differentiable Simulators: A Hybrid Approach to Reinforcement Learning and Trajectory Planning. In Proceedings of the 2025 IEEE International Conference on Robotics and Automation (ICRA), Atlanta, GA, USA, May 19 2025; IEEE; pp. 9214–9220. [Google Scholar]
- Parák, R.; Kůdela, J.; Matoušek, R.; Juříček, M. Deep-Reinforcement-Learning-Based Motion Planning for a Wide Range of Robotic Structures. Computation 2024, 12, 116. [Google Scholar] [CrossRef]
- Prianto, E.; Park, J.-H.; Bae, J.-H.; Kim, J.-S. Deep Reinforcement Learning-Based Path Planning for Multi-Arm Manipulators with Periodically Moving Obstacles. Appl. Sci. 2021, 11, 2587. [Google Scholar] [CrossRef]
- Sacchi, N.; Sangiovanni, B.; Incremona, G.P.; Ferrara, A. Scenario-Based Collision Avoidance Control with Deep Q-Networks for Industrial Robot Manipulators. In Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), December 14 2021; IEEE: Austin, TX, USA; pp. 4388–4393. [Google Scholar]
- Solowjow, E.; Ugalde, I.; Shahapurkar, Y.; Aparicio, J.; Mahler, J.; Satish, V.; Goldberg, K.; Claussen, H. Industrial Robot Grasping with Deep Learning Using a Programmable Logic Controller (PLC). In Proceedings of the 2020 IEEE 16th International Conference on Automation Science and Engineering (CASE), August 2020; IEEE: Hong Kong, Hong Kong; pp. 97–103. [Google Scholar]
- Sunwoo, Y.; Lee, W.C. Optimal Path Search for Robot Manipulator Using Deep Reinforcement Learning. In Smart Process. Comput.; IEIE, Translator; 2021; Volume 10, pp. 424–430. [Google Scholar] [CrossRef]
- Wang, Y.; Beltran-Hernandez, C.C.; Wan, W.; Harada, K. An Adaptive Imitation Learning Framework for Robotic Complex Contact-Rich Insertion Tasks. Front. Robot. AI 2022, 8, 777363. [Google Scholar] [CrossRef]
- Wang, X.; Cao, J.; Cao, Y.; Zou, F. Energy-Efficient Trajectory Planning for a Class of Industrial Robots Using Parallel Deep Reinforcement Learning. Nonlinear Dyn. 2025, 113, 8491–8511. [Google Scholar] [CrossRef]
- Wang, H.; Zhu, H.; Cao, F. Trajectory Planning Algorithm of Manipulator in Small Space Based on Reinforcement Learning. In Proceedings of the 2023 China Automation Congress (CAC), November 17 2023; IEEE: Chongqing, China; pp. 5780–5785. [Google Scholar]
- Sara Wilson, K.; Saravanan, K.K. Artificial Intelligent Based Control Strategy for Reach and Grasp of Multi-Objects Using Brain-Controlled Robotic Arm System. Netw. Comput. Neural Syst. 2025, 36, 1253–1281. [Google Scholar] [CrossRef]
- Wong, C.-C.; Feng, H.-M.; Lai, Y.-C.; Yu, C.-J. Ant Colony Optimization and Image Model-Based Robot Manipulator System for Pick-and-Place Tasks. J. Intell. Fuzzy Syst. 2019, 36, 1083–1098. [Google Scholar] [CrossRef]
- Yang, C.; Chen, C.; He, W.; Cui, R.; Li, Z. Robot Learning System Based on Adaptive Neural Control and Dynamic Movement Primitives. IEEE Trans. Neural Netw. Learn. Syst. 2019, 30, 777–787. [Google Scholar] [CrossRef]
- Zhang, B. Research on Trajectory Optimization and Adaptive Control of Manipulator Based on Deep Reinforcement Learning. 2025, 14. [Google Scholar] [CrossRef]
- Zhao, B.; Wu, Y.; Wu, C.; Sun, R. Deep Reinforcement Learning Trajectory Planning for Robotic Manipulator Based on Simulation-Efficient Training. Sci. Rep. 2025, 15, 8286. [Google Scholar] [CrossRef] [PubMed]
- Zhong, J.; Wang, T.; Cheng, L. Collision-Free Path Planning for Welding Manipulator via Hybrid Algorithm of Deep Reinforcement Learning and Inverse Kinematics. Complex Intell. Syst. 2022, 8, 1899–1912. [Google Scholar] [CrossRef]
- Ahmed, S.; Azar, A.T.; Tounsi, M. Design of Adaptive Fractional-Order Fixed-Time Sliding Mode Control for Robotic Manipulators. Entropy 2022, 24, 1838. [Google Scholar] [CrossRef] [PubMed]
- Robotic Arm Pick-and-Place Tasks.
- Al-Shanoon, A.; Lang, H.; Wang, Y.; Zhang, Y.; Hong, W. Learn to Grasp Unknown Objects in Robotic Manipulation. Intell. Serv. Robot. 2021, 14, 571–582. [Google Scholar] [CrossRef]
- Alhousani, N.; Saveriano, M.; Sevinc, I.; Abdulkuddus, T.; Kose, H.; Abu-Dakka, F.J. Geometric Reinforcement Learning for Robotic Manipulation. IEEE Access 2023, 11, 111492–111505. [Google Scholar] [CrossRef]
- Aljalbout, E.; Frank, F.; Karl, M.; Van Der Smagt, P. On the Role of the Action Space in Robot Manipulation Learning and Sim-to-Real Transfer. IEEE Robot. Autom. Lett. 2024, 9, 5895–5902. [Google Scholar] [CrossRef]
- Alles, M.; Aljalbout, E. Learning to Centralize Dual-Arm Assembly. Front. Robot. AI 2022, 9, 830007. [Google Scholar] [CrossRef]
- Amaya, L.; Von Arnim, A. Neurorobotic Reinforcement Learning for Domains with Parametrical Uncertainty. Front. Neurorobotics 2023, 17, 1239581. [Google Scholar] [CrossRef] [PubMed]
- An, T.; Zhu, X.; Ma, B.; Jiang, H.; Dong, B. Hierarchical Approximate Optimal Interaction Control of Human-Centered Modular Robot Manipulator Systems: A Stackelberg Differential Game-Based Approach. Neurocomputing 2024, 585, 127573. [Google Scholar] [CrossRef]
- Avhad, P.; Kumar, G.P.; Sunil, A.T.; Rallapalli, M.; Kumar, B.H.; Kumar, V. Adaptive Deep Reinforcement Learning for Robotic Manipulation in Dynamic Environments. In Proceedings of the 2024 International Conference on Data Science and Network Security (ICDSNS), July 26 2024; IEEE: Tiptur, India; pp. 1–5. [Google Scholar]
- Azimirad, V.; Khodkam, S.Y.; Bolouri, A. A New Hybrid Learning Control System for Robots Based on Spiking Neural Networks. Neural Netw. 2024, 180, 106656. [Google Scholar] [CrossRef]
- Valarezo Añazco, E.; Rivera Lopez, P.; Park, N.; Oh, J.; Ryu, G.; Al-antari, M.A.; Kim, T.-S. Natural Object Manipulation Using Anthropomorphic Robotic Hand through Deep Reinforcement Learning and Deep Grasping Probability Network. Appl. Intell. 2021, 51, 1041–1055. [Google Scholar] [CrossRef]
- Baek, S.; Baek, J.; Choi, J.; Han, S. A Reinforcement Learning-Based Adaptive Time-Delay Control and Its Application to Robot Manipulators. In Proceedings of the 2022 American Control Conference (ACC), Atlanta, GA, USA, June 8 2022; IEEE; pp. 2722–2729. [Google Scholar]
- Barnoy, Y.; Erin, O.; Raval, S.; Pryor, W.; Mair, L.O.; Weinberg, I.N.; Diaz-Mercado, Y.; Krieger, A.; Hager, G.D. Control of Magnetic Surgical Robots With Model-Based Simulators and Reinforcement Learning. IEEE Trans. Med. Robot. Bionics 2022, 4, 945–956. [Google Scholar] [CrossRef]
- Bashabsheh, M. Reinforcement Learning for Multi-Task Manipulation in Robotic Arm Systems Operating in Dynamic Environments.
- Bejar, E.; Moran, A. Predictive Control of a Robot Manipulator with Deep Reinforcement Learning. In Proceedings of the 2021 7th International Conference on Control, Automation and Robotics (ICCAR), April 23 2021; IEEE: Singapore; pp. 127–130. [Google Scholar]
- Blaise, J.; Bazzocchi, M.C.F. Space Manipulator Collision Avoidance Using a Deep Reinforcement Learning Control. Aerospace 2023, 10, 778. [Google Scholar] [CrossRef]
- Brito, T.; Queiroz, J.; Piardi, L.; Fernandes, L.A.; Lima, J.; Leitão, P. A Machine Learning Approach for Collaborative Robot Smart Manufacturing Inspection for Quality Control Systems. Procedia Manuf. 2020, 51, 11–18. [Google Scholar] [CrossRef]
- Calderón-Cordova, C.; Sarango, R.; Castillo, D.; Lakshminarayanan, V. A Deep Reinforcement Learning Framework for Control of Robotic Manipulators in Simulated Environments. IEEE Access 2024, 12, 103133–103161. [Google Scholar] [CrossRef]
- Cao, S.; Sun, L.; Jiang, J.; Zuo, Z. Reinforcement Learning-Based Fixed-Time Trajectory Tracking Control for Uncertain Robotic Manipulators With Input Saturation. IEEE Trans. Neural Netw. Learn. Syst. 2023, 34, 4584–4595. [Google Scholar] [CrossRef]
- Carron, A.; Arcari, E.; Wermelinger, M.; Hewing, L.; Hutter, M.; Zeilinger, M.N. Data-Driven Model Predictive Control for Trajectory Tracking With a Robotic Arm. IEEE Robot. Autom. Lett. 2019, 4, 3758–3765. [Google Scholar] [CrossRef]
- Castelli, F.; Michieletto, S.; Ghidoni, S.; Pagello, E. A Machine Learning-Based Visual Servoing Approach for Fast Robot Control in Industrial Setting. Int. J. Adv. Robot. Syst. 2017, 14, 1729881417738884. [Google Scholar] [CrossRef]
- Chen, D.; Zhang, Y.; Li, S. Zeroing Neural-Dynamics Approach and Its Robust and Rapid Solution for Parallel Robot Manipulators against Superposition of Multiple Disturbances. Neurocomputing 2018, 275, 845–858. [Google Scholar] [CrossRef]
- Chen, H. Robotic Manipulation with Reinforcement Learning, State Representation Learning, and Imitation Learning (Student Abstract). Proc. AAAI Conf. Artif. Intell. 2021, 35, 15769–15770. [Google Scholar] [CrossRef]
- Chen, L.; Jiang, Z.; Cheng, L.; Knoll, A.C.; Zhou, M. Deep Reinforcement Learning Based Trajectory Planning Under Uncertain Constraints. Front. Neurorobotics 2022, 16, 883562. [Google Scholar] [CrossRef] [PubMed]
- Chen, Y.; Ma, G.; Lin, S.; Ning, S.; Gao, J. Computed-Torque plus Robust Adaptive Compensation Control for Robot Manipulator with Structured and Unstructured Uncertainties. IMA J. Math. Control Inf. 2016, 33, 37–52. [Google Scholar] [CrossRef]
- Chen, P.; Pei, J.; Lu, W.; Li, M. A Deep Reinforcement Learning Based Method for Real-Time Path Planning and Dynamic Obstacle Avoidance. Neurocomputing 2022, 497, 64–75. [Google Scholar] [CrossRef]
- Chen, P.; Lu, W. Deep Reinforcement Learning Based Moving Object Grasping. Inf. Sci. 2021, 565, 62–76. [Google Scholar] [CrossRef]
- Chen, S.; Wen, J.T. Industrial Robot Trajectory Tracking Control Using Multi-Layer Neural Networks Trained by Iterative Learning Control. Robotics 2021, 10, 50. [Google Scholar] [CrossRef]
- Chen, X. Robotic Arm Control Method Based on Reinforcement Learning Optimization. In Proceedings of the 2025 IEEE 5th International Conference on Electronic Technology, Communication and Information (ICETCI), Changchun, China, May 23 2025; IEEE; pp. 221–225. [Google Scholar]
- Chen, Y.-L.; Cai, Y.-R.; Cheng, M.-Y. Vision-Based Robotic Object Grasping—A Deep Reinforcement Learning Approach. Machines 2023, 11, 275. [Google Scholar] [CrossRef]
- Chen, Y.; Wu, T.; Wang, S.; Feng, X.; Jiang, J.; McAleer, S.M.; Dong, H.; Lu, Z.; Zhu, S.-C.; Yang, Y. Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning.
- Chen, Y.; Geng, Y.; Zhong, F.; Ji, J.; Jiang, J.; Lu, Z.; Dong, H.; Yang, Y. Bi-DexHands: Towards Human-Level Bimanual Dexterous Manipulation. IEEE Trans. Pattern Anal. Mach. Intell. 2024, 46, 2804–2818. [Google Scholar] [CrossRef]
- Chen, Y.; Zeng, C.; Wang, Z.; Lu, P.; Yang, C. Zero-Shot Sim-to-Real Transfer of Reinforcement Learning Framework for Robotics Manipulation with Demonstration and Force Feedback. Robotica 2023, 41, 1015–1024. [Google Scholar] [CrossRef]
- Chen, Y.; Su, S.; Ni, K.; Li, C. Integrated Intelligent Control of Redundant Degrees-of-Freedom Manipulators via the Fusion of Deep Reinforcement Learning and Forward Kinematics Models. Machines 2024, 12, 667. [Google Scholar] [CrossRef]
- Christen, S.; Stevsic, S.; Hilliges, O. Demonstration-Guided Deep Reinforcement Learning of Control Policies for Dexterous Human-Robot Interaction. In Proceedings of the 2019 International Conference on Robotics and Automation (ICRA), May 2019; IEEE: Montreal, QC, Canada; pp. 2161–2167. [Google Scholar]
- Chu, C.; Takahashi, K.; Hashimoto, M. Comparison of Deep Reinforcement Learning Algorithms in a Robot Manipulator Control Application. In Proceedings of the 2020 International Symposium on Computer, Consumer and Control (IS3C), November 2020; IEEE: Taichung City, Taiwan; pp. 284–287. [Google Scholar]
- Cotrim, L.P.; José, M.M.; Cabral, E.L.L. Reinforcement Learning Control of Robot Manipulator. Rev. Bras. Comput. Apl. 2021, 13, 42–53. [Google Scholar] [CrossRef]
- Cui, Y.; Xu, Z.; Zhong, L.; Xu, P.; Shen, Y.; Tang, Q. A Task-Adaptive Deep Reinforcement Learning Framework for Dual-Arm Robot Manipulation. IEEE Trans. Autom. Sci. Eng. 2025, 22, 466–479. [Google Scholar] [CrossRef]
- Cutler, E.; Xing, Y.; Cui, T.; Zhou, B.; van Rijnsoever, K.; Hart, B.; Valencia, D.; Ong, L.V.C.; Gee, T.; Liarokapis, M.; et al. Benchmarking Reinforcement Learning Methods for Dexterous Robotic Manipulation with a Three-Fingered Gripper 2024.
- Ding, Z.; Tsai, Y.-Y.; Lee, W.W.; Huang, B. Sim-to-Real Transfer for Robotic Manipulation with Tactile Sensory. In Proceedings of the 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), September 27 2021; IEEE: Prague, Czech Republic; pp. 6778–6785. [Google Scholar]
- Dong, H. Intelligent Adaptive Control Algorithm Based on Reinforcement Learning in the Field of Robotics. In Proceedings of Fifth Doctoral Symposium on Computational Intelligence; Lecture Notes in Networks and Systems; Swaroop, A., Kansal, V., Fortino, G., Hassanien, A.E., Eds.; Springer Nature Singapore: Singapore, 2024; Vol. 1085, pp. 143–152. ISBN 978-981-97-6725-0. [Google Scholar]
- Dong, R.; Du, J.; Liu, Y.; Heidari, A.A.; Chen, H. An Enhanced Deep Deterministic Policy Gradient Algorithm for Intelligent Control of Robotic Arms. Front. Neuroinformatics 2023, 17, 1096053. [Google Scholar] [CrossRef]
- Salt Ducaju, J.M.; Olofsson, B.; Johansson, R. Iterative Reference Learning for Cartesian Impedance Control of Robot Manipulators. In Proceedings of the 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), October 14 2024; IEEE: Abu Dhabi, United Arab Emirates; pp. 11171–11178. [Google Scholar]
- Du, Z.; Wang, W.; Yan, Z.; Dong, W.; Wang, W. Variable Admittance Control Based on Fuzzy Reinforcement Learning for Minimally Invasive Surgery Manipulator. Sensors 2017, 17, 844. [Google Scholar] [CrossRef] [PubMed]
- Ehrlich, M.; Zaidel, Y.; Weiss, P.L.; Melamed Yekel, A.; Gefen, N.; Supic, L.; Ezra Tsur, E. Adaptive Control of a Wheelchair Mounted Robotic Arm with Neuromorphically Integrated Velocity Readings and Online-Learning. Front. Neurosci. 2022, 16, 1007736. [Google Scholar] [CrossRef] [PubMed]
- Enayati, A.M.S.; Dershan, R.; Zhang, Z.; Richert, D.; Najjaran, H. Facilitating Sim-to-Real by Intrinsic Stochasticity of Real-Time Simulation in Reinforcement Learning for Robot Manipulation. IEEE Trans. Artif. Intell. 2024, 5, 1791–1804. [Google Scholar] [CrossRef]
- Fareh, R.; Siddique, T.; Choutri, K.; Dylov, D.V. Physics-Informed Reward Shaped Reinforcement Learning Control of a Robot Manipulator. Ain Shams Eng. J. 2025, 16, 103595. [Google Scholar] [CrossRef]
- Filho, E.; Campos, M.F.M.; Neto, A.A. Kinematic Control of Manipulators Using Multi Deep Q-Learning. In Proceedings of the 2025 Brazilian Conference on Robotics (CROS), Belo Horizonte, Brazil, April 28 2025; IEEE; pp. 1–6. [Google Scholar]
- Franceschetti, A.; Tosello, E.; Castaman, N.; Ghidoni, S. Robotic Arm Control and Task Training through Deep Reinforcement Learning 2020.
- Fu, Z.; Cheng, X.; Pathak, D. Deep Whole-Body Control: Learning a Unified Policy for Manipulation and Locomotion 2022.
- Ganie, I.; Sarangapani, J. Lifelong Deep Learning-based Control of Robot Manipulators. Int. J. Adapt. Control Signal Process. 2023, 37, 3169–3192. [Google Scholar] [CrossRef]
- Gao, X. Design of Intelligent Control System of Manipulator Based on Deep Learning. In Proceedings of the 2022 2nd International Conference on Computers and Automation (CompAuto), Paris, France, August 2022; IEEE; pp. 99–102. [Google Scholar]
- Garcia-Hernando, G.; Johns, E.; Kim, T.-K. Physics-Based Dexterous Manipulations with Estimated Hand Poses and Residual Reinforcement Learning. In Proceedings of the 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), October 24 2020; IEEE: Las Vegas, NV, USA; pp. 9561–9568. [Google Scholar]
- Gawali, M.B.; Gawali, S.S.; Patil, M.; Khandare, A. A Novel Human-to-Robot Interaction Model Based on Transfer Expert Reinforcement Learning with Recurrent Neural Network. J. Auton. Intell. 2023, 7. [Google Scholar] [CrossRef]
- Ghediri, A.; Lamamra, K. Adaptive PID Computed-Torque Control of Robot Manipulators Based on DDPG Reinforcement Learning. [CrossRef]
- Grandesso, G. Reinforcement Learning and Trajectory Optimization for the Concurrent Design of High-Performance Robotic Systems.
- Gupta, R.K.; Chauhan, S. Comparision of PID Controller & Adaptive Neuro Fuzzy Controller for Robot Manipulator. In Proceedings of the 2015 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC), December 2015; IEEE: Madurai, India; pp. 1–4. [Google Scholar]
- Gu, S.; Holly, E.; Lillicrap, T.; Levine, S. Deep Reinforcement Learning for Robotic Manipulation with Asynchronous Off-Policy Updates. In Proceedings of the 2017 IEEE International Conference on Robotics and Automation (ICRA), May 2017; IEEE: Singapore, Singapore; pp. 3389–3396. [Google Scholar]
- Haiderbhai, M.; Gondokaryono, R.; Wu, A.; Kahrs, L.A. Sim2Real Rope Cutting With a Surgical Robot Using Vision-Based Reinforcement Learning. IEEE Trans. Autom. Sci. Eng. 2025, 22, 4354–4365. [Google Scholar] [CrossRef]
- Hardman, D.; George Thuruthel, T.; Iida, F. Manipulation of Free-Floating Objects Using Faraday Flows and Deep Reinforcement Learning. Sci. Rep. 2022, 12, 335. [Google Scholar] [CrossRef]
- Hazem, Z.B.; Saidi, F.; Guler, N.; Altaif, A.H. Reinforcement Learning-Based Intelligent Trajectory Tracking for a 5-DOF Mitsubishi Robotic Arm: Comparative Evaluation of DDPG, LC-DDPG, and TD3-ADX. Int. J. Intell. Robot. Appl. 2025, 9, 1982–2002. [Google Scholar] [CrossRef]
- Heaton, J.; Givigi, S. A Deep Reinforcement Learning Solution for the Low Level Motion Control of a Robot Manipulator System. In Proceedings of the 2023 IEEE International Systems Conference (SysCon), Vancouver, BC, Canada, April 17 2023; IEEE; pp. 1–7. [Google Scholar]
- He, W.; Huang, H.; Ge, S.S. Adaptive Neural Network Control of a Robotic Manipulator With Time-Varying Output Constraints. IEEE Trans. Cybern. 2017, 47, 3136–3147. [Google Scholar] [CrossRef] [PubMed]
- He, W.; Gao, H.; Zhou, C.; Yang, C.; Li, Z. Reinforcement Learning Control of a Flexible Two-Link Manipulator: An Experimental Investigation. IEEE Trans. Syst. Man Cybern. Syst. 2021, 51, 7326–7336. [Google Scholar] [CrossRef]
- He, W.; Yan, Z.; Sun, Y.; Ou, Y.; Sun, C. Neural-Learning-Based Control for a Constrained Robotic Manipulator With Flexible Joints. IEEE Trans. Neural Netw. Learn. Syst. 2018, 29, 5993–6003. [Google Scholar] [CrossRef]
- Al Homsi, M.; Trumić, M.; Fagiolini, A.; Cirrincione, G. Comparative Analysis of Deep Q-Learning Algorithms for Object Throwing Using a Robot Manipulator. Front. Robot. AI 2025, 12, 1567211. [Google Scholar] [CrossRef]
- Hosny, Z.; Nassar, A.; AboElyazeed, A.; Mohamed, M.; Abouheaf, M.; Gueaieb, W. An Online Model-Free Reinforcement Learning Approach for 6-DOF Robot Manipulators. In Proceedings of the 2023 IEEE International Symposium on Robotic and Sensors Environments (ROSE), November 6 2023; IEEE: Tokyo, Japan; pp. 1–7. [Google Scholar]
- Huang, Y.; Liu, D.; Liu, Z.; Wang, K.; Wang, Q.; Tan, J. A Novel Robotic Grasping Method for Moving Objects Based on Multi-Agent Deep Reinforcement Learning. Robot. Comput.-Integr. Manuf. 2024, 86, 102644. [Google Scholar] [CrossRef]
- Hu, J.; Wang, F.; Yi, J.; Li, X.; Xie, Z. Trajectory Tracking Control Based on Deep Reinforcement Learning and Ensemble Random Network Distillation for Robotic Manipulator. J. Phys. Conf. Ser. 2024, 2850, 012007. [Google Scholar] [CrossRef]
- Hu, J.; Mao, J.; Zhou, X.; Zhang, C. Real-Time Obstacle Avoidance and Pathfinding for Robot Manipulators Based on Deep Reinforcement Learning. In Intelligent Robotics and Applications; Lecture Notes in Computer Science; Lan, X., Mei, X., Jiang, C., Zhao, F., Tian, Z., Eds.; Springer Nature Singapore: Singapore, 2025; Vol. 15203, pp. 154–166. ISBN 978-981-96-0794-5. [Google Scholar]
- Hu, Y.; Si, B. A Reinforcement Learning Neural Network for Robotic Manipulator Control. Neural Comput. 2018, 30, 1983–2004. [Google Scholar] [CrossRef]
- Hu, Y.; Wang, W.; Liu, H.; Liu, L. Reinforcement Learning Tracking Control for Robotic Manipulator With Kernel-Based Dynamic Model. IEEE Trans. Neural Netw. Learn. Syst. 2020, 31, 3570–3578. [Google Scholar] [CrossRef]
- Hu, Z.; Zheng, Y.; Pan, J. Grasping Living Objects With Adversarial Behaviors Using Inverse Reinforcement Learning. IEEE Trans. Robot. 2023, 39, 1151–1163. [Google Scholar] [CrossRef]
- Hwang, J.-H.; Kang, Y.-C.; Park, J.-W.; Kim, D.W. Advanced Interval Type-2 Fuzzy Sliding Mode Control for Robot Manipulator. Comput. Intell. Neurosci. 2017, 2017, 1–11. [Google Scholar] [CrossRef]
- Incremona, G.P.; Sacchi, N.; Sangiovanni, B.; Ferrara, A. Experimental Assessment of Deep Reinforcement Learning for Robot Obstacle Avoidance: A LPV Control Perspective. IFAC-Pap. 2021, 54, 89–94. [Google Scholar] [CrossRef]
- Iqdymat, A.; Stamatescu, G. Reinforcement Learning of a Six-DOF Industrial Manipulator for Pick-and-Place Application Using Efficient Control in Warehouse Management. Sustainability 2025, 17, 432. [Google Scholar] [CrossRef]
- Iriondo, A.; Lazkano, E.; Susperregi, L.; Urain, J.; Fernandez, A.; Molina, J. Pick and Place Operations in Logistics Using a Mobile Manipulator Controlled with Deep Reinforcement Learning. Appl. Sci. 2019, 9, 348. [Google Scholar] [CrossRef]
- Iwasaki, H.; Okuyama, A. Development of a Reference Signal Self-Organizing Control System Based on Deep Reinforcement Learning. In Proceedings of the 2021 IEEE International Conference on Mechatronics (ICM), March 7 2021; IEEE: Kashiwa, Japan; pp. 1–5. [Google Scholar]
- James, S.; Johns, E. 3D Simulation for Robot Arm Control with Deep Q-Learning 2016.
- Jeong, J.-H.; Shim, K.-H.; Kim, D.-J.; Lee, S.-W. Brain-Controlled Robotic Arm System Based on Multi-Directional CNN-BiLSTM Network Using EEG Signals. IEEE Trans. Neural Syst. Rehabil. Eng. 2020, 28, 1226–1238. [Google Scholar] [CrossRef]
- Jiang, D.; Wang, H.; Lu, Y. Mastering the Complex Assembly Task With a Dual-Arm Robot: A Novel Reinforcement Learning Method. IEEE Robot. Autom. Mag. 2023, 30, 57–66. [Google Scholar] [CrossRef]
- Jiang, H.; An, T.; Zhang, Z.; Zhu, M.; Li, Y.; Dong, B. Adaptive Fuzzy Optimal Control of Modular Robot Manipulators Systems via Integral Reinforcement Learning-Based Value Iteration Algorithm. In Proceedings of the 2024 IEEE 14th International Conference on CYBER Technology in Automation, Control, and Intelligent Systems (CYBER), July 16 2024; IEEE: Copenhagen, Denmark; pp. 388–393. [Google Scholar]
- Jiang, R.; He, B.; Wang, Z.; Zhou, Y.; Xu, S.; Li, X. A Novel Simulation-Reality Closed-Loop Learning Framework for Autonomous Robot Skill Learning. IEEE Trans. Cogn. Dev. Syst. 2022, 14, 1520–1531. [Google Scholar] [CrossRef]
- Jin, L.; Huang, R.; Liu, M.; Ma, X. Cerebellum-Inspired Learning and Control Scheme for Redundant Manipulators at Joint Velocity Level. IEEE Trans. Cybern. 2024, 54, 6297–6306. [Google Scholar] [CrossRef]
- Joshi, S.; Kumra, S.; Sahin, F. Robotic Grasping Using Deep Reinforcement Learning. In Proceedings of the 2020 IEEE 16th International Conference on Automation Science and Engineering (CASE), August 2020; IEEE: Hong Kong, Hong Kong; pp. 1461–1466. [Google Scholar]
- Josifovski, J.; Malmir, M.; Klarmann, N.; Žagar, B.L.; Navarro-Guerrero, N.; Knoll, A. Analysis of Randomization Effects on Sim2Real Transfer in Reinforcement Learning for Robotic Manipulation Tasks. In Proceedings of the 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), October 23 2022; IEEE: Kyoto, Japan; pp. 10193–10200. [Google Scholar]
- Kalashnikov, D.; Irpan, A.; Pastor, P.; Ibarz, J.; Herzog, A.; Jang, E.; Quillen, D.; Holly, E.; Kalakrishnan, M.; Vanhoucke, V.; et al. Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation. 2018. [Google Scholar] [CrossRef]
- Kamali, K.; Bonev, I.A.; Desrosiers, C. Real-Time Motion Planning for Robotic Teleoperation Using Dynamic-Goal Deep Reinforcement Learning. In Proceedings of the 2020 17th Conference on Computer and Robot Vision (CRV), Ottawa, ON, Canada, May 2020; IEEE; pp. 182–189. [Google Scholar]
- Kang, E.; Qiao, H.; Gao, J.; Yang, W. Neural Network-Based Model Predictive Tracking Control of an Uncertain Robotic Manipulator with Input Constraints. ISA Trans. 2021, 109, 89–101. [Google Scholar] [CrossRef] [PubMed]
- Kankashvar, M.R.; Kharrati, H.; Asl, R.M.; Sadeghiani, A.B. Designing PID Controllers for a Five-Bar Linkage Robot Manipulator Using BBO Algorithm. In Proceedings of the 2015 6th International Conference on Modeling, Simulation, and Applied Optimization (ICMSAO), May 2015; IEEE: Istanbul, Turkey; pp. 1–6. [Google Scholar]
- Kataoka, S.; Ghasemipour, S.K.S.; Freeman, D.; Mordatch, I. Bi-Manual Manipulation and Attachment via Sim-to-Real Reinforcement Learning 2022.
- Katyal, K.; Wang, I.-J.; Burlina, P. Leveraging Deep Reinforcement Learning for Reaching Robotic Tasks. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), July 2017; IEEE: Honolulu, HI, USA; pp. 490–491. [Google Scholar]
- Kaur, C.; Balobaid, A. Artificial Intelligence in Robotics: Revolutionizing Industrial Automation and Beyond. In Proceedings of the 2025 International Conference on Frontier Technologies and Solutions (ICFTS), Chennai, India, March 27 2025; IEEE; pp. 1–7. [Google Scholar]
- Khan, A.H.; Li, S.; Luo, X. Obstacle Avoidance and Tracking Control of Redundant Robotic Manipulator: An RNN-Based Metaheuristic Approach. IEEE Trans. Ind. Inform. 2020, 16, 4670–4680. [Google Scholar] [CrossRef]
- Khodamipour, G.; Khorashadizadeh, S.; Farshad, M. Observer-Based Adaptive Control of Robot Manipulators Using Reinforcement Learning and the Fourier Series Expansion. Trans. Inst. Meas. Control 2021, 43, 2307–2320. [Google Scholar] [CrossRef]
- Kilinc, O.; Montana, G. Reinforcement Learning for Robotic Manipulation Using Simulated Locomotion Demonstrations. Mach. Learn. 2022, 111, 465–486. [Google Scholar] [CrossRef]
- Kim, M.; Yang, S.; Kim, B.; Kim, J.; Kim, D. Human-to-Robot Handover Based on Reinforcement Learning. Sensors 2024, 24, 6275. [Google Scholar] [CrossRef]
- Kuang, Y. Deep Reinforcement Learning and Transfer Learning of Robot In-Hand Dexterous Manipulation; 2023. [Google Scholar]
- Kumar, S.; Rani, K.; Banga, K.V. Robotic Arm Movement Optimization Using Soft Computing. IAES Int. J. Robot. Autom. IJRA 2017, 6, 1. [Google Scholar] [CrossRef]
- Kumar, V.; Hoeller, D.; Sundaralingam, B.; Tremblay, J.; Birchfield, S. Joint Space Control via Deep Reinforcement Learning. In Proceedings of the 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), September 27 2021; IEEE: Prague, Czech Republic; pp. 3619–3626. [Google Scholar]
- Kunal, K.; Kathiravan, M.; Madeshwaren, V.; Chandrakala, T.; Ganesan, V.; Gupta, S. AI-Driven Intelligent Control Strategies for Industrial Robotics: A Reinforcement Learning Approach. Int. J. Basic Appl. Sci. 2025, 14, 429–440. [Google Scholar] [CrossRef]
- Kurrek, P.; Zoghlami, F.; Jocas, M.; Stoelen, M.; Salehi, V. Q-Model: An Artificial Intelligence Based Methodology for the Development of Autonomous Robots. J. Comput. Inf. Sci. Eng. 2020, 20, 061006. [Google Scholar] [CrossRef]
- Lahmann, J.; Erwin, A. Series Elastic Actuation Improves Dynamic Performance after Reinforcement Learning. IFAC-Pap. 2025, 59, 479–484. [Google Scholar] [CrossRef]
- Lee, C.; An, D. AI-Based Posture Control Algorithm for a 7-DOF Robot Manipulator. Machines 2022, 10, 651. [Google Scholar] [CrossRef]
- Lee, D.; Kim, H.; Kim, S.; Park, C.-W.; Park, J.H. Learning Control Policy with Previous Experiences from Robot Simulator. In Proceedings of the 2020 International Conference on Information and Communication Technology Convergence (ICTC), October 21 2020; IEEE: Jeju, Korea (South); pp. 863–865. [Google Scholar]
- Lee, J.Y.; Lee, S.; Mishra, A.; Yan, X.; McMahan, B.; Gaisford, B.; Kobashigawa, C.; Qu, M.; Xie, C.; Kao, J.C. Non-Invasive Brain-Machine Interface Control with Artificial Intelligence Copilots 2024.
- Liang, X.; Yao, Z.; Ge, Y.; Yao, J. Reinforcement Learning Based Adaptive Control for Uncertain Mechanical Systems with Asymptotic Tracking. Def. Technol. 2024, 34, 19–28. [Google Scholar] [CrossRef]
- Lin, T.; Sachdev, K.; Fan, L.; Malik, J.; Zhu, Y. Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids 2025.
- Lin, X.; Liu, R. A Distributed Cerebellar-Inspired Learning Model for Robotic Arm Control. In Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), July 2017; IEEE: Seogwipo; pp. 929–932. [Google Scholar]
- Lin, Y.; Church, A.; Yang, M.; Li, H.; Lloyd, J.; Zhang, D.; Lepora, N.F. Bi-Touch: Bimanual Tactile Manipulation With Sim-to-Real Deep Reinforcement Learning. IEEE Robot. Autom. Lett. 2023, 8, 5472–5479. [Google Scholar] [CrossRef]
- Lin, Y.; Lloyd, J.; Church, A.; Lepora, N.F. Tactile Gym 2.0: Sim-to-Real Deep Reinforcement Learning for Comparing Low-Cost High-Resolution Robot Touch. IEEE Robot. Autom. Lett. 2022, 7, 10754–10761. [Google Scholar] [CrossRef]
- Liu, A.; Zhao, H.; Song, T.; Liu, Z.; Wang, H.; Sun, D. Adaptive Control of Manipulator Based on Neural Network. Neural Comput. Appl. 2021, 33, 4077–4085. [Google Scholar] [CrossRef]
- Liu, D.; Wang, Z.; Lu, B.; Cong, M.; Yu, H.; Zou, Q. A Reinforcement Learning-Based Framework for Robot Manipulation Skill Acquisition. IEEE Access 2020, 8, 108429–108437. [Google Scholar] [CrossRef]
- Liu, J.; Yap, H.J.; Khairuddin, A.S.M. Path Planning for the Robotic Manipulator in Dynamic Environments Based on a Deep Reinforcement Learning Method. J. Intell. Robot. Syst. 2024, 111, 3. [Google Scholar] [CrossRef]
- Liu, N.; Lu, T.; Cai, Y.; Wang, R.; Wang, S. Real-World Robot Reaching Skill Learning Based on Deep Reinforcement Learning. In Proceedings of the 2020 Chinese Control And Decision Conference (CCDC), August 2020; IEEE: Hefei, China; pp. 4780–4784. [Google Scholar]
- Liu, N.; Li, L.; Hao, B.; Yang, L.; Hu, T.; Xue, T.; Wang, S. Modeling and Simulation of Robot Inverse Dynamics Using LSTM-Based Deep Learning Algorithm for Smart Cities and Factories. IEEE Access 2019, 7, 173989–173998. [Google Scholar] [CrossRef]
- Liu, W.; Niu, H.; Mahyuddin, M.N.; Herrmann, G.; Carrasco, J. A Model-Free Deep Reinforcement Learning Approach for Robotic Manipulators Path Planning. In Proceedings of the 2021 21st International Conference on Control, Automation and Systems (ICCAS), October 12 2021; IEEE: Jeju, Korea, Republic of; pp. 512–517. [Google Scholar]
- Liu, X.; Wang, G.; Liu, Z.; Liu, Y.; Liu, Z.; Huang, P. Hierarchical Reinforcement Learning Integrating With Human Knowledge for Practical Robot Skill Learning in Complex Multi-Stage Manipulation. IEEE Trans. Autom. Sci. Eng. 2024, 21, 3852–3862. [Google Scholar] [CrossRef]
- Li, C.; Liu, Z.; Li, L.; Ji, Z.; Li, C.; Liang, J.; Li, Y. Improved PPO Optimization for Robotic Arm Grasping Trajectory Planning and Real-Robot Migration. Sensors 2025, 25, 5253. [Google Scholar] [CrossRef]
- Li, Q.; Pang, Y.; Wang, Y.; Han, X.; Li, Q.; Zhao, M. CBMC: A Biomimetic Approach for Control of a 7-Degree of Freedom Robotic Arm. Biomimetics 2023, 8, 389. [Google Scholar] [CrossRef] [PubMed]
- Li, R. Image Features for Vision-Based Robot Manipulation Based on Deep Reinforcement Learning. In Proceedings of the 2021 IEEE 17th International Conference on Intelligent Computer Communication and Processing (ICCP), October 28 2021; IEEE: Cluj-Napoca, Romania; pp. 359–364. [Google Scholar]
- Li, X.; Serlin, Z.; Yang, G.; Belta, C. A Formal Methods Approach to Interpretable Reinforcement Learning for Robotic Planning. Sci. Robot. 2019, 4, eaay6276. [Google Scholar] [CrossRef]
- Li, X.; Zhong, J.; Kamruzzaman, M.M. Complicated Robot Activity Recognition by Quality-Aware Deep Reinforcement Learning. Future Gener. Comput. Syst. 2021, 117, 480–485. [Google Scholar] [CrossRef]
- Li, X.; Zhao, J.; Wang, A.; Huang, Y. Sliding Mode Control for Uncertain Robot Manipulators Based on Reinforcement Learning. In Proceedings of the 2024 6th International Conference on Robotics, Intelligent Control and Artificial Intelligence (RICAI), December 6 2024; IEEE: Nanjing, China; pp. 74–79. [Google Scholar]
- Li, Y.; Chen, L.; Tee, K.P.; Li, Q. Reinforcement Learning Control for Coordinated Manipulation of Multi-Robots. Neurocomputing 2015, 170, 168–175. [Google Scholar] [CrossRef]
- Li, Y.; Hao, X.; She, Y.; Li, S.; Yu, M. Constrained Motion Planning of Free-Float Dual-Arm Space Manipulator via Deep Reinforcement Learning. Aerosp. Sci. Technol. 2021, 109, 106446. [Google Scholar] [CrossRef]
- Li, Z.; Li, S. Neural Network Model-Based Control for Manipulator: An Autoencoder Perspective. IEEE Trans. Neural Netw. Learn. Syst. 2023, 34, 2854–2868. [Google Scholar] [CrossRef] [PubMed]
- Li, Z.; Zhao, T.; Chen, F.; Hu, Y.; Su, C.-Y.; Fukuda, T. Reinforcement Learning of Manipulation and Grasping Using Dynamical Movement Primitives for a Humanoidlike Mobile Manipulator. IEEEASME Trans. Mechatron. 2018, 23, 121–131. [Google Scholar] [CrossRef]
- Lobbezoo, A.; Kwon, H.-J. Simulated and Real Robotic Reach, Grasp, and Pick-and-Place Using Combined Reinforcement Learning and Traditional Controls. Robotics 2023, 12, 12. [Google Scholar] [CrossRef]
- Luo, J.; Solowjow, E.; Wen, C.; Ojea, J.A.; Agogino, A.M. Deep Reinforcement Learning for Robotic Assembly of Mixed Deformable and Rigid Objects. In Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), October 2018; IEEE: Madrid; pp. 2062–2069. [Google Scholar]
- Luo, J.; Xu, C.; Wu, J.; Levine, S. Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning 2025.
- Luo, Y.; Cong, Y.; Du, H.; Zhu, W.; Shen, Z. Finite-Time Active Disturbance Rejection Trajectory Tracking Control of Robot Manipulators via PPO Algorithm. In Proceedings of the 2025 44th Chinese Control Conference (CCC), Chongqing, China, July 28 2025; IEEE; pp. 552–557. [Google Scholar]
- Majumder, S.; Sahoo, S.R. Enhancing 3D Trajectory Tracking of Robotic Manipulator Using Sequential Deep Reinforcement Learning with Disturbance Rejection. In Proceedings of the 2024 European Control Conference (ECC), Stockholm, Sweden, June 25 2024; IEEE; pp. 2512–2517. [Google Scholar]
- Majumder, S.; Sahoo, S.R. A Reinforcement-Learning Approach to Control Robotic Manipulator Based on Improved DDPG. In Proceedings of the 2023 Ninth Indian Control Conference (ICC), December 18 2023; IEEE: Visakhapatnam, India; pp. 281–286. [Google Scholar]
- Maldonado-Ramirez, A.; Rios-Cabrera, R.; Lopez-Juarez, I. A Visual Path-Following Learning Approach for Industrial Robots Using DRL. Robot. Comput.-Integr. Manuf. 2021, 71, 102130. [Google Scholar] [CrossRef]
- Malik, A.; Lischuk, Y.; Henderson, T.; Prazenica, R. A Deep Reinforcement-Learning Approach for Inverse Kinematics Solution of a High Degree of Freedom Robotic Manipulator. Robotics 2022, 11, 44. [Google Scholar] [CrossRef]
- Mannaa, A.S.; Zarubin, A.O. Manipulators with Machine Learning-Based Control with Reinforcement. In Proceedings of the 2023 V International Conference on Control in Technical Systems (CTS), September 21 2023; IEEE: Saint Petersburg, Russian Federation; pp. 232–234. [Google Scholar]
- Mao, J.; Hu, J.; Zhou, X.; Zhang, C.; Yang, J.; Wang, H. Real-Time Safe and Smooth Visual Servoing for Robot Manipulators via Reinforcement Learning. IEEE Trans. Ind. Electron. 2025, 1–12. [Google Scholar] [CrossRef]
- Matas, J.; James, S.; Davison, A.J. Sim-to-Real Reinforcement Learning for Deformable Object Manipulation.
- Mazzaglia, P.; Backshall, N.; Ma, X.; James, S. Redundancy-Aware Action Spaces for Robot Learning. IEEE Robot. Autom. Lett. 2024, 9, 6912–6919. [Google Scholar] [CrossRef]
- Ma, X.; Wen, Z.; Hu, T.; Zheng, X. Bridging Embodiment and Learning: A Machine Learning Framework for Next-Generation Embodied Intelligence. In Proceedings of the 2025 7th International Conference on Machine Learning, Big Data and Business Intelligence (MLBDBI), October 24 2025; IEEE: Hangzhou, China; pp. 68–71. [Google Scholar]
- Mellatshahi, N.; Mozaffari, S.; Saif, M.; Alirezaee, S. Inverted Pendulum Control with a Robotic Arm Using Deep Reinforcement Learning. In Proceedings of the 2021 International Symposium on Signals, Circuits and Systems (ISSCS), Iasi, Romania, July 15 2021; IEEE; pp. 1–6. [Google Scholar]
- Meyes, R.; Tercan, H.; Roggendorf, S.; Thiele, T.; Büscher, C.; Obdenbusch, M.; Brecher, C.; Jeschke, S.; Meisen, T. Motion Planning for Industrial Robots Using Reinforcement Learning. Procedia CIRP 2017, 63, 107–112. [Google Scholar] [CrossRef]
- Molina, A.M. Learning Robotic Manipulation Tasks Using Relational Reinforcement Learning and Human Demonstrations.
- Moon, J.; Bae, S.-H.; Cashmore, M. Meta Reinforcement Learning Based Underwater Manipulator Control. In Proceedings of the 2021 21st International Conference on Control, Automation and Systems (ICCAS), October 12 2021; IEEE: Jeju, Korea, Republic of; pp. 1473–1476. [Google Scholar]
- Mueangprasert, M.; Chermprayong, P.; Boonlong, K. Robot Arm Movement Control by Model-Based Reinforcement Learning Using Machine Learning Regression Techniques and Particle Swarm Optimization. In Proceedings of the 2023 Third International Symposium on Instrumentation, Control, Artificial Intelligence, and Robotics (ICA-SYMP), January 18 2023; IEEE: Bangkok, Thailand; pp. 83–86. [Google Scholar]
- Naranjo-Campos, F.J.; Victores, J.G.; Balaguer, C. Method for Bottle Opening with a Dual-Arm Robot. Biomimetics 2024, 9, 577. [Google Scholar] [CrossRef] [PubMed]
- Nguyen Nguyen, L.H.; Dao, M.K.; Hua, H.Q.B.; Pham, P.-T.; Ngo, X.-K.; Nguyen, Q.C. Motion Navigation Algorithm Based on Deep Reinforcement Learning for Manipulators. In Proceedings of the 2023 IEEE 3rd International Conference on Electronic Communications, Internet of Things and Big Data (ICEIB), April 14 2023; IEEE: Taichung, Taiwan; pp. 537–541. [Google Scholar]
- Rahimi Nohooji, H.; Zaraki, A.; Voos, H. Actor–Critic Learning Based PID Control for Robotic Manipulators. Appl. Soft Comput. 2024, 151, 111153. [Google Scholar] [CrossRef]
- Ouyang, Y.; He, W.; Li, X. Reinforcement Learning Control of a Single-link Flexible Robotic Manipulator. IET Control Theory Appl. 2017, 11, 1426–1433. [Google Scholar] [CrossRef]
- Pane, Y.P.; Nageshrao, S.P.; Babuska, R. Actor-Critic Reinforcement Learning for Tracking Control in Robotics. In Proceedings of the 2016 IEEE 55th Conference on Decision and Control (CDC), December 2016; IEEE: Las Vegas, NV, USA; pp. 5819–5826. [Google Scholar]
- Pane, Y.P.; Nageshrao, S.P.; Kober, J.; Babuška, R. Reinforcement Learning Based Compensation Methods for Robot Manipulators. Eng. Appl. Artif. Intell. 2019, 78, 236–247. [Google Scholar] [CrossRef]
- Pantoja-Garcia, L.; Garcia-Rodriguez, R.; Parra-Vega, V. A Novel Adaptive Actor-Critic Reinforcement Learning Controller for Constrained Robots. In Proceedings of the 2022 IEEE International Conference on Automation/XXV Congress of the Chilean Association of Automatic Control (ICA-ACCA), October 24 2022; IEEE: Curicó, Chile; pp. 1–6. [Google Scholar]
- Pan, S.; Cao, F. Deep Reinforcement Learning-Based Control Strategy for Underwater Manipulator Systems. IFAC-Pap. 2025, 59, 430–435. [Google Scholar] [CrossRef]
- Pan, Z.; Zhou, J.; Fan, Q.; Feng, Z.; Gao, X.; Su, M. Robotic Control Mechanism Based on Deep Reinforcement Learning. In Proceedings of the 2023 2nd International Symposium on Control Engineering and Robotics (ISCER), February 2023; IEEE: Hangzhou, China; pp. 70–74. [Google Scholar]
- Park, S.-Y.; Lee, C.; Kim, H.; Ahn, S.-H. Enhancement of Control Performance for Degraded Robot Manipulators Using Digital Twin and Proximal Policy Optimization. IEEE Access 2024, 12, 19569–19583. [Google Scholar] [CrossRef]
- Pavlichenko, D.; Behnke, S. Real-Robot Deep Reinforcement Learning: Improving Trajectory Tracking of Flexible-Joint Manipulator with Reference Correction. In Proceedings of the 2022 International Conference on Robotics and Automation (ICRA), Philadelphia, PA, USA, May 23 2022; IEEE; pp. 2671–2677. [Google Scholar]
- Pedersen, O.-M.; Misimi, E.; Chaumette, F. Grasping Unknown Objects by Coupling Deep Reinforcement Learning, Generative Adversarial Networks, and Visual Servoing. In Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France, May 2020; IEEE; pp. 5655–5662. [Google Scholar]
- Perrusquia, A.; Yu, W. Human-Behavior Learning for Infinite-Horizon Optimal Tracking Problems of Robot Manipulators. In Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), Austin, TX, USA, December 14 2021; IEEE; pp. 57–62. [Google Scholar]
- Perrusquia, A.; Yu, W.; Soria, A. Large Space Dimension Reinforcement Learning for Robot Position/Force Discrete Control. In Proceedings of the 2019 6th International Conference on Control, Decision and Information Technologies (CoDIT), April 2019; IEEE: Paris, France; pp. 91–96. [Google Scholar]
- Pham, P.-C.; Kuo, Y.-L. Online Reinforcement Learning Based Real-Time Robust Adaptive Control Design for Robot Manipulators. Int. J. Control Autom. Syst. 2025, 23, 3048–3059. [Google Scholar] [CrossRef]
- Polydoros, A.S.; Nalpantidis, L.; Kruger, V. Real-Time Deep Learning of Robotic Manipulator Inverse Dynamics. In Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), September 2015; IEEE: Hamburg, Germany; pp. 3442–3448. [Google Scholar]
- Popov, I.; Heess, N.; Lillicrap, T.; Hafner, R.; Barth-Maron, G.; Vecerik, M.; Lampe, T.; Tassa, Y.; Erez, T.; Riedmiller, M. Data-Efficient Deep Reinforcement Learning for Dexterous Manipulation. [CrossRef]
- Qiao, D.; Zhong, Z.; Zhang, H.; Zhao, Y. Trajectory Planning of Manipulator Based on DQN Algorithm Guided by MPC Sampling. In Proceedings of the 2021 3rd International Symposium on Robotics & Intelligent Manufacturing Technology (ISRIMT), September 24 2021; IEEE: Changzhou, China; pp. 319–323. [Google Scholar]
- Qin, Y.; Huang, B.; Yin, Z.-H.; Su, H.; Wang, X. DexPoint: Generalizable Point Cloud Reinforcement Learning for Sim-to-Real Dexterous Manipulation.
- Qi, G.; Li, Y. Reinforcement Learning Control for Robot Arm Grasping Based on Improved DDPG. In Proceedings of the 2021 40th Chinese Control Conference (CCC), July 26 2021; IEEE: Shanghai, China; pp. 4132–4137. [Google Scholar]
- Quillen, D.; Jang, E.; Nachum, O.; Finn, C.; Ibarz, J.; Levine, S. Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods. In Proceedings of the 2018 IEEE International Conference on Robotics and Automation (ICRA), Brisbane, QLD, May 2018; IEEE; pp. 6284–6291. [Google Scholar]
- Rajeswaran, A.; Kumar, V.; Gupta, A.; Vezzani, G.; Schulman, J.; Todorov, E.; Levine, S. Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations 2018.
- Ramirez, J.; Yu, W. Redundant Robot Control with Learning from Expert Demonstrations. In Proceedings of the 2022 IEEE Symposium Series on Computational Intelligence (SSCI), December 4 2022; IEEE: Singapore, Singapore; pp. 715–720. [Google Scholar]
- Ramirez, J.; Yu, W. Reinforcement Learning from Expert Demonstrations with Application to Redundant Robot Control. Eng. Appl. Artif. Intell. 2023, 119, 105753. [Google Scholar] [CrossRef]
- Ren, B.; Wang, Y.; Chen, J. Trajectory-Tracking-Based Adaptive Neural Network Sliding Mode Controller for Robot Manipulators. J. Comput. Inf. Sci. Eng. 2020, 20, 031009. [Google Scholar] [CrossRef]
- Ren, H.; Ben-Tzvi, P. Learning Inverse Kinematics and Dynamics of a Robotic Manipulator Using Generative Adversarial Networks. Robot. Auton. Syst. 2020, 124, 103386. [Google Scholar] [CrossRef]
- Rizzardo, C.; Chen, F.; Caldwell, D. Sim-to-Real via Latent Prediction: Transferring Visual Non-Prehensile Manipulation Policies. Front. Robot. AI 2023, 9, 1067502. [Google Scholar] [CrossRef]
- Rubagotti, M.; Sangiovanni, B.; Nurbayeva, A.; Incremona, G.P.; Ferrara, A.; Shintemirov, A. Shared Control of Robot Manipulators With Obstacle Avoidance: A Deep Reinforcement Learning Approach. IEEE Control Syst. 2023, 43, 44–63. [Google Scholar] [CrossRef]
- Saeed, M.; Nagdi, M.; Rosman, B.; Ali, H.H.S.M. Deep Reinforcement Learning for Robotic Hand Manipulation. In Proceedings of the 2020 International Conference on Computer, Control, Electrical, and Electronics Engineering (ICCCEEE), February 26 2021; IEEE: Khartoum, Sudan; pp. 1–5. [Google Scholar]
- Sahu, U.K.; Patra, D.; Subudhi, B. Deep Reinforcement Learning Controller for Vision-Based Serial Flexible Link Manipulator. In Proceedings of the 2021 International Symposium of Asian Control Association on Intelligent Robotics and Industrial Automation (IRIA), September 20 2021; IEEE: Goa, India; pp. 331–336. [Google Scholar]
- Saidi, K.; Boumediene, A.; Boubekeur, D. An Optimal GA-Based Backstepping Control Scheme for a MIMO Nonlinear System. J. Eur. Systèmes Autom. 2023, 56, 89–96. [Google Scholar] [CrossRef]
- Said, A.; Khalil, A.; Petra, R.; Yunus, S.; Peng, A.S.; Khan, S. PID Controller Optimization of Teleoperated 2DOF Robot Manipulator Using Artificial Bee Colony Algorithm. In Proceedings of the 2019 IEEE 6th International Conference on Engineering Technologies and Applied Sciences (ICETAS), December 2019; IEEE: Kuala Lumpur, Malaysia; pp. 1–6. [Google Scholar]
- Sajadi, S.M.; Karbasi, S.M.; Brun, H.; Tørresen, J.; Elle, O.J.; Mathiassen, K. Towards Autonomous Robotic Biopsy—Design, Modeling and Control of a Robot for Needle Insertion of a Commercial Full Core Biopsy Instrument. Front. Robot. AI 2022, 9, 896267. [Google Scholar] [CrossRef]
- Sangiovanni, B.; Incremona, G.P.; Piastra, M.; Ferrara, A. Self-Configuring Robot Path Planning With Obstacle Avoidance via Deep Reinforcement Learning. IEEE Control Syst. Lett. 2021, 5, 397–402. [Google Scholar] [CrossRef]
- Sangiovanni, B.; Rendiniello, A.; Incremona, G.P.; Ferrara, A.; Piastra, M. Deep Reinforcement Learning for Collision Avoidance of Robotic Manipulators. In Proceedings of the 2018 European Control Conference (ECC), June 2018; IEEE: Limassol; pp. 2063–2068. [Google Scholar]
- Sangiovanni, B.; Incremona, G.P.; Ferrara, A.; Piastra, M. Deep Reinforcement Learning Based Self-Configuring Integral Sliding Mode Control Scheme for Robot Manipulators. In Proceedings of the 2018 IEEE Conference on Decision and Control (CDC), December 2018; IEEE: Miami Beach, FL; pp. 5969–5974. [Google Scholar]
- Scheikl, P.M.; Tagliabue, E.; Gyenes, B.; Wagner, M.; Dall’Alba, D.; Fiorini, P.; Mathis-Ullrich, F. Sim-to-Real Transfer for Visual Reinforcement Learning of Deformable Object Manipulation for Robot-Assisted Surgery. IEEE Robot. Autom. Lett. 2023, 8, 560–567. [Google Scholar] [CrossRef]
- Sekkat, H.; Tigani, S.; Saadane, R.; Chehri, A. Vision-Based Robotic Arm Control Algorithm Using Deep Reinforcement Learning for Autonomous Objects Grasping. Appl. Sci. 2021, 11, 7917. [Google Scholar] [CrossRef]
- Shahid, A.A.; Piga, D.; Braghin, F.; Roveda, L. Continuous Control Actions Learning and Adaptation for Robotic Manipulation through Reinforcement Learning. Auton. Robots 2022, 46, 483–498. [Google Scholar] [CrossRef]
- Shahna, M.H.; Adel Alizadeh Kolagar, S.; Mattila, J. Integrating DeepRL with Robust Low-Level Control in Robotic Manipulators for Non-Repetitive Reaching Tasks. In Proceedings of the 2024 IEEE International Conference on Mechatronics and Automation (ICMA), August 4 2024; IEEE: Tianjin, China; pp. 329–336. [Google Scholar]
- Shao, H.; Hu, W.; Yang, L.; Wang, W.; Suzuki, S.; Gao, Z. Intelligent Impedance Strategy for Force–Motion Control of Robotic Manipulators in Unknown Environments via Expert-Guided Deep Reinforcement Learning. Processes 2025, 13, 2526. [Google Scholar] [CrossRef]
- Shetty, M.; Vishishta, B.; Choragi, S.; Subramanian, K.; George, K. Continuous Control of a Robot Manipulator Using Deep Deterministic Policy Gradient. In Proceedings of the 2021 Seventh Indian Control Conference (ICC), December 20 2021; IEEE: Mumbai, India; pp. 213–218. [Google Scholar]
- Shiferaw, B.A.; Agidew, T.F.; Alzahrani, A.S.; Srinivasagan, R. Synergistic Pushing and Grasping for Enhanced Robotic Manipulation Using Deep Reinforcement Learning. Actuators 2024, 13, 316. [Google Scholar] [CrossRef]
- Shin, C.; Ferguson, P.W.; Pedram, S.A.; Ma, J.; Dutson, E.P.; Rosen, J. Autonomous Tissue Manipulation via Surgical Robot Using Learning Based Model Predictive Control. In Proceedings of the 2019 International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada, May 2019; IEEE; pp. 3875–3881. [Google Scholar]
- Siddique, T.; Choutri, K.; Fareh, R.; Dylov, D.; Bettayeb, M. Tracking Control Using Standalone Reinforcement Learning for a Robot Manipulator. In Proceedings of the 2024 Advances in Science and Engineering Technology International Conferences (ASET), June 3 2024; IEEE: Abu Dhabi, United Arab Emirates; pp. 1–6. [Google Scholar]
- Simon, J.; Gogolák, L.; Sárosi, J. Deep Reinforcement Learning-Assisted Teaching Strategy for Industrial Robot Manipulator. Appl. Sci. 2024, 14, 10929. [Google Scholar] [CrossRef]
- Sivertsvik, M.; Sumskiy, K.; Misimi, E. Learning Active Manipulation to Target Shapes with Model-Free, Long-Horizon Deep Reinforcement Learning. In Proceedings of the 2024 IEEE International Conference on Robotics and Automation (ICRA), May 13 2024; IEEE: Yokohama, Japan; pp. 5411–5418. [Google Scholar]
- Song, X.; Xu, P.; Xu, W.; Li, B. Skill Acquisition Framework in Multi-Robot Precision Assembly Based on Cooperative Compliant Control. ISA Trans. 2024, 155, 319–336. [Google Scholar] [CrossRef]
- Song, Y.; Li, Z.; Li, B.; Wen, G. Optimized Leader-follower Consensus Control Using Combination of Reinforcement Learning and Sliding Mode Mechanism for Multiple Robot Manipulator System. Int. J. Robust Nonlinear Control 2024, 34, 5212–5228. [Google Scholar] [CrossRef]
- Staley, E.W.; Katyal, K.D.; Burlina, P. DRL Based Intelligent Joint Manipulator and Viewing Camera Control for Reaching Tasks and Environments with Obstacles and Occluders. In Proceedings of the 2018 International Joint Conference on Neural Networks (IJCNN), July 2018; IEEE: Rio de Janeiro; pp. 1–7. [Google Scholar]
- Sun, C.; Ovtcharova, J. AI-Based Regression Analysis for Optimizing the Performance of Robot Manipulator Trajectory Tracking. In Proceedings of the 2021 21st International Conference on Control, Automation and Systems (ICCAS), Jeju, Korea, Republic of, October 12 2021; IEEE; pp. 489–495. [Google Scholar]
- Sun, X.; Li, J.; Kovalenko, A.V.; Feng, W.; Ou, Y. Integrating Reinforcement Learning and Learning From Demonstrations to Learn Nonprehensile Manipulation. IEEE Trans. Autom. Sci. Eng. 2023, 20, 1735–1744. [Google Scholar] [CrossRef]
- Sun, Z.; Yuan, K.; Hu, W.; Yang, C.; Li, Z. Learning Pregrasp Manipulation of Objects from Ungraspable Poses. In Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA), May 2020; IEEE: Paris, France; pp. 9917–9923. [Google Scholar]
- Su, E.; Jia, C.; Qin, Y.; Zhou, W.; Macaluso, A.; Huang, B.; Wang, X. Sim2Real Manipulation on Unknown Objects with Tactile-Based Reinforcement Learning. In Proceedings of the 2024 IEEE International Conference on Robotics and Automation (ICRA), Yokohama, Japan, May 13 2024; IEEE; pp. 9234–9241. [Google Scholar]
- Su, H.; Zhang, Z.; Zhang, W. Adaptive Backstepping Optimal Tracking Control of Interconnected Robotic Manipulator System Based on Reinforcement Learning. IEEE Trans. Autom. Sci. Eng. 2025, 22, 19555–19567. [Google Scholar] [CrossRef]
- Su, H.; Qi, W.; Gao, H.; Hu, Y.; Shi, Y.; Ferrigno, G.; Momi, E.D. Machine Learning Driven Human Skill Transferring for Control of Anthropomorphic Manipulators. In Robot. Mechatron.; 2020. [Google Scholar]
- Tajdari, F.; Kabganian, M.; Khodabakhshi, E.; Golgouneh, A. Design, Implementation and Control of a Two-Link Fully-Actuated Robot Capable of Online Identification of Unknown Dynamical Parameters Using Adaptive Sliding Mode Controller. In Proceedings of the 2017 Artificial Intelligence and Robotics (IRANOPEN), April 2017; IEEE: Qazvin, Iran; pp. 91–96. [Google Scholar]
- Takeda, S.; Yamamori, S.; Yagi, S.; Morimoto, J. Hierarchically Connecting Modularly-Learned Policies to Generate a Controller for a Combined Robot System. In Proceedings of the 2025 IEEE 21st International Conference on Automation Science and Engineering (CASE), Los Angeles, CA, USA, August 17 2025; IEEE; pp. 1722–1727. [Google Scholar]
- Tang, W.; Cheng, C.; Ai, H.; Chen, L. Dual-Arm Robot Trajectory Planning Based on Deep Reinforcement Learning under Complex Environment. Micromachines 2022, 13, 564. [Google Scholar] [CrossRef] [PubMed]
- Uchibe, E.; Doya, K. Forward and Inverse Reinforcement Learning Sharing Network Weights and Hyperparameters. Neural Netw. 2021, 144, 138–153. [Google Scholar] [CrossRef] [PubMed]
- Valencia, D.; Jia, J.; Li, R.; Hayashi, A.; Lecchi, M.; Terezakis, R.; Gee, T.; Liarokapis, M.; MacDonald, B.A.; Williams, H. Comparison of Model-Based and Model-Free Reinforcement Learning for Real-World Dexterous Robotic Manipulation Tasks. In Proceedings of the 2023 IEEE International Conference on Robotics and Automation (ICRA), May 29 2023; IEEE: London, United Kingdom; pp. 871–878. [Google Scholar]
- Vijay, M.; Jena, D. Backstepping Terminal Sliding Mode Control of Robot Manipulator Using Radial Basis Functional Neural Networks. Comput. Electr. Eng. 2018, 67, 690–707. [Google Scholar] [CrossRef]
- Vu, V.T.; Dao, P.N.; Loc, P.T.; Huy, T.Q. Sliding Variable-Based Online Adaptive Reinforcement Learning of Uncertain/Disturbed Nonlinear Mechanical Systems. J. Control Autom. Electr. Syst. 2021, 32, 281–290. [Google Scholar] [CrossRef]
- J.K., V; Elumalai, V.K. A Proximal Policy Optimization Based Deep Reinforcement Learning Framework for Tracking Control of a Flexible Robotic Manipulator. Results Eng. 2025, 25, 104178. [Google Scholar] [CrossRef]
- Wang, C.; Zhang, Q.; Tian, Q.; Li, S.; Wang, X.; Lane, D.; Petillot, Y.; Wang, S. Learning Mobile Manipulation through Deep Reinforcement Learning. Sensors 2020, 20, 939. [Google Scholar] [CrossRef]
- Wang, F.; Hu, J.; Qin, Y.; Guo, F.; Jiang, M. Trajectory Tracking Control Based on Deep Reinforcement Learning for a Robotic Manipulator with an Input Deadzone. Symmetry 2025, 17, 149. [Google Scholar] [CrossRef]
- Wang, G.; Xin, M.; Wu, W.; Liu, Z.; Wang, H. Learning of Long-Horizon Sparse-Reward Robotic Manipulator Tasks With Base Controllers. IEEE Trans. Neural Netw. Learn. Syst. 2024, 35, 4072–4081. [Google Scholar] [CrossRef] [PubMed]
- Wang, H.; Dong, Z.; Zhu, T.; Lei, H.; Shi, W.; Zhang, Z.; Luo, W.; Wan, W.; Chen, X.; Huang, J. Robot Deformable Object Manipulation via NMPC-Generated Demonstrations in Deep Reinforcement Learning. IEEE Trans. Autom. Sci. Eng. 2025, 22, 23566–23578. [Google Scholar] [CrossRef]
- Wang, J.; Zhao, W.; Cao, J.; Park, J.H.; Shen, H. Reinforcement Learning-Based Predefined-Time Tracking Control for Nonlinear Systems Under Identifier-Critic–Actor Structure. IEEE Trans. Cybern. 2024, 54, 6345–6357. [Google Scholar] [CrossRef] [PubMed]
- Wang, M.; Yang, A. Dynamic Learning From Adaptive Neural Control of Robot Manipulators With Prescribed Performance. IEEE Trans. Syst. Man Cybern. Syst. 2017, 47, 2244–2255. [Google Scholar] [CrossRef]
- Wang, Q.; Sanchez, F.R.; McCarthy, R.; Bulens, D.C.; McGuinness, K.; O’Connor, N.; Wüthrich, M.; Widmaier, F.; Bauer, S.; Redmond, S.J. Dexterous Robotic Manipulation Using Deep Reinforcement Learning and Knowledge Transfer for Complex Sparse Reward-based Tasks. Expert Syst. 2023, 40, e13205. [Google Scholar] [CrossRef]
- Wang, S.; Shao, X.; Yang, L.; Liu, N. Deep Learning Aided Dynamic Parameter Identification of 6-DOF Robot Manipulators. IEEE Access 2020, 8, 138102–138116. [Google Scholar] [CrossRef]
- Wang, T.; Wang, F.; Xie, Z.; Qin, F. Curiosity Model Policy Optimization for Robotic Manipulator Tracking Control with Input Saturation in Uncertain Environment. Front. Neurorobotics 2024, 18, 1376215. [Google Scholar] [CrossRef]
- Wang, Y.; An, T.; Dong, B.; Zhu, M.; Li, Y. A-C-I Fuzzy Logic System Structure-Based Reconfigurable Robot Manipulators Finite-Time Optimal Backstepping Force/Position Control. IEEE Robot. Autom. Lett. 2025, 10, 8730–8737. [Google Scholar] [CrossRef]
- Wang, Y.; Wu, Z. Machine Learning Model-based Optimal Tracking Control of Nonlinear Affine Systems with Safety Constraints. Int. J. Robust Nonlinear Control 2025, 35, 511–535. [Google Scholar] [CrossRef]
- Wang, Y.; Sagawa, R.; Yoshiyasu, Y. A Hierarchical Robot Learning Framework for Manipulator Reactive Motion Generation via Multi-Agent Reinforcement Learning and Riemannian Motion Policies. IEEE Access 2023, 11, 126979–126994. [Google Scholar] [CrossRef]
- Weber, J.; Schmidt, M. An Improved Approach for Inverse Kinematics and Motion Planning of an Industrial Robot Manipulator with Reinforcement Learning. In Proceedings of the 2021 Fifth IEEE International Conference on Robotic Computing (IRC), Taichung, Taiwan, November 2021; IEEE; pp. 10–17. [Google Scholar]
- Wu, P.; Su, H.; Dong, H.; Liu, T.; Li, M.; Chen, Z. An Obstacle Avoidance Method for Robotic Arm Based on Reinforcement Learning. Ind. Robot Int. J. Robot. Res. Appl. 2025, 52, 9–17. [Google Scholar] [CrossRef]
- Wu, T.; Zhong, F.; Geng, Y.; Wang, H.; Zhu, Y.; Wang, Y.; Dong, H. GraspARL: Dynamic Grasping via Adversarial Reinforcement Learning 2022.
- Wu, Y.; Niu, W.; Kong, L.; Yu, X.; He, W. Fixed-Time Neural Network Control of a Robotic Manipulator with Input Deadzone. ISA Trans. 2023, 135, 449–461. [Google Scholar] [CrossRef]
- Wu, Y.; Zhang, L.; Wang, Z.; Feng, Y. Output Feedback Control for Uncertain Robot Manipulators Based on Reinforcement Learning. In Proceedings of the Proceedings of the 2024 7th International Conference on Artificial Intelligence and Pattern Recognition, September 20 2024; ACM: Xiamen China; pp. 1136–1140. [Google Scholar]
- Wu, Y.; Pan, L.; Wu, W.; Wang, G.; Miao, Y.; Xu, F.; Wang, H. RL-GSBridge: 3D Gaussian Splatting Based Real2Sim2Real Method for Robotic Manipulation Learning. In Proceedings of the 2025 IEEE International Conference on Robotics and Automation (ICRA), Atlanta, GA, USA, May 19 2025; IEEE; pp. 192–198. [Google Scholar]
- Xhin, O.Y.; Jo, H.S. Self-Learning Robot Manipulator Controller Using Reinforcement Learning. In Proceedings of the TENCON 2024 - 2024 IEEE Region 10 Conference (TENCON), December 1 2024; IEEE: Singapore, Singapore; pp. 707–710. [Google Scholar]
- Xiao, R.; Yang, C.; Jiang, Y.; Zhang, H. One-Shot Sim-to-Real Transfer Policy for Robotic Assembly via Reinforcement Learning with Visual Demonstration. Robotica 2024, 42, 1074–1093. [Google Scholar] [CrossRef]
- Xie, J.; Shao, Z.; Li, Y.; Guan, Y.; Tan, J. Deep Reinforcement Learning With Optimized Reward Functions for Robotic Trajectory Planning. IEEE Access 2019, 7, 105669–105679. [Google Scholar] [CrossRef]
- Xu, B.; Hassan, T.; Hussain, I. Seg-CURL: Segmented Contrastive Unsupervised Reinforcement Learning for Sim-to-Real in Visual Robotic Manipulation. IEEE Access 2023, 11, 50195–50204. [Google Scholar] [CrossRef]
- Xu, S.; Wu, Z. Adaptive Learning Control of Robot Manipulators via Incremental Hybrid Neural Network. Neurocomputing 2024, 568, 127045. [Google Scholar] [CrossRef]
- Yang, D.; Miao, C.; Liu, Y.; Du, X.; Zheng, Y. Improved DQN-Based Intelligent Trajectory Control for Coal Gangue Sorting Robotic Manipulators. IEEE Sens. J. 2025, 25, 40632–40650. [Google Scholar] [CrossRef]
- Yang, M.; Lin, Y.; Church, A.; Lloyd, J.; Zhang, D.; Barton, D.A.W.; Lepora, N.F. Sim-to-Real Model-Based and Model-Free Deep Reinforcement Learning for Tactile Pushing. IEEE Robot. Autom. Lett. 2023, 8, 5480–5487. [Google Scholar] [CrossRef]
- Yang, P.; Zhang, S.; Yu, X.; He, W. Reinforcement-Learning-Based Finite Time Fault Tolerant Control for a Manipulator With Actuator Faults. IEEE Trans. Cybern. 2025, 55, 2621–2632. [Google Scholar] [CrossRef]
- Yang, Q.; Zhang, F.; He, W.; Wang, C. Deterministic Learning-Based Knowledge Fusion Neural Control for Robot Manipulators with Predefined Performance. In Intelligent Robotics and Applications; Lecture Notes in Computer Science; Lan, X., Mei, X., Jiang, C., Zhao, F., Tian, Z., Eds.; Springer Nature Singapore: Singapore, 2025; Vol. 15208, pp. 174–188. ISBN 978-981-96-0782-2. [Google Scholar]
- Yang, Y.; Ding, Z.; Wang, R.; Modares, H.; Wunsch, D.C. Data-Driven Human-Robot Interaction Without Velocity Measurement Using Off-Policy Reinforcement Learning. IEEECAA J. Autom. Sin. 2022, 9, 47–63. [Google Scholar] [CrossRef]
- Yuan, Z.; Wei, T.; Gu, L.; Hua, P.; Liang, T.; Chen, Y.; Xu, H. HERMES: Human-to-Robot Embodied Learning from Multi-Source Motion Data for Mobile Dexterous Manipulation 2025.
- Zhang, F.; Leitner, J.; Milford, M.; Upcroft, B.; Corke, P. Towards Vision-Based Deep Reinforcement Learning for Robotic Motion Control 2015.
- Zhang, J.; Zhou, X.; Zhou, J.; Qiu, S.; Liang, G.; Cai, S.; Bao, G. A High-Efficient Reinforcement Learning Approach for Dexterous Manipulation. Biomimetics 2023, 8, 264. [Google Scholar] [CrossRef]
- Zhang, J. Research on Precise Control and Optimization of Intelligent Robotic Arm Based on Artificial Intelligence. In Proceedings of the 2025 IEEE 7th International Conference on Power, Intelligent Computing and Systems (ICPICS), Shenyang, China, August 29 2025; IEEE; pp. 197–203. [Google Scholar]
- Zhang, P.; Zhang, J.; Kan, J. A Research on Manipulator-Path Tracking Based on Deep Reinforcement Learning. Appl. Sci. 2023, 13, 7867. [Google Scholar] [CrossRef]
- Zhang, S.; Xia, Q.; Chen, M.; Cheng, S. Multi-Objective Optimal Trajectory Planning for Robotic Arms Using Deep Reinforcement Learning. Sensors 2023, 23, 5974. [Google Scholar] [CrossRef] [PubMed]
- Zhang, T.; Zhang, K.; Lin, J.; Louie, W.-Y.G.; Huang, H. Sim2real Learning of Obstacle Avoidance for Robotic Manipulators in Uncertain Environments. IEEE Robot. Autom. Lett. 2022, 7, 65–72. [Google Scholar] [CrossRef]
- Zhang, Z.; Chen, Z. Modeling and Control of Robotic Manipulators Based on Symbolic Regression. IEEE Trans. Neural Netw. Learn. Syst. 2023, 34, 2440–2450. [Google Scholar] [CrossRef]
- Zhang, Z.; Zheng, C. Simulation of Robotic Arm Grasping Control Based on Proximal Policy Optimization Algorithm. J. Phys. Conf. Ser. 2022, 2203, 012065. [Google Scholar] [CrossRef]
- Zhan, A.; Zhao, R.; Pinto, L.; Abbeel, P.; Laskin, M. A Framework for Efficient Robotic Manipulation 2022.
- Zhao, C.; Wei, Y.; Xiao, J.; Sun, Y.; Zhang, D.; Guo, Q.; Yang, J. Inverse Kinematics Solution and Control Method of 6-Degree-of-Freedom Manipulator Based on Deep Reinforcement Learning. Sci. Rep. 2024, 14, 12467. [Google Scholar] [CrossRef]
- Zhao, W.; Queralta, J.P.; Qingqing, L.; Westerlund, T. Towards Closing the Sim-to-Real Gap in Collaborative Multi-Robot Deep Reinforcement Learning. In Proceedings of the 2020 5th International Conference on Robotics and Automation Engineering (ICRAE), November 20 2020; IEEE: Singapore, Singapore; pp. 7–12. [Google Scholar]
- Zheng, L.; Wang, Y.; Yang, R.; Wu, S.; Guo, R.; Dong, E. An Efficiently Convergent Deep Reinforcement Learning-Based Trajectory Planning Method for Manipulators in Dynamic Environments. J. Intell. Robot. Syst. 2023, 107, 50. [Google Scholar] [CrossRef]
- Zhou, J.; Zheng, H.; Zhao, D.; Chen, Y. Intelligent Control of Manipulator Based on Deep Reinforcement Learning. In Proceedings of the 2021 12th International Conference on Mechanical and Aerospace Engineering (ICMAE), Athens, Greece, July 16 2021; IEEE; pp. 275–279. [Google Scholar]
- Zhu, A.; Ai, H.; Chen, L. A Fuzzy Logic Reinforcement Learning Control with Spring-Damper Device for Space Robot Capturing Satellite. Appl. Sci. 2022, 12, 2662. [Google Scholar] [CrossRef]
- Zhu, H.; Gupta, A.; Rajeswaran, A.; Levine, S.; Kumar, V. Dexterous Manipulation with Deep Reinforcement Learning: Efficient, General, and Low-Cost. In Proceedings of the 2019 International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada, May 2019; IEEE; pp. 3651–3657. [Google Scholar]
- Zhu, Q.; Bian, J. Reinforcement Learning-Based Optimized Tracking Control for Uncertain Robot Manipulators with Composite Constraints. In Proceedings of the 2025 6th International Conference on Artificial Intelligence and Electromechanical Automation (AIEA), Hefei, China, August 1 2025; IEEE; pp. 432–439. [Google Scholar]
- Zong, X.; Tan, S. Deep Reinforcement Learning-Based Multi-Task Optimization Algorithm for Industrial Robots. In Proceedings of the 2025 IEEE 2nd International Conference on Electronics, Communications and Intelligent Science (ECIS), Yueyang, China, May 23 2025; IEEE; pp. 1–5. [Google Scholar]
- Modares, H.; Ranatunga, I.; Lewis, F.L.; Popa, D.O. Optimized Assistive Human–Robot Interaction Using Reinforcement Learning. IEEE Trans. Cybern. 2016, 46, 655–667. [Google Scholar] [CrossRef]
- Perrusquía, A.; Yu, W.; Soria, A. Position/Force Control of Robot Manipulators Using Reinforcement Learning. Ind. Robot Int. J. Robot. Res. Appl. 2019, 46, 267–280. [Google Scholar] [CrossRef]
- Sasaki, M.; Muguro, J.; Kitano, F.; Njeri, W.; Matsushita, K. Sim–Real Mapping of an Image-Based Robot Arm Controller Using Deep Reinforcement Learning. Appl. Sci. 2022, 12, 10277. [Google Scholar] [CrossRef]
- Wang, X.; Guo, S.; Xu, Z.; Zhang, Z.; Sun, Z.; Xu, Y. A Robotic Teleoperation System Enhanced by Augmented Reality for Natural Human–Robot Interaction. Cyborg Bionic Syst. 2024, 5, 0098. [Google Scholar] [CrossRef] [PubMed]
- AlAttar, A.; Chappell, D.; Kormushev, P. Kinematic-Model-Free Predictive Control for Robotic Manipulator Target Reaching With Obstacle Avoidance. Front. Robot. AI 2022, 9, 809114. [Google Scholar] [CrossRef] [PubMed]
- Baselizadeh, A.; Khaksar, W.; Torresen, J. Motion Planning and Obstacle Avoidance for Robot Manipulators Using Model Predictive Control-Based Reinforcement Learning. In Proceedings of the 2022 IEEE International Conference on Systems, Man, and Cybernetics (SMC), October 9 2022; IEEE: Prague, Czech Republic; pp. 1584–1591. [Google Scholar]
- Ding, Z.; Song, C.; Xu, J.; Dou, Y. Human-Robot Interaction System Design for Manipulator Control Using Reinforcement Learning. In Proceedings of the 2021 36th Youth Academic Annual Conference of Chinese Association of Automation (YAC), May 28 2021; IEEE: Nanchang, China; pp. 660–665. [Google Scholar]
- Haddad, G.S.Q.; Akkar, R.H.A. Intelligent Swarm Algorithms for Optimizing Nonlinear Sliding Mode Controller for Robot Manipulator. Int. J. Electr. Comput. Eng. IJECE 2021, 11, 3943. [Google Scholar] [CrossRef]
- Jeong, R.; Aytar, Y.; Khosid, D.; Zhou, Y.; Kay, J.; Lampe, T.; Bousmalis, K.; Nori, F. Self-Supervised Sim-to-Real Adaptation for Visual Robotic Manipulation. In Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France, May 2020; IEEE; pp. 2718–2724. [Google Scholar]
- Jhan, Z.-Y.; Lee, C.-H.; Lin, C.-M. A New Adaptive Fuzzy Neural Force Controller for Robots Manipulator Interacting with Environments. In Proceedings of the 2015 International Conference on Machine Learning and Cybernetics (ICMLC), July 2015; IEEE: Guangzhou, China; pp. 572–577. [Google Scholar]
- Kallel, H.; Iqbal, K. Online Estimation of Manipulator Dynamics for Computed Torque Control of Robotic Systems. Sensors 2025, 25, 6831. [Google Scholar] [CrossRef]
- Kuang, C.; Han, L.; Liang, K.; Mao, J.; Zhang, C. Self-Optimization Composite Dynamic Control for Trajectory Tracking of Robot Manipulators via Deep Reinforcement Learning. J. Control Decis. 2025, 1–18. [Google Scholar] [CrossRef]
- Li, C.; Liu, F.; Wang, Y.; Buss, M. Concurrent Learning-Based Adaptive Control of an Uncertain Robot Manipulator With Guaranteed Safety and Performance. IEEE Trans. Syst. Man Cybern. Syst. 2022, 52, 3299–3313. [Google Scholar] [CrossRef]
- Li, J.; Peng, X.; Li, B.; Sreeram, V.; Wu, J.; Chen, Z.; Li, M. Model Predictive Control for Constrained Robot Manipulator Visual Servoing Tuned by Reinforcement Learning. Math. Biosci. Eng. 2023, 20, 10495–10513. [Google Scholar] [CrossRef] [PubMed]
- Li, Z.; Liu, J.; Huang, Z.; Peng, Y.; Pu, H.; Ding, L. Adaptive Impedance Control of Human–Robot Cooperation Using Reinforcement Learning. IEEE Trans. Ind. Electron. 2017, 64, 8013–8022. [Google Scholar] [CrossRef]
- Lu, T.; Zhang, K.; Shi, Y. Robust Data-Driven Model Predictive Control via On-Policy Reinforcement Learning for Robot Manipulators. In Proceedings of the 2024 IEEE 7th International Conference on Industrial Cyber-Physical Systems (ICPS), St. Louis, MO, USA, May 12 2024; IEEE; pp. 1–6. [Google Scholar]
- Pradhan, S.K.; Subudhi, B. Position Control of a Flexible Manipulator Using a New Nonlinear Self-Tuning PID Controller. IEEECAA J. Autom. Sin. 2020, 7, 136–149. [Google Scholar] [CrossRef]
- Sacchi, N.; Incremona, G.P.; Ferrara, A. Integral Sliding Modes Generation via DRL-Assisted Lyapunov-Based Control for Robot Manipulators. In Proceedings of the 2023 European Control Conference (ECC), Bucharest, Romania, June 13 2023; IEEE; pp. 1–6. [Google Scholar]
- Shcherbakov, V.; Bragina, A.; Shiryaev, V. Robot Manipulator Control Using Predictive Model under Conditions of Incomplete Information. In Proceedings of the 2020 Global Smart Industry Conference (GloSIC), November 17 2020; IEEE: Chelyabinsk, Russia; pp. 281–286. [Google Scholar]
- Wu, H.; Yang, J.; Huang, S.; Li, J. Adaptive Optimal Admittance Control for Robotic Precision Grinding Based on Improved Normalized Advantage Function. IEEEASME Trans. Mechatron. 2025, 30, 5166–5178. [Google Scholar] [CrossRef]
- Xie, Z.; Sun, T.; Kwan, T.; Wu, X. Motion Control of a Space Manipulator Using Fuzzy Sliding Mode Control with Reinforcement Learning. Acta Astronaut. 2020, 176, 156–172. [Google Scholar] [CrossRef]
- Yousef, H.; Siddique, T.; Fareh, R.; Choutri, K.; Dylov, D.; Khadraoui, S. Reinforcement Learning-Tuned Active Disturbance Rejection Controller for Tracking Control of a 4-DoF Robot Manipulator. In Proceedings of the 2024 12th International Conference on Control, Mechatronics and Automation (ICCMA), November 11 2024; IEEE: London, United Kingdom; pp. 125–130. [Google Scholar]
- Zhang, Y.; Mo, K.; Shen, F.; Xu, X.; Zhang, X.; Yu, J.; Yu, C. Self-Adaptive Robust Motion Planning for High DoF Robot Manipulator Using Deep MPC. In Proceedings of the 2024 3rd International Conference on Robotics, Artificial Intelligence and Intelligent Control (RAIIC), July 5 2024; IEEE: Mianyang, China; pp. 139–143. [Google Scholar]






| Review ID | Summary | Contribution |
|---|---|---|
| Mareco et al. [21] | This review analyzes the use of AI enabled methods in control systems, focusing on control-oriented techniques such as fuzzy logic, neuro fuzzy control, neural networks, and swarm or evolutionary optimization. The analysis is mainly qualitative and centered on methodological characteristics within specific application settings. | Provides a structured mapping of 188 studies across renewable energy, robotics, agriculture, and industrial processes, highlighting how AI has been adopted in application specific control problems. Its value lies in identifying major methodological trends, limitations, and research gaps in intelligent control design. |
| Nahavandi et al. [22] | This review evaluates machine learning for robotic manipulators, covering deep learning, RL, and imitation learning. It analyzes sim-to-real transfer and practical deployment across industrial, medical, and space sectors. | Provides a qualitative synthesis of machine learning strategies in robotic manipulation, utilizing algorithmic complexity estimates and tabulated comparisons instead of formal statistical analysis. It offers researchers a structured framework for evaluating the computational trade-offs and practical requirements of various control architectures across diverse operational domains. |
| Waseem et al. [17] | This qualitative review surveys the evolution of robotic manipulator control technologies from 2016 to 2024, spanning methods from classical PID and linear models to intelligent AI, hybrid, and quantum-inspired frameworks. No statistical analysis was performed; the study taxonomizes these strategies by their underlying architecture and optimization logic to highlight current operational trend | Provides a foundational mapping of control theories against emerging paradigms such as digital twins, federated learning, and blockchain to address persistent challenges in non-linear dynamics and real-time performance. It informs future research directions by identifying critical gaps in multi-agent system integration and scalable control for unstructured environments. |
| Our review | This review analyzes 343 studies on AI-based robot manipulator control published between 2015 and 2025, focusing on research trends across functional roles, application domains, robot types, and evaluation approaches. It highlights major patterns in the distribution and development of AI methods within the manipulator control pipeline. | Provides the first structured quantitative mapping of AI based robot manipulator control over the past decade, together with a unified taxonomy of functional AI roles in the control pipeline. The review identifies major research concentrations and persistent gaps, providing a data driven reference for future development in underrepresented control functions. |
| Databases | Search Strings |
|---|---|
| IEEEXplore, PubMed, Scopus | ("robot manipulator" OR "robotic arm" OR "industrial robot" OR manipulator) AND (AI OR "artificial intelligence" OR "machine learning" OR "deep learning" OR "reinforcement learning") AND ("control" OR "adaptive control" OR "intelligent control" OR "learning-based control") |
| Google Scholar | ("robot manipulator" OR "robotic arm" OR "industrial robot" OR manipulator) AND (AI OR "artificial intelligence" OR "machine learning" OR "deep learning" OR "reinforcement learning") AND ("control" OR "adaptive control" OR "intelligent control" OR "learning-based control") "robot manipulator" AND ("reinforcement learning" OR DRL) AND ("trajectory optimization" OR "motion planning" OR control) "robot manipulator" OR "robotic arm" OR "robotic manipulation" AND ("reinforcement learning" OR DRL OR "deep reinforcement learning") AND (grasping OR "dexterous manipulation" OR "sim-to-real" OR sim2real) |
| Functional Role | Authors | Year | Robot Types | Application Area | Control Technique | Methods | Learning Paradigm | Evaluation Method | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Perception and Estimation | Bertono et al. [24] | 2019 | Serial | NS | Learning-based pose estimation | CNN | Supervised | Experiment | |||||||
| Catalán et al. [25] | 2023 | Exoskeleton | Rehabilitation | Vision-guided learning-based | EEG/EOG classification, RGB-D object pose estimation, DMP | Supervised | Experiment | ||||||||
| Chen et al. [26] | 2023 | Serial | Manufacturing | Learning-based grasp planning | Min-Pnet, 1D CNN | Supervised | Both | ||||||||
| Chen et al. [27] | 2018 | Serial | Manufacturing | Vision-guided learning-based | Faster R-CNN | Supervised | Experiment | ||||||||
| Ghiasvand et al. [28] | 2024 | Serial | Aerospace | DNN-based visual servoing | DNN | Supervised | Both | ||||||||
| Gul et al.[29] | 2023 | Serial | Rehabilitation | Learning-based | KNN, LSTM, SVM, DT | Supervised | Experiment | ||||||||
| Heris et al. [30] | 2022 | Magnetic | Healthcare | Learning-based magnetic field estimation | ANN, Simulated Annealing | Supervised | Both | ||||||||
| Kirda et al. [31] | 2025 | Serial | Manufacturing | Vision-guided | YOLOv5 | Supervised | Experiment | ||||||||
| Kondratenko et al. [32] | 2022 | Mobile | NS | ML-based sensor processing | Fuzzy logic, neuro-fuzzy, YOLOv2, ResNet34 | Supervised | Experiment | ||||||||
| Kružić et al. [33] | 2020 | Serial | NS | Sensorless force estimation for interaction control | MLP, 1D CNN, LSTM | Supervised | Experiment | ||||||||
| Liu et al. [34] | 2021 | Mobile | Garbage Management | Learning-based | YOLACT, GPD | Supervised | Experiment | ||||||||
| Liu et al. [35] | 2022 | Serial | Manufacturing | Vision-guided DRL-based | DQN, FCN (DenseNet-121) | RL | Both | ||||||||
| Luo et al. [36] | 2020 | Industrial | Manufacturing | Vision-guided learning-based | YOLO-v2-ROI | Supervised | Experiment | ||||||||
| Marchionna et al. [37] | 2023 | Serial | Dexterity Game | Vision-guided visual servoing | YOLACT++ | Supervised | Experiment | ||||||||
| Martin et al. [38] | 2018 | Serial | Manufacturing | Pose estimation-based | OpenPose, HMR | Supervised | Experiment | ||||||||
| Panasiuk [39] | 2025 | NS | Manufacturing | Vision-guided positioning | YOLOv8 | Supervised | Experiment | ||||||||
| Piltan [40] | 2020 | Serial | Multiple | Neuro-fuzzy-based fault-tolerant | SVM, NN, Fuzzy logic | Supervised | Experiment | ||||||||
| Pitan et al. [41] | 2020 | Serial | Multiple | Adaptive fuzzy fault-tolerant | DT, TSK fuzzy | Supervised | Experiment | ||||||||
| Sacchi et al. [42] | 2023 | Serial | NS | DRL-based fault estimation | TD3 | RL | Simulation | ||||||||
| Shukla et al. [43] | 2021 | Serial | Multiple | Learning-based grasp pose estimation | GA, DQN | RL, Supervised, Evolutionary | Experiment | ||||||||
| Wang et al. [44] | 2024 | Myoelectric | Rehabilitation | sEMG gesture recognition | MS-CLSTM, CNN-LSTM | Supervised | Experiment | ||||||||
| Planning | Ahn et al. [45] | 2025 | Serial | Logistics | Learning-based planning | PPO, CMPNet | RL, IL | Both | |||||||
| Ak et al. [46] | 2022 | Serial | Rehabilitation | Learning-based | GoogLeNet | Supervised | Experiment | ||||||||
| Andersen et al. [47] | 2015 | Serial | Manufacturing | Learning-based predictive | NN, RT, GP | Supervised | Experiment | ||||||||
| Andriyanov [48] | 2023 | Serial | Agriculture | RL-based planning | Q-learning, YOLOv5 | RL, Supervised | Both | ||||||||
| Azizi [49] | 2020 | Serial | Manufacturing | Learning-based optimization | ANN, GA | Supervised | Simulation | ||||||||
| Batzianouis et al. [50] | 2021 | Serial | Rehabilitation | IRL-based planning | GP IRL, LDA | RL, Supervised | Experiment | ||||||||
| Bucinskas et al. [51] | 2022 | Serial | Manufacturing | DRL-based | DQN | RL | Experiment | ||||||||
| Chen et al. [52] | 2023 | Serial | Heatlhcare | DRL-based planning | PPO, DNN | RL, Supervised | Both | ||||||||
| Cheng et al. [53] | 2024 | Humanoid | NS | Learning-based planning | CNN | Supervised | Both | ||||||||
| Chi et al. [54] | 2018 | Platform | Healthcare | RL-based | PI2, DMP | RL, IL | Both | ||||||||
| Ebert et al. [55] | 2018 | Serial | NS | Learning-based | Video prediction model, Registration network, Meta-learned classifier | Self-supervised | Both | ||||||||
| Emamzadeh et al. [56] | 2015 | Serial | NS | Fuzzy-based hierarchical | TSK fuzzy predictor | Supervised | Simulation | ||||||||
| Jaquier et al. [57] | 2021 | Humanoid | NS | Learning-based | Tensor-based GMM, GMR | IL, Unsupervised | Both | ||||||||
| Karimi et al. [58] | 2022 | Redundant Serial | NS | DRL-based | DQN | RL | Simulation | ||||||||
| Kim et al. [59] | 2020 | Serial | Manufacturing | DRL-based planning | TD3, HER | RL | Both | ||||||||
| Kwon et al. [60] | 2020 | Robotic Hand | Rehabilitation | Learning-based | CNN | Supervised | Experiment | ||||||||
| Lee et al. [61] | 2025 | serial | Rehabilitation | Copilot-assisted | EEGNet, PPO, Grounding DINO | RL | Experiment | ||||||||
| Liu et al. [62] | 2022 | Serial | Manufacturing | Learning-based | TAC-CA, ANN | RL, Hybrid | Both | ||||||||
| Liu et al. [63] | 2021 | Serial | Manufacturing | RL-based | AC, Double Q-Learning | RL | Both | ||||||||
| Liu et al. [64] | 2021 | Serial | NS | RL-based | RL, Q-learning | RL, Hybrid | Both | ||||||||
| Li et al. [65] | 2024 | Redundant Serial | Manufacturing | DRL-based planning | AC DRL, ANN | RL, Hybrid | Both | ||||||||
| Li et al. [66] | 2023 | Serial | Manufacturing | DRL-based | DQN, Consensus-based training | RL | Both | ||||||||
| Li et al. [67] | 2023 | Serial | NS | DRL-based | DRL, ResNet-50 | RL, Self-supervised | Both | ||||||||
| Parag et al. [68] | 2025 | Serial | NS | Learning-based | Sobolev regression, NN | Hybrid | Simulation | ||||||||
| Parák et al. [69] | 2024 | Serial | Manufacturing | DRL-based | DDPG, TD3, Soft AC | RL | Simulation | ||||||||
| Prianto et al. [70] | 2021 | Serial | Manufacturing | Learning-based planning | Soft AC, HER | RL, Hybrid | Both | ||||||||
| Sacchi et al. [71] | 2021 | Serial | Manufacturing | DRL-based | DQN, Q-learning | RL | Simulation | ||||||||
| Solowjow et al. [72] | 2020 | Serial | Multiple | Learning-based grasping planning | MobileNet-SSD, FC-GQ-CNN | Supervised | Both | ||||||||
| Sunwoo et al. [73] | 2021 | Serial | Manufacturing | DRL-based | D3QN, PER | RL | Simulation | ||||||||
| Wang et al. [74] | 2022 | Serial | Manufacturing | Hybrid imitation-RL | HGCIL, DMP, Soft AC | RL, IL, Hybrid | Both | ||||||||
| Wang et al. [75] | 2025 | Serial | Manufacturing | DRL-based trajectory planning | DDPG, Autoencoder | RL, Unsupervised | Both | ||||||||
| Wang et al. [76] | 2023 | Serial | Manufacturing | DRL-based planning | DDQN, Q-learning | RL | Simulation | ||||||||
| Wilson et al. [77] | 2025 | NS | Rehabilitation | DRL-based | DRL-ANN, Improved ResNet, MSBO | RL, Supervised | Both | ||||||||
| Wong et al. [78] | 2019 | Serial | Manufacturing | ACO-based planning | ACO, k-means clusterin | Unsupervised | Both | ||||||||
| Yang et al. [79] | 2019 | Serial | NS | Learning-based | DMP, GMM, GMR, RBFNN | IL, Supervised, Hybrid | Experiment | ||||||||
| Zeng et al. [79] | 2018 | Serial | NS | DRL-based with MP | Q-learning, FCN, DenseNet | RL, Self-supervised | Both | ||||||||
| Zhang [80] | 2025 | Serial | NS | Hierarchical DRL-based | TCN-attention, dDRL, RL parameter adaptatio | RL | Both | ||||||||
| Zhao et al. [81] | 2025 | serial | Multiple | DRL-based planning | M2ACD | Supervised | Both | ||||||||
| Zhong et al. [82] | 2022 | serial | Manufacturing | DRL-based | DDPG | RL, Hybrid | Simulation | ||||||||
| Learning Control | Ahmed et al. [83] | 2022 | Serial | NS | Adaptive sliding mode | Adaptive Law | - | Simulation | |||||||
| Aiello [84] | 2020 | Serial | Service | DRL-based | DDPG, HER | RL | Simulation | ||||||||
| Al-Shanoon et al. [85] | 2021 | Serial | NS | DRL-based | DRL, Q-learning, Fully convolutional network | RL, Self-supervised | Both | ||||||||
| Alhousani et al. [86] | 2023 | Serial | NS | Geometric RL-based | Geometric RL, Soft AC, PPO, PoWER, CMA-ES | RL | Both | ||||||||
| Aljalbout et al. [87] | 2024 | Serial | NS | RL-based | PPO, RL | RL | Both | ||||||||
| Alles et al. [88] | 2022 | Serial | Manufacturing | DRL-based | Soft AC, HER | RL | Both | ||||||||
| Amaya et al. [89] | 2023 | Redundant Serial | Manufacturing | Neuromorphic RL-based | Soft AC, SNN | RL | Simulation | ||||||||
| An et al. [90] | 2024 | Serial | Rahabilitation | ADP-based optima | ADP, Critic NN, RBFNN | RL | Experiment | ||||||||
| Avhad et al. [91] | 2024 | Serial | NS | Adaptive DRL-based | DRL, Model ensembles, Q-networks | RL | Both | ||||||||
| Azimirad et al. [92] | 2024 | Mobile Serial | NS | Learning-based | SNN, RL, STDP | RL | Simulation | ||||||||
| Añazco et al. [93] | 2021 | Anthropomorphic Hand | Service | DRL-based | DRL, 3D CNN | RL, Supervised | Simulation | ||||||||
| Baek et al. [94] | 2022 | Serial | NS | DRL-based | Soft AC | RL | Simulation | ||||||||
| Barnoy et al. [95] | 2022 | Mobile magnetic needle | Healthcare | Learning-based | TD3, NN dynamics model | RL, Supervised | Both | ||||||||
| Bashabsheh [96] | 2025 | Serial | NS | RL-based | Policy-gradient RL | RL | Simulation | ||||||||
| Bejar et al. [97] | 2021 | Serial | NS | DRL-based | DDPG | RL | Simulation | ||||||||
| Blaise et al. [98] | 2023 | Serial | Aerospace | DRL-based | DDPG | RL | Simulation | ||||||||
| Brito et al. [99]3/28/2026 4:13:00 AM | 2020 | Serial | Manufacturing | RL-based | AC, LSTM | RL, Supervised | Experiment | ||||||||
| Calderón-Cordova et al. [100] | 2024 | Serial | Industrial | DRL-based | DDPG, DRL | RL | Simulation | ||||||||
| Cao et al. [101] | 2023 | Serial | NS | Learning-based | RL, AC, RBFNN | RL | Both | ||||||||
| Carron et al. [102] | 2019 | Compliant Serial | NS | Learning-based model predictiv | GP | Supervised | Experiment | ||||||||
| Castelli et al. [103] | 2017 | Serial | Manufacturing | Learning-based visual servoing | GMM, GMR | Supervised | Experiment | ||||||||
| Chen et al. [104] | 2018 | Parallel | NS | Neural-dynamics-based | Zeroing neural-dynamics, ZND model | - | Simulation | ||||||||
| Chen [105] | 2021 | Dexterous Hand | Multiple | Learning-based | DQN, PPO, Soft AC, DAPG | RL, IL, Self-supervised | Simulation | ||||||||
| Chen et al. [106] | 2022 | Redundant Serial | NS | DRL-based | Soft AC, DDPG | RL | Simulation | ||||||||
| Chen et al. [107] | 2016 | Serial | NS | Robust adaptive compensation | Adaptive fuzzy | Hybrid | Both | ||||||||
| Chen et al. [108] | 2022 | Redundant Serial | NS | DRL-based | Soft AC, Prioritized Experience Replay | RL | Simulation | ||||||||
| Chen et al. [109] | 2021 | Redundant Serial | Manufacturing | DRL-based | Soft AC | RL | Both | ||||||||
| Chen et al. [110] | 2021 | Serial | Manufacturing | Learning-based | multi-layer NN | Supervised | Both | ||||||||
| Chen [111] | 2025 | Serial | NS | DRL-based | DDPG | RL | Simulation | ||||||||
| Chen et al. [112] | 2023 | Serial | NS | Vision-guided DRL-based | YOLOv3, Soft AC | RL | Both | ||||||||
| Chen et al. [113] | 2021 | Dexterous Hand | NS | RL-based | PPO, TRPO, Soft AC, MAPPO, HAPPO | RL | Simulation | ||||||||
| Chen et al. [114] | 2024 | Dexterous Hand | NS | Learning-based | PPO, Soft AC, TRPO, DAPG, HAPPO, HATRPO, MAPPO, BCQ, TD3+BC, IQL, ProMP | RL, IL | Simulation | ||||||||
| Chen et al. [115] | 2023 | Serial | NS | DRL-based | Soft AC, MLP | RL, IL | Both | ||||||||
| Chen et al. [116] | 2024 | Redundant Serial | NS | DRL-based | DDPG, TD3, Soft AC | RL | Both | ||||||||
| Christen et al. [117] | 2019 | Dexterous Hand | NS | DRL-based | DDPG, DRL | RL, IL | Simulation | ||||||||
| Chu et al. [118] | 2020 | Redundant Serial | NS | DRL-based | DDPG, D4PG | RL | Simulation | ||||||||
| Cotrim et al. [119] | 2021 | Serial | Manufacturing | RL-based | REINFORCE, DQN | RL | Simulation | ||||||||
| Cui et al. [120] | 2025 | Serial | Multiple | DRL-based | Soft AC | RL | Both | ||||||||
| Cutler et al. [121] | 2024 | Serial | NS | DRL-based | TD3, Soft AC, DDPG | RL | Experiment | ||||||||
| Ding et al. [122] | 2021 | Serial | NS | DRL-based | TD3 | RL | Both | ||||||||
| Dong [123] | 2024 | NS | NS | RL-based adaptive | RL, PG | RL | Both | ||||||||
| Dong et al. [124] | 2023 | Redundant Serial | NS | DRL-based | DDPG | RL | Simulation | ||||||||
| Ducaju et al. [125] | 2024 | Redundant Serial | Manufacturing | Iterative learning-based | Iterative Learning Control | - | Experiment | ||||||||
| Du et al. [126] | 2017 | Serial with RCM | Healthcare | Fuzzy RL-based admittance | Fuzzy Sarsa(λ), RL | RL | Experiment | ||||||||
| Ehrlich et al. [127] | 2022 | Serial | Rehabilitation | Neuromorphic learning-based | NEF, SNN, PES | Supervised | Both | ||||||||
| Enayati et al. [128] | 2024 | Serial | NS | Learning-based | PPO, RL | RL | Both | ||||||||
| Fareh et al. [129] | 2025 | Serial | NS | DRL-based | DDPG, PINNs | RL | Both | ||||||||
| Filho et al. [130] | 2025 | Serial | NS | Multi-agent DRL-based | Multi DQN, DQN | RL | Both | ||||||||
| Franceschetti et al. [131] | 2020 | Serial | NS | DRL-based | TRPO, DQN-NAF | RL | Both | ||||||||
| Fu et al. [132] | 2022 | Quadruped | NS | RL-based whole-body | PPO, RL | RL | Both | ||||||||
| Ganie et al. [133] | 2023 | Serial | NS | Learning-based | DNN, Elastic weight consolidation | Hybrid | Simulation | ||||||||
| Gao [134] | 2022 | Serial | NS | DRL-based | DDPG | RL | Both | ||||||||
| Garcia-Hernando et al. [135] | 2020 | Dexterous Hand | NS | Residual RL-based | PPO, Adversarial IL | RL, IL, Hybrid | Simulation | ||||||||
| Gawali et al. [136] | 2023 | NS | NS | Learning-based | TERL, RNN, RL | RL | Simulation | ||||||||
| Ghediri et al. [137] | 2022 | Serial | NS | DRL-based | DDPG, DRL | RL | Simulation | ||||||||
| Grandesso [138] | 2023 | Redundant Serial | NS | RL-based | AC RL, DDPG variant | RL | Simulation | ||||||||
| Gupta et al. [139] | 2015 | Serial | Manufacturing | Neuro-fuzzy-based | ANFIS | Hybrid | Simulation | ||||||||
| Gu et al. [140] | 2017 | Serial | NS | DRL-based | NAF, DDPG | RL | Both | ||||||||
| Haiderbhai et al. [141] | 2025 | Serial | Healthcare | Vision-guided learning-based | PPO, RL | RL | Both | ||||||||
| Hardman et al. [142] | 2022 | Serial | NS | DRL-based | DDPG | RL | Experiment | ||||||||
| Hazem et al. [143] | 2025 | Serial | NS | DRL-based | DDPG, LC-DDPG, TD3-ADX | RL | Simulation | ||||||||
| Heaton et al. [144] | 2023 | Redundant Serial | NS | DRL-based | Soft AC | RL | Simulation | ||||||||
| He et al. [145] | 2017 | Serial | NS | Learning-based | RBFNN | - | Simulation | ||||||||
| He et al. [146] | 2021 | Serial | NS | RL-based | AC, RBFNN | RL | Both | ||||||||
| He et al. [147] | 2018 | Flexible-joint Serial | Manufacturing | Learning-based | NN | Hybrid | Both | ||||||||
| Homsi et al. [148] | 2025 | Serial | Logistics | DRL-based | DQN, DQN variants, Self-attention | RL | Simulation | ||||||||
| Hosny et al. [149] | 2023 | Serial | Manufacturing | Learning-based | RL, AC, Value iteration | RL | Simulation | ||||||||
| Huang et al. [150] | 2024 | Serial | Manufacturing | DRL-based | Multi-agent TD3, H-memory | RL | Both | ||||||||
| Hu et al. [150] | 2024 | Serial | NS | DRL-based | Soft AC, GAIL, LSTM | RL, IL | Simulation | ||||||||
| Hu et al. [151] | 2024 | Serial | NS | DRL-based | Soft AC, ERND | RL | Simulation | ||||||||
| Hu et al. [152] | 2024 | Serial | NS | DRL-based | Soft AC, HER | RL | Simulation | ||||||||
| Hu et al. [153] | 2018 | Serial | NS | RL NN | RL, AC NN | RL | Simulation | ||||||||
| Hu et al. [154] | 2020 | Serial | NS | MBRL-based | RL, kernel methods | RL | Simulation | ||||||||
| Hu et al. [155] | 2023 | Dexterous Hand | NS | Learning-based | IRL, RL, Graph convolutional network | RL | Both | ||||||||
| Hwang et al. [156] | 2017 | Serial | NS | Neuro-fuzzy-based | Interval type-2 fuzzy logic | - | Simulation | ||||||||
| Incremona et al. [157] | 2021 | Serial | Manufacturing | DRL-based | DQN, NAF | RL | Both | ||||||||
| Iqdymat et al. [158] | 2025 | Serial | Logistics | DRL-based | DDPG, AC | RL | Both | ||||||||
| Iriondo et al. [159] | 2019 | Mobile | Logistics | DRL-based | DDPG, PPO | RL | Simulation | ||||||||
| Iwasaki et al. [160] | 2021 | Inverted Pendulum | NS | DRL-based | DDPG | RL | Simulation | ||||||||
| James et al. [161] | 2016 | Serial | NS | DRL-based | DQN | RL | Both | ||||||||
| Jeong et al. [162] | 2020 | Serial | Rehabilitation | Learning-based | MDCBN, CNN-BiLSTM | Supervised | Experiment | ||||||||
| Jiang et al. [163] | 2023 | Serial | Manufacturing | DRL-based | PPO, GAIL, Transformer | RL, IL | Both | ||||||||
| Jiang et al. [164] | 2024 | Serial | NS | Fuzzy RL-based optimal | fuzzy logic system, Integral RL, Value iteration | RL | Experiment | ||||||||
| Jiang et al. [165] | 2022 | Serial | Service | Vision-guided DRL-based | Asym-DDPG, Position-CycleGAN, Supervised learning | RL, Supervised, Hybrid | Both | ||||||||
| Jin et al. [166] | 2024 | Redundant Serial | NS | Learning-based | Echo State Network, Kalman Filter | Supervised | Both | ||||||||
| Joshi et al. [167] | 2020 | Hybrid | NS | DRL-based | DDQN, Grasp-Q-Network | RL, Off-policy | Both | ||||||||
| Josifovski et al. [168] | 2022 | Serial | NS | Learning-based | PPO | RL | Both | ||||||||
| Kalashnikov et al. [169] | 2018 | Serial | NS | Vision-guided DRL-based | QT-Opt, DQN, CEM | RL, Self-supervised | Both | ||||||||
| Kamali et al. [170] | 2020 | Serial | Industrial | DRL-based | PPO, DRL | RL | Both | ||||||||
| Kang et al. [171] | 2021 | Serial | NS | NN-based MPC | RBFNN, AC NN | RL, Hybrid | Simulation | ||||||||
| Kankashvar et al. [172] | 2015 | Parallel | NS | BBO-based PID | modified BBO | - | Simulation | ||||||||
| Kataoka et al. [173] | 2022 | Serial | NS | Learning-based | PPO, MLP | RL | Both | ||||||||
| Katyal et al. [174] | 2017 | Serial | NS | DRL-based | DRL, DQN | RL | Simulation | ||||||||
| Kaur et al. [175] | 2025 | Serial | Industrial | DRL-based | DRL, DQN | RL | Simulation | ||||||||
| Khan et al. [176] | 2020 | Redundant Serial | NS | Metaheuristic-based | BAORNN, RNN, Beetle antennae olfactory | - | Simulation | ||||||||
| Khodamipour et al. [177] | 2021 | Serial | NS | RL-based adaptive | RL, Fourier series expansion | RL | Simulation | ||||||||
| Kilinc et al. [178] | 2022 | Dexterous Hand | NS | RL-based | DDPG, HER | RL | Simulation | ||||||||
| Kim et al. [179] | 2024 | Anthropomorphic Gripper | NS | Learning-based | PPO | RL | Both | ||||||||
| Kuang [180] | 2023 | Anthropomorphic Gripper | NS | Learning-based | DDPG+HER, GAIL, GDP, PL-CGS, Goal-SGAIL | RL, IL | Simulation | ||||||||
| Kumar et al. [181] | 2017 | Serial | Multiple | GA-optimized fuzzy | GA, Fuzzy logic, Neuro-fuzzy | Unsupervised, Hybrid | Simulation | ||||||||
| Kumar et al. [182] | 2021 | Redundant Serial | NS | DRL-based | PPO, Deep neural policy | RL | Both | ||||||||
| Kunal et al. [183] | 2025 | Serial | Manufacturing | DRL-based | PPO, DNN | RL | Both | ||||||||
| Kurrek et al. [184] | 2020 | Serial | Manufacturing | DRL-based | Q-learning, DQN, PPO | RL | Simulation | ||||||||
| Lahmann et al. [185] | 2025 | Serial | NS | DRL-based | PPO | RL | Simulation | ||||||||
| Lee et al. [186] | 2022 | Redundant Serial | NS | Learning-based | RL, ANN | RL, Supervised | Simulation | ||||||||
| Lee et al. [187] | 2020 | Serial | NS | DRL-based | HER, DDPG | RL | Simulation | ||||||||
| Lee et al. [188] | 2024 | Serial | Rehabilitation | Learning-based | CNN-KF, PPO, Grounding DINO | RL, Supervised | Experiment | ||||||||
| Liang et al. [189] | 2024 | Serial | NS | RL-based adaptive | AC, NN | RL | Both | ||||||||
| Lin et al. [190] | 2025 | Humanoid | Service | Vision-guided learning-based | RL, PPO, Diffusion policy, Policy distillation | RL, IL | Both | ||||||||
| Lin et al. [191] | 2017 | Serial | NS | Cerebellar-inspired learning-based | Distributed cerebellar model, Spike-timing-dependent plasticity | RL | Both | ||||||||
| Lin et al. [192] | 2023 | Serial | NS | DRL-based | PPO, DRL, Pix2Pix GAN | RL | Both | ||||||||
| Lin et al. [193] | 2022 | Serial | NS | DRL-based | PPO, GAN | RL | Both | ||||||||
| Liu et al. [194] | 2021 | Serial | Multiple | Learning-based | RBFNN | Supervised, Hybrid | Simulation | ||||||||
| Liu et al. [195] | 2020 | Serial | NS | RL-based | MAPPO | RL | Both | ||||||||
| Liu et al. [196] | 2024 | Serial | NS | DRL-based | TD3 | RL | Simulation | ||||||||
| Liu et al. [197] | 2020 | Serial | NS | DRL-based | DQN | RL | Both | ||||||||
| Liu et al. [198] | 2019 | Redundant Serial | Multiple | Learning-based | LSTM | Supervised | Simulation | ||||||||
| Liu et al. [199] | 2021 | Serial | NS | DRL-based | Off-policy AC, DRL | RL | Simulation | ||||||||
| Liu et al. [200] | 2024 | Serial | NS | HRL-based | Hierarchical RL, PGPE | RL | Both | ||||||||
| Li et al. [201] | 2025 | Serial | Manufacturing | DRL-based | PPO, Simulated Annealing | RL | Both | ||||||||
| Li et al. [202] | 2023 | Redundant Serial | NS | Biomimetic learning-based | SNN, DDPG, STDP | RL, Unsupervised | Both | ||||||||
| Li [203] | 2021 | Serial | Manufacturing | DRL-based | DQN | RL | Simulation | ||||||||
| Li et al. [204] | 2019 | Serial | NS | Automaton-guided RL | RL, PPO | RL | Both | ||||||||
| Li et al. [205] | 2021 | Serial | Service | DRL-based | DRL, CNN, Policy search | RL | Experiment | ||||||||
| Li et al. [206] | 2024 | Serial | NS | RL-based sliding mode | RL, AC NN, RBFNN | RL | Simulation | ||||||||
| Li et al. [207] | 2015 | Serial | NS | Learning-based | AC, RBFNN | RL | Simulation | ||||||||
| Li et al. [208] | 2021 | Serial | Aerospace | DRL-based | DDPG | RL | Simulation | ||||||||
| Li et al. [209] | 2023 | Redundant Serial | NS | Autoencoder-based kinematic | sparse autoencoder, RNN | - | Simulation | ||||||||
| Li et al. [210] | 2018 | Humanoid | NS | RL-based | RL, DMP | RL | Experiment | ||||||||
| Lobbezoo et al. [211] | 2023 | Serial | Manufacturing | Learning-based | PPO, Soft AC | RL | Both | ||||||||
| Luo et al. [212] | 2018 | Serial | Manufacturing | DRL-based | MDGPS, DNN | RL | Experiment | ||||||||
| Luo et al. [213] | 2025 | Serial | Manufacturing | Vision-guided learning-based | Off-policy RL, RLPD, Soft AC | RL, IL, Hybrid | Experiment | ||||||||
| Luo et al. [214] | 2025 | Serial | NS | RL-based | PPO | RL | Simulation | ||||||||
| Lu et al. [14] | 2023 | Parallel | NS | DRL-based | DRL, Soft AC | RL | Both | ||||||||
| Majumder et al. [215] | 2024 | Serial | NS | DRL-based | DDPG, AC | RL | Simulation | ||||||||
| Majumder et al. [216] | 2023 | Serial | NS | DRL-based | Improved DDPG | RL | Simulation | ||||||||
| Maldonado-Ramirez et al. [217] | 2021 | Serial | Manufacturing | Vision-guided DRL-based | PPO, A2C, TRPO | RL | Both | ||||||||
| Malik et al. [218] | 2022 | Redundant Serial | NS | DRL-based | DQN | RL | Both | ||||||||
| Mannaa et al. [219] | 2023 | Serial | Manufacturing | DRL-based | DDPG, HER | RL | Both | ||||||||
| Mao et al. [220] | 2025 | Serial | NS | Vision-guided learning-based | Soft AC | RL | Both | ||||||||
| Matas et al. [221] | 2018 | Serial | Service | DRL-based | DDPG | RL, IL | Both | ||||||||
| Mazzaglia et al. [222] | 2024 | Redundant Serial | NS | Learning-based | RL, IL | RL, IL | Both | ||||||||
| Ma et al. [223] | 2025 | Serial | Multiple | Learning-based | Multimodal contrastive learning, Diffusion models | RL, Self-supervised | Simulation | ||||||||
| Mellatshahi et al. [224] | 2021 | Serial | NS | DRL-based | DQN | RL | Simulation | ||||||||
| Meyes et al. [225] | 2017 | Serial | Manufacturing | RL-based | Q-learning | RL | Both | ||||||||
| Molina [226] | 2018 | Serial | Service | RL-based | Relational RL, Q-Learning | RL, IL | Both | ||||||||
| Moon et al. [227] | 2021 | Redundant Serial | Underwater Construction | Meta-RL-based control | meta-RL, Model-based RL | RL | Simulation | ||||||||
| Mueangprasert et al. [228] | 2023 | Serial | NS | MBRL-based | GPR, ANN, SVR, PSO | RL | Simulation | ||||||||
| Naranjo-Campos et al. [229] | 2024 | Mobile | Rehabilitation | DRL-based | PPO, DRL | RL | Both | ||||||||
| Nguyen et al. [230] | 2023 | Serial | NS | DRL-based | DDPG, HER | RL | Both | ||||||||
| Nohooji et al. [231] | 2024 | Serial | Multiple | RL-based PID | AC, RBFNN | RL | Simulation | ||||||||
| Ouyang et al. [232] | 2017 | Serial | NS | RL-based | RBFNN, AC | RL | Both | ||||||||
| Pane et al. [233] | 2016 | Serial | Manufacturing | RL-based | AC | RL | Experiment | ||||||||
| Pane et al. [234] | 2019 | Serial | Manufacturing | RL-based compensation | AC, RBFNN | RL | Experiment | ||||||||
| Pantoja-Garcia et al. [235] | 2022 | Serial | NS | AC-based | AC, NN | RL | Simulation | ||||||||
| Pan et al. [236] | 2025 | Serial | Multiple | DRL-based | TD3 | RL | Simulation | ||||||||
| Pan et al. [237] | 2023 | Serial | NS | DRL-based | DRL | RL | Simulation | ||||||||
| Park et al. [238] | 2024 | Serial | NS | DRL-based | PPO | RL | Both | ||||||||
| Pavlichenko et al. [239] | 2022 | Redundant Serial | NS | DRL-based | Soft AC, Beta policy | RL | Both | ||||||||
| Pedersen et al. [240] | 2020 | Serial | NS | DRL-based | PPO, CycleGAN | RL, Unsupervised | Experiment | ||||||||
| Perrusquia et al. [241] | 2021 | Serial | NS | RL-based optimal tracking | NN, RL, Experience replay | RL | Simulation | ||||||||
| Perrusquía et al. [242] | 2019 | Pan and tilt | NS | RL-based impedance | RL, Q-learning, TD-learning, NRBF, K-means | RL | Experiment | ||||||||
| Pham et al. [243] | 2025 | Parallel | NS | RL-based adaptive | AC RL, NN | RL | Experiment | ||||||||
| Polydoros et al. [244] | 2015 | Serial | Manufacturing | Learning-based | PC-ESN, Echo State Network, Generalized Hebbian Learning, Bayesian linear regression | Supervised | Experiment | ||||||||
| Popov et al. [245] | 2017 | Serial | NS | DRL-based | DDPG | RL | Both | ||||||||
| Qiao et al. [246] | 2021 | Serial | NS | MPC-guided DRL-based | DQN | RL | Simulation | ||||||||
| Qin et al. [247] | 2022 | Dexterous Hand | NS | DRL-based | PPO, PointNet | RL | Both | ||||||||
| Qi et al. [248] | 2021 | Serial | NS | DRL-based | improved DDPG | RL | Simulation | ||||||||
| Quillen et al. [249] | 2018 | Hybrid | NS | Vision-guided DRL-based | DQN, DDPG, Path consistency learning, Monte Carlo policy evaluation, Corrected Monte Carlo | RL, Supervised | Simulation | ||||||||
| Rajeswaran et al. [250] | 2018 | Anthropomorphic Gripper | NS | DRL-based | DRL, DAPG, Behavior cloning, NPG | RL, IL, Hybrid | Simulation | ||||||||
| Ramirez et al. [251] | 2022 | Redundant Serial | NS | DRL-based | TD3, Supervised learning | RL, IL, Supervised | Simulation | ||||||||
| Ramirez et al. [252] | 2023 | Redundant Serial | NS | RLED-based | TD3, Supervised learning | RL, IL, Hybrid | Simulation | ||||||||
| Ren et al. [253] | 2020 | Serial | NS | Learning-based | RBFNN | Supervised | Simulation | ||||||||
| Ren et al. [254] | 2020 | Serial | NS | GAN-based inverse modeling | CGAN, LSGAN, BiGAN, DualGAN | Supervised | Experiment | ||||||||
| Rizzardo et al. [255] | 2023 | Serial | NS | DRL-based | Soft AC, Variational Autoencoder | RL, Unsupervised | Both | ||||||||
| Rubagotti et al. [256] | 2023 | Serial | NS | DRL-based | DRL, Q-learning, NAF | RL | Both | ||||||||
| Saeed et al. [257] | 2021 | Dexterous Hand | NS | DRL-based | DDPG, DDPG+HER, PPO | RL | Simulation | ||||||||
| Sahu et al. [258] | 2021 | Serial | NS | DRL-based | DDPG, AC | RL | Simulation | ||||||||
| Saidi et al. [259] | 2023 | Serial | NS | GA-optimized backstepping | GA | - | Simulation | ||||||||
| Said et al. [260] | 2019 | Serial | NS | ABC-optimized PID | ABC, PSO | - | Simulation | ||||||||
| Sajadi et al. [261] | 2022 | Serial | Healthcare | DRL-based | DDPG | RL | Both | ||||||||
| Sangiovanni et al. [262] | 2021 | Redundant Serial | Manufacturing | DRL-based | DRL, NAF | RL | Simulation | ||||||||
| Sangiovanni et al. [263] | 2018 | Anthropomorphic | Manufacturing | DRL-based | DRL, NAF | RL | Simulation | ||||||||
| Sangiovanni et al. [264] | 2018 | Serial | Manufacturing | DRL-based | DRL, NAF | RL | Simulation | ||||||||
| Scheikl et al. [265] | 2023 | Serial | Healthcare | Vision-guided learning-based | PPO, Contrastive GAN, DCL | RL | Both | ||||||||
| Sekkat et al. [266] | 2021 | Serial | NS | DRL-based | DDPG, YOLOv5 | RL | Simulation | ||||||||
| Shahid et al. [267] | 2022 | Redundant Serial | Manufacturing | RL-based | PPO, Soft AC | RL | Both | ||||||||
| Shahna et al. [268] | 2024 | Serial | NS | DRL-based | Soft AC | RL | Simulation | ||||||||
| Shao et al. [269] | 2025 | Serial | NS | DRL-based impedance | DDPG, Behavior cloning | RL, IL, Hybrid | Simulation | ||||||||
| Shetty et al. [270] | 2021 | Serial | NS | DRL-based | DDPG | RL | Simulation | ||||||||
| Shiferaw et al. [271] | 2024 | Serial | Multiple | DRL-based | DQN, DenseNet-121 | RL, Self-supervised | Simulation | ||||||||
| Shin et al. [272] | 2019 | Cable-driven | Healthcare | Learning-based model predictiv | RL, Learning from demonstration, NN | RL, IL | Both | ||||||||
| Siddique et al. [273] | 2024 | Serial | Rehabilitation | DRL-based | DDPG | RL | Simulation | ||||||||
| Simon et al. [274] | 2024 | Serial | Multiple | DRL-based | DQN, DMP | RL, IL | Both | ||||||||
| Sivertsvik et al. [275] | 2024 | Hybrid | Multiple | DRL-based | PPO, PPG | RL | Both | ||||||||
| Song et al. [276] | 2024 | Serial | Manufacturing | RL-based | Q-learning | RL | Both | ||||||||
| Song et al. [277] | 2024 | Serial | NS | RL-based consensus | RL, AC, NN | RL | Simulation | ||||||||
| Staley et al. [278] | 2018 | Serial | NS | DRL-based | DQN, DRL | RL | Simulation | ||||||||
| Sun et al. [279] | 2021 | Redundant Serial | Manufacturing | Learning-based | Fully connected NN | Supervised | Simulation | ||||||||
| Sun et al. [280] | 2023 | Redundant Serial | NS | Hybrid LfD-RL | DMP, RL, Policy network, Q-network | RL, IL, Hybrid | Experiment | ||||||||
| Sun et al. [281] | 2020 | Serial | NS | DRL-based | DRL, Soft AC | RL | Both | ||||||||
| Su et al. [282] | 2024 | Serial | NS | RL-based | PPO | RL | Both | ||||||||
| Su et al. [283] | 2025 | Serial | NS | RL-based backstepping | AC NN, RL | RL | Simulation | ||||||||
| Su et al. [284] | 2020 | Anthropomorphic | Healthcare | Learning-based | DCNN | Supervised | Both | ||||||||
| Tajdari et al. [285] | 2017 | Serial | Manufacturing | Adaptive sliding-mode | Model reference adaptive control, Adaptive sliding mode | - | Experiment | ||||||||
| Takeda et al. [286] | 2025 | Serial | NS | Hierarchical RL-based | Soft AC, Goal-conditioned RL | RL | Simulation | ||||||||
| Tang et al. [287] | 2022 | Serial | Rehabilitation | DRL-based | PPO, DRL | RL | Simulation | ||||||||
| Uchibe et al. [288] | 2021 | Humanoid | NS | IL-based | ERIL, Soft AC, IRL | RL, IL | Both | ||||||||
| Valencia et al. [289] | 2023 | Serial | NS | RL-based | TD3, Probabilistic NN, Gaussian mixture | RL | Simulation | ||||||||
| Vijay et al. [290] | 2018 | Serial | Power Transmission | Learning-based | RBFNN | Supervised, Hybrid | Simulation | ||||||||
| Vu et al. [291] | 2021 | Serial | NS | ARL-based | AC NN, ADP | RL | Simulation | ||||||||
| V et al. [292] | 2025 | Flexible Joint | NS | DRL-based | CPPO, PPO, CNN | RL | Experiment | ||||||||
| Wang et al. [293] | 2020 | Mobile | NS | DRL-based | PPO, DRL | RL | Both | ||||||||
| Wang et al. [294] | 2025 | Serial | NS | DRL-based | Soft AC, LSTM, Random Network Distillation, attention mechanism | RL | Simulation | ||||||||
| Wang, et al. [295] | 2024 | Serial | NS | DRL-based | DDPG | RL | Both | ||||||||
| Wang et al. [296] | 2025 | Serial | Multiple | DRL-based | HTSK Fuzzy system, GABC, Rainbow-DDPG, CPL | RL, IL, Hybrid | Both | ||||||||
| Wang et al. [297] | 2024 | Serial | NS | RL-based | RL, NN | RL | Simulation | ||||||||
| Wang, et al. [298] | 2017 | Serial | NS | Learning-based | RBFNN | - | Simulation | ||||||||
| Wang et al. [299] | 2023 | Dexterous | NS | DRL-based | DDPG, HER, Knowledge Transfer | RL | Both | ||||||||
| Wang et al. [300] | 2020 | Serial | NS | Learning-based | LSTM, Attention mechanism | Supervised | Experiment | ||||||||
| Wang et al. [301] | 2024 | Serial | NS | MBRL-based | Model-based RL, Curiosity-driven RL, Soft AC | RL, Self-supervised | Simulation | ||||||||
| Wang et al. [302] | 2025 | Redundant Serial | NS | Neuro-fuzzy-based | Fuzzy Logic System, AC-Identify, ADP | RL, Hybrid | Experiment | ||||||||
| Wang et al. [303] | 2025 | Serial | Multiple | Learning-based optimal | Multilayer FNN, RL | RL | Simulation | ||||||||
| Wang et al. [304] | 2023 | Redundant Serial | NS | Hierarchical multi-agent RL-based | Multi-agent PPO, hierarchical RL, PPO | RL | Simulation | ||||||||
| Weber et al. [305] | 2021 | Serial | NS | DRL-based | DDPG | RL | Both | ||||||||
| Wu et al. [306] | 2025 | Serial | Manufacturing | RL-based | Soft AC, DDPG, gated feature extractor | RL | Both | ||||||||
| Wu et al. [307] | 2022 | Serial | NS | Adversarial RL-based | Adversarial RL | RL | Both | ||||||||
| Wu et al. [308] | 2023 | Serial | NS | Learning-based | RBFNN | - | Both | ||||||||
| Wu et al. [309] | 2024 | Serial | NS | RL-based output feedback | RL, RBFNN | RL | Simulation | ||||||||
| Wu et al. [310] | 2025 | Serial | NS | DRL-based | Soft AC, Soft ACwB | RL | Both | ||||||||
| Xhin et al. [311] | 2024 | Serial | NS | DRL-based | DDPG | RL | Both | ||||||||
| Xiao et al. [312] | 2024 | Serial | Manufacturing | Learning-based | PPO, IL | RL, IL | Both | ||||||||
| Xie et al. [313] | 2019 | Serial | NS | DRL-based | DDPG, A3C, DPPO | RL | Simulation | ||||||||
| Xu et al. [314] | 2023 | Serial | NS | Learning-based | CURL, Soft AC, U-Net | RL, Self-supervised, Unsupervised | Both | ||||||||
| Xu et al. [315] | 2024 | Serial | NS | Learning-based | Hybrid NN, RBFNN, DiffNEA | Supervised | Experiment | ||||||||
| Yagi et al. [12] | 2025 | Serial | NS | Hierarchical RL-based | Hierarchical RL, PPO | RL | Experiment | ||||||||
| Yang et al. [316] | 2025 | Serial | Mining | DRL-based | IM-DQN, Prioritized experience replay, ICM | RL | Both | ||||||||
| Yang et al. [317] | 2023 | Serial | NS | Hybrid model-based model-free RL, broader industrial applications | Soft AC, PETS, CEM | RL | Both | ||||||||
| Yang et al. [318] | 2025 | Serial | NS | Learning-based fault-tolerant | AC RL, RBFNN | RL | Both | ||||||||
| Yang et al. [319] | 2025 | Serial | NS | Learning-based | Deterministic learning, RBFNN, Knowledge fusion | - | Simulation | ||||||||
| Yang et al. [320] | 2022 | Serial | NS | RL-based impedance | Off-policy reinforcement | RL | Both | ||||||||
| Yuan et al. [321] | 2025 | Dexterous Hand | NS | Vision-guided learning-based | PPO, DrM, DAgger | RL, IL | Both | ||||||||
| Zhang et al. [322] | 2015 | Serial | NS | DRL-based | DQN | RL | Both | ||||||||
| Zhang et al. [323] | 2023 | Dexterous Hand | NS | DRL-based | Soft AC, GAN | RL | Both | ||||||||
| Zhang [324] | 2025 | Serial | Manufacturing | Learning-based | MLP, CNN, LSTM, Q-learning, AC | RL, Supervised, Hybrid | Both | ||||||||
| Zhang et al. [325] | 2023 | Redundant Serial | NS | DRL-based | DQN, Soft AC | RL | Simulation | ||||||||
| Zhang et al. [326] | 2023 | Serial | NS | DRL-based trajectory | PPO, DRL | RL | Both | ||||||||
| Zhang et al. [327] | 2022 | Serial | NS | DRL-based | Mask R-CNN, Soft AC | RL | Both | ||||||||
| Zhang et al. [328] | 2023 | Serial | NS | symbolic regression-based | Symbolic regression, Genetic programming | Supervised | Simulation | ||||||||
| Zhang et al. [329] | 2022 | Serial | NS | DRL-based | PPO, CNN | RL | Simulation | ||||||||
| Zhan et al. [330] | 2022 | Serial | NS | Vision-guided learning-based | Soft AC, Contrastive learning, Data augmentation | RL, IL, Self-supervised | Both | ||||||||
| Zhao et al. [331] | 2024 | Serial | Manufacturing | DRL-based | MAPPO, PPO | RL | Both | ||||||||
| Zhao et al. [332] | 2020 | Serial | NS | DRL-based | PPO, DRL | RL | Simulation | ||||||||
| Zheng et al. [333] | 2023 | Serial | NS | DRL-based trajectory | DDPG, TD3, Soft AC | RL | Simulation | ||||||||
| Zhou et al. [334] | 2021 | Serial | NS | DRL-based | DDPG | RL | Simulation | ||||||||
| Zhu et al. [335] | 2022 | Serial | Aerospace | Fuzzy RL-based | Fuzzy wavelet network, RL | RL | Simulation | ||||||||
| Zhu et al. [336] | 2019 | Dexterous Hand | NS | DRL-based | DRL, DAPG, natural policy gradient | RL, IL, Hybrid | Both | ||||||||
| Zhu et al. [337] | 2025 | Serial | Multiple | RL-based optimal tracking | AC-identifier NN, RL | RL | Simulation | ||||||||
| Zong et al. [338] | 2025 | Dexterous Hand | Manufacturing | DRL-based | TD3, D2SR, pruning | RL | Simulation | ||||||||
| Modares et al. [339] | 2016 | Serial | NS | Learning-based | Integral RL, NN | RL | Both | ||||||||
| Interaction and Safety | Perrusquía et al. [340] | 2019 | Serial | NS | Learning-based position/force | Q-learning, Sarsa | RL | Both | |||||||
| Sasaki et al. [341] | 2022 | Serial | NS | DRL-based | DDQN, CNN | RL, Supervised | Both | ||||||||
| Wang et al. [342] | 2024 | Serial | Multiple | Learning-based teleoperation | 1D CNN, Multiview multitask | Supervised | Experiment | ||||||||
| AlAttar et al. [343] | 2022 | Serial | NS | Learning-based predictive | Local linear models, Least squares regression | Self-supervised | Both | ||||||||
| Learning and Adaptation | Baselizadeh, et al. [344] | 2022 | Serial | NS | Learning-based | Q-Learning, RL | RL | Simulation | |||||||
| Ding et al. [345] | 2021 | Serial | NS | NS | Model-based off-policy RL | RL | Simulation | ||||||||
| Elsisi et al. [13] | 2021 | Serial | NS | NS | Modified NN Algorithm, polynomial mutation | - | Simulation | ||||||||
| Haddad et al. [346] | 2021 | Serial | NS | Swarm-optimized | PSO, SSO | - | Simulation | ||||||||
| Jeong et al. [347] | 2020 | Serial | NS | Vision-guided learning-based | MPO, behavioral cloning, Contrastive Forward Dynamics | RL, IL, Self-supervised | Both | ||||||||
| Jhan et al. [348] | 2015 | Serial | NS | Fuzzy neural-based adaptive impedance force | fuzzy NN, FNS | Hybrid | Simulation | ||||||||
| Kallel et al. [349] | 2025 | Serial | NS | Learning-based | MLP Regressor, Random Forest Regressor, PINNs | Hybrid, Supervised | Simulation | ||||||||
| Kuang et al. [350] | 2025 | Serial | NS | DRL-based | DRL, Soft AC | RL | Both | ||||||||
| Li et al. [351] | 2022 | Serial | NS | Learning-based adaptive | Concurrent learning | - | Both | ||||||||
| Li et al. [352] | 2023 | Serial | NS | RL-tuned MPC visual servoing | DDPG | RL | Simulation | ||||||||
| Li et al. [353] | 2017 | Exoskeleton | Rehabilitation | RL-based adaptive impedance | Integral RL | RL | Experiment | ||||||||
| Lu et al. [354] | 2024 | Serial | NS | Learning-based | SARSA, RL | RL | Simulation | ||||||||
| Pradhan et al. [355] | 2020 | Serial | NS | Nonlinear self-tuning PID | NARMAX, RLS | - | Both | ||||||||
| Sacchi et al. [356] | 2023 | Serial | NS | DRL-assisted ISM | DNN, TD3, DRL | RL | Simulation | ||||||||
| Shcherbakov et al. [357] | 2020 | Serial | Manufacturing | Digital-twin based adaptive | Kalman filtering, System identification, Predictive modeling | - | Simulation | ||||||||
| Wu et al. [358] | 2025 | Serial | Multiple | DRL-based admittance | DRL, NAF | RL | Both | ||||||||
| Xie et al. [359] | 2020 | Serial | Aerospace | RL-based fuzzy sliding mode | Fuzzy logic, RL, Q-learning | RL | Simulation | ||||||||
| Yousef et al. [360] | 2024 | Serial | NS | DRL-based | DDPG | RL | Simulation | ||||||||
| Zhang et al. [361] | 2024 | Serial | NS | Deep MPC-based | Deep MPC, NN | RL, Adaptive Learning | Simulation | ||||||||
| Robot Type | Frequency of Use | Percentage |
|---|---|---|
| Serial | 255 | 74.3% |
| Robotic Hand | 32 | 9.3% |
| Others | 18 | 5.2% |
| Flexible Joint | 15 | 4.4% |
| Anthropomorphic | 6 | 1.7% |
| Parallel | 5 | 1.5% |
| Magnetic | 5 | 1.5% |
| Mobile | 3 | 0.9% |
| Hybrid | 3 | 0.9% |
| Not Specified | 1 | 0.3% |
| Application | Frequency of Use | Percentage |
|---|---|---|
| Not Specified | 207 | 60.3% |
| Manufacturing | 64 | 18.7% |
| Multiple | 21 | 6.1% |
| Rehabilitation | 16 | 4.7% |
| Healthcare | 10 | 2.9% |
| Service | 7 | 2.0% |
| Aerospace | 5 | 1.5% |
| Logistics | 4 | 1.2% |
| Industrial | 3 | 0.9% |
| Others | 6 | 1.7% |
| Control Technical Family | Frequency of Use | Percentage |
|---|---|---|
| Learning-based | 311 | 90.7% |
| Fuzzy-based | 8 | 2.3% |
| Optimization-based | 6 | 1.7% |
| Adaptive / Robust classical control | 6 | 1.7% |
| MPC-based / Hybrid MPC | 3 | 0.9% |
| Perception / Estimation-guided | 6 | 1.7% |
| Assistive / Copilot-based | 1 | 0.3% |
| Not Specified | 2 | 0.6% |
| Learning Paradigm | Frequency of Use | Percentage |
|---|---|---|
| Reinforcement Learning | 202 | 58.9% |
| Supervised | 42 | 12.2% |
| Self-supervised | 2 | 0.6% |
| Unsupervised | 1 | 0.3% |
| Hybrid | 48 | 14.0% |
| Multi-paradigm | 29 | 8.5% |
| Not Specified | 19 | 5.5% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).