Submitted:
14 November 2024
Posted:
18 November 2024
Read the latest preprint version here
Abstract

Keywords:
1. Introduction
1.1. Background
1.2. Issue
1.3. Our Idea
1.4. Contributions
- Optimization in Defect Detection: This paper advances the field of injection molding by focusing on AI-driven modeling to enhance the accuracy and efficiency of defect detection, aiming to optimize manufacturing quality and process reliability.
- Reduced training time: Since Additional learning for new data becomes unnecessary, the required learning time is reduced and the maintenance period of the product life cycle can be extended.
- Securing corporate competitiveness: By reducing the cost of quality inspection due to an advanced detection of defects, the company can secure its own quality competitiveness of the overall product development.
- Reduction of resource waste: When modeling is performed for each individual setting, optimizing model parameters in training process becomes more simple due to distinguishable data characteristics. This reduces the cost of inference. When additional data is received, the cost of retraining is also reduced. Both aspects can be implied as the reduction of resource waste and lead to eco-friendly product yield.
2. Literature Review
2.1. Initial Experimentation and Predictive Modeling (Early 2000s)
2.2. Integration with Industry 4.0 and Real-Time Monitoring (Mid-2010s)
2.3. Advanced Quality Prediction and Zero-Defect Ambitions (2020s)
2.4. This Work
3. Methodology
3.1. Data Collection
- Collected data size is D with N features.
- dataset consists of F facilities and P products.
- Each product consists of C cavities.
- S setting features are used for classifying data into recipes.
- I input features are used for modeling Autoencoders.
3.2. Recipe Separation
- First, the number of clusters K is decided for how many clusters the dataset should be divided into.
- Next, initial centroids per each cluster are given. Data points which are close to each centroid are assigned to each cluster by using distance based metrics.
- After grouping data, initial centroids move to the center of the data points per each assigned cluster. This process is repeated until every data are assigned to each cluster and centroids are adjusted.
Classifying data into setting based recipes is as follows.
- Among Data size of D, select data with features of the existing N setting parameter values.
- Drop duplicated data and C unique combinations of setting values are left.
- Set the parameter n clusters as K.
- Apply standard-scaler to values for unifying different feature measures.
- Train K-Means Clustering with data size of d rows and N features
- Return to the original data size D and predict each data of setting parameter variables.
- Cluster Number from 1 to K for each data is now designated.
- Define cluster numbers as setting based recipes.
3.3. Train/Test Data Definition
- Suppose a lot process of the test data consists of multiple recipes and data of recipes also exist in the train data. At first, it seems predictable. However, note that only the real defect ratio of the entire lot process itself is known. This becomes impossible to compare predicted defect ratios per each classified recipe. In other words, defect ratio per each recipe is untrackable.
- Suppose a recipe in the train data does not exist in the test data. In this case, only partial prediction of the lot process can be done leading to incomplete quality measurement.
3.4. Min-Max Regularization
3.5. Predictive Modeling
3.5.1. Autoencoder Application
- Inputs are compressed through the encoder into the latent vector.
- In the latent vector, non linear correlations between features are captured which can effectively learn important components of inputs.
- Throughout the latent vector, reconstructed outputs are predicted with decoder.
3.5.2. Evaluation Metrics
3.5.2.1
3.6. Adaptable Learning
- P is defined as each setting value of the trained recipe.
- Q is defined as each setting value of new recipe. And is interpreted as whether it can properly infer P.
3.6.0.2
- Select trained recipes R,...., R+K.
- Select new recipes r,...., r+k . ( Recipe numbers are predicted with the trained K-Means in 3.2. )
- Select N setting parameter values per each trained recipe.
- Select N setting parameter values per each new recipe.
- Calculate the KL-Divergence value of each setting value between each new recipe and trained recipes individually.
- Calculate the sum of KL-Divergence of the setting values.
- Select trained recipe with the lowest KL-Divergence compared to the new recipe.
- Select model of trained recipe and predict data of new recipe.
- Define I input features used for prediction.
4. Experiment Setup
4.1. Data Collection
4.2. Recipe Separation
4.3. Select Experiment Data
- Remove data which facility does not consist of any defects.
- Select data from a specific facility where most defect ratios exist.
- Select data of a specific product the largest size.
4.4. Train / Test Data Organization
4.5. Results
4.5.1. Autoencoder Configuration
4.5.2. Threshold optimization
4.5.3. Prediction Comparisons
4.6. Adaptable Learning
- For recipe 6, there exist two lot process each with lot defect ratio of 0.0% and >0.0%. The first lot process have only good cavities and the second lot have good and bad cavities.
- For Recipe 7, there exist one lot process with lot defect ratio of >0.0%. The lot process have good and bad cavities.
- Find the nearest trained recipe per each new recipe 6 and 7 with KL-Divergence calculation.
- Select trained Autoencoders based on the nearest recipe.
- Predict new data of recipe 6 and 7 with each selected Autoencoder.
- Predict new data of recipe 6 and 7 as a whole with the integrated Autoencoder.
- Compare prediction results of integrated model and recipe models.
4.6.1. KL-Divergence Calculation
4.6.2. Data Organization
- From recipe 1, select the closest recipe among remaining recipe 2 and 3.
- Inputs for procedure of 3.6 are designated as R=2, K=1, r=1, k=0, N=76 , I=6.
- Through calculation, the closest setting values to recipe 1 is recipe 3.
- The existing train data of recipe 3 is decided as the validation data for setting the optimal threshold.
4.6.3. Prediction Comparisons
5. Discussion
6. Conclusions
References
- Cioffi, Raffaele, et al. "Artificial intelligence and machine learning applications in smart production: Progress, trends, and directions." Sustainability 12.2 (2020): 492. [CrossRef]
- Tseng, Ming-Lang, et al. "Sustainable industrial and operation engineering trends and challenges Toward Industry 4.0: A data driven analysis." Journal of Industrial and Production Engineering 38.8 (2021): 581-598. [CrossRef]
- Waltersmann, Lara, et al. "Artificial intelligence applications for increasing resource efficiency in manufacturing companies—a comprehensive review." Sustainability 13.12 (2021): 6689. [CrossRef]
- Chen, Yixuan, and Shanyue Jin. "Artificial intelligence and carbon emissions in manufacturing firms: The moderating role of green innovation." Processes 11.9 (2023): 2705. [CrossRef]
- Rakholia, Rajnish, et al. "Advancing Manufacturing Through Artificial Intelligence: Current Landscape, Perspectives, Best Practices, Challenges and Future Direction." IEEE Access (2024). [CrossRef]
- Peres, Ricardo Silva, et al. "Industrial artificial intelligence in industry 4.0-systematic review, challenges and outlook." IEEE access 8 (2020): 220121-220139. [CrossRef]
- Arinez, Jorge F., et al. "Artificial intelligence in advanced manufacturing: Current status and future outlook." Journal of Manufacturing Science and Engineering 142.11 (2020): 110804. [CrossRef]
- Espina-Romero, Lorena, et al. "Challenges and Opportunities in the Implementation of AI in Manufacturing: A Bibliometric Analysis." Sci 6.4 (2024): 60. [CrossRef]
- Charest, Meaghan, Ryan Finn, and Rickey Dubay. "Integration of artificial intelligence in an injection molding process for on-line process parameter adjustment." 2018 Annual IEEE International Systems Conference (SysCon). IEEE, 2018. [CrossRef]
- Silva, Bruno, et al. "Enhance the injection molding quality prediction with artificial intelligence to reach zero-defect manufacturing." Processes 11.1 (2022): 62. [CrossRef]
- Rousopoulou, Vaia, et al. "Predictive maintenance for injection molding machines enabled by cognitive analytics for industry 4.0." Frontiers in Artificial Intelligence 3 (2020): 578152. [CrossRef]
- Xu, Jiawen, et al. "A review on AI for smart manufacturing: Deep learning challenges and solutions." Applied Sciences 12.16 (2022): 8239. [CrossRef]
- Jung, Hail, et al. "Application of machine learning techniques in injection molding quality prediction: Implications on sustainable manufacturing industry." Sustainability 13.8 (2021): 4120. [CrossRef]
- Thabtah, Fadi, et al. "Data imbalance in classification: Experimental evaluation." Information Sciences 513 (2020): 429-441. [CrossRef]
- K. P. Sinaga and M. -S. Yang. "Unsupervised K-Means Clustering Algorithm." IEEE Access, vol. 8, pp. 80716-80727, 2020. [CrossRef]
- Deepa, Bharathi and K. Ramesh. “Epileptic seizure detection using deep learning through min max scaler normalization.” International journal of health sciences (2022). [CrossRef]
- Nayak, S. C., Bijan B. Misra, and Himansu Sekhar Behera. "Impact of data normalization on stock index forecasting." International Journal of Computer Information Systems and Industrial Management Applications 6 (2014): 13-13.
- S. Al-Abri, T. X. Lin, M. Tao and F. Zhang. "A Derivative-Free Optimization Method With Application to Functions With Exploding and Vanishing Gradients." IEEE Control Systems Letters, vol. 5, no. 2, pp. 587-592, April 2021. [CrossRef]
- Sullivan, Joe H., and William H. Woodall. "Change-point detection of mean vector or covariance matrix shifts using multivariate individual observations." IIE transactions 32.6 (2000): 537-549. [CrossRef]
- Song, Yang, Alexander Schwing, and Raquel Urtasun. "Training deep neural networks via direct loss minimization." International conference on machine learning. PMLR, 2016.
- Hosen, Mohammed Shahadat, et al. "Data-Driven Decision Making: Advanced Database Systems for Business Intelligence." Nanotechnology Perceptions (2024): 687-704.
- Berahmand, Kamal, et al. "Autoencoders and their applications in machine learning: a survey." Artificial Intelligence Review 57.2 (2024): 28. [CrossRef]
- Yang, Tianyi, et al. "Deep learning model-driven financial risk prediction and analysis." (2024).
- Bariah, Lina, and Merouane Debbah. "The interplay of ai and digital twin: Bridging the gap between data-driven and model-driven approaches." IEEE Wireless Communications (2024).
- Chen, Zhaomin, et al. "Autoencoder-based network anomaly detection." 2018 Wireless telecommunications symposium (WTS). IEEE, 2018. [CrossRef]
- Sewak, Mohit, Sanjay K. Sahay, and Hemant Rathore. "An overview of deep learning architecture of deep neural networks and autoencoders." Journal of Computational and Theoretical Nanoscience 17.1 (2020): 182-188. [CrossRef]
- Hodson, Timothy O. "Root mean square error (RMSE) or mean absolute error (MAE): When to use them or not." Geoscientific Model Development Discussions 2022 (2022): 1-10. [CrossRef]
- Givnan, Sean, et al. "Anomaly detection using autoencoder reconstruction upon industrial motors." Sensors 22.9 (2022): 3166. [CrossRef]
- F. Perez-Cruz, "Kullback-Leibler divergence estimation of continuous distributions." 2008 IEEE International Symposium on Information Theory, Toronto, ON, Canada, 2008, pp. 1666-1670.
- Yu, Miao, et al. "A model-based collaborate filtering algorithm based on stacked AutoEncoder." Neural Computing and Applications (2022): 1-9. [CrossRef]
- Liu, Tong, et al. "High-ratio lossy compression: Exploring the autoencoder to compress scientific data." IEEE Transactions on Big Data 9.1 (2021): 22-36. [CrossRef]
- Wang, Jianlong, et al. "Parameter selection of Touzi decomposition and a distribution improved autoencoder for PolSAR image classification." ISPRS Journal of Photogrammetry and Remote Sensing 186 (2022): 246-266. c. [CrossRef]
- Awwad, Ameen, Ghaleb A. Husseini, and Lutfi Albasha. "AI-Aided Robotic Wide-Range Water Quality Monitoring System." Sustainability 16.21 (2024): 9499. [CrossRef]
- Tynchenko, Vadim, et al. "Predicting Tilapia Productivity in Geothermal Ponds: A Genetic Algorithm Approach for Sustainable Aquaculture Practices." Sustainability 16.21 (2024): 9276. [CrossRef]
- Liu, Yang, et al. "Ada-XG-CatBoost: A Combined Forecasting Model for Gross Ecosystem Product (GEP) Prediction." Sustainability 16.16 (2024): 7203. [CrossRef]
- Nañez Alonso, Sergio Luis, et al. "Digitalization, circular economy and environmental sustainability: The application of Artificial Intelligence in the efficient self-management of waste." Sustainability 13.4 (2021): 2092. [CrossRef]
- Zhao, Lin, et al. "Application of artificial intelligence to wastewater treatment: A bibliometric analysis and systematic review of technology, economy, management, and wastewater reuse." Process Safety and Environmental Protection 133 (2020): 169-182. [CrossRef]
- Zhou, Lei, et al. "Emergency decision making for natural disasters: An overview." International journal of disaster risk reduction 27 (2018): 567-576. [CrossRef]
- Wu, Carole-Jean, et al. "Sustainable ai: Environmental implications, challenges and opportunities." Proceedings of Machine Learning and Systems 4 (2022): 795-813.
- Khakurel, Jayden, et al. "The rise of artificial intelligence under the lens of sustainability." Technologies 6.4 (2018): 100. [CrossRef]
- Tanveer, Muhammad, Shafiqul Hassan, and Amiya Bhaumik. "Academic policy regarding sustainability and artificial intelligence (AI)." Sustainability 12.22 (2020): 9435. [CrossRef]
- Yigitcanlar, Tan, and Federico Cugurullo. "The sustainability of artificial intelligence: An urbanistic viewpoint from the lens of smart and sustainable cities." Sustainability 12.20 (2020): 8548. [CrossRef]








| Lot Information with Specific Setting Recipe | |||||
| Lot Process Number | 20231214 | ||||
| Data Shape | 438 Data Size and 132 Features | ||||
| Setting Features | 76 Setting Values | ||||
| Input Features | 6 Input Features | ||||
| Number of Products | 438 | ||||
| Recipe(Cluster) Number | 10 | ||||
| Good Cavity Counts(Total) | 432 | ||||
| Defect Cavity Counts(Total) | 6 | ||||
| Overall Defect Ratio | Approximately 1.37 % | ||||
| Feature | Injection Time | Switch Position | Cushion Distance | Weight Time | Max Injection Press | Peak Pressure |
| Mean | 2.33 | 13.46 | 11.29 | 24.10 | 151.52 | 13321.01 |
| Std | 0.08 | 2.83 | 2.72 | 2.76 | 2.52 | 12.31 |
| Min | 2.19 | 7.99 | 5.88 | 17.33 | 118.73 | 13260.5 |
| 25% | 2.30 | 12.00 | 9.81 | 23.85 | 149.91 | 13312.6 |
| 50% | 2.340 | 13.50 | 11.39 | 24.26 | 151.40 | 13323.0 |
| 75% | 2.35 | 15.99 | 13.69 | 24.52 | 153.17 | 13330.4 |
| Max | 6.39 | 21.0 | 18.56 | 172.03 | 171.26 | 13358.9 |
| Train Dataset | Test Appliable | |||||
| Lot Defect Ratio | =0.0% | O | ||||
| Integrated Data Shape | 8190 Data Size and 6 Features | O | ||||
| Recipe1 Data Shape | 2654 Data Size and 6 Features | O | ||||
| Recipe2 Data Shape | 1996 Data Size and 6 Features | O | ||||
| Recipe3 Data Shape | 3073 Data Size and 6 Features | O | ||||
| Recipe4 Data Shape | 435 Data Size and 6 Features | X | ||||
| Recipe5 Data Shape | 31 Data Size and 6 Features | X | ||||
| Recipe6 Data Shape | 1 Data Size and 6 Features | X | ||||
| Test Dataset | |||||
| Lot Defect Ratio | >0.0% | ||||
| Integrated Data Shape | 5842 Data Size and 6 Features | ||||
| Recipe1 Data Shape | 4396 Data Size and 6 Features | ||||
| Recipe2 Data Shape | 475 Data Size and 6 Features | ||||
| Recipe3 Data Shape | 971 Data Size and 6 Features | ||||
| Applied Parameters | Integrated Data | Recipe 1 | Recipe 2 | Recipe 3 |
| Outer Layer Size | 128 | 128 | 64 | 64 |
| Inner Layer Size | 64 | 64 | 32 | 32 |
| Latent Vector Size | 16 | 4 | 16 | 16 |
| Dropout Ratio | 0.15 | 0.1 | 0.15 | 0.15 |
| Activation Function | ’Tanh’ | |||
| Optimizer | ’Adam’ | |||
| Batch Size | 100 | 10 | 10 | 10 |
| Number of Epochs | 200 | |||
| Validation Split Ratio | 0.15 | 0.1 | 0.15 | 0.15 |
| Learning Rate | 0.01 | |||
| Early Stopping | 200 | |||
| Loss | Mean Absolute Error | |||
| Autoencoder | Validation Maximum Threshold |
| Integrated | 0.125 |
| Recipe1 | 0.0273 |
| Recipe2 | 0.0437 |
| Recipe3 | 0.0635 |
| Lot No | Real Defect Ratio | Integrated pred | Recipe Pred |
| 20240322 | 0.514(5/971) | 0.102(1/971) | 0.617(6/971) |
| 20240402 | 0.319(1/313) | 0.00(0/313) | 0.319(1/313) |
| 20240429 | 0.879(6/682) | 0.00(0/682) | 1.173(8/682) |
| 20240430 | 0.833(7/835) | 0.00(0/835) | 1.556(13/835) |
| 20240502 | 0.611(5/817) | 0.00(0/817) | 1.22(10/817) |
| 20240507 | 1.920(7/368) | 0.00(0/368) | 0.271(1/368) |
| 20240614 | 1.312(5/381) | 0.262(1/381) | 2.099(8/381) |
| 20240617 | 0.647(2/309) | 0.00(0/309) | 0.647(2/309) |
| 20240623 | 0.289(2/691) | 0.00(0/691) | 0.723(5/691) |
| 20240717 | 0.210(1/475) | 0.00(0/475) | 1.473(7/475) |
| Sum | 0.701%(41/5842) | 0.034%(2/5842) | 1.04%(61/5842) |
| KL-Divergence Value | |||
| New Recipes | Trained Recipe1 | Trained Recipe2 | Trained Recipe3 |
| Recipe6 | 40.1 | 41.63 | 41.12 |
| Recipe7 | 11.13 | 12.66 | 12.15 |
| Train | Validation | Prediction |
| Recipe1 | Recipe6(Normal) | Recipe6(Defect) |
| Recipe1 | Recipe3 | Recipe 7 |
| New Data | Lot No | Defect Ratio | Integrated | Adaptable | Additional |
| Recipe 6 | 20240903 | 0.293(2/682) | 0.00(0/682) | 4.692(32/682) | 6.158(42/682) |
| Recipe 7 | 20240902 | 0.546(4/728) | 0.0(0/728) | 0.683(5/728) | X |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).