Submitted:
17 December 2024
Posted:
18 December 2024
Read the latest preprint version here
Abstract

Keywords:
1. Introduction
1.1. Background
1.2. Issue
1.3. Our Idea
1.4. Contributions
- Optimization in Defect Detection: This paper advances the field of injection molding by focusing on AI-driven modeling to enhance the accuracy and efficiency of defect detection, aiming to optimize manufacturing quality and process reliability.
- Reduced training time: Since additional learning for new data becomes unnecessary, the required learning time is reduced and the maintenance period of the product life cycle can be extended.
- Securing corporate competitiveness: By reducing the cost of quality inspection due to advanced defect detection, the company can secure its own quality competitiveness.
- Reduction of resource waste: When modeling is performed for each individual setting, optimizing model parameters during training process becomes more simple due to distinguishable data characteristics. This reduces the cost of inference. This is also held with additionally collected data which reduces cost of retraining models.. Both aspects can be implied as the reduction of resource waste and lead to eco-friendly product yield.
2. Literature Review
2.1. Initial Experimentation and Predictive Modeling (Early 2000s)
2.2. Integration with Industry 4.0 and Real-Time Monitoring (Mid-2010s)
2.3. Advanced Quality Prediction and Zero-Defect Ambitions (2020s)
2.4. Similar Studies
2.5. This Work
3. Methodology
3.1. Data Collection
- Collected data size is D with N features.
- The dataset consists of F facilities and P products.
- Each product consists of C cavities.
- S setting features are used for classifying data into recipes.
- I input features are used for building Autoencoder models.
3.2. Recipe Separation
- First, the number of clusters K is decided for how many clusters the data should be divided into.
- Initial centroids for each cluster are given. Data close to each centroid are assigned to each cluster with distance-based metrics such as Euclidean-Distance.
- Initial centroids move to the center of the data points of each assigned cluster. This process is repeated until every data are assigned to clusters and centroids are finally adjusted.
- Among dataset size of D, select data with only N setting features.
- From the selected data, find C unique combinations of setting features.
- Set the parameter n clusters as K which is the same number of C.
- Apply standard-scaling method to unify different measures of setting features.
- Train K-Means Clustering with final data size of d rows and N features
- Return to the original data size D and predict each data based on the setting features.
- Throughout prediction, designate the cluster number from 1 to K for each data.
- Define the final cluster numbers as setting parameter-based recipes.
- When new data is collected, the cluster(recipe) number of setting features are predicted with the trained K-Means.
3.3. Train/Test Data Organization
- Suppose a batch process consists of multiple setting-based recipes which also exist in the train data. Although it seems appropriate for prediction, there is a setback. Since only the overall defect ratio per batch process is known, it is inappropriate to compare predicted defect ratios of multiple recipes and the existing ratio. In other words, in a batch process, defect ratio per each recipe is untrackable. Accordingly, batch process which have one unique setting recipe are left in the test data.
- Suppose the test data consists of an unseen recipe information. In this case, only partial prediction can be done leading to incomplete quality measurement of the batch process.
3.4. Min-Max Regularization
3.5. Predictive Modeling
3.5.1. Autoencoder Application
- Inputs are compressed through the encoder layer into the latent vector.
- In the latent vector, non-linear correlations between features are captured which can effectively learn important components of inputs.
- Throughout the latent vector, reconstructed outputs are predicted with the decoder layer.
- Inputs for training Autoencoders of the recipes and the entire normal data(integrated data) are the I input features.
3.5.2. Threshold Optimization with Evaluation Metrics
3.6. Adaptable Learning
- P is defined as each setting feature of the trained recipe.
- Q is defined as each setting feature of new recipe.
- The range of KL-Divergence is from zero to infinity. If the divergence is lower, the distributions are interpreted to be more similar.
- As a result, the goal is to make sure to find Q which can properly infer P.
- Select recipes R,...., R+K of the trained dataset.
- Define recipes r,...., r+k of the new dataset. (Recipe numbers are predicted by the trained K-Means model in Section 3.2.)
- Select N setting features per each trained recipe and new recipe.
- Calculate the KL-Divergence value of each setting feature between the new and trained recipes.
- Calculate the sum of the calculated KL-Divergence values.
- Select the trained recipe with the lowest KL-Divergence value compared to the new recipe.
- Select model of trained recipe and predict data of new recipe.
- Define I input features used for prediction.
4. Experiment Setup
4.1. Data Collection
4.2. Recipe Separation
4.3. Select Experiment Data
- Remove facility data which does not have any defect.
- Select data from a specific facility where most defect ratios exist.
- Select data of a specific product with the largest size.
4.4. Train/Test Data Organization
4.5. Results
4.5.1. Autoencoder Configuration
4.5.2. Threshold Optimization
4.5.3. Prediction Comparisons
4.6. Adaptable Learning
- For recipe 6, there exist process of two unique batches each with no defects(defect ratio=0.0%) and existing defects(defect ratio>0.0%). In other words, the first batch consists of only good cavities and the second consists of both good and bad cavities.
- For Recipe 7, there exist process of one unique batch with existing defects.
- Find each nearest trained recipe data and its prediction model(AutoEncoder) of recipe number 6 and 7 via KL-Divergence calculation.
- Optimize thresholds with validation data.
- Predict new data with each selected Autoencoder and the integrated AutoEncoder referred to Table 5.
- Compare prediction results of integrated and recipe-based models.
4.6.1. KL-Divergence Calculation
4.6.2. Data Organization
- Based on recipe 1, select the closest data among the remaining recipe 2 and 3.
- With KL-Divergence, the closest data of recipe 1 result in recipe 3.
- The existing train data of recipe 3 is decided as the validation dataset.
4.6.3. Prediction Comparisons
5. Discussion
6. Conclusions
References
- Cioffi, Raffaele, et al. "Artificial intelligence and machine learning applications in smart production: Progress, trends, and directions." Sustainability 12.2 (2020): 492.
- Tseng, Ming-Lang, et al. "Sustainable industrial and operation engineering trends and challenges Toward Industry 4.0: A data driven analysis." Journal of Industrial and Production Engineering 38.8 (2021): 581-598.
- Waltersmann, Lara, et al. "Artificial intelligence applications for increasing resource efficiency in manufacturing companies—a comprehensive review." Sustainability 13.12 (2021): 6689. [CrossRef]
- Chen, Yixuan, and Shanyue Jin. "Artificial intelligence and carbon emissions in manufacturing firms: The moderating role of green innovation." Processes 11.9 (2023): 2705. [CrossRef]
- Rakholia, Rajnish, et al. "Advancing Manufacturing Through Artificial Intelligence: Current Landscape, Perspectives, Best Practices, Challenges and Future Direction." IEEE Access (2024).
- Peres, Ricardo Silva, et al. "Industrial artificial intelligence in industry 4.0-systematic review, challenges and outlook." IEEE access 8 (2020): 220121-220139.
- Arinez, Jorge F., et al. "Artificial intelligence in advanced manufacturing: Current status and future outlook." Journal of Manufacturing Science and Engineering 142.11 (2020): 110804. [CrossRef]
- Espina-Romero, Lorena, et al. "Challenges and Opportunities in the Implementation of AI in Manufacturing: A Bibliometric Analysis." Sci 6.4 (2024): 60. [CrossRef]
- Rousopoulou, Vaia, et al. "Predictive maintenance for injection molding machines enabled by cognitive analytics for industry 4.0." Frontiers in Artificial Intelligence 3 (2020): 578152. [CrossRef]
- Xu, Jiawen, et al. "A review on AI for smart manufacturing: Deep learning challenges and solutions." Applied Sciences 12.16 (2022): 8239. [CrossRef]
- Jung, Hail, et al. "Application of machine learning techniques in injection molding quality prediction: Implications on sustainable manufacturing industry." Sustainability 13.8 (2021): 4120. [CrossRef]
- Charest, Meaghan, Ryan Finn, and Rickey Dubay. "Integration of artificial intelligence in an injection molding process for on-line process parameter adjustment." 2018 Annual IEEE International Systems Conference (SysCon). IEEE, 2018.
- Silva, Bruno, et al. "Enhance the injection molding quality prediction with artificial intelligence to reach zero-defect manufacturing." Processes 11.1 (2022): 62. [CrossRef]
- Thabtah, Fadi, et al. "Data imbalance in classification: Experimental evaluation." Information Sciences 513 (2020): 429-441. [CrossRef]
- Mao, Ting, et al. "Data driven injection molding process monitoring using sparse auto encoder technique." 2015 IEEE International Conference on Advanced Intelligent Mechatronics (AIM). IEEE, 2015.
- K. P. Sinaga and M. -S. Yang. "Unsupervised K-Means Clustering Algorithm." IEEE Access, vol. 8, pp. 80716-80727, 2020.
- Chen, Zhaomin, et al. "Autoencoder-based network anomaly detection." 2018 Wireless telecommunications symposium (WTS). IEEE, 2018.
- F.Perez-Cruz, "Kullback-Leibler divergence estimation of continuous distributions." 2008 IEEE International Symposium on Information Theory, Toronto, ON, Canada, 2008, pp. 1666-1670.
- Deepa, Bharathi and K. Ramesh. “Epileptic seizure detection using deep learning through min max scaler normalization.” International journal of health sciences (2022). [CrossRef]
- Nayak, S. C., Bijan B. Misra, and Himansu Sekhar Behera. "Impact of data normalization on stock index forecasting." International Journal of Computer Information Systems and Industrial Management Applications 6 (2014): 13-13.
- S. Al-Abri, T. X. Lin, M. Tao and F. Zhang. "A Derivative-Free Optimization Method With Application to Functions With Exploding and Vanishing Gradients." IEEE Control Systems Letters, vol. 5, no. 2, pp. 587-592, April 2021. [CrossRef]
- Sullivan, Joe H., and William H. Woodall. "Change-point detection of mean vector or covariance matrix shifts using multivariate individual observations." IIE transactions 32.6 (2000): 537-549.
- Song, Yang, Alexander Schwing, and Raquel Urtasun. "Training deep neural networks via direct loss minimization." International conference on machine learning. PMLR, 2016.
- Hosen, Mohammed Shahadat, et al. "Data-Driven Decision Making: Advanced Database Systems for Business Intelligence." Nanotechnology Perceptions (2024): 687-704. [CrossRef]
- Berahmand, Kamal, et al. "Autoencoders and their applications in machine learning: a survey." Artificial Intelligence Review 57.2 (2024): 28. [CrossRef]
- Yang, Tianyi, et al. "Deep learning model-driven financial risk prediction and analysis." (2024). [CrossRef]
- Bariah, Lina, and Merouane Debbah. "The interplay of ai and digital twin: Bridging the gap between data-driven and model-driven approaches." IEEE Wireless Communications (2024). [CrossRef]
- Sewak, Mohit, Sanjay K. Sahay, and Hemant Rathore. "An overview of deep learning architecture of deep neural networks and autoencoders." Journal of Computational and Theoretical Nanoscience 17.1 (2020): 182-188. [CrossRef]
- Hodson, Timothy O. "Root mean square error (RMSE) or mean absolute error (MAE): When to use them or not." Geoscientific Model Development Discussions 2022 (2022): 1-10. [CrossRef]
- Givnan, Sean, et al. "Anomaly detection using autoencoder reconstruction upon industrial motors." Sensors 22.9 (2022): 3166. [CrossRef]
- Yu, Miao, et al. "A model-based collaborate filtering algorithm based on stacked AutoEncoder." Neural Computing and Applications (2022): 1-9. [CrossRef]
- Liu, Tong, et al. "High-ratio lossy compression: Exploring the autoencoder to compress scientific data." IEEE Transactions on Big Data 9.1 (2021): 22-36. [CrossRef]
- Wang, Jianlong, et al. "Parameter selection of Touzi decomposition and a distribution improved autoencoder for PolSAR image classification." ISPRS Journal of Photogrammetry and Remote Sensing 186 (2022): 246-266. [CrossRef]
- Awwad, Ameen, Ghaleb A. Husseini, and Lutfi Albasha. "AI-Aided Robotic Wide-Range Water Quality Monitoring System." Sustainability 16.21 (2024): 9499. [CrossRef]
- Tynchenko, Vadim, et al. "Predicting Tilapia Productivity in Geothermal Ponds: A Genetic Algorithm Approach for Sustainable Aquaculture Practices." Sustainability 16.21 (2024): 9276. [CrossRef]
- Liu, Yang, et al. "Ada-XG-CatBoost: A Combined Forecasting Model for Gross Ecosystem Product (GEP) Prediction." Sustainability 16.16 (2024): 7203.
- Nañez Alonso, Sergio Luis, et al. "Digitalization, circular economy and environmental sustainability: The application of Artificial Intelligence in the efficient self-management of waste." Sustainability 13.4 (2021): 2092.
- Khakurel, Jayden, et al. "The rise of artificial intelligence under the lens of sustainability." Technologies 6.4 (2018): 100. [CrossRef]
- Wu, Carole-Jean, et al. "Sustainable ai: Environmental implications, challenges and opportunities." Proceedings of Machine Learning and Systems 4 (2022): 795-813.
- Tanveer, Muhammad, Shafiqul Hassan, and Amiya Bhaumik. "Academic policy regarding sustainability and artificial intelligence (AI)." Sustainability 12.22 (2020): 9435. [CrossRef]
- Yigitcanlar, Tan, and Federico Cugurullo. "The sustainability of artificial intelligence: An urbanistic viewpoint from the lens of smart and sustainable cities." Sustainability 12.20 (2020): 8548. [CrossRef]








| Data Information of a Specific Recipe | |||||
|---|---|---|---|---|---|
| Batch Number | 20231214 | ||||
| Data Shape | 438 Data Size and 132 Features | ||||
| Setting Features | 76 Setting Features | ||||
| Input Features | 6 Input Features | ||||
| Number of Products | 438 | ||||
| Predicted Recipe(Cluster) Number | 10 | ||||
| Good Cavity Counts(Total) | 432 | ||||
| Defect Cavity Counts(Total) | 6 | ||||
| Overall Defect Ratio | Approximately 1.37 % | ||||
| Mean | Std | Min | 25% | 50% | 75% | Max | |
|---|---|---|---|---|---|---|---|
| Injection Time | 2.33 | 0.08 | 2.19 | 2.30 | 2.340 | 2.35 | 6.39 |
| Switch Position | 13.46 | 2.83 | 7.99 | 12.00 | 13.50 | 15.99 | 21.0 |
| Cushion Distance | 11.29 | 2.72 | 5.88 | 9.81 | 11.39 | 13.69 | 18.56 |
| Weight Time | 24.10 | 2.76 | 17.33 | 23.85 | 24.26 | 24.52 | 172.03 |
| Max Injection Press | 151.52 | 2.52 | 118.73 | 149.91 | 151.40 | 153.17 | 171.26 |
| Peak Pressure | 13321.01 | 12.31 | 13260.5 | 13312.6 | 13323.0 | 13330.4 | 13358.9 |
| Defect Ratio of Batch Process | =0.0% | Test Appliable | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Integrated Data Shape | 8190 Data Size and 6 Features | O | |||||||
| Recipe1 Data Shape | 2654 Data Size and 6 Features | O | |||||||
| Recipe2 Data Shape | 1996 Data Size and 6 Features | O | |||||||
| Recipe3 Data Shape | 3073 Data Size and 6 Features | O | |||||||
| Recipe4 Data Shape | 435 Data Size and 6 Features | X | |||||||
| Recipe5 Data Shape | 31 Data Size and 6 Features | X | |||||||
| Recipe6 Data Shape | 1 Data Size and 6 Features | X | |||||||
| Defect Ratio of Batch Process | >0.0% | ||||||
|---|---|---|---|---|---|---|---|
| Integrated Data Shape | 5842 Data Size and 6 Features | ||||||
| Recipe1 Data Shape | 4396 Data Size and 6 Features | ||||||
| Recipe2 Data Shape | 475 Data Size and 6 Features | ||||||
| Recipe3 Data Shape | 971 Data Size and 6 Features |
| Applied Parameters | Integrated Data | Recipe 1 | Recipe 2 | Recipe 3 |
|---|---|---|---|---|
| Loss | Mean Absolute Error | |||
| Activation Function | Tanh | |||
| optimizer | Adam | |||
| Learning Rate | 0.01 | |||
| Number of Epochs | 200 | |||
| Early Stopping | 200 | |||
| Output Layer Size | 128 | 128 | 64 | 64 |
| Inner Layer Size | 64 | 64 | 32 | 32 |
| Latent Vector Size | 16 | 4 | 16 | 16 |
| Dropout Ratio | 0.15 | 0.1 | 0.15 | 0.15 |
| Batch Size | 100 | 10 | 10 | 10 |
| Validation Split Ratio | 0.15 | 0.1 | 0.15 | 0.15 |
| Autoencoder | Validation Maximum Threshold | ||||||
|---|---|---|---|---|---|---|---|
| Integrated | 0.125 | ||||||
| Recipe1 | 0.0273 | ||||||
| Recipe2 | 0.0437 | ||||||
| Recipe3 | 0.0635 |
| Batch No | Existing Defect Ratio | Integrated pred | Recipe Pred | Recipe No. | ||||
|---|---|---|---|---|---|---|---|---|
| 20240322 | 0.514(5/971) | 0.102(1/971) | 0.617(6/971) | 3 | ||||
| 20240402 | 0.319(1/313) | 0.00(0/313) | 0.319(1/313) | 1 | ||||
| 20240429 | 0.879(6/682) | 0.00(0/682) | 1.173(8/682) | 1 | ||||
| 20240430 | 0.833(7/835) | 0.00(0/835) | 1.556(13/835) | 1 | ||||
| 20240502 | 0.611(5/817) | 0.00(0/817) | 1.22(10/817) | 1 | ||||
| 20240507 | 1.920(7/368) | 0.00(0/368) | 0.271(1/368) | 1 | ||||
| 20240614 | 1.312(5/381) | 0.262(1/381) | 2.099(8/381) | 1 | ||||
| 20240617 | 0.647(2/309) | 0.00(0/309) | 0.647(2/309) | 1 | ||||
| 20240623 | 0.289(2/691) | 0.00(0/691) | 0.723(5/691) | 1 | ||||
| 20240717 | 0.210(1/475) | 0.00(0/475) | 1.473(7/475) | 2 | ||||
| Total | 0.701%(41/5842) | 0.034%(2/5842) | 1.04%(61/5842) | |||||
| New Recipes | Trained Recipe1 | Trained Recipe2 | Trained Recipe3 | ||||
|---|---|---|---|---|---|---|---|
| Recipe6 | 40.1 | 41.63 | 41.12 | ||||
| Recipe7 | 11.13 | 12.66 | 12.15 |
| Train Data | Validation Data | Prediction Data | |||
|---|---|---|---|---|---|
| Recipe1 | 6(Normal) | Recipe6(Defect) | |||
| Recipe1 | Recipe3 | Recipe 7 |
| Recipe No. | Batch No | Defect Ratio | Integrated | Adaptable | Additional | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 6 | 20240903 | 0.293(2/682) | 0.00(0/682) | 4.692(32/682) | 6.158(42/682) | ||||||
| 7 | 20240902 | 0.546(4/728) | 0.0(0/728) | 0.683(5/728) | X |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).