Preprint
Article

This version is not peer-reviewed.

A Hybrid Empirical-AI Model for Multi-Product Yield Prediction and Optimization from Cocoa Residues: A Stochastic Approach for Sustainable Valorization

Submitted:

19 May 2026

Posted:

20 May 2026

You are already at the latest version

Abstract
The inefficient management of agro-industrial residues, particularly cocoa pod husk and mucilage, represents a critical environmental and economic challenge in cocoa-producing regions such as Santander and Norte de Santander, Colombia. These by-products, constituting approximately 70% of the fruit’s total weight, are currently underutilized, generating pollution and wasting resources with high valorization potential. This article proposes the design and rigorous experimental validation of an empirical model based on artificial intelligence capable of predicting quantities of valuable compounds, including bioethanol, essential oils, paraffins, antioxidants, and pectins, obtained from cocoa residues. The model integrates critical variables such as cocoa variety, extraction methods, and process conditions, incorporating advanced machine learning techniques trained on a 100% empirical database of eighty-four (84) laboratory trials, combined with a post-inference sensitivity analysis via the Monte Carlo method with 10,000 simulations. Preliminary results demonstrate significant varietal differences; for instance, the CCN-51 variety achieves a mean bioethanol yield of 79.30 ± 4.96 mL/kg with a 95% confidence interval of (69.44–88.93) mL/kg, while the Criollo variety reaches 43.55 ± 2.72 mL/kg (38.14–48.84 mL/kg), both exhibiting identical coefficients of variation (6.25%). Furthermore, the integration of an optimized extraction sequence combined with neural networks allows for maximizing by-product yields while reducing final residue generation by 40%. This tool not only contributes to the circular economy and alignment with the Sustainable Development Goals (SDGs 9 and 12) but also offers a tangible pathway to improve the competitiveness of the Colombian cocoa industry through data-driven decision-making and sustainable technology adoption.
Keywords: 
;  ;  ;  ;  ;  
Subject: 
Engineering  -   Bioengineering
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated