Introduction and research overview
The rapid evolution of telecommunication systems, particularly the transition from 5G to 6G networks, imposes increased demands on data rates, network capacity, reliability, and overall efficiency [
1,
2]. These requirements introduce new challenges in the design of microwave components, devices, and systems to achieve the desired propagation characteristics, minimal losses, and efficient operation over wide frequency bandwidths.
Microstrip technology remains widely used in the design of microwave devices and components because of its compact size, low cost, as well as compatibility with printed circuit fabrication technologies and lumped components. Additionally, a solid theoretical foundation in transmission line theory and microwave engineering provides reliable design guidance [
3,
4,
5]. As a result, microstrip components are commonly used in mobile communications, satellite systems, radar, and sensing applications [
6,
7,
8].
Traditionally, microstrip circuits have been designed using empirical formulas, such as the Hammerstad equations [
9], along with iterative electromagnetic (EM) simulations [
10,
11,
12,
13,
14,
15]. Although these methods generally provide accurate results, they are often computationally intensive, particularly in multiparameter designs. Additionally, in certain cases, the obtained results are suboptimal, requiring parameter tuning, which is a time-consuming procedure. In such cases, global optimization is necessary to achieve the best possible performance and automate the design process.
Artificial neural networks (ANNs) are nowadays used in microwave engineering, replacing slow physics-based models, for fast, accurate modeling and simulation of various radiofrequency (RF) and microwave components (such as couplers, power dividers, filters, antennas, and amplifiers), thus accelerating computer-aided design (CAD). Additionally, they have been developed into physical “microwave neural network” (MNN) chips that process data using microwave signals for ultrafast, low-power computation in broadband communications, essentially employing EM simulations in artificial intelligence (AI) tasks.
The applications of ANNs and AI in Microwave Engineering include the following:
Device modeling: Training ANNs on EM simulation data to create fast surrogate models for complex passive (inductors, vias) and active (microwave transistor) components, thus substantially reducing simulation/design time.
Circuit optimization: Optimizing circuit performance (e.g., filters and matching networks) by employing ANN models, instead of performing traditional, time-consuming simulations.
Knowledge-based ANNs: Incorporating the existing microwave theory (empirical/analytical models) into ANN structures to improve accuracy, especially when limited data are available.
The applications of the emerging MNNs include the following:
Hardware implementation: Building actual chips that perform computations using microwave signals, operating in the frequency domain.
Ultrafast processing: MNNs can manipulate broadband signals (gigabits/sec) with low-speed controls, potentially achieving speeds near the speed of light, thus surpassing digital limitations.
Signal processing, decoding, and potentially replacing conventional RF front-end components (mixers, ADCs) in wireless systems to achieve low-latency communications.
The advanced algorithms provided by AI and machine learning (ML) technologies can accelerate the design process and produce highly accurate results for a given EM problem. Early studies by Horng et al. [
16] employed ANNs to efficiently design microstrip circuits. Based on a finite set of pairs of input and output parameter vectors obtained from a full-wave analysis, they employed a three-layer ANN that learned the input/output vector mappings and provided accurate approximations for the output vectors for any arbitrary input vector. Patnaik et al. [
17] demonstrated that a backpropagation ANN can be employed for the calculation of the effective dielectric constant of microstrip lines by comparing their results with those obtained using the spectral-domain (SD) technique. Wang et al. [
18] employed a hierarchical ANN to incorporate microwave functional knowledge and library inherent structural knowledge into neural models, substantially reducing the computational cost of the library development. Zhang and Gupta [
19] demonstrated that ANNs can approximate nonlinear EM behavior, indicating that data-driven methods could provide an efficient alternative. Subsequently, Devabhaktuni et al. [
20] employed ANNs for modeling and designing microwave devices, addressing model development and nonlinear modeling issues. They systematically discussed key aspects of ANN modeling, including data generation, the range and distribution of samples in the model input parameter space, data scaling, weight initialization, and the prevention of overlearning and underlearning. Additionally, they reviewed techniques for the partial automation of ANN model development, such as the multilayer perceptron (MLP) architecture, adaptive controller, and training-driven adaptive sampling, demonstrating their effectiveness in efficiently generating accurate models for practical microwave applications, including transistors, DC, small-s and large-signal modeling, as well as nonlinear circuit behavior using recurrent neural networks (RNNs). A sequential flow diagram summarizing the main steps of ANN model development, including data generation, preprocessing, network selection, training, and validation, is presented in
Figure 1 [
20].
Zhang et al. [
21] proposed an ANN-based algorithm that achieves user-specified accuracy for automating the RF/microwave design process. During training, the algorithm dynamically generates new data samples by automatically interfacing with simulation tools such as OSA90, Ansoft-HFSS, and Agilent ADS, enabling ANN models to efficiently learn the behavior of passive and active components, including coplanar waveguide discontinuities and MESFET devices. This approach achieves fast and accurate component modeling, circuit simulation, and yield optimization, significantly reducing the computation time compared with traditional physics-based simulation methods.
Figure 2 shows the training and evaluation workflow of the ANN-based modeling approach proposed in [
21]. The flowchart includes the selection of an MLP architecture, gradient-based weight optimization, as well as the systematic use of training, validation, and dataset testing to ensure accuracy and generalization. This workflow forms the basis of many surrogate modeling strategies adopted in RF and microwave design automation.
These early studies [
16,
20,
21] established ANNs as effective modeling tools for microwave design and laid the foundation for surrogate-based modeling and automated design workflows.
Furthermore, the application of evolution and optimization strategies to electromagnetics, antennas, and microwave filters has been reported in several studies. Yang et al. [
22] proposed an enhanced population-based incremental learning (PBIL) algorithm by introducing multiple probability vectors for individual solutions to improve the global search capability. Additionally, they refined the updating strategies for these vectors, incorporated negative learning to avoid poor solutions, introduced mutation operators for both individual and probability vectors to enhance diversity, and implemented an adaptive learning rate scheme to effectively balance exploration and exploitation. Goudos and Sahalos [
23] employed the generalized differential evolution (GDE3) algorithm, which is a multiobjective extension of the classical differential evolution (DE) algorithm, in the design of multilayer dielectric and open-loop ring resonator (OLRR) microwave filters under multiple constraints. They demonstrated that the GDE3 algorithm effectively generates well-distributed Pareto-optimal solutions, outperforming other evolutionary multiobjective algorithms, such as NSGA-II, MOPSO, and MOPSO-fs, in terms of solution diversity, convergence, and computational efficiency. Their results verified the applicability of GDE3 to practical filter design, including the selection of optimal layer materials from a predefined database, while satisfying strict passband, stopband, and total thickness constraints. Rocca et al. [
24] provided a comprehensive review of conventional, modified, and hybrid DE strategies for solving complex, high-dimensional EM optimization problems. They demonstrated the efficiency of DE in iteratively refining solutions via mutation, crossover, and selection, and discussed variants such as dynamic DE, group-based DE, and hybrid real/integer-coded DE. Their review focused on key EM applications, including antenna synthesis, planar and linear array design, inverse scattering, and microwave component optimization, emphasizing the impact of control parameters on convergence and the ability of DE to handle multiobjective and constrained optimization problems. Deb et al. [
25] evaluated the performance of different optimization techniques, such as conventional DE, particle swarm optimization (PSO), and genetic algorithms (GAs), in the impedance matching of a circularly polarized high-gain microstrip antenna over a predefined frequency range. Their results showed that
DE often provides faster solutions than the other two methods, whereas PSO provides stable optimization. Furthermore, they demonstrated that real-coded GAs (employing crossover/mutation) outperform basic versions but encounter difficulties in complex problems. Feng et al. [
26] proposed a novel parallel EM optimization approach for microwave filter design using feature-assisted neuro-transfer functions to address the problem of poor starting points. They employed surrogate models in EM optimization, enabling feature zeros to be simultaneously identified and adjusted to the desired frequency bands while ensuring that the responses meet the design specifications. Compared with conventional TF-based optimization, coarse/fine mesh space mapping, and direct EM optimization, their approach performs better in avoiding local minima and reaching the optimal EM solution in fewer iterations. Overall, these evolutionary optimization strategies have demonstrated strong global search capability in complex EM design problems, motivating their integration with surrogate modeling techniques in later studies.
More recently, ML-assisted design has attracted attention. Surrogate models trained on EM simulation data can capture complex relationships between geometry and frequency response, enabling fast evaluation and inverse design. Combined with evolutionary optimization, surrogate models are effective in complex microwave design structures. Liu et al. [
27] proposed a surrogate model-assisted evolutionary algorithm for microwave filter optimization. Specifically, they combined Gaussian process surrogate modeling, differential evolution operators, and Gaussian local search to efficiently obtain high-quality results, addressing the challenges of numerous local optima and narrow optimal regions in filter design landscapes while significantly reducing computational cost compared with standard global optimization methods.
Figure 3 shows the two main phases of the surrogate-assisted evolutionary optimization method presented in [
27]. In the exploration phase, differential evolution and local Gaussian process models are used to scan the design space and identify promising regions. In the exploitation phase, the best candidate solutions are further refined using surrogate-assisted local search; as a result, the number of required full-wave EM simulations is reduced.
Chen et al. [
28] proposed a manifold Gaussian process (MGP) ML method based on the DE algorithm for microwave filter parameter extraction. They achieved a significant reduction in the test errors of a fourth- and sixth-order coupling-filter surrogate model, verifying the efficiency of their method. Zhang et al. [
29] proposed a surrogate-based multiphysics optimization technique for microwave devices by incorporating ANNs and a trust-region algorithm. They applied an accurate and efficient ANN surrogate model that employed multiple multiphysics training samples around the optimized solution obtained from the previous iteration to a waveguide filter featuring a piezoactuator. Zhang et al. [
30] proposed yield optimization for microwave filters by constructing a single high-accuracy offline surrogate model, which fully replaces EM simulations in the entire yield optimization process. They introduced a hybrid ML approach by combining radial basis function neural network (RBFNN) regression and SVM classification using features extracted via vector fitting. Then, they efficiently employed global optimization using differential evolution to obtain designs with substantial yield improvement, demonstrating the effectiveness of their approach in filters with more than 10 sensitive design variables. An example of a surrogate-assisted yield optimization methodology is shown in
Figure 4 [
30], where a hybrid ML surrogate model fully replaces EM simulations during the optimization process.
Wei et al. [
31] combined convolutional autoencoders with multiobjective evolutionary algorithms for efficient filter design using the 1D convolutional autoencoder (CAE) network as a surrogate model to predict the scattering (S) parameters, reduce reliance on full-wave EM simulations, and optimize multiple design objectives while significantly reducing the design time. The workflow shown in
Figure 5 illustrates EM data acquisition, training of a 1D convolutional autoencoder surrogate model for S-parameter prediction, and integration with a multiobjective evolutionary optimization algorithm to efficiently satisfy the design requirements.
Sagar et al. [
32] presented an overview of ML and its application in electromagnetics, communications, radar, and sensing, discussing recent advances in intelligent algorithms for antenna design, synthesis, EM inverse scattering, SAR target recognition, and fault detection systems. Additionally, they highlighted limitations and future research directions. Yu et al. [
33] reviewed state-of-the-art filter design methodologies, including surrogate modeling methods, such as ANNs, radial basis function networks, and Gaussian process regression, along with advanced optimization algorithms within surrogate-model-assisted optimization frameworks. They discussed smart sampling strategies, feature-based surrogate modeling, and the decomposition of design problems based on filter variable sensitivities. A representative example of surrogate-model-assisted optimization frameworks, where EM simulations are coupled with ML-based surrogate models and adaptive sampling strategies, enabling efficient inverse design, is presented in
Figure 6.
Sahu et al. [
34] introduced an RNN-based surrogate modeling framework, where frequency is treated as a sequential parameter, enabling the learning of inter-frequency dependencies in the S-parameter responses. By employing bidirectional long short-term memory (LSTM) and gated recurrent unit (GRU) neural network architectures, as well as sequential processing of the spectral characteristics (
Figure 7), they achieved improved prediction accuracy and robustness compared with conventional regression-based surrogate models.
Wang et al. [
35] developed a generative adversarial network (GAN)-based inverse design methodology for microstrip filters, where binary pixel representations of the resonator geometries are generated and evaluated using a CNN-based surrogate model of the S-parameters, combined with a genetic algorithm optimizer. This hybrid generative–optimization framework enables the synthesis of single- and dual-band filter responses that closely match predefined spectral targets. Mashayekhi et al. [
36] proposed a multistage deep-learning (DL) framework for the inverse design of multimode Ku-band substrate-integrated waveguide (SIW) filters by combining a feedforward inverse model with hybrid inverse–forward residual refinement and iterative correction (
Figure 8) to systematically reduce the prediction errors and improve convergence.
These studies [
27,
28,
29,
30,
31,
32,
33,
34,
35,
36] have marked the transition from purely forward modeling toward data-driven inverse-design and multifunctional microwave components, reflecting the increasing use of ML techniques in RF and microwave engineering for the efficient design of complex filter structures.
From a broad methodological perspective, most surrogate modeling approaches for passive microwave circuits reported in the literature are based on EM datasets generated using a single full-wave solver and tailored to specific circuit topologies or operating bands [
16,
17,
18,
19,
20,
21,
26,
27,
28,
29,
30,
31,
33]. These studies have mainly focused on achieving accurate forward modeling or optimization performance within predefined parameter spaces and for fixed structural configurations.
Despite the recent advances in convolutional and attention-based neural architectures, MLPs remain the dominant modeling paradigm in microwave surrogate modeling [
16,
17,
18,
19,
20,
21,
26,
27,
28,
29,
33]. This preference is largely driven by the characteristics of the underlying design problems, where the input parameter space is relatively low-dimensional and governed by well-defined physical relationships, rather than by spatial patterns or long-range dependencies. Moreover, the generation of training data relies on computationally intensive EM simulations, which typically constrain the size of the available training datasets. Consequently, MLPs provide a balance between modeling capacity, data efficiency, and stable training behavior.
Recent AI-based surrogate and inverse-design approaches have emphasized the inherent challenges of microwave inverse modeling, particularly the ill-posed and highly nonlinear relationship between S-parameter responses and geometric design parameters. As reported in [
34,
35,
36], different network architectures and learning strategies are required to address these challenges, including sequence-based S-parameter modeling [
34], nonlinear inverse mapping [
35], and multistage residual refinement frameworks [
36]. These studies have shown that single-pass inverse models often suffer from systematic prediction errors, necessitating the use of constrained design spaces, hybrid inverse–forward formulations, or iterative correction mechanisms to improve accuracy and convergence.
Another important modeling issue is the treatment of frequency. Surrogate modeling of frequency-domain responses can be formulated either by treating frequency as an explicit input variable or by representing the full spectral response as a structured output. Although frequency-as-input formulations enable pointwise predictions, they do not explicitly capture inter-frequency dependencies. In contrast, frequency-as-output representations can learn the overall spectral behavior of the device, thus improving modeling reliability and inverse-design performance in broadband microwave applications [
35,
36].
In summary, early neural-network-based surrogate models for microwave components [
16,
17,
18,
19,
20,
21] have shown that data-driven approaches can significantly reduce the computational cost of EM analysis; however, they are generally restricted to single topologies and narrowly defined parameter spaces. Subsequent studies incorporating evolutionary optimization and surrogate modeling [
22,
23,
24,
25,
26,
27,
28,
29,
30] have shown that the global optimization of complex microwave structures is feasible; however, these approaches typically rely on topology-specific datasets and solver-dependent modeling assumptions. More recent DL-based methods [
34,
35,
36] have improved modeling flexibility and inverse-design capability by addressing spectral correlations and nonlinear response behaviors, often at the expense of increased data requirements and architectural complexity. Furthermore, Gaussian process and manifold-based surrogate models [
27,
28] provide uncertainty-awareness, which is advantageous for optimization and refinement; however, their scalability to multifrequency or multitopology settings has not yet been widely reported; multitopology surrogate modeling and systematic active learning strategies have received limited attention, with only preliminary discussions or partial implementations reported in the literature [
32,
36].
Overall, the existing studies have independently addressed individual stages of the microwave design process, necessitating a unified and reproducible framework that integrates data generation, surrogate modeling, inverse optimization, and validation across multiple microwave topologies.