Using an Improved Regularization Method and Rigid Transformation for Super-Resolution Applied on MRI Data

Matina Christina Zerva; Giannis Chantas; Lisimachos Paul Kondi

doi:10.20944/preprints202410.1291.v1

Submitted:

16 October 2024

Posted:

17 October 2024

You are already at the latest version

Abstract

Super-resolution (SR) techniques have shown great promise in enhancing the resolution of MRI images, which are often limited by hardware constraints and acquisition time. Regularization-based methods, which incorporate prior knowledge into the SR process, have been especially effective in improving image quality and mitigating the effects of noise and blur. In this paper, we propose an advanced regularization method for MRI super-resolution that balances high-frequency detail preservation with noise suppression. By leveraging spatially adaptive regularization techniques and a robust denoising process, the proposed method outperforms traditional SR algorithms, as demonstrated on real-world MRI datasets.

Keywords:

MRI

;

Super-resolution

;

Regularization method

Subject:

Computer Science and Mathematics - Artificial Intelligence and Machine Learning

1. Introduction

Super-resolution (SR) has gained significant attention in recent years, particularly in medical imaging applications, where the resolution of acquired images is often limited by hardware constraints, time limitations, and patient comfort considerations. Traditional medical imaging modalities such as Magnetic Resonance Imaging (MRI) and Computed Tomography (CT) produce images at a resolution that can restrict the level of detail observable for diagnostic purposes. Increasing the resolution of these images through hardware improvements is often costly and impractical. As a result, computational techniques like SR have emerged as a powerful alternative, allowing high-resolution (HR) images to be reconstructed from low-resolution (LR) inputs without the need for expensive hardware [1,2,3].

The principle behind SR methods is to overcome limitations by leveraging redundant information from multiple LR images or sequences, often involving complex algorithms like regularization methods and machine learning models [4]. Various approaches, from classic interpolation methods to more advanced neural network-based models, have been employed to enhance the quality of medical images in terms of spatial resolution, signal-to-noise ratio (SNR), and edge preservation [5].

In medical imaging, SR is particularly valuable because it enhances the quality of images used in diagnostic processes. For instance, MRI scans are used to assess various medical conditions, and improving their resolution can lead to more accurate diagnoses. Using SR techniques like Wiener filter regularization [1] or edge-preserving high-frequency regularization [3] allows for better visual quality in images without increasing acquisition costs or hardware requirements. Furthermore, neural network-based SR methods [2] have demonstrated promising results in improving image quality with reduced computational time, making them suitable for real-time applications in clinical settings.

The concept of SR in medical imaging has evolved significantly over the years, with various methodologies proposed to tackle the challenges of resolution enhancement. One of the early applications of SR to MRI was proposed by Peled et al. [6], where an Iterative-Back-Projection (IBP) method was used to enhance MRI images of human white matter fiber tracts. While this method showed some promise, it was limited by the use of synthetic image data, which does not fully capture the complexities of real-world medical imaging. Subsequent work by Scheffler [7] addressed this limitation by highlighting the importance of utilizing original image data for more reliable SR reconstruction.

More recently, the integration of machine learning techniques into SR models has shown great promise. For example, a method combining iterative regularization with feed-forward neural networks was proposed by Babu et al. [2], yielding improved results over previous methods due to its capability to handle noise and produce clearer, higher-resolution images. This method demonstrates the potential of neural networks to enhance SR models by reducing computational complexity while maintaining high image quality.

Bayesian methods have also been a major area of exploration in SR research [8,9,10,11,12]. Aguena et al. [1] introduced a Bayesian approach to MRI SR, which employed a Wiener filter to regularize the iterative solution. This method achieved notable improvements in both noise reduction and edge preservation. Similarly, Ben-Ezra et al. [4] proposed a regularized SR framework for brain MRI, incorporating domain-specific knowledge to improve the quality of SR reconstructions. Their approach outperformed traditional maximum a posteriori (MAP) estimators in terms of both edge clarity and overall image quality.

Moreover, Ahmadi and Salari [3] proposed a high-frequency regularization technique that combines edge-preserving methods with traditional SR models. Their approach allows for enhanced edge definition in MRI images without the need for image segmentation, offering a computationally efficient solution suitable for clinical applications.

The Accelerated Proximal Gradient Method (APGM), as outlined in [13], is a well-known optimization technique commonly used for solving inverse problems in imaging, including SR. APGM accelerates the convergence of proximal gradient methods, which are widely adopted for SR tasks involving regularization. Its primary strength lies in its speed, as it converges more quickly than traditional gradient methods, making it suitable for large-scale imaging problems. However, APGM’s effectiveness is heavily dependent on the choice of regularizer, which influences how well the method can balance smoothness and sharpness in the reconstructed image. Poorly chosen regularizers can introduce artifacts or excessively smooth the image. While APGM is flexible and powerful, it requires careful tuning to achieve optimal results, especially when handling high-frequency details.

Block-matching and 3D filtering (BM3D), a renowned denoising algorithm discussed in [14], uses a collaborative filtering approach to reduce noise while preserving image structures. For super-resolution tasks, BM3D can act as a regularizer that effectively manages noise without compromising edges and textures. Its block-matching mechanism compares similar patches in the image, applying 3D filtering to reduce noise in these matched blocks. Although BM3D excels at preserving textures and fine details in natural images, its computational complexity can be high, particularly when dealing with large images or complex noise patterns. Additionally, the block-matching process may struggle in scenarios where image structures do not align well with the blocks, leading to potential loss of detail in areas with intricate textures.

Gu et al. [15] enhanced the BM3D algorithm by introducing weighted nuclear norm minimization, which improved its performance in image denoising. This modification further solidified BM3D as a versatile and widely adopted tool in image processing. Although BM3D is primarily an image denoising algorithm rather than a typical regularization technique, denoising often relies on regularization to reduce noise and improve image quality. BM3D utilizes collaborative filtering and 3D transform-domain techniques to achieve denoising, making it more aligned with advanced signal processing than conventional regularization methods.

Total variation (TV) regularization, a widely used technique in inverse problems, aims to promote sparsity in image gradients, leading to smoother regions while preserving sharp edges. TV regularization is known for its simplicity and its ability to retain edge information, making it a popular choice in SR tasks. However, TV regularization often suffers from the staircasing effect, where smooth regions of the image appear blocky or exhibit artificial edges. Over-regularization can further result in a loss of fine details, which limits the technique’s applicability in images with rich textures or high-frequency content [16].

Rapid and Accurate Image Super Resolution (RAISR), introduced in [17], is a learning-based SR method that is both computationally efficient and fast. It works by learning filters that are adaptive to local image features, such as gradients and edge orientations. Unlike deep learning-based methods that often require significant computational resources, RAISR is lightweight and quick, making it an attractive option for real-time applications. Despite its efficiency, RAISR tends to fall short when compared to more advanced SR methods like deep neural networks in terms of recovering high-frequency details. Its performance is highly dependent on the quality of the learned filters, and it may struggle with images that have complex structures or varying noise levels.

PPPV1, as described in [18], is a video super-resolution method, based on the Plug-and-Play (PnP) framework. This method iteratively refines images, ensuring the gradual recovery of fine details over multiple iterations. A key feature of PPPV1 is its reliance on a denoising module, originally based on DnCNN (Denoising Convolutional Neural Network), to remove noise from the image during each step of the reconstruction process. While DnCNN is effective at reducing noise, it sometimes introduces oversmoothing, especially in high-frequency regions where texture and fine details are critical. The proposed improvement to this method involves replacing DnCNN with a custom prior for denoising. This change allows for more control over detail preservation and texture recovery, potentially reducing the risk of oversmoothing. A well-designed custom prior can provide better balance between noise suppression and sharpness, which could lead to more accurate and visually appealing results, particularly in areas with intricate patterns or high-frequency details.

In addition to neural network-based and Bayesian approaches, convex optimization methods have been explored. Kawamura et al. [5] applied convex optimization techniques to MRI SR, producing state-of-the-art results by carefully balancing noise suppression and detail preservation.

Overall, the field of SR in medical imaging is rapidly advancing, with numerous approaches showing great potential in improving diagnostic imaging and reducing the need for high-cost imaging hardware. The next phase of research will likely focus on integrating these various techniques into more robust, real-time systems suitable for clinical environments.

In this paper, we propose an improved PPP regularization method for MRI super-resolution, utilizing an effective prior specifically designed for denoising and handling motion between frames. Building upon the foundation of our previous method (PPPV1 [18]), our approach incorporates an innovative denoiser that significantly enhances performance. Unlike traditional methods that primarily focus on denoising, our method integrates these advances into an MRI super-resolution framework.

2. Materials and Methods

The acquisition model we are assuming is:

\begin{matrix} y = A x + ε, \end{matrix}

(1)

where:

$y$ is the full set of low resolution (LR) frames, described as $y = {[{y_{1}}^{T}, {y_{2}}^{T}, \dots, {y_{p}}^{T}]}^{T}$ , where $y_{k}, k = 1, 2, . . ., p$ are the p LR images. Each observed LR image is of size $N_{1} \times N_{2}$ . Let the kth LR image be denoted in lexicographic notation as $y_{k} = {[y_{k, 1}, y_{k, 2}, \dots, y_{k, M}]}^{T}$ , for $k = 1, 2, . . ., p$ and $M = N_{1} N_{2}$ .
$x$ is the desired high resolution (HR) image, of size $L_{1} N_{1} \times L_{2} N_{2}$ , written in lexicographical notation as the vector $x = {[x_{1}, x_{2}, . . ., x_{N}]}^{T}$ , where $N = L_{1} N_{1} L_{2} N_{2}$ and $L_{1}$ and $L_{2}$ represent the up-sampling factors in the horizontal and vertical directions, respectively.
$ε = {[ε_{1}, ε_{2}, . . ., ε_{p}]}^{T}$ , where $ε_{k}$ is the noise vector for frame k and contains independent zero-mean Gaussian random variables.
$A = {[A_{1}, A_{2}, . . ., A_{p}]}^{T}$ is the degradation matrix which performs the operations of blur, rigid transformation and subsampling.

Assuming that each LR image is corrupted by additive noise, we can then represent the observation model as [19]:

\begin{matrix} y_{k} = A_{k} x + ε_{k} f o r 1 \leq k \leq p \end{matrix}

(2)

where

A_{k} = S B_{k} M_{k} .

(3)

M_{k}

is a matrix of size

L_{1} N_{1} L_{2} N_{2} \times L_{1} N_{1} L_{2} N_{2}

that performs the rigid transformation,

B_{k}

represents a

L_{1} N_{1} L_{2} N_{2} \times L_{1} N_{1} L_{2} N_{2}

blur matrix, and S is a

N_{1} N_{2} \times L_{1} N_{1} L_{2} N_{2}

subsampling matrix. In our case

B_{k} = I

, since we assumed no added blur on video frames.

The goal is to find the estimate

\hat{x}

of the HR image

x

from the p LR images

y_{k}

by minimizing the cost function

\hat{x} = arg min_{x \in R^{N}} f (x) w i t h f (x) = g (x) + h (x),

(4)

where

g (x) = \sum_{k = 1}^{p} \frac{1}{2} {∥ A_{k} x - y_{k} ∥}_{2}^{2}

is the “fidelity to the data” term, and

h (x)

is the regularization term, which offers some prior knowledge about

x

. In this study, we adopt the Plug-and-Play Priors approach, in which the ADMM algorithm is modified so that the proximal the proximal operator related to

h (x)

is replaced by a denoiser that solves the problem of Eq. (5). The denoiser used is based on the work by Chantas et al. [20].

The following outlines the algorithm we propose:

1.: The first step of our algorithm is to evaluate the term $M_{k}$ from the Equation (3), by using rigid registration. Rigid registration, also known as rigid body registration or rigid transformation, is a fundamental technique in medical image processing and computer vision. It is used to align two images by performing translations and rotations while preserving the shape and size of the structures within the images [22].

In a 2D plane, a rigid transformation can be represented using a $3 \times 3$ matrix, often referred to as the transformation matrix. For example, a 2D translation can be represented as [23]:

$T = [\begin{matrix} 1 & 0 & t_{x} \\ 0 & 1 & t_{y} \\ 0 & 0 & 1 \end{matrix}]$

Rotation and reflection matrices can also be formulated similarly. The result of the rigid transformation is represented as an affine transformation matrix. This matrix captures the translation and rotation parameters applied to the original image [23].

We assume that one of the LR images, $y_{m i d}$ (typically the middle one), is produced from the HR image $x$ , by applying only downsampling, without transformation. Thus, $M_{m i d} = I$ . Rigid transformation is calculated between $y_{m i d}$ and the rest of the LR images. Following that, we get $M_{k}$ for the remaining $p - 1$ images.
2.: The subsequent phase is centered on employing the PnP-ADMM technique. We execute the PnP-ADMM, adhering to the procedure outlined in Algorithm 1 until reaching convergence, in order to minimize the problem described by Eq. (4). The initial HR image guess, $x^{0}$ , is generated from $y_{m i d}$ using the pseudo-inverse of $A_{m i d}$ . Here, D represents the denoising operator, introduced and discussed in Section 2.1, and g is formulated as $g (x) = \sum_{k = 1}^{p} \frac{1}{2} {∥ A_{k} x - y_{k} ∥}_{2}^{2}$ .

Algorithm 1 PnP-ADMM [24]

1:: $u^{0} = 0, x^{0}, a n d γ > 0$
2:: for $k = 1, 2, . . ., t$ do
3:: $z^{k} \leftarrow p r o x_{γ g} (x^{k - 1} - u^{k - 1})$
4:: $x^{k} \leftarrow D (z^{k} + u^{k - 1})$
5:: $u^{k} \leftarrow u^{k - 1} + (z^{k} - x^{k})$
6:: end for
7:: return $x^{t}$

We next explain the modification made to the standard ADMM algorithm to obtain PnP-ADMM. Line 4 or the standard ADMM is

x^{k} \leftarrow p r o x_{β h} (z^{k} + u^{k - 1})

. In the PnP-ADMM, the proximal operator is replaced by a denoiser D that solves the problem

\begin{matrix} z = x_{0} + w, where x_{0} \sim p, w \sim N (0; β I) . \end{matrix}

(5)

It can be shown that the Maximum A Posteriori (MAP) estimator

\hat{x_{0}}

of

x_{0}

is the proximal operator:

\hat{x_{0}} = p r o x_{β h} (z) = arg min_{x \in R^{N}} {\frac{1}{2} {∥ x - z ∥}_{2}^{2} + β h (x)},

(6)

for

h (x) = - log (p (x))

.

2.1. The Denoising Algorithm

In this section, we describe the algorithm we use to implement the denoising step of Eq. (6). The algorithm is a simplification of that proposed in [20], it is formulated in a probabilistic (Variational Bayes) context and utilizes an effective prior distribution, which we describe in short next.

2.1.1. The Prior Distribution

The prior distribution we employ for the denoising step was proposed in [20] for single image Super-Resolution, and it is of the form:

p (x) \propto \prod_{w \in Ω} (\sum_{δ \in D} {(1 + \frac{λ}{ν} ϵ_{w, δ} (x))}^{- \frac{ν + 1}{2}}),

(7)

where

λ, ν

are the real-positive distribution parameters and

ϵ_{w, δ}

is a similarity measure between two patches each of center pixel w and

w + δ

. The above distribution is produced after integrating out the hidden variables of the prior in [20]. However, this form in never explicitly used (it is not necessary) in the optimization algorithm. We show it here in this form for simplicity of presentation. Indeed,

h (x)

enables us to interpret the prior in a deterministic context, analogous with the penalty function imposed on the video frames, see equation (6).

We introduce a similarity measure between two image patches, denoted as

N_{w}

and

N_{w^{'}}

, where

x (w)

and

x (w^{'})

represent the central pixel of the first and second patch, respectively.

The complete set of pixel coordinates is represented by

Ω = {1, \dots, N}

. Furthermore, we define

δ

as the integer displacement between the center pixels of the two patches, such that

w^{'} = w + δ

. For measuring similarity, we employ a weighted Euclidean norm, represented by

ϵ_{w, δ}

, to quantify the difference between

N_{w}

and

N_{w^{'}}

(or

N_{w + δ}

) as follows:

ϵ_{w, δ} = \sum_{i \in Ω} v_{δ}^{2} (i) γ_{w} (i),

(8)

where

v_{δ}

is defined by:

v_{δ} = Q_{δ} x

and

v_{δ}^{2}

indicates the vector obtained by squaring each element of

v_{δ}

.

Q_{δ}

represents the difference operator, an

N \times N

matrix, such that the i-th component of

Q_{δ} x

equals

x (i) - x (i^{'})

for all

i, i^{'} \in Ω

with

i^{'} - i = δ

. The matrix

G_{w}

is an

N \times N

diagonal matrix, where its diagonal elements corresponding to the pixels in

N_{w}

are the only non-zero values, specifically,

G_{w} (i, i) = 0

for all i not in

N_{w}

. Lastly, we denote by

γ_{w}

the

N \times 1

vector with elements the weights of the weighted norm: the closer to the central pixel of the patches the larger the weight value.

The norm defined by (8) retains its value even if the summation (8) runs over only the subset

N_{w} \subset Ω

instead of

Ω

, since

γ_{w} (i) = 0

for

i \notin N_{w}

. However, we use the full summation range over

Ω

for enabling fast computations with the Fast Fourier Transform, as explained next.

The distance between the patch

N_{w = 1}

and an arbitrary patch

N_{w^{'}}

,

w^{'} \in Ω

, is

δ = w - w^{'} = 1 - w^{'}

. Given that the image patches correspond to

g_{1}

and

g_{w^{'}}

, it is:

g_{w^{'}} (i) = g_{w = 1} (i - δ) = g_{1} (i + 1 - w), \forall i \in Ω .

(9)

As we can see, each

γ_{w^{'}}

, is a circularly shifted by

w^{'}

version of

g_{1} \equiv g

(denoted simply by

g

from now on). The formula (8) for calculating

ϵ_{w, δ}

, expressed in terms of

γ

, is:

ϵ_{w, δ} = \sum_{i \in Ω} v_{δ}^{2} (i) g_{w} (i) = \sum_{i \in Ω} v_{δ}^{2} (i) g (i + 1 - w) .

(10)

Clearly, the values of

ϵ_{w, δ}

for all w’s, are the result of the correlation between

v_{δ}^{2}

and

g

, since the indices of

v^{2}

and

g

always differ by the constant

1 - w

. To calculate the correlation required for the super-resolution technique discussed in the following section, we use the Fast Fourier Transform (FFT). This approach decreases the computational complexity of the algorithm from

O (N^{2})

, typical for correlation calculations, to

O (N log N)

, which is the complexity for multiplication in the DFT (Discrete Fourier Transform) domain.

2.1.2. Denoising in PnP-ADMM

Next, we describe the algorithm we employ in the PnP-ADMM context of Algorithm 1, and specifically for the denoising step (line 4). The algorithm we employ, as a denoising sub-problem of the general super-resolution algorithm (Algorithm 1), is in essence a special case of the VBPS algorithm in [20], where there is no blurring nor decimation. Mathematically speaking, this means that the imaging operator

DH

is the

N \times N

identity matrix

I

, as shown in line 8 of Algorithm 2.

Algorithm 1: Variational Bayes Patch Similarity Denoising

Input: Noisy image

z^{k} + u^{k - 1}

.

Output: Denoised image

x^{k}

.

Initialization:

Image initial estimate: Set

α_{new} = α / 2

, where

α

is the regularization parameter obtained from [21]. Then, set

m^{(0)} = x_{Stat}

, where

x_{Stat}

is the super-resolved image obtained after setting

α = α_{new}

. Parameter selection: Set

t = 0

, and

β = {N / ∥ x - z ∥}_{2}^{2}

,

λ = 10^{3} α_{new}

,

ν = 7

,

r m a x = 280

,

MAXITER = 25

and err

= 10^{- 7}

.

1:: while $∥ m^{(t)} - m^{(t - 1)} ∥_{2}^{2} / N > e r r$ AND $t < MAXITER$ do
2:: for every $δ$ in $D$ do
3:: $v_{δ} \leftarrow Q_{δ} m^{(t)}$
4:: for every w in $Ω$ do
5:: Calculate the expectations of the following model’s random variables:

${〈 a_{w, δ} 〉}_{(t)} = \frac{1 + ν}{λ {\hat{ϵ}}_{w, δ} + ν},$

${〈 z_{w, δ} 〉}_{(t)} = \frac{e^{- \frac{λ}{2} {〈 a_{w, δ} 〉}_{(t)} {\hat{e}}_{w, δ} - \frac{ν}{2} log {〈 a_{w, δ} 〉}_{(t)}}}{\sum_{δ} e^{- \frac{λ}{2} {〈 a_{w, δ^{'}} 〉}_{(t)} {\hat{e}}_{w, δ^{'}} - \frac{ν}{2} log {〈 a_{w, δ^{'}} 〉}_{(t)}}},$

where $\hat{ϵ}$ is the $ϵ$ in (8), calculated with the image estimation provided in the previous iteration $t - 1$ ,
6:: calculate $b_{δ}^{(t)} (w) = {〈 a_{w, δ} 〉}_{(t)} {〈 z_{w, δ} 〉}_{(t)}$ , for all w and $δ$ ,
7:: set $Λ_{δ}^{(t)} = diag {b_{δ}^{(t)} * g}$ (convolution),
8:: $t \leftarrow t + 1$
9:: Obtain $m^{(t)}$ by solving the linear system $(β I + λ \sum_{δ} Q_{δ}^{T} Λ_{δ}^{(t)} Q_{δ}) m^{(t)} = β y$ with the Conjugated Gradients algorithm.
10:: end for
11:: end for
12:: end while
13:: T=t; $x^{k} = m^{(T)}$ .

More specifically, the imaging model assumed for the denoising step is a simplified form of Eq. (2.1) in [20], because it is now

DH = I

(i.e., no blur/decimation, hence it is just the identity matrix). Also, in this form,

z^{k} + u^{k - 1}

has the role of the “noisy image” and

x^{k}

is the uncorrupted one, meant to be estimated by the denoising algorithm.

In parallel with imaging model, we assume the imaging model, i.e., the prior distribution introduced above and given by Eq. (5). This is in essence the prior distribution for the uncorrupted image to be estimated via the denoising procedure. This means that the Algorithm 2 is the result of the adoption of both the imaging model mentioned above and the prior (5) for

x

. Lastly, note that the denoising Algorithm 2 selects automatically, in the initialization step, the noise variance

β

, among other parameters.

We implemented our method in SCICO [25], which is an open source library for computational imaging that includes implementations of several algorithms.

To evaluate our method, the widely-used publicly available dataset named the cancer image archive (TCIA) [26] was used, in order to compare our results to the previously proposed method. Specifically, we conducted experiments using a dataset of LR brain MRI images and a corresponding HR reference dataset.

3. Results

Our method with the effective prior achieved notable improvements in image quality, as demonstrated by Figure 1 and Figure 2.

To objectively evaluate the effectiveness of our improved technique, we calculated the PSNR and conducted comparisons with both alternative approaches and enhanced versions of our own method. Specifically, we compared against PPPV1 [18], APGM (accelerated proximal gradient method) [13], BM3D (Block-matching and 3D filtering) [16], Total Variation [14], RAISR (Rapid and Accurate Image Super Resolution) [17] and MIRNetv2 [27], as well as with the pseudo-inverse and the denoised pseudo-inverse images. The difference between PPPV1 and the currently proposed method is that now we use a custom prior instead of DnCNN for the denoising, while we use the same rigid transformation. The outcomes, detailed in Table 1, unequivocally demonstrate that our method surpasses others in delivering higher image quality.

The Wilcoxon signed-rank test was used to compare the PSNR values of the proposed method with the respective values for PPP V1, Pseudo-inverse, Denoised Pseudoinverse, APGM, BM3D and TV methods. The results obtained with those statistical tests are shown in Figure 3 and indicated statistically significant differences between the PPP and the other six methods, since no per-slice data was available for RAISR and MIRNetv2.

Considering the perceptual quality of the frames, it is obvious from Table 2 that our method gives the best results, outperforming the other methods on all datasets. This outcome demonstrates the robustness and effectiveness of our method in enhancing the natural quality of super-resolved videos for this specific dataset.

4. Discussion

In summary, the results presented in this study highlight the superior performance of the proposed method in the field of video super-resolution. This method consistently outperforms state-of-the-art techniques, as demonstrated by the substantial PSNR gains observed on the datasets used for evaluation. The following key takeaways can be drawn:

The experimental results demonstrate the superiority of our approach over existing techniques, underscoring its potential for clinical applications in neuroimaging.
The practical implications of our results suggest that our method holds great promise for applications where MRI slices quality enhancement is paramount.
Computational efficiency is another significant advantage of our method. Unlike Deep Neural Network-based methods, our approach does not rely on neural networks and requires no training, making it faster and less resource-intensive.

These findings make a strong case for the adoption of our method in MRI enhancement and upscaling tasks. We believe that the approach we suggest has the potential to contribute significantly to the field of video super-resolution and benefit a wide range of applications.

Author Contributions

Conceptualization, M.C.Z., G.C. and L.P.K.; methodology, M.C.Z., G.C. and L.P.K.; software, M.C.Z.; validation, M.C.Z., G.C. and L.P.K.; resources, M.C.Z., G.C. and L.P.K.; writing—original draft preparation, M.C.Z., G.C. and L.P.K.; writing—review and editing, M.C.Z., G.C. and L.P.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by by project “Dioni: Computing Infrastructure for Big-Data Processing and Analysis” (MIS No. 5047222) co-funded by European Union (ERDF) and Greece through Operational Program “Competitiveness, Entrepreneurship and Innovation”, NSRF 2014-2020.

Institutional Review Board Statement

Not applicable

Informed Consent Statement

Not applicable

Data Availability Statement

Availability of data and material: The datasets analysed during the current study are available in the TCIA repository, https://www.cancerimagingarchive.net/

Acknowledgments

Not applicable

Conflicts of Interest

The authors declare no conflicts of interest

References

M. L. S. Aguena, N. D. A. Mascarenhas, J. C. Anacleto, S. S. Fels. MRI Iterative Super Resolution with Wiener Filter Regularization. 2013 XXVI Conference on Graphics, Patterns and Images, 2013.
M. Ganesh Babu, S. S. Panda, H. B. Bandela. Super Resolution Image Reconstruction Using Iterative Regularization Method and Feed-Forward Neural Networks. J. Phys. Conf. Ser., 1228:012021, 2019.
K. Ahmadi, E. Salari. Edge-Preserving MRI Super Resolution Using a High Frequency Regularization Technique. University of Toledo, 2019.
A. Ben-Ezra, H. Greenspan, Y. Rubner. Regularized Super-Resolution of Brain MRI. IEEE Trans. Med. Imag., 2009.
H. Kawamura et al. Super-Resolution of Magnetic Resonance Images via Convex Optimization. International Journal of Biomedical Imaging, 2018.
S. Peled, Y. Yeshurun. Super-resolution in MRI: Application to human white matter fiber tract visualization by diffusion tensor imaging. Magn. Reson. Med., 45(1):29-35, 2001.
K. Scheffler. Super-resolution MRI: Strategies and applications. NeuroImage, 15(2):91-103, 2003.
C. Liu and D. Sun, "On Bayesian Adaptive Video Super Resolution," *IEEE Transactions on Pattern Analysis and Machine Intelligence*, vol. 36, no. 2, pp. 346-360, 2014. [CrossRef]
N. P. Galatsanos, V. Z. Mesarović, R. Molina, and A. K. Katsaggelos, "Hierarchical Bayesian image restoration from partially known blurs," *IEEE Transactions on Image Processing*, vol. 9, no. 10, pp. 1784-1797, 2000. [CrossRef]
A. Zomet, A. Rav-Acha, and S. Peleg, "Robust super-resolution," in *Proc. of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001*, vol. 1, pp. 645-650, 2001. [CrossRef]
S. Farsiu, D. Robinson, M. Elad, and P. Milanfar, "Robust Shift and Add Approach to Super-Resolution," *Proceedings of SPIE - The International Society for Optical Engineering*, vol. 5203, 2003. [CrossRef]
S. Farsiu, M. D. Robinson, M. Elad, and P. Milanfar, "Fast and robust multiframe super resolution," *IEEE Transactions on Image Processing*, vol. 13, no. 10, pp. 1327-1344, 2004. [CrossRef]
U. S. Kamilov, C. A. Bouman, G. T. Buzzard, and B. Wohlberg, "Plug-and-Play Methods for Integrating Physical and Learned Models in Computational Imaging: Theory, algorithms, and applications," IEEE Signal Processing Magazine, vol. 40, no. 1, pp. 85-97, Jan. 2023. [CrossRef]
K. Dabov, A. Foi, V. Katkovnik, and K. Egiazarian, "Image restoration by sparse 3D transform-domain collaborative filtering," in Proc. Image Processing: Algorithms and Systems VI, J. T. Astola, K. O. Egiazarian, and E. R. Dougherty, Eds., Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, vol. 6812, pp. 681207, Feb. 2008. [CrossRef]
S. Gu, L. Zhang, W. Zuo, and X. Feng, "Weighted Nuclear Norm Minimization with Application to Image Denoising," in *Proc. of the 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)*, pp. 2862-2869, June 2014. [CrossRef]
L. I. Rudin, S. Osher, and E. Fatemi, "Nonlinear total variation based noise removal algorithms," Physica D: Nonlinear Phenomena, vol. 60, no. 1, pp. 259-268, 1992. [CrossRef]
S. He and B. Jalali, "Brain MRI Image Super Resolution using Phase Stretch Transform and Transfer Learning," arXiv preprint arXiv:1807.11643, 2018. arXiv:1807.11643, 2018.
M. Ch. Zerva and L. P. Kondi, “Video super-resolution using plug-and-play priors,” IEEE Access, vol. 12, pp. 11963–11971, 2024. [CrossRef]
S. C. Park, M. K. Park, and M. G. Kang, "Super-resolution image reconstruction: a technical overview," IEEE Signal Processing Magazine, vol. 20, no. 3, pp. 21-36, 2003. [CrossRef]
G. Chantas, S. Nikolopoulos, and I. Kompatsiaris, "Heavy-Tailed Self-Similarity Modeling for Single Image Super Resolution," IEEE Transactions on Image Processing, vol. 30, pp. 838-852, Nov. 2020. [CrossRef]
G.K. Chantas, N. P. Galatsanos, and N. A. Woods, “Super-resolution based on fast registration and maximum a posteriori reconstruction,” IEEE Transactions on Image Processing, vol. 16, no. 7, pp. 1821–1830, 2007.
K. Anjyo and H. Ochiai, "Rigid Transformation," in Mathematical Basics of Motion and Deformation in Computer Graphics, Second Edition, Cham: Springer International Publishing, 2017, pp. 5-21.
R. Szeliski, Computer Vision: Algorithms and Applications, 1st ed. Berlin, Heidelberg: Springer-Verlag, 2010, ISBN: 1848829345.
U. S. Kamilov, H. Mansour, and B. Wohlberg, "A plug-and-play priors approach for solving nonlinear imaging inverse problems," IEEE Signal Process. Lett, vol. 24, no. 12, pp. 1872-1876, 2017.
T. Balke, F. R. Davis, C. Garcia-Cardona, M. McCann, L. Pfister, and B. E. Wohlberg, "Scientific Computational Imaging Code (SCICO)," Journal of Open Source Software, vol. 7, no. 78, Oct. 2022. [CrossRef]
K. Clark et al., "The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository," Journal of Digital Imaging, vol. 26, no. 6, pp. 1045-1057, 2013.
S.W. Zamir, A. Arora, S.H. Khan, H. Munawar, F.S. Khan, M.H. Yang, and L. Shao, “Learning Enriched Features for Fast Image Restoration and Enhancement,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022. [CrossRef]

Figure 1. Result of image 001 from Dataset 1

Figure 2. Result of image 261 from Dataset 2

Figure 3. Scatter plot representation and the Wilcoxon signed-rank test results of the comparison for each of the six super-resolution methods (PPP V1, Pseudo-inverse, Denoised Pseudoinverse, APGM, BM3D and TV) with the PPP method regarding PSNR values. Four stars (****) are less commonly used than one, two, or three asterisks in standard practice. If used, they might denote an extremely high level of significance, possibly at the 0.0001 level (p-value < 0.0001), while in this case all results were p=0.000, indicating an ultimately significant correlation.

Table 1. PSNR statistics for the two datasets of all the methods

	Dataset 1		Dataset 2
	Average	St.Dev	Average	St.Dev
PPPV1	22.49	0.44	25.26	0.25
PPP	26.59	0.49	25.67	0.65
Pseudo-inverse	19.52	0.56	22.81	0.26
Denoised pseudo-inverse	20.36	0.51	23.73	0.28
APGM	19.91	0.34	23.78	0.22
BM3D	20.58	0.82	23.72	0.36
TV	22.48	0.44	23.50	0.29
RAISR	21.99	0.43	25.77	0.32
MIRNetv2	14.05	0.27	14.26	0.18

Table 2. NIQE statistics for the two datasets of all the methods

	Dataset 1		Dataset 2
	Average	St.Dev	Average	St.Dev
PPPV1	6.14	0.15	6.66	0.17
PPP	5.82	0.15	6.39	0.16
Pseudo-inverse	14.13	0.36	14.08	0.35
Denoised pseudo-inverse	14.13	0.36	14.08	0.35
APGM	13.86	0.35	13.03	0.33
BM3D	10.66	0.27	11.92	0.30
TV	12.22	0.31	12.82	0.32
RAISR	5.87	0.15	9.61	0.24
MIRNetv2	7.18	0.18	7.95	0.20

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Using an Improved Regularization Method and Rigid Transformation for Super-Resolution Applied on MRI Data

Abstract

Keywords:

Subject:

1. Introduction

2. Materials and Methods

2.1. The Denoising Algorithm

2.1.1. The Prior Distribution

2.1.2. Denoising in PnP-ADMM

3. Results

4. Discussion

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

MDPI Initiatives

Important Links

Subscribe