Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Global optimization of default phases for parallel transmit coils for ultra-high-field cardiac MRI

  • Maxim Terekhov ,

    Roles Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Software, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    Terekhov_M@ukw.de

    Affiliation Chair of Cellular and Molecular Imaging, Comprehensive Heart Failure Center, University Hospital Wuerzburg, Wuerzburg, Germany

  • Ibrahim A. Elabyad,

    Roles Investigation, Methodology, Resources, Software, Validation, Writing – original draft, Writing – review & editing

    Affiliation Chair of Cellular and Molecular Imaging, Comprehensive Heart Failure Center, University Hospital Wuerzburg, Wuerzburg, Germany

  • Laura M. Schreiber

    Roles Funding acquisition, Project administration, Resources, Supervision, Writing – review & editing

    Affiliation Chair of Cellular and Molecular Imaging, Comprehensive Heart Failure Center, University Hospital Wuerzburg, Wuerzburg, Germany

Abstract

The development of novel multiple-element transmit-receive arrays is an essential factor for improving B1+ field homogeneity in cardiac MRI at ultra-high magnetic field strength (B0 > = 7.0T). One of the key steps in the design and fine-tuning of such arrays during the development process is finding the default driving phases for individual coil elements providing the best possible homogeneity of the combined B1+-field that is achievable without (or before) subject-specific B1+-adjustment in the scanner. This task is often solved by time-consuming (brute-force) or by limited efficiency optimization methods. In this work, we propose a robust technique to find phase vectors providing optimization of the B1-homogeneity in the default setup of multiple-element transceiver arrays. The key point of the described method is the pre-selection of starting vectors for the iterative solver-based search to maximize the probability of finding a global extremum for a cost function optimizing the homogeneity of a shaped B1+-field. This strategy allows for (i) drastic reduction of the computation time in comparison to a brute-force method and (ii) finding phase vectors providing a combined B1+-field with homogeneity characteristics superior to the one provided by the random-multi-start optimization approach. The method was efficiently used for optimizing the default phase settings in the in-house-built 8Tx/16Rx arrays designed for cMRI in pigs at 7T.

Introduction

The implementation of MRI-scanners, operating at ultra-high field static magnetic fields (UHF of B0≥7T, promises a significant increase of both SNR and the spatial resolution (up to ~200μm in-plane) [1] of clinical MR-images. Therefore, despite technical challenges related to both B0 and B1+ fields’ inhomogeneities, the interest in the application of UHF scanners for cardiovascular MRI (cMRI) grows [24]. At the Larmor frequency of protons ≈ 300MHz (B0≈7T) and electrical permittivity of muscle tissue (ε≈60), the wavelength of (10-12cm) is smaller than the dimensions of a human thorax. This leads to the establishment of a standing wave regime and creates interferences of a B1+-field across an imaged field-of-view (FOV). Additionally, the homogeneity of the B1+-field suffers from strong differences in the permittivity of intra-thoracic structures, in particular, of the lung parenchyma and cardiac muscle tissue. Thus, the spatial distribution of contrast in MR images becomes essentially heterogeneous rising demand for methods capable to neutralize these negative factors.

During the last decades, multiple elements transmit-receive arrays (mTx-arrays) combined with parallel transmit (pTX) technology became an emerging tool for the improvement of the B1+-field homogeneity in UHF cMRI. Starting with standard birdcage coils with individual control of the phases of the driving ports, the technology development has led to customized arrays with up to 32 transceiver elements [5]. The type of mTX-arrays elements also evolved from classical magnetic loops to electrical dipoles, microstrips, and sophisticated loop-dipole hybrids [511]. The novel hardware allows for shaping the optimal spatial distribution of the B1+-field to excite a targeted transversal magnetization with the best possible smoothness of both magnitude and phase within a region-of-interest (ROI). This procedure is usually called “B1+-shimming”. Additionally, using the RF-power amplifier (RFPA) operating in pTX mode allows for driving individual TX-arrays elements dynamically, i.e. with varying magnitude and phase of the RF-pulse waveforms to shape excitation profiles (including 2D and 3D) using different optimization criteria [1216]. Last but not least, both techniques allow for including safety margins regarding specific adsorption rate (SAR) of electromagnetic energy.

Despite the significant progress in the field of subject-specific static and dynamic B1+-shimming, it remains to be a quite complex research tool. It requires deep expertise in MR-physics, pulse sequence programming, and optimization methods, to build a pipeline including (i) B1+ -mapping, (ii) B1+ -shimming parameters computation taking into account SAR margins, and (iii) integration on the scanner system.

For cardiac MRI additional significant efforts for reliable absolute B1+-mapping are required to overcome limitations related to breathing and cardiac motion [17]. An important step in the development of mTx-arrays for UHF applications is the optimization and fine-tuning of the coil ensuring reasonable image quality and SAR safety for an average subject without additional adjustment procedures. In particular, this includes finding the default set of phase shifts required for the driving voltages of the individual TX-elements such that a uniform B1+-distribution can be achieved for the largest possible Field-of-View (FOV). This set, usually called a “phase vector”, can be integrated into the hardware as permanent or adjustable phase shifters implemented e.g in the form of coaxial cables of defined length. Alternatively, for the last generation of MR-scanners with support of pTX-mode (e.g. Magnetom™ “Terra", Siemens Healthineers), the default phase vector can be fixed in the coil configuration file to set the phases of driving voltages generated by RFPA. This provides operation of the mTX-array in the so-called “pTX Compatibility Mode”. The default phase vector should shape a reasonable combined B1+-profile for an average subject making possible a straightforward application of the array in the single-Tx mode without RF-shimming as well as simplifying the B1-shimming process for pTX mode. For the birdcage volume resonators or dipole antenna arrays with cylindrical symmetry of elements arrangement, this usually corresponds to the “circularly polarized” (“CP”) phases distribution based on the geometry of elements allocation [18]. However, the straightforward geometry-based approach for setting up default phases is not possible for the surface cardiac mTX-arrays having sophisticated shapes of individual elements and their allocation on a thorax. Therefore, for this type of mTX-arrays, the problem of default phasing is formulated as an optimization problem for a cost function maximizing homogeneity of the B1+-field with certain constraints or regularization on SAR and efficiency of using an available RF-power [5, 8, 11, 19, 20]. For typical numbers of Tx-elements (N = 8…16), the high-dimensional cost function has numerous local extremes. Many computational methods were applied in the context of both static and dynamic B1+ manipulations which include linear and non-linear solvers approaches. However, most of these methods are targeted for the application of the B1+-shimming during the scan session, to deliver practical solutions under a strong time limitation (~1 min computation time). Therefore in most cases, the result represents a local optimum dependent on starting point used for the search process initialization. Moreover, these methods are mostly developed for the B1+-shimming using pTX-RFPA and manipulating with both phases and magnitudes of the driving voltages. Due to this reason, the optimization techniques developed for the subject-specific B1+-shimming providing quick but locally optimal solution within limited FOV is suboptimal for the optimization of default array elements phases as a part of the coil development and final fine-tuning. In this case, the computation time could be extended to hours and the main goal is to achieve the globally optimal characteristics of a shaped B1+ in a frame of the formulated optimization problem.

For the small (N = 4–6) number of TX-elements, the most straightforward and universal solution of the discussed “phase only” optimization problem is the global brute force search over the complete phase space raster defined with reasonable discretization over each vector coordinate [21]. However, because the number of values in the raster grows exponentially with the number of elements this requires an extremely long computation time even for a moderate number of phase elements N≥8 and discretization step (e.g 5°). Therefore, often a prior experience and pragmatic restrictions on the searched phase vector components range are used to accelerate these computations. As an example, in the work [8] the brute force approach was used in a highly restricted phase space raster for 12 elements array (363≈40000 vectors was used). This procedure was considered to be sufficient for a symmetrical rectilinear array geometry and suggesting that phases should not vary along the z-direction. Alternatively, in the work [6], the search of optimal phases was performed using a non-linear solver (NLS) approach with multiple random starting points (referred to further as “random multi-start”). However, for keeping computation time reasonably short, the number of starting vectors for the NLS-optimization was deliberately limited to 1000. For the mTX array with 16 elements, this number corresponds to ≈10−24 parts of the whole phase vector space gridded with 10° steps. As it’ll be shown further in this paper, in an arbitrary array elements configuration, the straightforward usage of random-multi-start strategy may not allow us to detect a sufficiently good approximation of a global optimum of the typical optimization cost functions. This paper aims to propose a flexible pragmatic approach for searching for an optimal phase setting to shape a targeted default B1+-field of mTX arrays. The proposed numerical optimization strategy allows for a flexible trade-off between (i) sufficiently short computation time and (ii) probability of reaching the global optimum of the targeted B1+ characteristics. It does not involve a priori constraints on the searched phase space (e.g. originating from the symmetry of the coil’s geometry). In the current work, the proposed technique was validated using an in-house developed 8TX/16RX dual part array for 7T cardiac MRI in pigs described in [22].

Theory

The combined field created by an mTX array with the constant driving amplitudes is expressed as: (1)

Here, are complex phasors of the driving current for channels and are spatial B1+-maps of the individual coil elements. The phasing of an array is performed by control of the phasor vector {Φ} = {Φ1⋯ΦN} to achieve the targeted spatial homogeneity of combined field B1c+(r).

Cost functions and optimization problem.

In general, the goal of B1+-shimming is to achieve a uniform excitation pulse flip-angle (FA) distribution within а whole imaged FOV. However, using static and manipulating only with phases of individual elements this task would be either non-accomplishable or put very strong limitations on the range of achievable FA. Therefore, a pragmatic approach demands a certain degree of homogeneity characterized by statistical metrics of a distribution uniformity to be reached in the limited region-of-interest (ROI) Δr = (Δrx,Δry,Δrz).

One of the widely used uniformity metrics used for the B1+-field shimming is the coefficient of variation: (2)

Here mean() and std() denote the mean value and the standard deviation calculated over the voxels within Δr. Similar to the work [23] we introduce a regularization factor Δm to enhance the demand on homogeneity and compose the spatial uniformity cost function Fu as: (3)

Among an achievable B1+-homogeneity, the essential characteristic of an mTX-array is the efficient usage of the available radiofrequency power. This can be controlled in the optimization procedure via an array transmit efficiency ratio of “sum-of-magnitudes” (SOM) and “magnitude-of-sum” (MOS) determined as [18, 21, 23]: (4)

Taking values in the range [0… 1] it characterizes the efficiency of power usage at the specific combination of phases Φk producing .

Finally, the optimization problem for the combined cost function Fc to be solved for the determining phasor vector can be formulated as follows: (5)

Method

Computation over sub-sampled phase space

The essential difficulty of solving the problem (5) is the periodic influence of each component of the phasor vector Φk on the cost function Fc leading to multiple local extremes (S1 Fig). In practice, this means that an optimization search initiated at a specific point {Φ0} ends up in the nearest local extremum {Φopt}. This often makes a straightforward application of both derivative-based and other types of local NLS inefficient.

The most straightforward and universal solution for the problem is using a global exhaustive search varying all possible combinations of phasor vector components with a discrete step. However, for the given step δΦ and N Tx-elements, the global full-exhaustive search requires computing the cost function (360/δΦ)N times which for the δΦ≈10° and N = 16 would lead to more than 1025 multiplications of complex matrices of size (Δrx·Δry·Δrz). The estimated computation time (up to several months on a high-performance workstation with multicores CPU) makes such an approach technically non-practical. In this work, we propose the computation strategy with an intelligent fusion of the brute-force approach and usage of local solvers to find a sufficiently good approximation of the global maximum of the cost function and, thus, globally optimal combined B1+-profile.

The complete flowchart scheme of the proposed computation method is shown in Fig 1. The goal of the first stage (Stage I) is to find a reasonable finite discrete representation of Fc(Δr,{Φ}) using the N-dimensional discrete phase space {Φ}. For this purpose, the discrete grid of phase vector with N components should be formed as [6] providing the “full-grid” discrete form of the cost function , where L denotes the total number of nodes in the grid.

thumbnail
Fig 1. Flowchart of the phase optimization algorithm.

At stage II, phase vectors {Φtop} providing cost function values Fc above 90% of maximum are selected as initial condition {Φ0} for iterative NLS-search and selection of the optimum vectors {Φopt}. The vector sets found by this algorithm undergo final selection by (additional) user-defined criteria in Stage III to provide final set(s) {Φfin}.

https://doi.org/10.1371/journal.pone.0255341.g001

The further approach relies on two observations: (i) the cost function Fc({Φ}) is periodic by each phase vector component and (ii) the set of values {Φtop} providing values above 90% of its top (Fctop} >0.9·max(Fc)) is formed by relatively tiny (<10−6) part of all vectors in full grid (see S1 Fig). Therefore, as the second step, the is randomly sub-sampled by the sampling operator Rs with the uniform probability density function (PDF). This operation selects Ls = LN/s vectors from , where s denotes the scaling factor of the effective nodes number reduction. In this work, the sub-sampling was performed using the uniform PDF. The usage of alternative sampling schemes (e.g. with Poisson disk PDF) for a further improvement of the computation efficiency of the proposed method can be also considered. As the result, this creates a “sub-sampled” grid: (7)

The phasor vector sub-space with the reasonably chosen subsampling by factor s≈103 for N = 8 can be used to find the great majority of the summit points of the cost function within practically usable computation time on the middle-class CPU or GPU.

Further, in Stage II, the vectors {Φtop} such as are used as starting points in the NLS-optimization of the problem (5) to find phasor vector {Φopt}. The number of vector Ntop forming {Φtop} typically varies in the range of several dozens depending on the under-sampling factor s and can be reduced if necessary by setting up a higher threshold for selecting the summit values (for example ). Depending on starting point and volume of the optimization region it requires NI≈5-10·103 optimization solver iterations to find the local optimum of the cost function for a single starting point in {Φtop}. This procedure is repeated iteratively (see Fig 1) to reduce the set of phasors {Φopt} to a sufficiently small amount (N<10) which can be considered as a good approximation of the global optimum. The B1c+ field shaped by these vectors {Φfin} can be analyzed based on various preference criteria for (e.g. minimal B1c+ gradients, maximal peak FA, best possible slice profile, etc) to select the variant used for the array default setup. In this work Stage II was performed using “fminsearch” Matlab function implementing the Nelder-Mead algorithm of optimization and known as less dependent from the starting point in comparison to other local solvers [24]. The performed in the Stage I preselection of the starting points allows for starting NLS-search close to the summits of the cost function and, thus, essentially increases the probability that the found solution vector will be globally optimal for the problem (5).

B1+-profiles of the transmit elements

The described B1+ optimization strategy relies on the knowledge of the complex B1+-maps of each Tx-element b1k(r). However, to acquire the experimental B1+-maps a fully functional prototype of the array is required. Moreover, measuring these maps experimentally with sufficient reconstruction quality is often a non-trivial problem. The destructive interferences lead to attenuation of the MR-signal and producing artifacts of reconstruction or voids in the B1+-maps. If this takes place in the region where B1+-shimming is targeted, the reliable optimization becomes problematic or requires significant efforts for data curation.

The alternative efficient approach is to use numeric electromagnetic simulations to compute the 3D distribution of the electromagnetic fields created by individual coil elements. After validating the simulated data by the experimental MR-measurements it can be further employed for the computation of the combined B1c+ in any arbitrarily placed region of interest. Moreover, this allows to test the capabilities of the specific array design in terms of tailoring desired B1+ profiles “in-silico” and perform necessary optimizations of both physical elements arrangement and electrical circuits [25].

In this work, the electromagnetic fields of the mTX-arrays were calculated using CST Studio Suite (3DS, Dassault Systems). The time-domain solver was used for the EM-simulation of the array’s structures and the whole phantom volume within the coil using 4·107 mesh cells. The average computation time using a dual-core Intel Xeon(TM) E2650 CPU and two NVIDIA Tesla(TM) K80 GPUs was 48 hours for each simulated structure. The simulated data have been exported as 3D complex H-field values with the isotropic spatial resolution voxels of 4x4x4mm. Further computations were performed using in-house developed Matlab scripts (Matlab 2017a/b, Mathworks, USA).

Testing and validation of the phase space under-sampling strategy

To test and validate the efficiency of Stage I strategy in terms of a sub-sampling factor the optimization of phase vector for the six paired elements (Tx1-Tx4, Tx6, Tx7) was performed. These elements provide ~90% of the total contribution in the combined B1+ field in the center of the ROI targeted for the testing. Reducing dimension made it possible to perform computation over a full phase space grid and grid undersampled by low factors s within a reasonable time. The full phase space grid GL was built according to [6] using L = 17 nodes. With the span of {ϕk} components within [-180°..180°] it provides the discretization step δϕ ≈ 200. The full grid was sampled according to [7] to create the sub-sampled grids Us with densities varying in a range of s = [1..104]. The grids with the different sub-sampling factors were used to test the fidelity of approximation of the cost function Fc by the sub-sampled version . This was done by comparison of the maximal values Fc and . These values were computed for the varied size of regions Δr and values of sub-sampling factor s to test if the sufficient amount of top values of the “full-grid” cost function Fc are preserved in the sub-sampled . The profiles of at different values were analyzed.

Usage of CPU and GPU

In general, the efficiency of GPU computations depends on the GPU performance index and dimensionality of the problem, and the size of processed arrays. For N = 8 elements no advantage of GPU usage was found. However, the situation changes significantly for a higher-dimensional problem with N = 16 elements optimization. For the CPU the computation time grows roughly proportional to the number of voxels in the optimized ROI. At the same time for the GPU, the computation rate for both Stage I and Stage II calculations remains practically independent of the size of computed arrays up to 2.5·104 voxels in the optimized ROI (corresponds to 12x12x12cm volume). S2 Fig shows that computation time with a modern GPU may become drastically shorter compared to the CPU already starting from a targeted volume of ~1300 voxels. Therefore, in practice, the decision of usage of GPUs within the computation pipeline should be based on benchmarking of small portions of the actual phase grid Us (e.g ~0.5%) for the particular dimensionality of the problem and optimization volume.

Comparison to a traditional optimal phase computation method

To analyze the improvement of the homogeneity and mean value of B1+ which can be obtained with our optimization method we compared it to the random multi-start optimization [6]. For N = 8 optimization a randomly selected set of 1000 phasor seed values with uniform distribution of components φk in the range of [-180°..180°] was used as a starting point. The NLS-search was performed using the function (5). To provide sufficient statistics for the NLS-search comparison, the number of initial phasors {Φ0} for the pre-optimized start search was set to be 250. This corresponded to a cost function cut-off level of ≈70% from the top (the cut-off at 90% typically provides ~20–30 phasors for Stage II). The comparison of the optimization methods was performed for an ROI within a spherical phantom P1 (described below). The ROI with dimensions Δr = 50x50x50mm was placed as shown in Fig 5. Because the major intrinsic B1+ inhomogeneity for the surface array is the gradient in the anterior-posterior (“Y-axis”) direction, the optimized B1c+ profiles were compared using the relative standard deviation of the mean intensity projection computed as and its relative absolute gradient . These projections were computed in z-direction for the slab Δz = 50mm.

As the final step, the comparison of an optimized and random multi-start was done for a high dimensional problem (N = 16 elements). The optimal phase vector search was done using the simulated elements B1+-maps of the array loaded by the numerical phantom P2 (described below). The dependence of the achieved optimization result from the number of starting phasors Nstart was analyzed using 3 repetitions of NLS-optimization with Nstart = 7500. The cumulative maximum of the cost function was computed as max ( [1..Nstart]), where Nstart increments from 1 to 7500.

Coils, phantoms and MRI validation of electromagnetic simulation data

The mTX-array used for the validation consists of two physically independent parts each comprised of eight loop elements. The array is connected to the scanner via a dedicated RF-interface (Rapid Biomedical, Rimpar, Germany) which allows for using connected coils both in sTx and pTX modes. A detailed description of the array design and hardware specifications can be found in [22]. The driving voltage phases of elements can be adjusted in two ways:

  1. Individually for each of 16 elements using phase-shifting coaxial cables connected to the dedicated sockets of the RF-interface.
  2. Pairwise for every two elements, using 8 RFPA channels available on the scanner in pTX regime.

Fig 2 shows a sketch of the array elements’ configuration and location in both anterior and posterior parts along with the elements pairing scheme for driving by 8TX channels in pTX regime. Two phantoms we used for testing the proposed method both in the numerical simulations and in the experimental validation. Phantom P1 comprised six rectangular cross-section PE bottles loading the posterior part, and a plexiglass sphere (diameter, 160mm) positioned inside the anterior part (Fig 2 left panel). The bottles and the sphere P1 were filled with a solution of sugar and NaCl mixed in an experimentally defined proportion providing permittivity close to the typical muscle tissue (ε≈58). The second phantom P2 represents an acryl glass sphere with a diameter of 200mm filled with PVP solution prepared as described in [22] and [26].

thumbnail
Fig 2.

(a) Experimental 8TX/16RX array with antisymmetric L-shaped elements used for demonstration of the described phase optimization method efficiency. The position of the 160mm (P1) and 200mm (P2) spherical phantoms used in both computation and MRI-measurement are shown. Panel (b) shows the coil elements of both parts (top array, bottom array) on the maximal intensity projection images (MIP) of the CST-computed B1+ field.

https://doi.org/10.1371/journal.pone.0255341.g002

The spherical phantom was chosen based on practical experience to minimize the number of destructive RF-interferences within the ROIs corresponding to the typical position of an animal’s heart relative to the array. Simultaneously, this mimics real animal studies where some of the anterior array elements are more distant from the loading tissue than others.

All MRI measurements were performed using a 7T Magnetom(TM) Terra whole body 7T scanner (Siemens Healthineers, Erlangen, Germany) equipped with an 8TX channels RFPA. The FA measurement for the B1+-map reconstruction was done with the double angle mapping (DAM) technique based on a GRE sequence [27]. Additionally, the B1+-maps were cross-validated with the vendor’s B1+-mapping pulse sequence (turbo-FLASH with magnetization preparation [28]) available on the scanner. For the GRE-DAM measurements, the parameters were: TE/TR = 1.8/4000ms, pixel resolution 2mm3, slice thickness 4mm. The sequence parameters for the Turbo-FLASH B1+-mapping were TE/TR = 1.7/9000ms, FA = 100 for the same spatial resolution. The coronal slice providing the best overview of the Tx-profiles of the anterior array elements was chosen for comparison with the CST-simulated maps. The FA-map reconstruction was performed using an in-house developed Matlab script. The agreement of simulated and measured data was checked numerically by linear regression between normalized 1D-profiles computed along the lines crossing the most prominent features of simulated and measured 2D FA-maps in each channel.

Testing the efficiency of the computed phases for B1+ optimization was done using the same GRE-sequence as for DAM measurements. The initial “zero” phasеs of the array were set by manual adjustment geometrically guessed phase distribution [22]. To visualize the effect of B1+ optimization the volumes inside the iso-surface SNR>30 were compared using 3D stack of GRE images. SNR was computed on pixel bases as SNR(x,y,z) = Si(x,y,z)/σm. The Si denotes signal intensity in pixel and σm is the mean standard deviation of the intensity of the pixels in the “noise pool” represented by 20x20 pixel regions in each corner of the FOV for all acquired slices.

Results

Fig 2A demonstrates the schematic of array elements and the position of the spherical phantom P1 in EM-simulations and MRI-measurements. Both panels of Fig 2B show the maximal intensity projection (MIP) of simulated magnitudes of B1+-fields for each element pair combined to be driven by each of eight Tx-channel (labeled accordingly). The enhanced lines on MIP of B1+-maps show the position of the element’s loops.

Fig 3A and 3B demonstrates exemplary results from the validation of the CST-computed B1+ maps for individual Tx-channels using MRI measurements. The plotted contour diagrams demonstrate a very good qualitative agreement in the 2D spatial distribution of the B1+-field between experiment and simulation. Additionally, the 1D profiles computed across characteristic features of the B1+-maps show good quantitative agreement. The computed coefficients of determination of the linear regressions between experimental and measured profiles are in the range R2 = [0.97–0.99]. This confirms the possibility of using the CST-simulated B1+ profiles for the optimization of the combined B1c+. The comprehensive validation of the array CST model with full 2D channel-by-channel B1+ maps can be found in

thumbnail
Fig 3. Comparison of simulated in CST (“CST”-label) and experimentally measured (“EXP”-label) B1+ fields of pig cardiac array using phantom P1.

Central coronal slice B1+-maps are simulated and measured in the spherical phantom. Labels A, P, H, F, L, R denotes corresponding spatial directions in the scanner coordinate system. Two normalized maps of individual Tx-channels of the array are shown as an example. The top row shows good agreement of 2D maps (in particular, positions of maxima and minima). The bottom row demonstrates 1D profiles (1) and (2) extracted along directions marked by dashed lines on 2D maps. The close agreement of simulated and measured profiles is confirmed by high coefficient of determination of linear regression between both data.

https://doi.org/10.1371/journal.pone.0255341.g003

Fig 4A demonstrates the results of testing the capability of using in the search of the top values a sub-sampled cost function to capture the summit points of the fully sampled function Fc. Plot 4a shows values computed using Us-spaces with sub-sampling factor in the range of s = [1– 104]. One can see that for s<103 the values of are very close to 1. The value Ntop denotes the number of vectors in the set{Φtop} forming the summit of , such as . This number reduces proportional to the sub-sampling factor (Fig 4B). That way, the preselection of vectors from a space performed in the Stage I should provide the same set of optimal starting points for the NLS-optimization as if a full grid would be used. Therefore, choosing sub-sampling factor s in the range 102..104 should provide the set of vectors {Φtop} which due to the sufficient proximity to the global maximum of the cost function will ensure high efficiency of the NLS-based optimization in Stage II.

thumbnail
Fig 4. Detection of the top decile of the cost function Fc using sub-sampled phase space Us.

Panel (a) shows the value computed using Us-grids with sub-sampling varied in the range of s = [1– 104]. Panel (b) demonstrates that the number of the vectors Ntop selected by Stage I is fairly inverse proportional to the size of the searched grid (defined by the factor s).

https://doi.org/10.1371/journal.pone.0255341.g004

Fig 5A demonstrates CST-simulated combined B1+-map in the spherical phantom P1. Маp were computed with a default zero phasor {φk} = 0. The marked square ROI was selected for the comparison of the standard (random-multi-start) and proposed novel optimization approaches. Fig 5B shows the B1+ field and the absolute value of its gradient demonstrating the existing B1-inhomogeneity prior to the optimization of the elements phases.

thumbnail
Fig 5.

Panel (a). The solid square marks the position (xy-projection) of the targeted optimization region within the spherical phantom. The region is used to compare two optimization methods efficiently. Panel (b) shows the mean intensity projection of B1c+ through z-direction computed before optimization (denoted as b1m+). The zoomed target region labeled with a dashed line on panel (a) is plotted. Panel (c) shows the absolute gradient of the b1m.+. An essential inhomogeneity of b1m.+ before the optimization is observed.

https://doi.org/10.1371/journal.pone.0255341.g005

Fig 6 depicts the results of the optimization of the phases for N = 8 paired elements performed with the standard random-multi-start approach and the proposed novel dual-stage method of phase optimization. Panel (a) demonstrates volume-rendered B1c+ cuts at the center of the optimization volume. Panel (b) shows the mean intensity projection of b1m over z-dimension within the 50mm slab. The optimized dual-stage multi-start approach provides better”focusing” of the homogeneous area of the B1c+ within the optimized volume in comparison to the random multi-start. The improved homogeneity metrics of <b1m> achieved with the dual-stage optimized multi-start method confirm the optical impression.

thumbnail
Fig 6. Comparison of the new phase-optimization methodology (“optimized multi-start”) with the conventional one (“random multi-start”).

Top panel (a) shows the 3D rendering of 50mm thick volumes of interest in different orientations. Panel (b) presents a mean intensity projection through the z-axis direction of the resulting B1c+. The optimized starting points of the phase search result in a significantly higher homogeneity of the B1c+-field.

https://doi.org/10.1371/journal.pone.0255341.g006

Fig 7A demonstrates a difference in the distribution of the final optimized phasor components found by the NLS optimization using (i) the pre-optimized starting vectors selected by Stage I and (ii) random starting vectors with uniformly distributed components. In both cases, the angular histograms for each phase component were built using the 250 phasors which provided the largest values of the cost function. One can see that pre-optimized phase vectors used as starting points for NLS-optimization provide very high coherence of the final resulted vectors. The individual components of the phasors are spread in relatively narrow angular sectors with the standard deviation not exceeding 200 (top panel). At the same time, the NLS-optimization with random starting points (bottom panel) results in the spread of phasor’s components in the range of up to 1800.

thumbnail
Fig 7.

Panel (a) shows the distribution of the optimized phasor components obtained by the optimized and random multi-start. The standard deviation does not exceed 200 (top panel) for the optimized multi-start. At the same time, the NLS-optimization with random starting points (bottom panel) gives rise to distribution with the spread reaching 1800. Panel (b) shows the dependence of the cost function grow-up rate Fc({Φopt})/Fc({Φ0}) from the norm of the difference of the starting and an optimal phasor vector ‖Φopt−Φ0‖.

https://doi.org/10.1371/journal.pone.0255341.g007

Fig 7B shows the dependence of the cost function progression ratio in the optimization process, defined as Fc({Φopt})/Fc({Φ0}), from the norm of the initial and the final phasor vector difference δΦopt = ‖Φopt−Φ0‖. One can see that the length of the “search trajectory” δΦopt is nearly constant (≈2 rad) for the pre-optimized starting phasors. At the same time, for the random starting phasors one observes a widely spread distribution of δΦopt in the range of 2 to 10 rad. The median value of the cost function grow-up rate performed with the pre-optimized starting phasors is ≈25% higher than that of random starting points (marked by dashed lines of corresponding colors).

Fig 8 demonstrates the efficiency of the random multi-start NLS-optimization for high dimensional (N = 16) problem (5) performed with 7500 starting vectors. Panel (a) shows the index of starting vector for the 25 highest final values of the cost function achieved by 3 repetitions of random multi-start NLS-optimization. For the convenience of comparison, the final function is normalized to the value at zero phase vector . The approximate timings of the computations for ~50% and full starting random vectors set are shown. The cost function value achieved with the optimized multi-start within comparable (~5hrs on GPU) computation time is shown. Panel (b) shows the cumulative value of the maximum of the normalized optimized cost function depending on the starting vector index. The dynamic of the reached optimized value demonstrates that optimization with a random multi-start for a high dimensional optimization problem (N = 16) requires the number of starting vectors essentially exceeding 1000.

thumbnail
Fig 8. Efficiency of the random multi-start NLS-optimization for high dimensional (N = 16) problem using 7500 random starting phasors.

Panel (a) shows the index of starting vector for the 25 top final values of the normalized optimized cost function . The approximate timings of the computations for ~50% and 100% of starting random vectors set are shown. Panel (b) shows the cumulative maximum of the normalized starting and optimized cost functions and depending on the starting vector index (red and blue lines respectively). The dashed blue line shows the level of the same value achieved using optimized multi-start with ~5·107 vectors in Stage I.

https://doi.org/10.1371/journal.pone.0255341.g008

Fig 9 demonstrates the results of optimization of the combined B1c+ shaped by N = 16 individual array elements in the marked white rectangle region of the phantom P2. The B1c+-field in the central sagittal and coronal slices of the optimized region for zero phase vector (left) and phase vectors found by random (middle) and proposed optimized multi-start optimization (right) are shown. The value of the normalized cost function , coefficient of variation and mean gradient demonstrate an improvement of B1c+ homogeneity achieved by the proposed method.

thumbnail
Fig 9.

Combined B1c+ in the targeted region (marked by the white lines) shaped with zero phase vector (left) and vectors found by random (middle) and optimized multi-start (right) optimization. Sagittal and coronal slices at positions respectively marked with dashed white lines are shown. The value of the normalized cost function , coefficient of variation and mean gradient demonstrate an improvement of B1c+ homogeneity achieved by the proposed method.

https://doi.org/10.1371/journal.pone.0255341.g009

Fig 10A and 10B show the GRE MR-images acquired using phase vector settings optimized for N = 8 channels in P1 phantom. Phases vector were set using the pTX adjustment platform of the scanner simulating usage of the array in the “pTX Compatibility Mode”. The mean SNR value in the optimized coronal region is improved by ≈60%. For the labeled on the panel (a) optimization region the ≈40% increase of the volume with SNR>30 was achieved as shown on panel (b). The average local gradient of SNR in the anterior-posterior direction was reduced by ≈50%.

thumbnail
Fig 10. GRE MR-images acquired using phase vector settings optimized for N = 8 channels in P1 phantom for default and optimized phase vectors.

SNR map with labeled optimization region is shown on panel (a). Panel (b) demonstrates iso-surface labeling volume of SNR>30 for images acquired before and after application of the optimized vector.

https://doi.org/10.1371/journal.pone.0255341.g010

Fig 11 shows the experimental result of the optimization of 16 components phase vector for the phantom P2. An optimized phase vector for driving voltages of array elements was set using phase shifters (coaxial cables) connected to the dedicated sockets of the RF-interface. B1+-map acquired in the central coronal slice of the optimization ROI labeled on the left panel. The central panel and right panels show B1+-map before and after setting optimized phases.

thumbnail
Fig 11. Result of optimization of 16 components phase vector for the phantom P2.

B1+-map acquired in the central coronal slice of the optimization ROI labeled on the left panel. The center and right panels show B1+-maps before and after setting optimized phases respectively.

https://doi.org/10.1371/journal.pone.0255341.g011

Discussion

In this paper, we demonstrated a relatively simple way to balance the computation time with the depth of optimization search targeted to reach the best possible homogeneity of the combined B1+-field achievable by the fine-tuning of the array by the developer before (or without) subject-specific adjustments in the scanner. The efficiency of the approaching to a globally optimal for the formulated optimization problem phase vector is expected to be close to the full phase space exhaustive search, however, achieved with a realistic computation time even for high-dimensional problems (N>16). To the best of our knowledge, the groups reporting the development of in-house designed transceiver arrays for UHF MRI usually use implicit or explicit pragmatic limitations on the depth of exploring of possible phase combinations in their approaches to the adjustment of the default phases. The results of this work demonstrate that the arbitrary set limitations on the number of starting vectors for the plain random multi-start optimization may lead to achieving essentially lower B1+-field homogeneity than using the proposed optimized multi-start strategy with adequately defined phase space raster for Stage I. In particular, with Nstart = 1000 starting vectors and N = 8 optimized phase components the random multi-start optimization using the cost function based on the coefficient-of-variation may lead to the relative gradient which is by factor 10 higher compared to the one achieved in the optimized multi-start. The analysis of the high dimensional problem (N = 16) confirms that arbitrary set limitations of the starting vectors set may limit the potentially achievable B1+-homogeneity. For example, as shown by Fig 8, for Nstart = 1000, the maximal cost function value achieved by the random-multi-start would be 20% lower than for the described strategy. The increased depth of the random multi-start search (up to the Nstart = 7500 initial vectors,) may be still insufficient to reach the same value of the cost function as provided by the proposed strategy with comparable computation time.

The problems of higher dimensions (N = 24..32 elements) should be preferably solved by stepwise iterative optimization. On each step, the dominant array’s elements for specific ROI can be selected, optimized, and combined using found phase vector. This combined section will be considered as a single element for the optimization of the remaining (or next part of) elements as it was proposed in the work [29]. The experimental validation of the optimized phasing by MRI measurements confirms increased uniformity of the B1+ leading to the improved homogeneity of the SNR in the optimized region. The results of testing the arrays for pig cardiac MRI with default B1-profiles optimized using the described technique were demonstrated both ex-vivo and in-vivo for the dedicated pigs arrays [22, 26] and as well as for the human arrays [30, 31]. The limitation of the current approach is that found globally optimal phase settings are valid in the mathematical context of the specific optimization problems i.e. “global maximum/minimum of the specific cost function(s) used in the optimization”. The achieved B1 is not necessarily optimal across the “global population” (animals or humans). Moreover, as we demonstrate in the work [30] the human model-based fixed phases optimization is most probably limited to a specific cohort (specific patient weight range and sex). Solution globally optimal to the population of human or animal subjects for the static B1-shimming would require using a machine or deep learning approach with a neuronal network trained to a large number of B1-maps representing the population [32]. Another approach providing B1+ optimization which is global regarding a population is a dynamic pTX-based B1+-shimming with using universal 3D kT pTX-pulses designed based on the multiple B1-maps acquired from the population [33]. Both these approaches, however, require the preliminary acquisition of patient-specific B1-maps using a fully operational array validated for SAR safety, whereas the described method is primarily targeted to the optimization at the hardware development and construction stage.

Conclusion

In this paper, we propose a time-efficient technique for the calculation of globally optimized hardware phases to be used for the construction and the subsequent optimization of transceiver arrays for UHF MRI. The proposed methodology combines the advantages of full coverage of phase space by an exhaustive search with high computational time efficiency of the solver-based optimization. Besides a final hardware adjustment of phases, the proposed approach allows for a rapid evaluating of the B1+-shimming capability of different array architectures “in silico”. The proposed technique has demonstrated its efficiency in the development of the transceiver arrays with the complex antisymmetric geometry of the elements which does not allow for simple geometry-based guessing of the default hardware phases.

Supporting information

S1 Fig.

(a) Cost function computed for the B1-fields generated by each of 8 individual Tx-channels combined from the array element pairs as shown by Fig 2B. Panel (b): Cost function Fc computed for the combined B1c+ of 6 dominant Tx-channels (Tx1-Tx4, Tx6, Tx7) using a fragment of fully sampled phase space grid GLN with δϕ = 1°. The numerous local extremums of Fc makes it difficult to find the global optimum using local optimization solver search. Newertheless, the periodicity of Fc provides possibility to find a sufficient number of phase vectors providing the Fc values close to the global maximum (above the dashed line). This can be performed by the a relatively quick brute-force scanning over the phase grid ULN randomly sub-sampled from GLN with adjustable density factor s.

https://doi.org/10.1371/journal.pone.0255341.s001

(TIF)

S2 Fig. Computation performance of both stages of the optimized multi-start process.

In both stages using of GPU brings acceleration of the computations starting from the number of voxels in the optimization region exceeding ~1300. One can notice, that the speed of CPU computation remains practically constant by a 6-fold increase of the optimization volume whereas for CPU computation the time increases linearly up to the number of voxels ~2200 and even faster by larger arrays size. The CPU type used was AMD Ryzen 9 3950x/16 cores. GPU type: GeForce RTX 2080 Titan, 68 multiprocessors, 1.545GHz, Matlab compute capability index = 7.5.

https://doi.org/10.1371/journal.pone.0255341.s002

(TIF)

References

  1. 1. Trattnig S, Springer E, Bogner W, Hangel G, Strasser B, Dymerska B, et al. Key clinical benefits of neuroimaging at 7T. Neuroimage. 2018;168:477–89. Epub 2016/11/17. pmid:27851995; PubMed Central PMCID: PMC5832016.
  2. 2. Niendorf T, Schulz-Menger J, Paul K, Huelnhagen T, Ferrari VA, Hodge R. High Field Cardiac Magnetic Resonance Imaging: A Case for Ultrahigh Field Cardiac Magnetic Resonance. Circ Cardiovasc Imaging. 2017;10(6). Epub 2017/06/15. pmid:28611118.
  3. 3. Niendorf T, Schulz-Menger J. [Cardiovascular ultrahigh field magnetic resonance imaging: challenges, technical solutions and opportunities]. Radiologe. 2013;53(5):422–8. Epub 2013/04/25. pmid:23613023.
  4. 4. Niendorf T, Graessl A, Thalhammer C, Dieringer MA, Kraus O, Santoro D, et al. Progress and promises of human cardiac magnetic resonance at ultrahigh fields: a physics perspective. J Magn Reson. 2013;229:208–22. Epub 2013/01/08. pmid:23290625.
  5. 5. Graessl A, Renz W, Hezel F, Dieringer MA, Winter L, Oezerdem C, et al. Modular 32-channel transceiver coil array for cardiac MRI at 7.0T. Magnetic resonance in medicine. 2014;72(1):276–90. Epub 2013/08/02. pmid:23904404.
  6. 6. Oezerdem C, Winter L, Graessl A, Paul K, Els A, Weinberger O, et al. 16-channel bow tie antenna transceiver array for cardiac MR at 7.0 tesla. Magnetic resonance in medicine. 2016;75(6):2553–65. Epub 2015/07/18. pmid:26183320.
  7. 7. Erturk MA, Raaijmakers AJ, Adriany G, Ugurbil K, Metzger GJ. A 16-channel combined loop-dipole transceiver array for 7 Tesla body MRI. Magnetic resonance in medicine. 2017;77(2):884–94. Epub 2016/02/19. pmid:26887533; PubMed Central PMCID: PMC4988950.
  8. 8. Hosseinnezhadian S, Frass-Kriegl R, Goluch-Roat S, Pichler M, Sieg J, Vit M, et al. A flexible 12-channel transceiver array of transmission line resonators for 7T MRI. J Magn Reson. 2018;296:47–59. Epub 2018/09/12. pmid:30205313.
  9. 9. Erturk MA, Li X, Van de Moortele PF, Ugurbil K, Metzger GJ. Evolution of UHF Body Imaging in the Human Torso at 7T: Technology, Applications, and Future Directions. Top Magn Reson Imaging. 2019;28(3):101–24. Epub 2019/06/13. pmid:31188271; PubMed Central PMCID: PMC6587233.
  10. 10. Snyder CJ, Delabarre L, Moeller S, Tian J, Akgun C, Van de Moortele PF, et al. Comparison between eight- and sixteen-channel TEM transceive arrays for body imaging at 7 T. Magnetic resonance in medicine. 2012;67(4):954–64. Epub 2011/11/22. pmid:22102483; PubMed Central PMCID: PMC3290686.
  11. 11. Thalhammer C, Renz W, Winter L, Hezel F, Rieger J, Pfeiffer H, et al. Two-dimensional sixteen channel transmit/receive coil array for cardiac MRI at 7.0 T: design, evaluation, and application. Journal of magnetic resonance imaging: JMRI. 2012;36(4):847–57. Epub 2012/06/19. pmid:22706727; PubMed Central PMCID: PMC3445730.
  12. 12. Schmitter S, DelaBarre L, Wu X, Greiser A, Wang D, Auerbach EJ, et al. Cardiac imaging at 7 Tesla: Single- and two-spoke radiofrequency pulse design with 16-channel parallel excitation. Magnetic resonance in medicine. 2013;70(5):1210–9. Epub 2013/09/17. pmid:24038314; PubMed Central PMCID: PMC3960017.
  13. 13. Schmitter S, Moeller S, Wu X, Auerbach EJ, Metzger GJ, Van de Moortele PF, et al. Simultaneous multislice imaging in dynamic cardiac MRI at 7T using parallel transmission. Magnetic resonance in medicine. 2017;77(3):1010–20. Epub 2016/03/08. pmid:26949107.
  14. 14. Schmitter S, Schnell S, Ugurbil K, Markl M, Van de Moortele PF. Towards high-resolution 4D flow MRI in the human aorta using kt-GRAPPA and B1+ shimming at 7T. Journal of magnetic resonance imaging: JMRI. 2016;44(2):486–99. Epub 2016/02/04. pmid:26841070; PubMed Central PMCID: PMC4947022.
  15. 15. Schmitter S, Wu X, Ugurbil K, Van de Moortele PF. Design of parallel transmission radiofrequency pulses robust against respiration in cardiac MRI at 7 Tesla. Magnetic resonance in medicine. 2015;74(5):1291–305. Epub 2014/11/21. pmid:25411131; PubMed Central PMCID: PMC4437923.
  16. 16. Setsompop K, Wald LL, Alagappan V, Gagoski BA, Adalsteinsson E. Magnitude least squares optimization for parallel radio frequency excitation design demonstrated at 7 Tesla with eight channels. Magnetic resonance in medicine. 2008;59(4):908–15. Epub 2008/04/03. pmid:18383281; PubMed Central PMCID: PMC2715966.
  17. 17. Dietrich S, Aigner CS, Kolbitsch C, Mayer J, Ludwig J, Schmidt S, et al. 3D Free-breathing multichannel absolute B 1 + Mapping in the human body at 7T. Magnetic resonance in medicine. 2020. Epub 2020/12/08. pmid:33283915.
  18. 18. Metzger GJ, Snyder C, Akgun C, Vaughan T, Ugurbil K, Van de Moortele PF. Local B1+ shimming for prostate imaging with transceiver arrays at 7T based on subject-dependent transmit phase measurements. Magnetic resonance in medicine. 2008;59(2):396–409. Epub 2008/01/30. pmid:18228604; PubMed Central PMCID: PMC4406352.
  19. 19. Grassl A, Winter L, Thalhammer C, Renz W, Kellman P, Martin C, et al. Design, evaluation and application of an eight channel transmit/receive coil array for cardiac MRI at 7.0 T. Eur J Radiol. 2013;82(5):752–9. Epub 2011/09/17. pmid:21920683.
  20. 20. Dieringer MA, Renz W, Lindel T, Seifert F, Frauenrath T, von Knobelsdorff-Brenkenhoff F, et al. Design and application of a four-channel transmit/receive surface coil for functional cardiac imaging at 7T. Journal of magnetic resonance imaging: JMRI. 2011;33(3):736–41. Epub 2011/03/26. pmid:21438067.
  21. 21. Weinberger O, Winter L, Dieringer MA, Els A, Oezerdem C, Rieger J, et al. Local Multi-Channel RF Surface Coil versus Body RF Coil Transmission for Cardiac Magnetic Resonance at 3 Tesla: Which Configuration Is Winning the Game? PLoS One. 2016;11(9):e0161863. Epub 2016/09/07. pmid:27598923; PubMed Central PMCID: PMC5012568.
  22. 22. Elabyad IA, Terekhov M, Stefanescu MR, Lohr D, Fischer M, Schreiber LM. Design of a novel antisymmetric coil array for parallel transmit cardiac MRI in pigs at 7T. J Magn Reson. 2019;305:195–208. Epub 2019/07/16. pmid:31306985.
  23. 23. Metzger GJ, Auerbach EJ, Akgun C, Simonson J, Bi X, Ugurbil K, et al. Dynamically applied B1+ shimming solutions for non-contrast enhanced renal angiography at 7.0 Tesla. Magnetic resonance in medicine. 2013;69(1):114–26. Epub 2012/03/24. pmid:22442056; PubMed Central PMCID: PMC3387512.
  24. 24. Nelder JA, Mead R. A Simplex-Method for Function Minimization. Comput J. 1965;7(4):308–13. WOS:A1965CLA3200015.
  25. 25. Kozlov M, Turner R. Fast MRI coil analysis based on 3-D electromagnetic and RF circuit co-simulation. J Magn Reson. 2009;200(1):147–52. Epub 2009/07/03. pmid:19570700.
  26. 26. Elabyad IA, Terekhov M, Lohr D, Stefanescu MR, Baltes S, Schreiber LM. A Novel Mono-surface Antisymmetric 8Tx/16Rx Coil Array for Parallel Transmit Cardiac MRI in Pigs at 7T. Sci Rep. 2020;10(1):3117. Epub 2020/02/23. pmid:32080274; PubMed Central PMCID: PMC7033245.
  27. 27. Cunningham CH, Pauly JM, Nayak KS. Saturated double-angle method for rapid B1+ mapping. Magnetic resonance in medicine. 2006;55(6):1326–33. Epub 2006/05/10. pmid:16683260.
  28. 28. Chung S, Kim D, Breton E, Axel L. Rapid B1+ mapping using a preconditioning RF pulse with TurboFLASH readout. Magnetic resonance in medicine. 2010;64(2):439–46. Epub 2010/07/29. pmid:20665788; PubMed Central PMCID: PMC2929762.
  29. 29. Tao Y, Hess A., Robson , editor A two-stage RF shimming method for 7T human first-pass myocardial perfusion. International Society of Magnetic Resonance in Medicine Annual Meeting; 2016; Singapore.
  30. 30. Terekhov M, Elabyad IA, Resmer F, Lanz T, Reiter T, Lohr D, et al., editors. Customized B1+-Shaping using Multi-Channel Transceiver Array Prototype for 7T Cardiac MRI with Central Elements Symmetry. ISMRM Anual Meeting; 2019; Virtual Meeting.
  31. 31. Elabyad IA, Terekhov M, Bille M, Schreiber LM. Elabyad IA, Terekhov M, et al. Design and implementation of two 16-element antisymmetric transceiver coil arrays for parallel transmission human cardiac MRI at 7 Tesla. IEEE Transactions on Microwave Theory and Technic. 2021.
  32. 32. Ianni JD, Cao Z, Grissom WA. Machine learning RF shimming: Prediction by iteratively projected ridge regression. Magnetic resonance in medicine. 2018;80(5):1871–81. Epub 2018/03/25. pmid:29572990; PubMed Central PMCID: PMC6107437.
  33. 33. Aigner CS, Dietrich S, Schmitter S. Three-dimensional static and dynamic parallel transmission of the human heart at 7 T. NMR in biomedicine. 2021;34(3):e4450. Epub 2020/12/17. pmid:33325581.