(image) (image) [Previous] [Next]

Modeling of Defect Related Reliability Phenomena
in SiC Power-MOSFETs

Chapter 5 Measurements, Simulations and Results

BTI and gate-leakage currents are considered major reliability threats responsible for device parameter degradation in SiC MOSFETs. Their characterization and simulation by accurate models are presented within this section. The Si-face interface of SiC to its deposited or grown native oxide has been intensively studied in the past. Thus, investigations of charge trapping by inelastic tunneling processes as presented here, mainly focusing on lateral MOS test structures and commercially available DMOS devices.

First, BTI is analyzed on lateral MOSFET test-structures by employing a bias pulse technique to separate fast and slow degradation components for both pBTI and nBTI degradation. Based on these findings different DMOS technologies are characterized by a large measurement data set and, using the newly developed ESiD algorithm, therefrom defect parameters are extracted which reveal similarities, but also differences among the defect bands in the SiC MOSFETs.

In the second section, gate leakage currents are analyzed based on enhanced temperature activated TAT currents observed at low to medium field strengths employing a SiC/SiO2 MOSCAP, a MOSFET, and a MIM capacitor with ZrO2, as used in RAM applications. The newly proposed TAT model is demonstrated and reveals details about the charge trapping kinetics within these structures and by a comparison of the defect parameters with ab-initio computations, a class of defects is identified as TAT transition centers in both binary oxides.

5.1 Bias Temperature Instabilities in SiC MOSFETs

The results within this section have been previously published in [CSC8],[CSJ6] and [CSC1].

This section aims to reveal the underlying physical mechanisms that cause the main features of BTI as observed in SiC MOSFETs, which are frequently compared to BTI in Si technologies [88, CSC5, 90] by applying the same characterization and data analysis methods. For this purpose, defect parameters are extracted by device scale simulations and restricted to be consistent within bounds defined by ab-initio calculations of likely defect candidates. Similarities and differences among different SiC DMOSFET technologies and Si based MOSFETs in terms of the origin of BTI are also discussed.

5.1.1 Lateral Test Structures

Some of the advantages of studying the impact of charge trapping as the main reason for (math image) and Ron shifts on lateral structures include:

  • • A uniform electric field distribution across the oxide can be assumed for small enough drain bias.

  • • Geometrical effects, e.g. edge effects due to trench sidewalls, can be neglected.

  • • A small contact resistance is assumed and an additional series resistance as in vertical device architectures is not expected to influence on the ∆Vth extraction.

  • • The band offsets and other material parameters, e.g. transition region width between SiC and SiO2, are best studied for the Si-face SiC/SiO2 interface, compared to a-face, c-face or m-face surface planes.

Hence, planar SiC/SiO2 MOSFETs with channel dimensions of W \( \times \)L = 100 \( \times \)2 µm2, 4 µm2 and 6 µm2 and an oxide layer with a thickness of about 70 nm have been investigated. After the oxide deposition, the devices have received an annealing process step in NO ambient.

An initial (math image)( (math image)) curve to convert the measured drain current into ∆Vth is recorded as shown in Figure 2.2 on a pristine device for each BTI stress sequence. To determine the threshold voltage shift with a reduced perturbation of the (math image)( (math image))-curve due to charge trapping during the gate bias sweep, these time-zero transfer characteristics are measured at a rate of \( R \) = 50 V/s and within a narrow bias range of only up to about 1 V above the threshold voltage. The selected sweep rate is the fastest for the chosen current limit of \( I_\mathrm {max} = \SI {1}{\micro \ampere } \) for the ultra-low noise Defect Probing Instrument (DPI) used for this characterization. A constant current criterion of ID = 100 nA is used to convert ID to ∆Vth and the drain bias has been scaled with L by VD = 0.1 V, 0.2 V and 0.3 V. Without any additional delay after recording the pristine (math image)( (math image)) characteristics, an eMSM scheme is applied with stress times ranging from tstr = 1 × 10−6 s to 1 × 104 s and recovery times trec = 10 s to 1 × 105 s with the first measurement point extracted after \( t^\mathrm {delay} \) = 100 µs. For the PBTI characterization, the stress bias has been selected as \( V_\mathrm {G}^\mathrm {s} \) = 30 V, 37.5 V and 45 V, resulting in approximate oxide stress fields of \( E_\mathrm {ox}^\mathrm {str} \approx \) 3.5 MV/cm, 4.6 MV/cm and 5.7 MV/cm. The recovery bias \( V_\mathrm {G}^\mathrm {r} \) is chosen according to the extracted \( V_\mathrm {th,0} \approx \) 4.4 V. All measurements have been conducted on the custom-built low-noise DPI presented in Section 2.3 and each eMSM scheme is recorded on a fresh n-MOSFET.

The material parameters that were used to calculate the electrostatic quantities in Comphy (c.f. Section 4.6.1) are listed in Table 5.1 for the channel substrate and in Table 5.2 for the insulating layer. The transition of the band edges between the channel and the oxide is assumed to be abrupt and the band edges are thus linearly interpolated for the simulation within the first 5 Å as shown in the band diagrams in Figure 5.1.

quantity ref. value unit
\( E_\text {G,0} \) band gap at 0 K [22] 3.36 eV
\( E_\text {G,1} \) temperature coefficient [247, 248] −3.3 × 10−4 eV/K
\( m_\mathrm {l} \) lateral el. eff. mass [249, 29] 0.33 \( m_\mathrm {e} \) kg
\( m_\text {t,0} \) transversal el. eff. mass [249, 29] 0.42 \( m_\mathrm {e} \) kg
\( m\sub {a}...m\sub {i} \) coeff. for eff. mass val. band [33] see ref 1
\( N_\text {cv0} \) band weight [22] 2.54 × 1019 1/cm3
\( M_\text {c} \) conduction band minima [22] 3 1
\( \varepsilon _\text {r,chan} \) relative permittivity channel [22] 9.76 1
\( \Delta E_\mathrm {w,0} \) channel/gate work. func. diff. [22] −1.2 eV
\( N\sub {a,chan} \) acceptor doping concentration 2 × 1017 1/cm3
\( N\sub {d,chan} \) donor doping concentration 1 × 1010 1/cm3

Table 5.1: Channel parameters to calculate the electrostatic quantities in Comphy. Note that \( m_\mathrm {e} \) donates the electron mass.

quantity ref. value unit
\( E\sub {G} \) band gap 9.0 eV
\( E\sub {off} \) channel/oxide \( E_\mathrm {v} \) offset [22] \( -4.68 \) eV
\( m\sub {t} \) tunneling mass [250] 0.42 \( m_\mathrm {e} \) kg
\( \varepsilon _\text {r,ox} \) rel. permittivity oxide 3.9 1
tox thickness of the oxide layer 70 nm

Table 5.2: Input quantities used for the SiO \( _\mathrm {2} \) layer. Note that \( m_\mathrm {e} \) donates the electron mass.

For the extraction of the defect parameters, defects are considered uniformly distributed on an equidistant spatial grid \( x_\mathrm {T} \) and normally distributed on a equidistant energetic grid ( \( E_\mathrm {R} \), \( E_\mathrm {T} \)). Each grid point ( \( x_\mathrm {T} \), \( E_\mathrm {R} \), \( E_\mathrm {T} \)) is weighed by this distribution to yield the selected total defect density \( N_\mathrm {T} \), with the grid parameters given in Table 5.3.

parameter value unit
\( \Delta E\sub {T} \) trap energy level 0.1 eV
\( \Delta S \) relaxation energy level 0.1 eV
\( \Delta x\sub {T} \) spatial distribution 0.05 nm
\( p\sub {tol} \) cut-off probability 1 × 10−4 1

Table 5.3: Grid parameters used in Comphy. The distance between the points are given for the three dimensional grid together with a cut-off probability \( p_\mathrm {tol} \) for the Gaussian tails.

Using these settings, the defect parameters of two bands are extracted by minimizing the difference between the simulation results and the measurement data by a Nelder-Mead least-square minimization algorithm [245] as implemented in scipy. As initial guess, a shallow defect band extracted on Si/SiO2 technology [235] has been used and with ET close to \( E_\mathrm {c,SiC} \) that is sampled up to 3 nm into the oxide. A good agreement with the measurement data for the tails of the recovery traces is revealed in Figure 5.1 (top). For the calculation of the remaining fast recovering ∆Vth defects with small relaxation energies and located within the transition region at a distance of up to 5 Å from the interface have been introduced, resulting in a fast electron trap band (EB). These defects define the charge transfer kinetics as required to explain the fast recovering component of ∆Vth and lead to degradation of ∆Vth \( > \) 2 V for the longest stress phases. Note that the small \( E_\mathrm {R} \) used in the fast EB effectively yield transition rates that can also be approximated by employing the extended-SRH model, as discussed in Chapter 4 and in detail by Ruch et. al [208].

(image)

Figure 5.1: The measured positive ∆Vth (circles) of three MSM measurement schemes employing three different positive stress biases at a temperature of \( T \) = 30 °C, is well described by the calibrated simulation (lines) (bottom). Thereby, two defect bands are used, as indicated in the band-diagrams, of which one (shallow electron trap band) accounts for the slowly degrading and recovering ∆Vth part (top), and the other (fast electron trap band) for the fast component of ∆Vth degradation and recovery (center). Note that the shallow defects which explain the tails of the recovery traces employ similar parameters as extracted for Si-based devices in [235]. (taken from [CSC8])

parameter fast EB shallow EB HB 1 HB 2 unit
\( \overline {E}\sub {T} \pm \sigma _{E_\text {T}} \) \( 1.87\pm 0.08 \) \( 1.73\pm 0.17 \) \( -1.12\pm 0.24 \) \( -1.77\pm 0.04 \) eV
\( \overline {E}_\mathrm {R} \pm \sigma _{E_\text {R}} \) \( 0\phantom {.00}+0.85 \) \( 4.93\pm 1.95 \) \( 5.2\phantom {-}\phantom {0}\pm 4.89 \) \( 0\phantom {-}\phantom {.00}+1.55 \) eV
\( R \) 0.9 0.437 1.19 0.6 1
\( N\sub {T} \) 2.8 × 1019 3.44 × 1019 1.26 × 1019 7.3 × 1018 1/cm3

Table 5.4: Four defect bands are employed for reproducing a large variation of PBTI and NBTI characteristics of lateral SiC-MOSFETs. The electron traps are concentrated around the conduction band edge, while the whole traps reach further into the band gap from the valence band edge. For both types, a fast defect band (fast EB and HB2) with low \( E_\mathrm {R} \) is used to reproduce the interfacial defect characteristics.

(image)

Figure 5.2: A pulsed MSM scheme is used to separate the contributions of fast and slow recovering ∆Vth.To this end a pulse with (math image) = 0 V or −5 V is applied after the stress sequence for 10 s and before the recovery trace is recorded. The resulting recovery traces show a reverse characteristic for the depletion regime pulse (VG= 0 V), which is completely reconstructed by the shallow electron trap band in the simulation (top, left). The fast traps are emptied by both pulse biases (top, right). The accu- mulation regime pulse enables hole trapping close to the valence band, that shifts the recovery traces to more negative ∆Vth, which is accounted for by two additional hole trapping bands (bottom, left). A superposition of all bands explains the characteristic recovery traces for all conditions (bottom, right). (taken from [CSC8])

To investigate the impact of bias switches to the depletion and accumulation regime, as for instance proposed in [87] for an evaluation of Vth shifts in parallel for many devices, pulsed MSM sequences have also been measured and included in the fitting routine, as shown in Figure 5.2. Thereby, a pulse at a bias level in the depletion ( \( V_\mathrm {G}^\mathrm {p} \) = 0 V) or accumulation ( \( V_\mathrm {G}^\mathrm {p} \) = −5 V) regime is applied after the stress phase before the recovery phase is started for \( t^\mathrm {p} \) = 10 s. Initially, only the pulse is applied without stress to investigate the impact of the pulse on the pristine device.

When splitting the contributions of the defect bands in the simulation, it is revealed that the shallow EB solely accounts for the characteristic recovery traces for a pulse bias of \( V_\mathrm {G}^\mathrm {p} \) = 0 V, shown in Figure 5.2 (top). Note the degradation of ∆Vth at the beginning of the recovery traces. This increasing ∆Vth at recovery conditions is a result of electron capture of a small fraction of defects in the shallow EB, that previously emitted charge during the pulse phase. Quite to the contrary, all defects in the fast band emit the previously captured electrons during the pulse duration. Furthermore, the application of an accumulation pulse results in a different recovery characteristic, starting at negative ∆Vth. This NBTI characteristics is a result of hole capture and can be accounted for by the introduction of two hole trap bands (HB1, HB2) with parameters listed in Table 5.4. These bands together with the two EBs are able to reproduce the peculiar ∆Vth recovery at all pulse conditions shown in Figure 5.2 (bottom).

(image) (image)

Figure 5.3: A ramped voltage stress (RVS) MSM scheme to capture the bias dependence of NBTI at \( T = \SI {30}{\celsius } \) (left). Initially a readout is performed without prior stress for 1 × 105 s (black) showing that electron capture is already happening at the readout voltage close to Vth. The temperature activation of the charge transfer kinetics in the simulation (lines) for both positive and negative bias stress is confirmed by a good agreement of the recovery trends when compared to the measurement data (circles) (right). (taken from [CSC8])

Moreover, the NBTI bias dependence is extracted by measuring a RVS MSM scheme with an initial readout at the recovery bias and stress bias variation in the range of \( V_\mathrm {G}^\mathrm {s} \) = −2.5 V to −25 V at fixed stress time of \( t^\mathrm {s} \) = 1 ks and readout time of \( t^\mathrm {r} \) = 10 ks, as shown in Figure 5.3 (left). It has to be noted that even if the extraction of NBTI in an nMOS is operation relevant when bipolar gate drivers are used, a large fraction of defects capturing holes at negative bias already have emitted their charge before the read-out phase. Hence, only a small fraction of defects contributing to NBTI is experimentally accessible with MSM sequences conducted in n-channel MOSFETs. Note that the NBTI results on experimental pMOS structures investigated in [251] show significantly larger (math image) shifts compared to those of the n-MOSFETs presented in this work.

The accurate reproduction of the temperature activation in the simulation is mainly influenced by the relaxation energy \( E_\mathrm {R} \) within the NMP model as shown in Figure 5.3 (right) for both PBTI and NBTI MSM sequences. Again, in Figure 5.3, the impact of an initial readout phase at VG \( \approx \) Vth prior to the stress phase is shown by the black line. This readout shows that already the application of the readout bias leads to electron trapping and hence (math image) shifts of more than ∆Vth = 0.1 V for tread = 10 ks.

(image)

Figure 5.4: Recovery traces on a trench MOS can be reconstructed with the same defect parameters for the long-term tails, but require an increased defect density for the fast EB. Although, a higher mobility and \( \mathit {SS} \) (c.f. (math image)( (math image)) curve in inset) is reached on the a-face interface compared to lateral MOSFETs, the short term degradation is slightly increased.(taken from [CSC8])

To further strengthen the hypothesis of the shallow EB band being an intrinsic property of the oxide, a PBTI MSM sequence has been measured on a commercially available trench MOSFET [252, 253]. The inset of Figure 5.4 shows the initial transfer characteristics used to extract (math image) with steeper \( \mathit {SS} \) and lower (math image) compared to the lateral device. Thus, the device electrostatics in Comphy have been adapted for the reduced initial (math image) by changing the channel doping concentration to compensate for the difference. The first point of the recovery trace could not be resolved below 100 ms due to the large input capacitance of the trench device. As shown, the long-term recovery behavior is again captured well by the simulation and can solely be described by the shallow EB with the same parameters as given in Table 5.4. Increasing the defect density of the fast EB by about a factor of 1.5, finally leads to a reasonable agreement of the computed short term ∆Vth with the measurement. The increased density thereby compensates for the different interfacial properties obtained on Si-face when compared to a-face terminated SiC MOSFETs. It has to be emphasized that the comparatively large PBTI shifts in SiC MOSFETs does not contradict the fact that PBTI is almost negligible in Si/SiO2 technologies, as the active defects are not accessible in the Si based devices due to the difference of about 0.3 eV in the conduction band offsets between Si/SiO2 and SiC/SiO2 [254]. A comparison of the PBTI degradation of different SiC technologies [CSC5] to Si-based MOSFETs is shown in Figure 5.5 together with a comparison of the AERs available in both systems. In order to acquire only the long-term BTI component for the SiC characterization, a preconditioning scheme employing negative pulses has been used to remove the fast degrading and recovering components. Thus, the long-term degradation shows a similar trend as in the Si technology with an increased absolute ∆Vth of up to a factor of 200.

(image) (image)

Figure 5.5: PBTI quantified by ∆Vth over stress time of different SiC MOSFETs compared to a Si MOSFET with 8 to 200 times larger degradation in the SiC MOSFETs with a similar time evolution trend (left) (reproduced from [CSC5]). Assuming that the same bulk SiO2 defects are present in both device types, the enhanced PBTI can be explained as a result of the energetically elevated AER within the oxide for positive bias operation (right) (taken from [CSC8]).

5.1.2 Comparison of different DMOS Technologies

As large differences in (math image) shifts have been reported for different SiC MOSFETs [255, 88] for both negative and positive BTI, three different n-channel DMOSFETs from two vendors are analyzed in terms of their BTI recovery trends and defect distributions that explain each peculiar ∆Vth recovery characteristics. The differences may arise from various influences during device manufacturing such as POA under different nitrogen containing precursors (NO, NO \( _\mathrm {2} \), NH \( _\mathrm {3} \)) and annealing temperatures or channel counter-doping for increased mobility [52, 53] and off-axis epitaxial growth.

With the same argument as in Section 5.1.1 that the Si-face SiC/SiO2 interface is best studied compared to other SiC surface terminations, three different DMOSFET devices are analyzed. Thereby two technologies are the second (T1/G2) [256] and third generation (T1/G3) [257] of the same vendor, and the third one is the first generation of a different vendor (T2/G1) [258]. In order to guarantee comparability, devices with similar oxide thicknesses were chosen, differing only in a few nanometers, so that the applied stress biases result in similar electric field strengths.

(image)

(image)

Figure 5.6: A comparison of recovery traces of Vth of measurements (circles) and simulations (lines) of DC MSM sequence (top) for stress times of \( t^\mathrm {str} \) = 1 × 10−7 s to 1 × 104 s at constant stress bias of (math image) = 25 V and two \( T \) = 298 K and 448 K is shown for varying maximum recovery times \( t^\mathrm {rec} \). Note that the first point of recovery is measured at \( t^\mathrm {delay} \) = 1 µs only after the stress period. The first phase shown is a relaxation phase with constant (math image) forced through the channel via the feedback loop, showing no significant ∆Vth. Recovery traces after short term bipolar AC stress at a constant stress time of 100 ms at 50 kHz and varying VGH and VGL are reproduced in detail by the simulation (bottom). The increasing intensity of the background color indicates the advanced time within the last AC period of the stress signal before the interruption for VGL (blue) and VGH (red). (taken from [CSJ6]).

Contrary to the lateral test structures, the measurement setup used to extract (math image) shifts [147] consists of a feedback loop, in which an operational amplifier forces a constant drain current of ID = 1 mA through the channel by regulating VG at fixed VD (c.f. Figure 2.7 (right)). Besides the direct readout of ∆Vth \( (t) \) = \( V_\mathrm {G}(t) - V_\mathrm {G}(0) \), an additional advantage is that the first read-out can be conducted after only \( t^\mathrm {delay} \approx \) 1 µs. Two types of MSM schemes are used to characterize the short and long-term ∆Vth shifts. For operation relevant short term shifts, an AC MSM scheme [83] is used (c.f. Figure 2.4 (top)), in which an AC stress signal is applied for tstr = 100 ms at a frequency of \( f = \SI {50}{\kilo \hertz } \) at a constant high VGH and low bias VGL. This stress phase is interrupted at different points within the last AC signal period, and a recovery trace is recorded for 10 ms by forcing the constant threshold current through the channel as described above. This scheme is repeated for 20 different interruption points that are logarithmically distributed across the high and low phase of the last AC period of the stress phase. The total AC stress MSM sequence has then been permuted for different elevated operation voltages of VGH = 15 V and 20 V and VGL = −2 V, −5 V and −7 V. While this sequence covers all the short term drifts of (math image) expected during an AC signal period at regular operation, additionally DC CVS MSM sequences (c.f. Figure 2.4) with increased stress bias ( \( V_\mathrm {G}^\mathrm {str} = \SI {25}{\volt } \)) have been measured to capture the long-term degradation. In order to justify the extrapolation of ∆Vth beyond the experimental time window and to obtain the temperature activation of charge trapping, all sequences have been repeated at \( T \) = 448 K.

Symbol

Quantity

T1/G2

T1/G3

T2/G1

Unit

Qf

fixed Charge

2 × 1011

1.4 × 1012

0

C/cm2

tox

oxide thickness

37

33

33

nm

Table 5.5: Parameters used for calibrating the electrostatic of each device. A fixed positive charge rigidly shifts the electrostatic parameters (math image) and \( V_\mathrm {fb} \) towards more negative values.

For calculating ∆Vth, as described in Section 5.1.1, Comphy is calibrated to the device electrostatics of each device using the channel material parameters given in Table 5.1. Additionally, as also proposed by Ito et. al [259], fixed positive charges, as given in Table 5.5, have to be used to compensate for the deviation from an ideal device characteristic, c.f. CV measurements in Figure 2.3. It should be noted that the exact origin of these fixed positive charges is subject of ongoing research. In the work of Rescher [193], process splits were performed during device manufacturing and thereby it has been revealed by measuring CV curves after each processing step during the gate stack formation that the poly-Si gate deposition and its subsequent annealing step results in the largest deviations from the ideal CV characteristics. This led to the the hypothesis that the passivation of Si-dangling bonds at the poly-Si/SiO2 interface in forming gas leads to H \( ^{+} \) accumulation within the SiO2 layer and thus to a positive charge build up.

Layer

ET range

\( \Delta \)ET

ER range

\( \Delta \)ER

xT range

\( \Delta \)xT

oxide

−3 eV–3 eV

50 meV

0.1 eV–5 eV

70 meV

0.6 nm–3 nm

0.1 nm

interface

−2.2 eV–2.2 eV

34 meV

0.1 eV–3 eV

70 meV

0.0 nm–0.5 nm

0.1 nm

Table 5.6: Grid parameters used for the ESiD extraction method. An oxide layer accounts for border traps considered as a bulk SiO2 property, while the interface layer accounts for defects within the transition region from bulk SiC to bulk SiO2.

Contrary to the simplex optimization approach used for the lateral devices, the ESiD algorithm is employed to extract defect parameters that can explain the indicated (math image) recovery over the bias and temperature space. As discussed in Section 4.8, a major advantage of this algorithm is that no initial distribution has to be assumed for the defect parameters and respective densities, and thus no initial guess is required. Besides this improvement over the iterative optimization scheme, the computational effort is significantly reduced, as only a few iterations for varying the regularization parameter \( \gamma \), c.f. Figure 4.15 are required. Quite to the contrary, for the bands optimized by the simplex method, the parameter space increases by four parameters, ET and ER mean and standard deviations when assuming Gaussian distributed defect bands, for each additional band. Thus, the optimization in this large parameter space rapidly becomes cumbersome. Here it has to be emphasized that only the application of the ESiD allows to extract defect parameters for the large amount of data acquired by AC and DC MSM schemes with bias and temperature variations. The computational expenses for ESiD are, however, larger than for Si based MOSFETs as an increased defect parameter grid is required due to the AER covering a larger fraction of the entire SiO2 bandgap. As can be seen in Table 5.6, the parameter grid spanned for the ESiD extraction has been chosen to be split into two layers for electron and hole traps each. The first layer (interfacial layer) spans the range from 0 nm to 0.5 nm from the SiC/SiO2 interface and is introduced to account for the changing stoichiometric composition and therefore interface defect properties as well as to allow for increased defect densities in the transition region from bulk-SiC to bulk-SiO2, c.f. Figure 3.3. Additionally, a second layer (oxide trap layer) covering the range of 0.6 nm to 3.0 nm accounts for border traps, as a bulk-SiO2 property. Note that any further extension to larger distances within the oxide is not meaningful, as the tunneling probability in the transitions rates (4.29) and (4.30) decreases exponentially with increasing distance, as shown in [CST1].

(image)

Figure 5.7: The first points of the AC MSM recovery periods are shown for different VGL with each symbol (measurement) interrupted at different times during the last AC stress period, e.g. the first point refers to an interruption of the AC signal after 1 × 10−8 s during the VGL phase. The lines indicate the points of simulation, which are connected for a better guidance to the eye. Large differences in the quantitative characteristics can be seen, i.e. T1/G3 shows no negative (math image) shifts for the applied VGL, while T2/G1 exhibits large shifts of ∆Vth \( \approx \) 2 V in positive and negative bias range. (taken from [CSJ6])

The optimization of the defect parameters by the simulation with the calibrated device and material parameters allows to achieve an excellent agreement between the transient simulation with the measurement data, as exemplary shown for T1/G2 in Figure 5.6. A comparison of the short-term (math image) characteristics is shown in Figure 5.7 by comparing the first point of each AC MSM recovery trace at the different interruption times \( t_\mathrm {AC,interrupt} \). The largest degradation can be observed in T2/G1 for both negative and positive stress bias, as well as for the long-term DC PBTI characterization. More negative ∆Vth at more negative VGL can be seen in both T1/G2 and T2/G1, however, negative ∆Vth is not noticeable in T1/G3 at all. The positive (math image) shift values of T1/G3 are well below 0.5 V and slightly smaller than those observed for its preceding technology generation (T1/G2).

5.1.3 Defects Parameters

In order to explain the observed positive (math image) in the lateral test MOSFETs, as well as for all three DMOSFETs, acceptor-like defects in the vicinity of the conduction band edge are required, as the AER spans from \( E_\mathrm {c,SiC} \) towards larger energies at PBTI conditions. All technologies therefore show distinct peaks, above \( E_\mathrm {c,SiC} \) with varying densities, i.e. NT for electron traps in T2/G1 and lateral MOSFET are about four times higher compared to those in T1/G2 and T1/G3, as can be seen in Figure 5.8 together with their assignments to the employed layers. Also donor-like defects in the lower half of the SiC band-gap are prevalent and distributed over a wide energetic area towards \( E_\mathrm {v,SiC} \), except for T1/G3, which does not require a notable amount of hole traps, consistent with the absence of NBTI in this technology. The extracted relaxation energies are relatively low compared to those extracted for Si-based technologies employing a plasma-nitrided oxide [CSJ8, CSC4].

(image) (image)

Figure 5.8: The thermodynamic trap levels according to the Gaussian distributions that were used for reproducing ∆Vth in the lateral MOSFETs (left). A comparison of the distributions of the extracted de- fect parameters of the three DMOSFET technologies is shown in the right panel. (taken from [CSC8] and [CSJ6])

For the relaxation energies it has to be mentioned that the extracted values in Table 5.4 are considered as too large. This arises from not pinning R to a value of 1 which together with the parameter correlation of R and ER , as mentioned in Section 4.8, leads to the overestimated values for ER . By fixing R to 1 and correcting for the correlation, the mean relaxation energy of the shallow electron trap band reduces to \( E_\mathrm {R} \) = 1.82 eV, which is comparable to those for the electron traps extracted in the ESiD spectra with larger ER , shown in Figure 5.8 that are considered as oxide defects. Additionally, the fast EB as well as the ESiD spectra with ER values smaller than 1 eV are associated with interface states, as these defects exhibit significantly smaller ER  [208] compared to structures considered for oxide traps [77]. The thermodynamic trap levels of several defect candidates, which have been extracted from DFT calculations found in the literature, are shown in Figure 5.9 together with the AERs for an intermediate stress oxide field of 6 MV/cm, which covers all presented CTLs. It has to be noted, however, that the CTL only represents a thermodynamic property and the charge transfer kinetics depend on the relaxation energy of the defect as well. Also, what is not represented by this graph is the formation energy of the defect candidates, which determines the probability of formation and therefore is correlated to the defect densities in a real device. Finally, the (ET , ER ) maps for the DMOSFETs shown in Figure 5.10 show a strong correlation for the electron traps extracted with two major distributions, i.e. defect types, and in part also for the donor-like defects within the lower half of the SiC bandgap. A possible origin for the different densities extracted for the electron traps with low ER values situated at the conduction band edge is charge trapping at N-complexed defects, as the amount of accumulated N at the interface has been correlated with increasing numbers of interface states at \( E_\mathrm {c,SiC} \) [201]. This seems also plausible when considering the fact that N is a suitable dopand with relatively small activation energy in SiC.

(image)

Figure 5.9: A comparison of defect CTLs extracted from DFT calculation in literature are shown for bulk-SiO2 (dark-green), the interface layer SiO \( _\mathrm {x} \)C \( _\mathrm {y} \) and bulk SiC. All candidates lie within or close to the trap-levels extracted by our simulations, c.f. Figure 5.8. The AER is shown for a maximum interface distance of 3 nm at \( \vert E_\mathrm {ox} \vert \) = 6 MV/cm and covers all shown CTLs at PBTI and NBTI conditions. The defect levels were extracted from literature as follows: polaron [177, CSJ3]; OV, HB, H-E \( ^\prime \) [77, 172]; Si \( _\mathrm {2} \)-C-O [186], C \( _\mathrm {O} \)=C \( _\mathrm {O} \), C \( _\mathrm {i} \)=C \( _\mathrm {i} \) [184]; \( P_\mathrm {b,C} \) [188]; N \( _\mathrm {C} \)V \( _\mathrm {Si} \) [197]

(image)

Figure 5.10: The extracted defect distributions are shown in a (ET , ER ) parameter map. A detailed visualization is achieved by scaling the defect densities by \( \mathrm {log}_\mathrm {10}\left (1 + \kappa N_\mathrm {T} / N_\mathrm {T}^\mathrm {max} \right ) \) / \( \mathrm {log}_\mathrm {10}\left (1 + \kappa \right ) \) with \( \kappa \) as denoted in the panels. Compared to its preceding technology T1/G2 (top), T1/G3 exhibits a negligible number of defects in the lower half of EG , while similar densities are seen for the two distributions close to EC . This two-fold distribution is also extracted on T2/G1 (bottom), with increased density, and additionally a widespread hole trap contribution compared to T1/G2 and T1/G3. The two electron trap distributions represent a commonality among all three technologies. (adapted from [CSJ6]).

(image)

Figure 5.11: The dependence of ET (left) and ER (right) on a variation of the capture cross section \( \sigma \) is shown for the electron trap interfacial layer of T1/G2. A clear compensation of the rate kinetics with lower ER for smaller \( \sigma \) can be observed for the large ER defect distribution, while smaller ER distributions and both ET peaks remain largely un-affected.

As the only left “free" parameters in calculating the transition rates (4.29) and (4.30) are the capture-cross sections for electron and hole trapping \( \sigma _{n,p} \), their relevance on the parameter extraction is briefly discussed in the following. Therefore, the impact of a \( \sigma \) variation is studied, in steps of one order of magnitude to 1 × 10−14 cm2 and 1 × 10−16 cm2 [260] from the originally used values of \( \sigma _{n,p} \) = 1 × 10−15 cm2 [68, 208]. For both alterations of \( \sigma \), the ESiD extraction has been repeatedly applied, with the results shown in Figure 5.11. It becomes evident that the thermodynamic trap level is not significantly influenced by the selection of \( \sigma \), with only minor deviations of the two main distributions around \( E_\mathrm {c,SiC} \). On the other hand, the distribution of ER with larger mean values shifts by about 0.5 eV towards smaller values with decreasing \( \sigma \), while the lower ER distribution is less affected. The change of ER is effectively compensated by the change of charge transfer kinetics due to the altered \( \sigma \) value, with no loss in the ESiD accuracy. Even though a large variation of two orders of magnitude in \( \sigma \) is shown, the resulting deviation of ER is low, which is a result of the exponential dependence of the rates on ER . As the relaxation energy extracted within the simulation is typically compared to ab-initio calculations, the compensation effect seen between ER and \( \sigma \) can be minimized by targeting consistency between the two computational methods.

5.1.4 Temperature Activation of Electron Emission

An non-intuitive feature of the (math image) recovery previously reported by Puschkarsky et. al, is the less pronounced degradation observed for higher \( T \) [83], as shown in detail in Figure 5.12. Such an inverse recovery temperature acceleration contradicts observations from NBTI on Si/SiO2 MOS devices, however, has also been reported for PBTI on n-channel Si/SiON [CSJ8]. As for the Si/SiON technology, in the SiC/SiO2 device this phenomenon can be explained by a pronounced temperature activation for the electron emission, compared to that for the capture process, which does lead to a higher degradation during the stress phase, as shown in Figure 5.12. For the long-term recovery, a crossover between the recovery curves is observed which leads to larger observable degradation at the higher \( T \) after longer recovery times. This phenomena is also well captured by the NMP charge transfer kinetics, with the extracted defect parametrization for the DMOSFETs.

(image)

Figure 5.12: The first and the last stress and recovery phases of the full MSM sequence are shown. An interesting feature of the recovery is the enhanced \( T \)-activation for electron emission, resulting in a crossover point of the recovery traces of the two temperatures after tstr =10 ks. (taken from [CSJ6])

In Figure 5.13 the time to reach ∆Vth= 0.6 V on the lateral devices is plotted for different stress bias and temperatures. The same inverse \( T \) behaviour is observed, however, with a strong dependence on the readout time \( t^\mathrm {delay} \) and \( V_\mathrm {G}^\mathrm {s} \), as the effect is most pronounced for an intermediate \( V_\mathrm {G}^\mathrm {s} \) = 37.5 V ( \( E_\mathrm {ox} \approx \) 5 MV/cm), and decays for lower and higher field strengths.

(image)

Figure 5.13: The time \( t_\mathrm {\SI {0.6}{\volt }} \) to reach ∆Vth = 0.6 V is shown for different readout delays for the lateral MOSFETs. The unambigous trend that for most of the ( \( t_\mathrm {delay} \), \( V_\mathrm {G}^\mathrm {s} \)) combinations the degradation of 0.6 V (arbitrary chosen) is exceeded after a longer period is again a result of more pronounced temperature activation of the emission compared to electron capture. However, the effect also strongly depends on the selected \( t_\mathrm {delay} \). (taken from [CSC8])

5.1.5 Capture and Emission Time Maps

(image) (image)

Figure 5.14: The experimentally extracted CET map of lateral MOSFETs (left) agrees with the simulation (right) in the main features for the noted \( V_\mathrm {G}^\mathrm {L} ~\approx V_\mathrm {th}, V_\mathrm {G}^\mathrm {H} = \SI {30}{\volt } \) and \( T = \SI {30}{\celsius } \). A peak in the time constant distribution is visible above the \( \tau _\mathrm {c}^{H} \) = \( \tau _\mathrm {e}^{L} \) diagonal with a decreasing density towards larger \( \tau _\mathrm {c}^{H} \) and \( \tau _\mathrm {e}^{L} \). For the calculated CET map, \( \mathrm {log}_\mathrm {10}\left (1 + \kappa V_\mathrm {th} / V_\mathrm {th}^\mathrm {max} \right ) \) / \( \mathrm {log}_\mathrm {10}\left (1 + \kappa \right ) \) with \( \kappa \) = 1000 has been used to visualize the relatively small contribution of the defects with larger charge transition time constants. The experimental window is represented by the shaded area in the simulated map. (taken from [CSC8])

While the (ET ,ER ) map, c.f. Figure 5.10, represents a full picture of the defect properties, which in turn allows to compute ∆Vth for arbitrary input signals VG( \( t \), \( T \)), a direct readout of ∆Vth from the map for a specific input signal is not possible. Therefore, Capture Emission Time (CET) maps can be used for a direct representation of the degradation for a specific \( V_\mathrm {G}^\mathrm {L}, V_\mathrm {G}^\mathrm {H} \) and \( T \). A large set of experimentally extracted CET and activation energy maps is shown in previous works of Puschkarsky et. al [83, 90, 261]. The main features within these maps are consistent with the previous explanation of a slow and fast degrading and recovering BTI component, transferring into two peaks within these maps. These peculiarities in CET maps extracted from SiC MOSFETs are also prevalent in the lateral test structures, as shown in Figure 5.14 for the experimentally extracted and simulated CET maps.

(image) (image)

Figure 5.15: A comparison of unipolar AC (VGL = 0 V) (left) with a bipolar operation VGL = −5 V (right) shows the shift of the distribution of charge transtion time constants towards lower emission times, leading to less observable ∆Vth. The fast degrading component recovers fast enough and parts of the slow-degrading component is also shifted towards smaller emission times, leading to a smaller overall PBTI due to charge accumulated in electron traps.(taken from [CSC1])

As a bipolar operation of SiC DMOSFETs is recommended [262] as it increases the long term stability of (math image), i.e. it reduces long-term BTI, the CET maps of T1/G2 are shown for VGL = 0 V compared to VGL = −5 V in Figure 5.15. The electron trapping time distributions shift towards smaller \( \tau _\mathrm {e}^{L} \), which leads to the ascribed effect of enhanced electron emission at the fast electron traps, c.f. pulsed MSM in Section 5.1.1. This observation has led to the introduction of preconditioning schemes [87, 88] to primarily measure long term BTI effects.

5.1.6 Reliable Prediction of ∆Vth

The extrapolation of ∆Vth under arbitrary bias and temperature operating conditions is in some works still performed based on fitting a power-law to the measured ∆Vth [263]. However, it lacks of any physical justification, e.g. ∆Vth never saturates, and in the case of SiC MOSFETs even more than one branch of a power law is needed to describe the different regimes of degradation. This fact is emphasized in Figure 5.16 for a constant voltage DC stress bias, leading to a severe overestimation of the degradation. The application of a Gaussian distribution of charge capture and emission times [54] might be justified in the case of only one defect type accounting for the observed ∆Vth, however, in the case of different distributions present, c.f. Section 5.1.4, it leads to an underestimation of ∆Vth. Both empirical methods require in any case different parameters for different read-out delays when measuring ∆Vth. On the contrary, with a carefully calibrated physical defect model, ∆Vth can be predicted for arbitrary input signals and readout delays accurately, shown in Figure 5.16 (right).

(image) (image)

Figure 5.16: Extrapolation of ∆Vth with a simple empirical power-law (solid lines) (left) lacks not only of physical justification, but also severely overestimates the (math image) degradation, as the slopes over stress time start to decay after 1 × 10−1 s to 1 × 101 s. The strong dependence of the readout time is also not captured by a single power-law exponent. A fit with a simple Gaussian distribution for charge transition times (dashed lines) explains the extracted (math image) more accurate, however, underestimates degradation at lifetimes. On the contrary, the calculation of (math image) with a calibrated physical defect model captures the degradation and slopes well for arbitrary bias conditions and measurement delays (right). (taken from [CSC8])

A precondition for the calculation of a valid extrapolation of ∆Vth at operating conditions of a MOSFET is the careful design of the stress conditions of the defect parameter extraction. The temperature and stress bias need to be increased to allow to capture charge transition times that are accelerated enough to be shifted into the experimental time window. This acceleration is shown in Figure 5.17 for the harshest PBTI stress conditions (VGH = 25 V, \( T \)=448 K), which shows a larger defect occupation after the interpolation time of about 100 ks, compared to the occupation after a device lifetime of 10 years at operation conditions (VGH = 20 V, \( T \)=300 K).

(image)

Figure 5.17: A comparison of charge transition time distribution of defects active at harshest stress condition (top) with the according distribution for recommended operation of the MOSFET  (bottom) shows a larger occupation of the defects, i.e. more defects with smaller \( \tau _\mathrm {c}^\mathrm {H} \) that are likely occupied after the interpolation time, compared to a lifetime of about 10 years at the operation condition. This renders the computation of ∆Vth after such a long lifetime period at operation condition valid.(taken from [CSC1])

(image)

(image)

Figure 5.18: An extrapolation of ∆Vth analytically computed for bipolar AC stress (f=50 kHz, duty cycle = 0.5) for T1/G2 is shown up to a device lifetime of 10 years for different (VGL, VGH) (bottom, left). A maximum of ∆Vth \( \approx \) 2 V is observed for VGL = 0 V, which is reduced by a factor of 2 for recommended operation (VGL = −5 V, VGH = 15 V). A large fraction of the degradation recovers within a short time for readout at VG \( \approx \) Vth (bottom, right). The comparison of stress and read-out Vth extrapolation of the three technologies (top) suggests highest stability for T1/G3, as a result of low electron trap densities and the absence of hole traps. (taken from [CSJ6]).

Based on the calibration of the DMOSFET technologies in Comphy, a lifetime prediction is presented in Figure 5.18 for each DMOSFET for different digital AC gate signals, as typically applied, considering a continuous operation. For this, the occupation of the defects has been calculated based on (4.60). It becomes clear that the wide distributions of defects with increased densities for T2/G1 leads to the most pronounced drifts of (math image), as well as to the largest variation. In contrast, the small densities and absence of hole traps in T1/G3 leads to relatively small drifts during stress, and as low as about 0.2 V only after 10 years, when read out after 1 × 10−4 µs. Exemplarily shown for T1/G2 is the variation for different (VGL,VGH) bias combinations with a maximum of about 2 V degradation for unipolar positive bias operation, which refers to the value at the end of the VGH phase during the AC period. For all three devices bipolar operation leads to a significantly reduced (math image) shift when an AC signal is applied and readout is performed at VG \( \approx \) (math image).

(image)

Figure 5.19: A comparison of the overdrive voltage dependence of the Vth and Ron degradation for a duty cycle of \( d \) = 0.5 for T1/G2 after 10 years and 10 µs readout time at (math image). An initial (math image)( (math image)) characteristic is used to calculate Ron and ∆Ronwithin a range of ∆Ron \( \approx \) 8 mΩ to 13 mΩ for different VGL at the largest \( V_\mathrm {ov} \). This translates to a minor static on state loss increase of less than 1 W considering a constant maximum current as given in the datasheet.(taken from [CSC1])

When comparing the overdrive voltage dependence \( V_\mathrm {ov} = V_\mathrm {G} - V_\mathrm {th,0} \) dependence of ∆Vth and on-state resistance shift ∆Ron at different VGL, as shown in Figure 5.19, a minimum of about 8 mΩ increase in Ron is predicted after 10 years of continuous operation at the maximum recommended bias conditions. Ron is thereby computed from an initially recorded (math image)( (math image))-curve with only a small deviation depending on the method of (math image)( (math image)) recording, e.g. continuous sweep or pulsed (math image)( (math image)) measurement, as discussed and shown in [CSJ10]. The continuous (math image)( (math image)) characteristics is used for the Ron calculation in Figure 5.19, as a readout time of 10 µs after the stress phase, as used for the extrapolation of (math image) is expected to compensate for potential accumulation pulses during the pulsed (math image)( (math image)) sweep. With the small static degradation of Ron predicted by the simulation, a minor increase of the static on-state losses \( P_\mathrm {on} \) of less than 1 W at continuous current operation at the maximum value according to the datasheet of 10 A can be expected. Finally, in Figure 5.20 the duty cycle \( d = t^\mathrm {H} / \left ( t^\mathrm {H} + t^\mathrm {L} \right ) \) dependence of ∆Vth shows only a minor variation of about 0.2 V for relevant \( d \) in the range 0.1 to 0.9 for all VGL, VGH permutations, with larger deviations only obtained for the DC operation cases at constant VG = VGL and VG = VGH. This leads to the conclusion that no major deviation due to BTI is predicted for variable high / low voltage conversion operation within a power converter circuit.

(image)

Figure 5.20: Duty cycle \( d \) dependence of (math image) after 10 years of operation at varying VGL, VGH shows a minor variation over \( d \) except for the DC cases. A VGL of −5 V significantly reduces the degradation by more than 0.5 V for any VGH, when compared to non-negative VGL.(taken from [CSC1])

5.1.7 Summary

For the first time, BTI in different SiC MOSFETs has been reproduced in detail by a device scale model employing physical defect parameters. The parameters for defects causing the long-term degradation are similar to those extracted in Si-based MOSFETs. However, the obtained defect densities are increased due to the different conduction band offsets in SiC/SiO2, which enables a larger fraction of shallow electron traps to be charged during device operation. These defects are considered an intrinsic property of SiO2. Together with fast electron traps, the defects close to the SiC conduction band are a common feature extracted in three commercially available DMOSFETs and a lateral test MOSFET, however, vary quantitatively in the extracted densities. The two components, i.e. fast and shallow EB, have been demonstrated to be separable by accumulation pulses and the optimum operation condition employing bipolar gate drive bias has been explained by the extracted defect charge transition times. These electron trap properties have recently also been confirmed by TCAD simulations employing a detailed four-state NMP model [264]. Furthermore, based on carefully calibrated simulation parameters and including data from accelerated stress experiments, static degradation extrapolation is performed. It predicts that the low on-state resistance and power loss increase can be considered as no major threat in SiC MOSFETs. Although a unique defect candidate cannot be identified with the presented methods, a parameter range is defined in the presented work that can help to further pin down suspected defect structures by single defect characterization in future works.