Why should I care about BJTs? This is the CMOS age. If you design full TX/RX chains in industry, chances are you’re using CMOS. But if you look at dedicated RF front-end blocks (like LNAs and PAs) you’ll notice they often use non-CMOS tech. Sometimes SOI, but more often, HBTs (a variant of BJTs). Turns out, BJTs have survived the test of time.
It wouldn’t be an exaggeration to say BJTs are still the best transistors for RFIC applications, offering higher \(f_{max}\) and \(f_\tau\) than CMOS. And let’s be real, CMOS hasn’t been making big moves beyond 22nm in terms of RF performance.
Either way, CMOS \(f_{max}\) isn’t improving the way we hoped. Meanwhile, BJTs are accelerating, pushing towards \(f_{max}\) in the THz range. And here’s the thing: junctions are everywhere whether you use CMOS or non-CMOS. If you really want to understand how transistors work, BJTs are the best place to start. They show you how to use junctions cleverly, and once you get BJTs, you get a much deeper feel for device physics.
In last post, we learned how a BJT works. Let’s dig into details and see how key metrics like gain, \(f_{max}\), breakdown and linearity are traded off in BJT design.
Let’s understand how current flows in BJT before we dive deep. The base–emitter (B–E) junction is forward biased which injects electrons from the emitter across the B–E junction into the base. These injected electrons create an excess concentration of minority carriers in the base. The base–collector (B–C) junction is reverse biased, so the minority carrier electron concentration at the edge of the B–C junction is ideally zero. The large gradient in the electron concentration means that electrons injected from the emitter will diffuse across the base region into the B–C space charge region, where the electric field will sweep the electrons into the collector. This completes the flow of current as shown in the image below. Let’s understand different components of current first.
Collector and base current consist of many components. Let’s breakdown them. The current densities are defined as follows:
To maximize BJT current gain, we need to:
These goals can be achieved if we can make \(\alpha=\frac{I_C}{I_E}\) aka common base current gain to be unity. This will ensure that \(\beta=\frac{I_C}{I_B}\) aka common emitter current gain is maximized since they are related as:
\(\alpha\) can be broken down into three components which all needs to be unity in order to make \(\alpha\) unity.
Let’s understand these components one-by-one:
The emitter injection efficiency factor \(\gamma\) takes into account the minority carrier hole diffusion current in the emitter. This current is part of the emitter current, but does not contribute to the transistor action in that \(J_{pE}\) is not part of the collector current. It is defined as:
Putting in expression for diffusion current and some mathematical manipulations will reveal that we can write \(\gamma\) as [1]:
where,
Above equation suggests, for the emitter injection efficiency to be close to unity:
The emitter doping must be much larger than the base doping. This ensures that far more electrons from the n-type emitter are injected into the base than holes from the p-type base into the emitter.
The base transport factor \(\alpha_T\) takes into account any recombination of excess minority carrier electrons in the base. Ideally, we want no recombination in the base. It is defined as:
Putting in expression for diffusion current and some mathematical manipulations will reveal that we can write \(\alpha_T\) as [1]:
where \(L_B\) is diffusion length. Above equation suggests, for the base transport factor to be close to unity, the base width must be much smaller than diffusion length. This makes sures minority carriers don’t spend too much time passing through base and getting lost in recombination. For \(x_B<<L_B\), we may expand the cosh function in a Taylor series such that
The recombination factor \(\delta\) takes into account the recombination in forward biased B–E junction. The current \(J_R\) contributes to the emitter current but does not contribute to collector current. Recombination factor can be defined as:
It can be further simplified to, as detailed in [1]:
where \(c\) is some constant which depends on base width, doping, lifetime etc. We don’t need to worry about it. What we learn is that the recombination factor is a function of the B–E voltage. As \(V_{BE}\) increases, the recombination current becomes less dominant, and the recombination factor approaches unity.
Assume \(D_E=D_B\) and \(x_E=x_B\) for simplicity. To get \(\gamma\) to be \(0.9967\), we need:
This shows that emitter doping concentration must be 300 times larger than the base doping concentration to achieve a high emitter injection efficiency.
Assume \(L_B=1\mu m\). To get \(\alpha_T\) to be \(0.9967\), we need:
This shows that base width has to be very small to achieve a high base transport factor.
Assume \(c=1000\), to get \(\gamma\) to be \(0.9967\), we need:
This example demonstrates that recombination factor may become dominant limiting factor if bias voltage is lower than 0.654V.
Our analysis of current gain did not consider frequency effects. We focused on current collection by making sure all the injected current is collected. Next, we may ask how long does it take to collect the injected carriers because that will ultimately determine how fast can we switch the device. The time carriers take to traverse the device is known as transit time \(\tau\), and corresponding frequency is known as transit frequency \(f_\tau\). They are related as:
Note that there is \(2\pi\) factor because delay is inverse of angular frequency, on the other hand time periods are inverse of linear frequency.
\(\tau\) can be broken down into three main components:
We can then write total transit time as:
where,
Since most components are negligible, transit time is often approximated as:
Image below provides step-by-step account of these delays as carriers (electrons in our example) encounter while they travel from emitter to collector.
As the frequency increases, this transit time \(\tau\) can become comparable to the period of the input signal. At this point, the output response will no longer be in phase with the input and the magnitude of the current gain will decrease. One would then want to define a frequency where collection is 3dB less than injection. Therefore, we can re-write \(\alpha\) as:
where \(\alpha_0\) is DC common-base current gain and \(f_\tau\) is 3dB cutoff frequency related to \(\tau\) as we know \(f_\tau=\dfrac{1}{2\pi\tau}\)
This shows that at \(f=f_\tau\;\), \(\beta=1\). This is the origin for the definition of transit frequency, where we say \(f_\tau\) is unity (common-emitter) current gain frequency or 3dB common-base current gain frequency.
So far, we’ve examined \(f_\tau\) from a purely device-level perspective. Now, let’s develop a small-signal circuit model to bridge the gap between the device and circuit domains.
Let’s drive \(C_\pi\) in terms of transit-time. Say, it takes time \(\tau\) to transit charge \(Q\) from emitter to collector (i.e., \(\tau\) includes all diffusion delays base, emitter, collector). We can then write:
Also note that, this only expresses \(C_\pi\) in terms of \(\tau\), it does not mean that diffusion delay only results from \(C_\pi\) and \(r_\pi\) has no role. \(r_\pi\) is hidden inside \(g_m\) as we know \(\beta=g_mr_\pi\).
Since we are interested in current IN (injection) and current OUT (collection), we can short-circuit output (C-E) to collect all the current irrespective of load. The base and collector currents can be written as:
Ignoring 1 as sum of other terms in denominator is much bigger than unity at high frequencies. Also, ignoring \(j\) to arrive get magnitude:
Putting \(\beta=1\):
This means \(\beta\) can also be written as:
Let’s figure out \(f_\tau\) in terms of \(\tau\), consider from Eq (2):
This shows dependence of \(f_\tau\) on \(g_m\). At lower currents, \(g_m\) is low, the second term in denominator is much bigger than \(\tau\), hence \(f_\tau\) increases with collector current. Physically this means, when \(g_m\) is low it takes more time to charge junction capacitances which increases the delay. At medium currents the depletion capacitance term becomes smaller than \(\tau\), and hence \(f_\tau\) ceases to rise with collector current, reaching peak \(f_{\tau MAX}\), and is given by:
This means peak \(f_\tau\) is limited by diffusion delay which as mentioned before is dominated by base transit time.
At high collector currents the cut-off frequency decreases markedly due to Kirk effect. Therefore, it is important to select collector current correctly: too low or too high both not good, there is an optimal for peak \(f_\tau\).
While \(f_\tau\) is a great metric for device designers, it is not very meaningful circuit designers. For example, series parasitic at input of device and shunt parasitic at output of device are ignored in \(f_\tau\) whereas they can seriously degrade operating frequency in a real circuit. \(f_{max}\) was introduced to account for such parasitics and is more realistic measure of how high in frequency a device can really go. We have a dedicated post on \(f_\tau\) vs \(f_{max}\) here, where we derive \(f_{max}\) to be:
\[\LARGE {\textcolor{#40CE7F} {f_{max} = \sqrt{\frac{f_\tau}{8 \pi r_b C_{\mu}}}}}\]
here \(r_b\) is the series resistance at input before \(r_{\pi}\) as shown in image below. This shows that \(r_b\) is so crucial to \(f_{max}\). If \(r_b\) were zero, we would get infinite \(f_{max}\) even though \(f_\tau\) would remain same. \(r_b\) originates from finite base doping (device designer controls this) and base contact resistance (layout designer can mitigate this). Base doping is dominant source of \(r_b\) as base is lightly doped to inject less holes current and thus increase \(\beta\). This shows there is direct trade-off between \(\beta\) and \(f_{max}\).
\(r_b, \; r_c\) and \(r_e\) are base, collector and emitter resistances from contact and finite dopings, \(r_o\) is from base-width modulation and \(C_s\) is capacitance between collector and substrate as collector sits on top of substrate as show in cross-section here.
We know a PN junction breaks down when reverse bias voltage is increased. A BJT has two PN junctions:
Zener breakdown happens when doping concentrations are high. Since base and collector are lightly doped, this is usually not a concern for BJT.
We learned that a reverse bias to a PN junction causes the depletion region to extend. When BJT is in forward active mode, B-C junction is reverse biased. This causes B-C junction to extend. If reverse bias is increased, there comes a time that junction has extended so much that it occupies whole basewidth and joins up B-E depletion region. The emitter and collector are then connected together by a single depletion region, as shown in image. This is known as punch-through, and when it occurs B-E potential barrier is lowered which results in very large current flow between emitter and collector. Intuitively, this can be understood as collector strong electric field now reaching the base and pulling the electrons from emitter as if B-E forward bias was increased and henc more current flows for same B-E voltage. While not a breakdown in the traditional avalanche sense, the electrical behavior is similar to junction breakdown. Punch-through sets a fundamental limit on how much the base width of a bipolar transistor can be scaled down. A thinner base increases gain and speed but also lowers the breakdown voltage, making the device more susceptible to punch-through.
Avalanche multiplication or impact ionization is by far the most common breakdown mechanism in junctions. In bipolar transistors, the avalanche breakdown voltage depends on the base termination impedance. If base is terminated with low impedance as in common-base connection where base is grounded, the breakdown voltage obtained is the same as that of a normal PN junction. If base is terminated with higher impedance as in common-emitter connection, the breakdown voltage is considerably lower, as shown in image. Therefore, in practice, two breakdown voltages are reported:
Above equation shows that the common emitter breakdown voltage \(BV_{CEO}\) is inversely proportional to the common emitter current gain \(\beta\) of the transistor. There is therefore a trade-off between gain and breakdown voltage. Clearly a high gain and a high breakdown voltage cannot be obtained simultaneously, so a compromise must be reached between reasonable values of gain and breakdown voltage.
The BJT is built on a p-type substrate which serves as the reference potential (e.g., ground). An n⁺ buried layer is placed on top of the substrate to reduce the collector’s series resistance; without it, the relatively lightly doped collector region would suffer from excessive voltage drop and delays. Above the buried layer is an n-type epitaxial layer, which is grown specifically to form the collector region with controlled doping. The p-type base region is diffused or implanted into this epitaxial layer, and an n⁺ region is subsequently introduced to create the emitter. Heavily doped p⁺ and n⁺ regions at the surface make low-resistance contacts to the base, emitter and collector. The p⁺ polysilicon seen in the diagram is commonly used as an intermediary between metal and semiconductor contacts; it provides a high doping source, prevents unintentional junction formation, easier to pattern etc. etc. Finally, isolation regions on either side electrically separate this transistor from other devices on the chip, ensuring proper functionality and preventing unwanted current flow.
Quasi-saturation is an effect that occurs at high currents due to the internal collector resistance \(r_c\) of the transistor. It occurs when the voltage drop across the collector resistance is large enough to forward bias the B-C junction. The quasi-saturation is seen as a soft transition into the saturation region of the transistor’s IV curve. Its effect is greater at higher collector currents, because the voltage drop across the internal collector resistor is larger.
At high currents there can be a significant voltage drop across the intrinsic base resistance \(r_b\) of a transistor due to the lateral flow of base current from the base contact. As a result of this voltage drop across the intrinsic base resistance, the potential in the base close to the base contact is higher than that away from the base contact. Consequently, the B-E junction is more forward biased at the edge of the emitter that is closest to the base contact. This effect is known as current crowding since the current crowds to the edge of the emitter that is closest to the base contact. The larger current density near the emitter edge may cause localized heating effects as well as localized high-injection effects. If a bipolar transistor is required to deliver a high current, it is therefore necessary to maximize the emitter perimeter for a given emitter area. This can be done by partitioning the emitter into narrow emitter fingers and interdigitation.
When the mobile charge in the B–Cdepletion region exceeds the fixed ionized donor charge, the net space charge is reduced, weakening the electric field in the collector. As a result, the neutral base region extends into the collector, effectively widening the base. This phenomenon, known as base widening or base pushout or the Kirk effect, occurs at high current densities and degrades transistor performance by increasing base transit time and reducing current gain.
As \(V_{BE}\) increases, the injection of electrons from the emitter into the base becomes more intense. At sufficiently high bias, the concentration of these injected minority carriers (electrons in the p-type base) can become comparable to the base’s majority carrier concentration (holes). When this happens, the base region starts to behave more like an intrinsic region, disrupting the normal operation of the BJT. This shift in carrier dynamics reduces the emitter injection efficiency, because the advantage of the heavily doped emitter over the lightly doped base is diminished. As a result, fewer of the total carriers crossing the E-B junction come from the emitter, degrading transistor gain.
Assume you have a big BJT device. It is like as if you have connected many smaller ones in parallel. When several junctions operate in parallel, a slight temperature increase in one can lower its built-in potential, causing it to conduct disproportionately more current than its neighbors. This localized current hogging can drive the junction into high-level injection, further increasing its temperature and leading to a destructive feedback loop. To mitigate this effect, resistors are often placed in series with the base (called base-blasting) or emitter (called emitter blasting). These resistors help to drop a portion of the voltage externally, ensuring that no single junction is forced to handle excessive current, thereby maintaining uniform current distribution and preventing thermal runaway and subsequent damage.
Effect Name | What it Varies | Explanation | Optimization/Mitigation Techniques |
---|---|---|---|
Base Width Modulation | Output resistance \(r_o\) | The B–C depletion region expands and contracts as output voltage (and hence B–C voltage) changes. This leads to change in base-width which results in change in collector current. | Shield your \(g_m\) transistor from voltage swings e.g., by using cascode. |
Exponential Current | Transconductance \(g_m\) | The exponential I–V relationship of the B–E junction causes \(g_m\) to vary steeply with \(V_{BE}\), making it very non-linear. | Use degeneration, feedback, feedforward, and pre-distortion techniques as listed here. |
High Injection Effects | Current gain \(\beta\) | At high current densities, the injected minority carrier concentration can rival the majority, disrupting exponential behavior and reducing gain. | Optimize layout e.g., use interdigitation to mitigate current crowding. |
Junction Capacitance Variation | Input/output capacitances | Capacitance of the junctions changes with applied voltage, affecting frequency response and introducing phase distortion. | Use feedback neutralization techniques and reduce device size. |
Series Resistance Variation | Output current and region of operation | Series resistances create current-dependent voltage drop. For example, if the collector draws a large current, the drop across \(r_c\) can be so high to put the device into saturation. | Optimize geometry and contact layout. |
Thermal Runaway | Gain and current | Localized heating lowers \(V_{BE}\), causing that device to draw more current, which increases heating further, leading to high injection and potential damage. | Use base or emitter ballasting resistors. |
Impact Ionization | Current multiplication and breakdown | High electric fields can generate additional carriers, resulting in nonlinear multiplication and possible breakdown. | Operate below breakdown limits. |
Note: In device context, term width is used so we kept it but for circuit designers its actually “length” (something that comes fixed from foundry). So, table below is mainly from device designer perspective, we (circuit designers) don’t control these things.
Parameter | Effect on Current Gain | Effect on High Frequency Performance | Effect on Breakdown Voltage |
---|---|---|---|
Base Width 🔼 | 🔽 (more recombination lowers gain) | 🔽 (longer transit time slows switching) | 🔼 (wider base reduces punch-through risk) |
Emitter Width 🔼 | 🔼 (improves injection efficiency) | 🔽 (increased diffusion delay) | minimal direct effect |
Collector Width 🔼 | little direct effect | 🔽 (longer carrier path increases delay) | 🔼 (spreads electric field, improving breakdown) |
Base Doping 🔼 | 🔽 (more hole injection/recombination) | 🔽 \(f_\tau\) (reduced diffusion constant hampers speed) 🔼 \(f_{max}\) (reduced base series resistance) | 🔽 (narrower depletion region lowers breakdown voltage) |
Emitter Doping 🔼 | 🔼 (enhances injection efficiency) | some improvement seen as \(\beta\) increases but bandgap narrowing offsets gains | 🔽 (bandgap narrowing lowers built-in potential which affects indirectly, there is no direct effect since breakdown is mostly B-C junction phenomena) |
Collector Doping 🔼 | minimal effect on gain | little effect on carrier transit | 🔽 (higher doping increases electric field intensity, lowering breakdown voltage) |
RFInsights
Published: 13 April 2025
Last Edit: 13 April 2025