IMPORTANT:
Irrational Analysis is heavily invested in the semiconductor industry.
Please check the ‘about’ page for a list of active positions.
Positions will change over time and are regularly updated.
Opinions are authors own and do not represent past, present, and/or future employers.
All content published on this newsletter is based on public information and independent research conducted since 2011.
This newsletter is not financial advice and readers should always do their own research before investing in any security.
AEHR 0.00%↑ just made an acquisition that is suspicious at best, nonsensical at worst.
VICR 0.00%↑ , a company seemingly left for dead, is oddly showing signs of life.
And INTC 0.00%↑, the most beaten-up large-cap semi name, is going up while the entire semiconductor industry is a sea of red.
Believe it or not, these three investments are connected in some way.
Before getting the investment talk (dessert), you must first suffer through the technical material (vegetables) I have prepared for you today.
Contents:
Electromigration
[H]igh [T]emperature [O]perating [L]ife
AEHR’s Legendary Gaslighting
Reading the VICR Tea Leaves
Bonus: INTC Possible Tsunami-Level Tail Risk
[1] Electromigration
Electromigration (EM) is the process by which semiconductors degrade over time.
High current, high voltage, and high temperature (in particular) accelerate EM.
Determining the reliability of a chip design depends on two things:
The design team simulating (correctly) and making sure all their circuits meet a minimum reliability criterion, typically 10-years for logic like consumer CPUs.
The post-silicon team needs to run HTOL testing and analyze all failures, adjusting settings to make sure production chips don’t fail early.
{2} [H]igh [T]emperature [O]perating [L]ife
How do you quickly find out if your chip will last, on average, 10 years under normal operating conditions?
The answer is simple: HTOL burn-in testing.
Take a batch of chips (80-240 units typically) and run them at full load for 1000 hours at 125C.
Note: Specialty parts may need longer testing at a higher target accelerated degradation temperatures.
Conceptually, HTOL testing is simple. Running your chip at an abusively high temperature for one hour is equivalent to MANY hours of normal operation.
There are machines specifically designed for this type of testing…
Think of these machines as giant ovens that feed power to boards that contain many chips each. In addition to power delivery and temperature control, these HTOL systems support digital and/or analog telemetry to track precisely when a device is in a failed state and needs to be pulled out for analysis.
HTOL testing is a tedious, someone annoying process but is absolutely vital for making sure designs will survive as long as they are supposed to.
Thankfully, once a design has passed HTOL, it does not need to be re-tested unless a major change is made.
Examples of changes that would warrant new round of HTOL testing:
Porting the design to a new process node.
Non-trivial architectural changes.
Request by customers for qualification in a new market that has materially different nominal operating conditions.
[3] AEHR’s Legendary Gaslighting
Aehr Test Systems is a specialty supplier or silicon-carbide (SiC), gallium-nitride (GaN) and silicon-photonics wafer-level burn-in testing equipment.
These three semiconductor niche segments need production-scale burn-in testing to make sure the weak chips die quickly in the factory.
The goal of this style of burn-in testing is to weed out the weak chips and sell the survivors, knowing they are unlikely to fail early in the field.
For detailed information on what Aehr does and why their solution is cool, check out this SemiAnalysis post.
TLDR;
Aehr makes test machines that can take in entire wafers at a time. Saves a lot of money because you don’t need to chop-up wafer into chips, package, then test. Strong recurring-revenue stream at high gross margin from consumable probe cards.
Aehr’s recent stock performance has been due to an acquisition they just announced, Incal.
Surprise! I did not waste your time explain random engineering terms. You already know what HTOL is and thus know exactly what Incal does.
Here is a comparison sheet I made.
These two businesses are nothing alike.
Completely different engineering goals.
Completely different financial and business models.
Zero synergy.
The press release and earnings call transcript has some of the most gaslighting, nonsensical, misleading, bullshit I have ever had the misfortune to read.
Gayn Erickson, President and CEO of Aehr Test Systems, commented, “The rapidly growing artificial intelligence semiconductor market is still in the early stages, and we see a significant opportunity in this market for wafer level burn-in using our new high-power FOX multi-wafer production system. In addition, given the unique challenges of testing very high-power devices related to AI processors, there is a very real need for a significant amount of engineering qualification and process development as well as a significant new opportunity for production reliability screening at the packaged part level.
“We believe that Incal’s new family of ultra-high-power solutions have significant technological and commercial advantages over other solutions, enabling Aehr to capture an even larger share of the AI semiconductor test market. In addition, Incal is in a unique position with intimate knowledge and working relationships with a significant number of AI industry leaders, providing a front-row seat to the technology needs of these customers. Incal is shipping systems today for use by a broad range of companies and their subcontract manufacturing and test houses for reliability testing for AI, GPU, network, and HPC processors, with many of these companies projecting needs to move to high volume, production level burn-in of these devices.
Throughout the press-release and earnings call transcript, Aehr management repeatedly imply that AI semiconductor testing will soon need high-volume, production level burn-in.
Multiple participants on the call…. called bullshit on this acquisition as a contrived way of boosting next-years revenue guidance, which would have missed street expectations significantly without the $12M in Incal revenue.
From an engineering perspective, this acquisition by Aehr is obvious bullshit. Folks on the financial side are already suspicious. Do some more digging and this could be a juicy short.
[4] Reading the VICR Tea Leaves
Vicor makes power components. Converting DC to DC, AC to DC and what not. There are plenty of other companies that make these components such as MPWR 0.00%↑ ADI 0.00%↑ and TXN 0.00%↑ .
For background on Vicor, check this SemiAnalysis piece:
E N H A N C E
WE MUST GO DEEPER.
E N H A N C E !!!!!!!!!
Some degenerate pod monkeys have clearly bid this garbage up. If any readers want to really know what is going on with Vicor, ask around for HTOL reports.
Vicor has pissed off basically every major customer they had due to reliability issues. Nobody is going to be dumb enough to trust them with a high-volume program without a detailed HTOL report for the part in question.
Find the HTOL report and you have an answer if this stock is about to plummet or goto moon.
[5] Bonus: INTC Possible Tsunami-Level Tail Risk
Certain Intel chips (high-core count 13th and 14th gen Raptor Lake) have been failing alarmingly early en-masse.
These are desktop chips that also get deployed in small servers.
Datacenter providers are hiking warranty prices.
Based on information from a variety of sources, I believe the following is an accurate summary of the current situation at the time of writing:
High-core count desktop CPUs from Intel’s 13th and 14th generation are failing as early as a few months of life, typically 6 months of operation.
The datacenter motherboards do not support overclocking and are deployed in well-cooled environments. (not a PC gamer overclocking problem)
Early EM induced failure appears to be occurring in the ring bus. (the part that connects all cores, PCIe PHY, iGPU, and so on together)
From a financial perspective, this is a nothingburger so long as the problem is limited to high-end gaming SKUs.
If this EM problem is latent in the laptop SKUs…. it could be a massive problem.
TLDR;
Intel may have an unexpected, multi-billion-dollar pre-tax charge if this electromigration problem is in all Raptor Lake, not just the high-core count desktop SKUs.
Intel has just blamed motherboard voltage again. They later updated their PR statement to admit the CPU's have an oxidation problem...hours after the press ran with the voltage excuse.