Nvidia Raises Ante in AI Chip Game With New Blackwell Architecture

Nvidia is pumping up the facility in its line of synthetic intelligence chips with the announcement Monday of its Blackwell GPU structure at its first in-person GPU Technology Conference (GTC) in 5 years.
According to Nvidia, the chip, designed to be used in massive knowledge facilities — the sort that energy the likes of AWS, Azure, and Google — provides 20 PetaFLOPS of AI efficiency which is 4x sooner on AI-training workloads, 30x sooner on AI-inferencing workloads and as much as 25x extra energy environment friendly than its predecessor.
Compared to its predecessor, the H100 “Hopper,” the B200 Blackwell is each extra highly effective and power environment friendly, Nvidia maintained. To prepare an AI mannequin the dimensions of GPT-4, for instance, would take 8,000 H100 chips and 15 megawatts of energy. That similar process would take solely 2,000 B200 chips and 4 megawatts of energy.
“This is the company’s first big advance in chip design since the debut of the Hopper architecture two years ago,” Bob O’Donnell, founder and chief analyst of Technalysis Research, wrote in his weekly LinkedIn publication.
Repackaging Exercise
However, Sebastien Jean, CTO of Phison Electronics, a Taiwanese electronics firm, known as the chip “a repackaging exercise.”
“It’s good, but it’s not groundbreaking,” he advised TechNewsWorld. “It will run faster, use less power, and allow more compute into a smaller area, but from a technologist perspective, they just squished it smaller without really changing anything fundamental.”
“That means that their results are easily replicated by their competitors,” he stated. “Though there is value in being first because while your competition catches up, you move on to the next thing.”
“When you force your competition into a permanent catch-up game, unless they have very strong leadership, they will fall into a ‘fast follower’ mentality without realizing it,” he stated.
“By being aggressive and being first,” he continued, “Nvidia can cement the idea that they are the only true innovators, which drives further demand for their products.”
Although Blackwell could also be a repackaging train, he added, it has an actual internet profit. “In practical terms, people using Blackwell will be able to do more compute faster for the same power and space budget,” he famous. “That will allow solutions based on Blackwell to outpace and outperform their competition.”
Plug-Compatible With Past
O’Donnell asserted that the Blackwell structure’s second-generation transformer engine is a major development as a result of it reduces AI floating level calculations to 4 bits from eight bits. “Practically speaking, by reducing these calculations down from 8-bit on previous generations, they can double the compute performance and model sizes they can support on Blackwell with this single change,” he stated.
The new chips are additionally appropriate with their predecessors. “If you already have Nvidia’s systems with the H100, Blackwell is plug-compatible,” noticed Jack E. Gold, founder and principal analyst with J.Gold Associates, an IT advisory firm in Northborough, Mass.

“In theory, you could just unplug the H100s and plug the Blackwells in,” he advised TechNewsWorld. “Although you can do that theoretically, you might not be able to do that financially.” For instance, Nvidia’s H100 chip prices $30,000 to $40,000 every. Although Nvidia didn’t reveal the value of its new AI chip line, pricing will most likely be alongside these strains.
Gold added that the Blackwell chips may assist builders produce higher AI functions. “The more data points you can analyze, the better the AI gets,” he defined. “What Nvidia is talking about with Blackwell is instead of being able to analyze billions of data points, you can analyze trillions.”
Also introduced on the GTC have been Nvidia Inference Microservices (NIM). “NIM tools are built on top of Nvidia’s CUDA platform and will enable businesses to bring custom applications and pretrained AI models into production environments, which should aid these firms in bringing new AI products to market,” Brian Colello, an fairness strategist with Morningstar Research Services, in Chicago, wrote in an analyst’s notice Tuesday.
Helping Deploy AI
“Big companies with data centers can adopt new technologies quickly and deploy them faster, but most human beings are in small and medium businesses that don’t have the resources to buy, customize, and deploy new technologies. Anything like NIM that can help them adopt new technology and deploy it more easily will be a benefit to them,” defined Shane Rau, a semiconductor analyst with IDC, a world market analysis firm.
“With NIM, you’ll find models specific to what you want to do,” he advised TechNewsWorld. “Not everyone wants to do AI in general. They want to do AI that’s specifically relevant to their company or enterprise.”
While NIM will not be as thrilling as the most recent {hardware} designs, O’Donnell famous that it’s considerably extra essential in the long term for a number of causes.

“First,” he wrote, “it’s supposed to make it faster and more efficient for companies to move from GenAI experiments and POCs (proof of concepts) into real-world production. There simply aren’t enough data scientists and GenAI programming experts to go around, so many companies who’ve been eager to deploy GenAI have been limited by technical challenges. As a result, it’s great to see Nvidia helping ease this process.”
“Second,” he continued, “these new microservices allow for the creation of an entire new revenue stream and business strategy for Nvidia because they can be licensed on a per GPU/per hour basis (as well as other variations). This could prove to be an important, long-lasting, and more diversified means of generating income for Nvidia, so even though it’s early days, this is going to be important to watch.”
Entrenched Leader
Rau predicted that Nvidia will stay entrenched because the AI processing platform of selection for the foreseeable future. “But competitors like AMD and Intel will be able to take modest portions of the GPU market,” he stated. And as a result of there are totally different chips you should use for AI — microprocessors, FPGAs, and ASICs — these competing applied sciences can be competing for market share and rising.”
“There are very few threats to Nvidia’s dominance in this market,” added Abdullah Anwer Ahmed, founding father of Serene Data Ops, a knowledge administration firm in San Francisco.
“On top of their superior hardware, their software solution CUDA has been the foundation of the underlying AI segments for over a decade,” he advised TechNewsWorld.
“The main threat is that Amazon, Google, and Microsoft/OpenAI are working on building their own chips optimized around these models,” he continued. “Google already has their ‘TPU’ chip in production. Amazon and OpenAI have hinted at similar projects.”
“In any case, building one’s own GPUs is an option only available to the absolute largest companies,” he added. “Most of the LLM industry will continue to buy Nvidia GPUs.”

Nvidia Raises Ante in AI Chip Game With New Blackwell Architecture

Related

Recent Articles

Related Stories

Stay on op - Ge the daily news in your inbox

Share this:

Related

Share this:

Related