tag

Monday, June 01, 2026 | Daily Newspaper published by GPPC Doha, Qatar.

Tag Results for "data center" (2 articles)

Nvidia CEO Jensen Huang.
Business

Nvidia tells sceptical investors AI is ready to go mainstream

Nvidia Corp, facing more investor scepticism, used its latest quarterly report to tout progress in diversifying the company, which aims to rely less on the giant data center operators that have fueled its runaway growth.Though spending has continued to surge from large data center clients — a group known as hyperscalers — Nvidia predicted that a vast array of other businesses and governments would soon become a bigger source of revenue. They’re poised to snap up Nvidia’s chips and other computing products to support their own artificial intelligence ambitions.Down the road, so-called physical AI will bring a colossal new opportunity in the form of robots and automated vehicles, Chief Executive Officer Jensen Huang said on a conference call with analysts. “We’ve got it all covered,” he said.But investors have become harder to impress. Even after the company beat analysts’ estimates with its results and forecast, the shares were little changed after markets opened in New York on Thursday. Shareholders weren’t swayed by an expansion of investor rewards, including a massive increase to the company’s dividend.Sales in the three months ending in July will be about $91bn, the company said in its quarterly report. That topped the average estimate of $87bn, though analysts’ projections ranged as high as $96bn, according to data compiled by Bloomberg.At the same time, the company is facing the first major challenges to its dominance in AI computing, with a variety of chipmakers trying to carve out a piece of the business. And major buyers of Nvidia’s technology are developing their own in-house components.Nvidia shares had gained 20% this year heading into the report. That increase outpaced the S&P 500 but trailed most major chip peers.Nvidia is the top seller of AI accelerators, chips used to develop artificial intelligence models. But it faces growing competition from across Silicon Valley. Advanced Micro Devices Inc has rival processors, and Broadcom Inc and Alphabet Inc’s Google are attacking the market with their own technology.Nvidia remains in an enviable position, with Wall Street predicting that the company’s revenue will account for more than a third of the entire semiconductor sector’s sales this year.“The build-out of AI factories — the largest infrastructure expansion in human history — is accelerating at extraordinary speed,” Huang said in a statement.Data center spending, which is the main source of Nvidia’s revenue, hasn’t shown signs of letting up. Hyperscalers plan to shell out a combined total of roughly $725bn on AI this year. And based on Nvidia’s latest results, revenue from those companies continues to outpace other sources.That hasn’t just buoyed sales of accelerators. General-purpose CPUs, or central processing units, also are in greater demand. That’s lifted results for Intel Corp. and AMD. Chip upstarts are getting a boost as well: Cerebras Systems Inc, which offers a novel product based on large pieces of silicon, had the year’s biggest initial public offering last week.Santa Clara, California-based Nvidia doesn’t just sell accelerators. It offers a range of chips, as well as networking, software, AI models and even complete computer systems. That helps make its reach and capabilities unassailable, Nvidia management has argued. The company has said it has more orders than it can fill and is investing to add supply to meet that demand.In the three months ended April 26, Nvidia’s sales gained 85% to $81.6bn. Analysts had estimated $79.2bn on average. Profit, minus certain items, climbed to $1.87 a share. That beat a projection of $1.77. Adjusted gross margin, the percentage of revenue remaining after deducting costs of production, was 75%.Nvidia boosted its quarterly dividend to 25 cents a share from a penny. And the chipmaker announced $80bn in stock repurchases.Nvidia’s all-important data center unit generated revenue of $75.2bn, compared with an estimate of $73.5bn. Networking, part of the data center division, delivered $14.8bn in sales, versus an estimate of $12.7bn.As part of its report, Nvidia said it was transitioning to a new framework that would better reflect “its current and future growth drivers.” Its data centre sales figures will now separate hyperscalers from a group it calls ACIE, for AI clouds, industrial and enterprise customers.The company is on course to record total revenue of more than $370bn this year, according to estimates. By that measure, it will be roughly 22 times the size it was in fiscal 2021. Nvidia easily chalks up more sales in a quarter than its next three largest rivals combined.Huang has just returned from a trip with President Donald Trump to China, the largest market for semiconductors overall. US export rules have stymied Nvidia’s growth in that country by restricting sales of AI accelerators on national security grounds.The Trump administration has begun allowing older Nvidia products to be sold to Chinese customers. But Beijing, trying to cultivate local suppliers, has resisted that initiative. That’s left Nvidia mostly locked out of a market that it has said could generate $50bn a year. 

Gulf Times
Qatar

Positron targets Middle East’s AI infrastructure with energy-efficient inference

Across the Gulf, governments are investing billions in sovereign AI initiatives, data center expansions, and smart city ecosystems. From NEOM’s greenfield infrastructure to Abu Dhabi’s G42-backed cloud services and Dubai’s push to embed AI in public services, the region is making AI central to its long-term economic leadership. But as these ambitions scale, they demand energy-efficient, high-performance infrastructure capable of running AI at national levels—without breaking the grids or the budgets. This challenge is prompting a closer look at energy-efficient alternatives to running AI models.This shift is already underway in the U.S., where trillions of dollars are being poured into data center infrastructure. With it, a wave of new players is emerging with hardware optimized for efficiency, scalability, and cost-effectiveness in AI workloads. One of the most promising is Positron, a startup with deep connections to capital partners behind well-known brands—two of the most compute-intensive sectors in AI. The company recently raised $51M to scale its inference accelerators, designed for high power efficiency, strong performance-per-dollar, and optimized memory bandwidth, key factors for large-scale inference deployment.As Middle Eastern nations plant their flags and chart the course of their AI futures, it’s worth looking for solutions beyond obvious hardware, like expensive, non-specialized GPUs. Positron’s recent deployments with leading US neoclouds like Cloudflare might be an indicator of the powerful and efficient infrastructure that will soon be deployed in many countries in the Middle East.Where AI Gets ExpensiveWhile hardware choices often focus on raw power or training benchmarks, the real costs and energy demands come after the model is trained. Inference, the phase where AI models are actually deployed and used, is what makes or breaks infrastructure strategies. And it’s where the gap between traditional GPUs and purpose-built systems becomes most apparent.In the training phase, models learn from data to make predictions and perform tasks. Inference is the subsequent phase where a trained model uses its knowledge to process new data and generate outputs like user prompts and synthetic content, or power customer-facing AI assistants. Where training is a one-time cost, inference is a continuous process that scales with every user interaction on every AI platform. And today, inference has become the largest and fastest-growing source of AI’s ongoing energy consumption.Positron is directly addressing this challenge with its inference system, designed from the ground up to maximize memory capacity and support large-scale deployment across neocloud platforms and enterprise environments. By focusing on efficiency, memory optimization, and compatibility, Positron is helping AI builders and infrastructure providers improve performance while working toward more energy-efficient operations.Rethinking AI’s Cost CurveToday, inference accounts for nearly 90 percent of all model-related workloads across both consumer and enterprise applications. Despite this, most inference is still being run on power-hungry general-purpose GPUs, which were originally designed to train large, flexible models, not for the demands of real-time, high-throughput inference workloads.This misalignment has created a hidden cost structure in which enterprises and cloud providers are forced to rely on expensive GPU hardware to perform repetitive, memory-bound inference tasks, often utilizing only a fraction of each chip’s capacity. This inefficiency inflates operating costs and puts additional strain on power grids in regions where energy management and infrastructure optimization are critical concerns.Positron’s answer to this problem is Atlas, a full-stack inference system that leverages reprogrammable FPGAs to maximize memory bandwidth and throughput. Atlas is designed to make more efficient use of memory bandwidth than standard GPUs, with a focus on improving cost-effectiveness and reducing energy consumption. The platform is already in use by neocloud companies, including Cloudflare and Parasail, where it’s powering large-scale inference workloads with no code modifications required.Strategic Funding to Scale AI AccessPositron just closed a $51.6 million Series A funding round led by Valor Equity Partners, Atreides Management, and DFJ Growth with participation from 1517 Fund, Flume Ventures, and Unless. The round brings the company’s total funding to $75 million and reflects growing investor confidence in inference-optimized infrastructure as the next major wave in AI.This funding will support the continued deployment of Positron’s first-gen product and accelerate the rollout of Titan, the company’s upcoming ASIC-based inference platform slated for release in 2026. Titan is being developed to support some of the largest and most complex AI models in use today, with a planned capacity of up to 16 trillion parameters and two terabytes of high-speed memory per chip. Its modular, high-throughput design is intended to address the growing demand for scalable, sovereign AI infrastructure in both enterprise and government deployments.Unlike traditional solutions, Positron’s hardware supports existing model binaries and APIs, minimizing the need for code rewrites or major system changes.Enabling Innovation in Emerging MarketsAs AI deployment expands beyond Big Tech and into broader enterprise and public-sector applications, the economic burden of inference is becoming increasingly visible. AI-driven tools are now reaching users across every geography and industry—but their accessibility still hinges on the cost and efficiency of the underlying computer infrastructure.By making inference more affordable and energy-efficient, AI will become accessible beyond the walls of Big Tech and elite research labs, which will open the door for a broader range of users to fine-tune open-source models and run them locally. This unlocks entirely new categories of use cases, from healthcare diagnostics in rural clinics to real-time translation in schools——applications that only become viable in power-constrained environments with energy-efficient inference solutions.This transformation is especially relevant in the Middle East, where governments are prioritizing both AI leadership and sustainability. By decoupling inference performance from power-hungry GPUs, Positron offers a path forward for building AI infrastructure that is both economically and environmentally aligned with the region’s strategic goals.The Future of AI InfrastructureLooking ahead, the next phase of AI evolution will not be determined by who can train the largest model, but by who can deploy it most efficiently. Inference is already the dominant workload—and its impact on cost, access, and infrastructure design is only accelerating. As AI becomes a foundational layer across sectors, the ability to serve models at scale—securely, affordably, and sustainably—will shape the region’s global economic posture.Positron’s inference-first architecture presents a credible alternative to GPU-dominated infrastructure. By designing systems tailored to how AI is actually used in production, the company is unlocking new opportunities for cloud providers, national governments, and enterprise platforms alike. In the Middle East, a region increasingly focused on energy efficiency, technological development, and AI infrastructure, this shift in architecture could power the next chapter of AI growth in digitally ambitious, resource-aware economies like those across the Middle East.