Sunday, October 13, 2024
HomeautomobileNVIDIA and Oracle to Speed up AI, Knowledge Processing for Enterprises

NVIDIA and Oracle to Speed up AI, Knowledge Processing for Enterprises



NVIDIA and Oracle to Speed up AI, Knowledge Processing for Enterprises

Enterprises are searching for more and more highly effective compute to assist their AI workloads and speed up knowledge processing. The effectivity gained can translate to raised returns for his or her investments in AI coaching and fine-tuning, and improved consumer experiences for AI inference.

On the Oracle CloudWorld convention right this moment, Oracle Cloud Infrastructure (OCI) introduced the primary zettascale OCI Supercluster, accelerated by the NVIDIA Blackwell platform, to assist enterprises prepare and deploy next-generation AI fashions utilizing greater than 100,000 of NVIDIA’s latest-generation GPUs.

OCI Superclusters permit prospects to select from a variety of NVIDIA GPUs and deploy them anyplace: on premises, public cloud and sovereign cloud. Set for availability within the first half of subsequent 12 months, the Blackwell-based programs can scale as much as 131,072 Blackwell GPUs with NVIDIA ConnectX-7 NICs for RoCEv2 or NVIDIA Quantum-2 InfiniBand networking to ship an astounding 2.4 zettaflops of peak AI compute to the cloud. (Learn the press launch to study extra about OCI Superclusters.)

On the present, Oracle additionally previewed NVIDIA GB200 NVL72 liquid-cooled bare-metal situations to assist energy generative AI functions. The situations are able to large-scale coaching with Quantum-2 InfiniBand and real-time inference of trillion-parameter fashions inside the expanded 72-GPU NVIDIA NVLink area, which may act as a single, large GPU.

This 12 months, OCI will provide NVIDIA HGX H200 — connecting eight NVIDIA H200 Tensor Core GPUs in a single bare-metal occasion through NVLink and NVLink Change, and scaling to 65,536 H200 GPUs with NVIDIA ConnectX-7 NICs over RoCEv2 cluster networking. The occasion is accessible to order for purchasers seeking to ship real-time inference at scale and speed up their coaching workloads. (Learn a weblog on OCI Superclusters with NVIDIA B200, GB200 and H200 GPUs.)

OCI additionally introduced common availability of NVIDIA L40S GPU-accelerated situations for midrange AI workloads, NVIDIA Omniverse and visualization. (Learn a weblog on OCI Superclusters with NVIDIA L40S GPUs.)

For single-node to multi-rack options, Oracle’s edge choices present scalable AI on the edge accelerated by NVIDIA GPUs, even in disconnected and distant places. For instance, smaller-scale deployments with Oracle’s Roving Edge System v2 will now assist as much as three NVIDIA L4 Tensor Core GPUs.

Firms are utilizing NVIDIA-powered OCI Superclusters to drive AI innovation. Basis mannequin startup Reka, for instance, is utilizing the clusters to develop superior multimodal AI fashions to develop enterprise brokers.

“Reka’s multimodal AI fashions, constructed with OCI and NVIDIA know-how, empower next-generation enterprise brokers that may learn, see, hear and communicate to make sense of our advanced world,” stated Dani Yogatama, cofounder and CEO of Reka. “With NVIDIA GPU-accelerated infrastructure, we are able to deal with very massive fashions and in depth contexts with ease, all whereas enabling dense and sparse coaching to scale effectively at cluster ranges.”

NVIDIA acquired the 2024 Oracle Know-how Answer Companion Award in Innovation for its full-stack strategy to innovation.

Accelerating Generative AI Oracle Database Workloads

Oracle Autonomous Database is gaining NVIDIA GPU assist for Oracle Machine Studying notebooks to permit prospects to speed up their knowledge processing workloads on Oracle Autonomous Database.

At Oracle CloudWorld, NVIDIA and Oracle are partnering to reveal three capabilities that present how the NVIDIA accelerated computing platform could possibly be used right this moment or sooner or later to speed up key parts of generative AI retrieval-augmented technology pipelines.

The primary will showcase how NVIDIA GPUs can be utilized to speed up bulk vector embeddings instantly from inside Oracle Autonomous Database Serverless to effectively carry enterprise knowledge nearer to AI. These vectors may be searched utilizing Oracle Database 23ai’s AI Vector Search.

The second demonstration will showcase a proof-of-concept prototype that makes use of NVIDIA GPUs, NVIDIA cuVS and an Oracle-developed offload framework to speed up vector graph index technology, which considerably reduces the time wanted to construct indexes for environment friendly vector searches.

The third demonstration illustrates how NVIDIA NIM, a set of easy-to-use inference microservices, can increase generative AI efficiency for textual content technology and translation use circumstances throughout a variety of mannequin sizes and concurrency ranges.

Collectively, these new Oracle Database capabilities and demonstrations spotlight how NVIDIA GPUs can be utilized to assist enterprises carry generative AI to their structured and unstructured knowledge housed in or managed by an Oracle Database.

Sovereign AI Worldwide

NVIDIA and Oracle are collaborating to ship sovereign AI infrastructure worldwide, serving to deal with the info residency wants of governments and enterprises.

Brazil-based startup Extensive Labs educated and deployed Amazonia IA, one of many first massive language fashions for Brazilian Portuguese, utilizing NVIDIA H100 Tensor Core GPUs and the NVIDIA NeMo framework in OCI’s Brazilian knowledge facilities to assist guarantee knowledge sovereignty.

“Growing a sovereign LLM permits us to supply purchasers a service that processes their knowledge inside Brazilian borders, giving Amazônia a novel market place,” stated Nelson Leoni, CEO of Extensive Labs. “Utilizing the NVIDIA NeMo framework, we efficiently educated Amazônia IA.”

In Japan, Nomura Analysis Institute, a number one world supplier of consulting companies and system options, is utilizing OCI’s Alloy infrastructure with NVIDIA GPUs to reinforce its monetary AI platform with LLMs working in accordance with monetary rules and knowledge sovereignty necessities.

Communication and collaboration firm Zoom will likely be utilizing NVIDIA GPUs in OCI’s Saudi Arabian knowledge facilities to assist assist compliance with native knowledge necessities.

And geospatial modeling firm RSS-Hydro is demonstrating how its flood mapping platform — constructed on the NVIDIA Omniverse platform and powered by L40S GPUs on OCI — can use digital twins to simulate flood impacts in Japan’s Kumamoto area, serving to mitigate the affect of local weather change.

These prospects are amongst quite a few nations and organizations constructing and deploying home AI functions powered by NVIDIA and OCI, driving financial resilience by sovereign AI infrastructure.

Enterprise-Prepared AI With NVIDIA and Oracle

Enterprises can speed up job automation on OCI by deploying NVIDIA software program comparable to NIM microservices and NVIDIA cuOpt with OCI’s scalable cloud options. These options allow enterprises to shortly undertake generative AI and construct agentic workflows for advanced duties like code technology and route optimization.

NVIDIA cuOpt, NIM, RAPIDS and extra are included within the NVIDIA AI Enterprise software program platform, out there on the Oracle Cloud Market.

Study Extra at Oracle CloudWorld 

Be a part of NVIDIA at Oracle CloudWorld 2024 to find out how the businesses’ collaboration is bringing AI and accelerated knowledge processing to the world’s organizations.

Register to the occasion to observe periods, see demos and be part of Oracle and NVIDIA for the answer keynote, “Unlock AI Efficiency with NVIDIA’s Accelerated Computing Platform” (SOL3866), on Wednesday, Sept. 11, in Las Vegas.

RELATED ARTICLES

LEAVE A REPLY

Please enter your comment!
Please enter your name here

- Advertisment -
Google search engine

Most Popular

Recent Comments