OpenAI has signed a $300 billion, five-year contract with Oracle to provide 4.5 gigawatts of data center capacity for Project Stargate, starting in 2027, to advance AI models. This historic deal, involving partners like SoftBank, aims to boost U.S. AI infrastructure amid industry competition and regulatory scrutiny.
Google Cloud has launched the GKE Inference Gateway and Inference Quickstart for efficient AI inference on Kubernetes, enhancing routing, load balancing, and autoscaling while reducing deployment time and latency by up to 30%. These tools support scalable generative AI applications in sectors like healthcare and finance, positioning GKE as a leader in AI orchestration.
|