Back to Feed
AI▲ 70
AWS SageMaker Accelerates AI Inference with G7e Instances
AWS ML Blog·
Amazon SageMaker AI now offers G7e instances featuring NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs, significantly boosting generative AI inference capabilities. These instances come with substantial GDDR7 memory, allowing for the deployment of large open-source foundation models like GPT-OSS-120B and Nemotron-3-Super-120B. This advancement provides organizations with a cost-effective and high-performance solution for running complex AI workloads directly on AWS, enhancing efficiency and accessibility for advanced machine learning applications.
Tags
ai
product
Original Source
AWS ML Blog — aws-ml.amazon.com