Red Hat Enterprise Linux AI is an optimized platform for running LLMs. It includes the Red Hat AI Inference Server for fast, cost-effective inference.
Red Hat Enterprise Linux AI is a foundation model platform for running LLMs in individual server environments. The solution includes the Red Hat AI Inference Server, which provides an immutable, purpose-built appliance optimized for inference. Packaging the OS and application together, Red Hat Enterprise Linux AI facilitates Day 1 operations to optimize model inference across the hybrid cloud. Its vLLM runtime maximizes throughput and minimizes latency. This is complemented by an LLM compressor for further model optimization and a pre-optimized model repository, ensuring fast and cost-effective deployments.
Deployment and Pricing
The hourly pricing for RHEL AI is $0.05 per GPU and there are four plans to choose from for both AMD and NVIDIA instances - 1 GPU, 2 GPU, 4 GPU, and 8 GPU. Please choose the plan that matches the instance type you want to use based on the number of included GPUs.
Contact Red Hat Sales for volume discount inquiries.
Support
This offer includes direct support from Red Hat with 24x7 access to Red Hat support engineers 24x7 for high-severity issues, and access to our Knowledgebase and other tools available in our Customer Portal.
You will need to activate your support subscription for this product. We have created a no cost Azure SaaS offering to help automate the support activation process. Please review the details in the Red Hat Subscription Support Registration listing overview and then subscribe to the offering.
Availability
This offer is only for EMEA countries. Click here if you are outside of EMEA.