Overview
Alibaba Cloud’s full-stack solution for generative AI (GenAI) provides whole-process services for foundation models (FMs) and other AI development tasks. This solution helps you build and optimize FMs, fine-tune them according to your business preferences, and deploy them easily as online services, all on purpose-built AI infrastructure optimized for performance and efficiency. Regardless of the scale and stage of your business, this solution enables you to create new and intelligent customer experiences and drive business transformation with innovations in generative AI.








• Unified Hardware and Software Acceleration for AI
1. GPUs for Model Training and Inference
Model Training: gn7 series of ECS instances power large-scale training tasks with high-performance GPUs.
Model Inference: gn6 series of ECS instances provide a cost-effective choice for model inference tasks
2. AI Acceleration
You can leverage GPU Accelerator AIACC to speed up AI training tasks by up to 70% and inference tasks by 2-3 times according to Stanford DAWN Deep Learning Benchmark.
Data Preparation
Ready your data for model training with intelligent, customizable, and highly efficient multimodal data labeling services
Model Development
Build foundation models with our one-stop visualized modeling tool - PAI-Designer, or perform interactive development with Notebook in PAI-DSW
Model Training
Train models with PAI-DLC, our one-stop platform for cloud-native deep learning and training compatible with predefined and customized algorithm frameworks
Model Deployment
Deploy your model as an online service or a web app with PAI-EAS, which supports push-button deployment of large-scale complex models
Tongyi Qianwen (Qwen)
Alibaba Cloud provides a series of open-source Tongyi Qianwen models: Qwen, the LLM; Qwen-VL, the large vision and language model; Qwen-Audio, the large audio language model; Qwen-Coder, the coding model; and Qwen-Math, the mathematical model. Qwen models are pre-trained on multilingual data covering various industries and domains, and offer a wide range of capabilities, including multimodal understanding and generation, state-of-the-art image processing, and fully managed APIs to support your innovation in generative AI. Qwen2.5 further improves its overall performance, especially in coding and mathematics, and supports up to 29 languages including English and Chinese.
You can easily fine-tune Qwen models with your enterprise data and deploy them as online services that understand your business.
Wan2.1
Wan2.1 is an open-source, versatile video foundation model suite for text-to-video and image-to-video generation. It excels at generating realistic visuals by accurately handling complex movements, enhancing pixel quality, adhering to physical principles, and optimizing the precision of instruction execution.
DeepSeek
DeepSeek-V3 is a high-performence LLM featuring a mixture of experts (MoE) architecture. DeepSeek-R1 is trained based on DeepSeek-V3-Base. The Model Gallery of PAI provides accelerated deployment options, such as BladeLLM, SGLang, and vLLM, so you can deploy DeepSeek models with one click.
Llama 3
LLaMA 3 is a powerful open-source LLM with a large set of training data. It focuses on innovation, scalability, and simplicity with several architectural improvements over its predecessor, LLaMA 2. You can access, fine-tune, and deploy LLaMA 3 with Platform for AI (PAI) in a few simple steps.
Built-In Model Inference and Evaluation Workflows
Speed up model development workflows with comprehensive tools designed to support SFT and LoRA, built-in model compression and inference acceleration, multi-dimensional model evaluation in visualized templates, and one-click model deployment
One-Click RAG Setup with AnalyticDB for PostgreSQL
Model Studio jointly applies in-depth retrieval optimization with AnalyticDB for PostgreSQL, which provides retrieval capability of over 10 billion vectors and is compatible with a variety of Alibaba Cloud AI products. Learn more about RAG >
Simplified GenAI Application Development
Accelerate generative AI application development with pre-built workflows on visualized canvas, highly customizable orchestration, template-based prompt engineering, and a rich set of APIs for easy integration with your business system
Comprehensive Security Measures
Secure your enterprise data in storage and transmission by completing model and app development in your dedicated Virtual Private Cloud (VPC) network and accessing data with PrivateLink, apply customizable content governance to prompts and content, and combine responsible AI principles with tools for human accountability

AI Doc
AI Doc revolutionizes digital document management for enterprises with the power of LLMs, including Qwen models. It efficiently parses various documents, accurately extracts information based on business requirements, and swiftly generates tailored documents.

Automatic Speech Recognition
This solution offers high-precision capabilities to convert speech from audio and video files to text in complex environments and identify English, Mandarin, and Cantonese speech across multilingual contexts, with more languages to come.

Lingma
Alibaba Cloud's Lingma is a coding copilot powered by Qwen models. It provides features such as intelligent code generation, AI Chat for developers, multi-file code modification, and automatic code execution capabilities.
New Ways of Smart Work
Tongyi Qianwen Transforms Work with High-Level Automation and Optimization
New Experience of Smart Shopping
Tongyi Qianwen Revolutionizes Retail with Accurate Recommendations and Intelligent Services
New Ways of Smart Living
Tongyi Qianwen Customizes Lifestyles with Highly Personalized Services




