Cloud Native 12 min read

Build an AI‑Powered English Speaking Coach with Alibaba Cloud Function Compute

This guide walks you through creating an AI English‑speaking companion by deploying a web app with Function Compute, integrating Alibaba's AI model platform, Intelligent Media Service, and real‑time audio (ARTC), covering architecture, workflow setup, service configuration, deployment steps, and validation.

Alibaba Cloud Native
Alibaba Cloud Native
Alibaba Cloud Native
Build an AI‑Powered English Speaking Coach with Alibaba Cloud Function Compute

Solution Overview

The solution provides an AI‑driven English speaking practice platform that combines pronunciation training, dialogue simulation, instant feedback, and personalized scenario modeling. It uses Alibaba Cloud Function Compute to host a web app, the AI model platform (Bailian) for large‑model services, Intelligent Media Service for real‑time interaction, and ARTC for audio streaming.

Technical Architecture

The core cloud services involved are:

Function Compute (FC) : Deploys the web application.

Bailian Large‑Model Service : Provides one‑stop large‑model development and inference.

Intelligent Media Service (IMS) : Supplies audio processing, AI agent interaction, and speech synthesis.

Realtime Audio‑Video (ARTC) : Enables real‑time audio calls between the AI agent and the user.

These components are orchestrated to deliver a seamless AI‑assistant that can converse in English, assess user responses, and suggest improvements.

Creating the Bailian Workflow Application

Log in to the Bailian console and click My Applications → Add Application → Workflow Application → Create Dialogue Workflow .

In the workflow editor, rename the two default start‑node parameters to difficulty and scene to represent level and scenario.

Drag a large‑model node onto the canvas, connect it to the start node, and configure the model parameters (e.g., Model: Tongyi Qianwen‑Max, Temperature: default, Max Tokens: 1024, Search: off).

Set the System Prompt and User Prompt (e.g., ${sys.query}) and enable context as needed.

Connect the large‑model node to an End node, configure the output mode to Text, and map the model result to the next node.

Save the workflow.

Creating the Real‑Time Audio (ARTC) Application

Enable the Video Live service (billing by traffic usage) and open the Video Cloud console.

Navigate to Live + → Realtime Audio‑Video → Application Management and click Create Application .

Enter a custom instance name, accept the service agreement, and purchase the app.

After activation, note the generated AppId and AppKey for later configuration.

Creating the AI Agent

Open the AI Real‑Time Interaction console → Agent Management and click Create Agent .

Name the agent (e.g., Agent‑English) and bind it to the previously created workflow ( Workflow‑English) and ARTC application.

Submit the agent configuration.

Deploying the Web Application with Function Compute

Use the provided Function Compute template ( ai-english-coach) and configure the following parameters before creating the default environment:

Deployment Type : Direct deployment.

Application Name : Auto‑generated (default).

Role Name : Default role (grant additional permissions if prompted).

Realtime Audio‑Video AppId / AppKey : Values obtained from the ARTC console.

Agent Id : ID of the Agent‑English created earlier.

RAM Role ARN : Role used by Function Compute to access IMS.

Region : Default to China East 1 (Hangzhou).

Namespace : Default (can be customized for multiple instances).

Verification and Usage

After deployment, locate the example app’s domain name in the environment details and open it in a browser (HTTPS is required for microphone access). The app will prompt for microphone permission; once granted, you can select a scenario and difficulty level, start a conversation, and receive real‑time scoring and feedback from the AI assistant.

Note that the temporary domain provided by the Serverless Devs sandbox does not have a valid SSL/TLS certificate; for production use, bind a proper certificate to ensure secure HTTPS connections.

Summary

By following the steps above, you can build and deploy an AI‑enhanced English speaking coach that leverages Function Compute, Bailian large‑model services, Intelligent Media Service, and ARTC, offering personalized, real‑time language practice.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

AIFunction ComputeEnglish learningRealtime Audio
Alibaba Cloud Native
Written by

Alibaba Cloud Native

We publish cloud-native tech news, curate in-depth content, host regular events and live streams, and share Alibaba product and user case studies. Join us to explore and share the cloud-native insights you need.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.