This app is powered by the technology of Realtime Communication, Realtime Transcription, a Large Language Model (LLM), and Text to Speech extensions. The TEN Framework makes the workflow super easy! The iOS Demo resembles the web demo and acts as the mobile frontend to the AI Agent. You may ask the Agent any general question.
For more details of the project, view the guide.
- Agora Developer account
- TEN Frameworks Agent
- XCode
- iOS Device
- Text to Speech Support (e.g. API Key from Azure)
- LLM Support (e.g. OpenAI API key)
First you should have gotten the TEN Agent working in your environment. The playground part is optional and it can be stopped for the test for this application. You will just need the Server running.
- Clone this GitHub project.
- Open the project with Xcode and import the Agora SDK plugin. The project includes the Agora Video SDK as a dependency package. Once Xcode loads the project, the package download should start automatically and finish in a few minutes.
- Enter your AppID in the Config file. An AppID with token security enabled is required. Note that this should match your AppID in the .env file of the TEN Agent setup.
- Enter your TEN server's URL. If you are running it locally, it is http://localhost:8080.
Build to the iOS Device.
You may also run the app from the Simulator. However, it is recommended to use headphones in this setup, as the Agent’s conversation could be captured through the computer microphone as input and cause confusion.
The App will generate a random channel name whenever the home screen is loaded. You may change the text for your channel name. Then press Join to chat with the Agent.
For reference, it is worthwhile to check out the following resources: