Pinecall's architecture is designed to provide a seamless voice AI experience through multiple input channels and specialized components working together. Understanding this architecture will help you better integrate and leverage the platform's capabilities.
Pinecall supports multiple input channels to provide flexibility in how users interact with the platform:
The telephony gateway connects to traditional telephone networks and VoIP services, managing inbound and outbound calls. It handles call setup, teardown, DTMF processing, and voice data transmission. This layer abstracts the complexities of telecom protocols and ensures high-quality voice connections across both traditional telephony and web-based audio channels.
This component converts between audio and text in real-time. It uses advanced speech-to-text (STT) engines to transcribe caller speech and text-to-speech (TTS) engines to generate natural-sounding voice responses. Pinecall employs state-of-the-art neural TTS models for human-like pronunciation, intonation, and emotion. The speech processing layer handles audio from both phone calls and browser-based web calls with consistent quality.
The orchestration layer coordinates the flow of information between all system components. It handles session management, routes messages, manages state, and ensures timely processing of all events. This central component maintains system coherence and enables complex multi-turn conversations by directing data between speech processing, LLM interface, and integration points.
This layer interfaces with large language models (LLMs) like GPT-4 and manages the conversational logic of AI agents. It processes transcribed speech, maintains conversation context, generates appropriate responses, and handles specialized functionality like information retrieval and API calls. The agent logic is channel-agnostic, providing consistent intelligence whether the user is calling from a phone or connecting through a web browser.
Pinecall exposes RESTful APIs for management operations and WebSocket interfaces for real-time updates. These interfaces allow developers to create, configure, and monitor voice AI systems, as well as integrate with external applications and data sources. The API layer provides unified access to both phone and web-based calling functionality, allowing seamless integration with your existing systems.
Pinecall's architecture provides several integration points for developers:
Create and manage agents, phone numbers, web call endpoints, and calls. Configure webhooks, retrieve analytics, and more.
Language-specific libraries for easier integration with your applications, including web client SDKs for browser-based calling.
Receive notifications about call events and agent actions across both phone and web channels.
Real-time updates and monitoring of ongoing calls and agent activities for both phone and web-based interactions.
Now that you understand Pinecall's architecture, you might want to explore: