AI TECH
Human-Like Voice Engine / TTS (Text to Speech)
The best-sounding Voice Engines on the market are currently is Eleven Labs and Play.HT.
Tortoise TTS is the foundation of Elevenlabs, which they have successfully fine-tuned for more accurate, clear-sounding voices. Some are better than others but we aim to develop the most realistic voice.
Our Vision at Real Voice AI is to deliver a voice agent/assistant that is so human, that you'll forget you are talking to a robot. Here are the elements to create a human-realistic experience:
1. Understand Context, Not Just Keywords
2. Use Conversational Fillers & Small Talk Naturally
3. Vary Tone & Inflection
4. Give It an Appropriate Personality
5. Implement Randomness & Variation
6. Incorporate Interruption Handling & Repeat Responses
7. Admit Mistakes & Ignorance
8. Integrate Empathetic Response
9. Continuously Expand Its Knowledge Base
Emotional Prosody
Wikipedia Definition: Emotional prosody or affective prosody is the various non-verbal aspects of language that allow people to convey or understand emotion. It includes an individual's tone of voice in speech that is conveyed through changes in pitch, loudness, timbre, speech rate, and pauses.
In the realm of human communication, the nuances of how we say something often carry as much weight as the words themselves.
Vector Database - Infinite Memory
A vector database for infinite memory in the context of an AI voice agent refers to a specialized database designed to store and manage data in the form of vectors.
This can be your company FAQ, a knowledge base, all the content from your website, and data on customers.
This type of database is optimized for high-speed search and retrieval of complex, multidimensional data, making it particularly suited for applications in AI and machine learning, where such data representations are common.
In an AI voice agent, the concept of "infinite memory" gives the agent the ability to access and leverage data efficiently, enabling the agent to understand, process, and respond to voice inputs with remarkable accuracy and relevance.
The advantage of using a vector database in this context lies in its understanding, and generating contextually appropriate responses. The AI voice agent can provide responses that are not only accurate but also derived from a broader knowledge base, essentially mimicking an "infinite memory."
This setup significantly enhances the AI voice agent's capabilities, enabling it to deliver more personalized, context-aware, and intelligent responses, thereby improving the user experience.
Previously stored conversations represent a wealth of information that a salesperson, despite their best efforts, could never remember across all leads. Traditionally, sales professionals would rely on manually entering and referring to notes in a Customer Relationship Management (CRM) system to recall specific details from past interactions. However, this method is inherently limited by the salesperson's diligence in note-taking and their ability to efficiently navigate and interpret these notes during or in preparation for future interactions.
In contrast, our AI voice agent revolutionizes this process by remembering all data from previous conversations. This capability means that every detail, no matter how minor, from past interactions with leads and customers is instantly accessible to the agent. Unlike humans, who may forget or overlook critical pieces of information, the AI agent can recall and leverage this vast repository of data in real-time, ensuring that every interaction is informed by the complete history of conversations.
Large Language Model
Leveraging smaller language models, such as Mistral 7B, offers a practical and efficient alternative to heavier models like ChatGPT. Mistral 7B, with its compact size, is tailored to deliver high performance and accuracy while requiring significantly less computational resources. This efficiency not only translates to faster processing speeds but also ensures that applications can run more cost-effectively, with lower energy consumption. Such models are particularly advantageous in environments where rapid response times are critical, outperforming larger counterparts in speed without compromising the quality of outcomes.
Telephony
Phone number rental, Porting, Inbound, and Outbound Calls. We can integrate with most Telphony providers that have an API. Here are some examples:
Twilio
Genesys
Telnyx
Vonage
Ring Central
Talk Desk
Five 9
Hardware / LPU Inference Engine
NVIDIA GPUs have long been at the forefront of AI innovation, offering robust computational power that accelerates the processing of complex algorithms and large datasets, which are fundamental to machine learning and deep learning applications. Their highly parallel architecture and efficiency in handling multiple tasks simultaneously have made them the go-to hardware for researchers and developers aiming to push the boundaries of artificial intelligence. However, Groq's introduction of the LPU (Tensor Streaming Processor or TSP) inference engine represents a paradigm shift in computational processing for AI. Groq's architecture, designed from the ground up for machine learning tasks, focuses on streamlining data flow and minimizing latency, offering unprecedented speed and efficiency. This approach significantly reduces the time required for AI models to make inferences, thus revolutionizing the speed at which AI applications can operate, opening new avenues for real-time processing in AI Voice Agents.
STT (Speech-to-Text)
The STT has to be able to listen to audio in real-time and transcribe all the audio into text at record speeds so that it can formulate a response back to the user using TTS (text to speech).
Expanding Integrations
Rest API
Webhooks
Ical Integration - Lark Calendar, Apple Calendar, Microsoft Exchange, Outlook, Google Calendar
Zapier Integration
Make.com Integration
Salesforce Plugin?
Prompt Engineering Structure
Background Information
Product Information
Target Audience
Value Proposition
Objection Handling
Rules
Example Script
Real Estate AI - Buyer Inquiry and Lead Qualification Call Script Example
## BACKGROUND INFO
**Company Info:** [Your Real Estate Agency] is renowned for its dedication to matching clients with their ideal properties. Our team combines local market expertise with a personal touch to ensure every buyer's journey is smooth and successful.
**Value Proposition:** We offer personalized, data-driven property recommendations, ensuring our clients make informed decisions. Our extensive portfolio includes properties that cater to a wide range of preferences and budgets.
**Agent Information:**
- **Name:** [Your AI Assistant's Name]
- **Role:** AI Real Estate Assistant
- **Objective:** To engage potential buyers, understand their specific needs, and guide them towards finding their perfect property.
**Target Audience:** Individuals or families looking to purchase residential properties, from first-time homebuyers to seasoned investors seeking new opportunities.
## RULES
1. **Thorough Qualification:** Aim to collect detailed information to understand the buyer's exact needs.
2. **Active Listening:** Allow the buyer to express their preferences and requirements fully.
3. **Buyer-Centric Approach:** Focus the conversation on understanding and addressing the buyer's property needs.
4. **Professional and Friendly Tone:** Communicate in a manner that is both professional and welcoming.
5. **Confidentiality and Trust:** Assure the buyer that their information is confidential and valued.
6. **Subject Adherence:** If the conversation veers off-topic, politely steer it back to discussing the buyer's property preferences and the home-buying process.
## OBJECTION HANDLING
- **Concerns About Market Conditions:** Provide insights into current market trends and reassure the buyer that [Your Real Estate Agency] is equipped to find great options regardless of market fluctuations.
- **Budget Limitations:** Emphasize the variety of properties within our portfolio and our commitment to finding the best match within the buyer's budget.
- **Uncertainty about Location:** Offer to provide more information about different neighborhoods, including amenities, schools, and community lifestyle.
- **Hesitation to Proceed:** Reassure the buyer of the no-pressure environment and the value of viewing properties that meet their criteria.
## SCRIPT
**START SCRIPT/**
1. You: "Hi, this is [Your AI Assistant's Name] from [Your Real Estate Agency], where we specialize in finding your dream home. I understand you're exploring property options. Could I have your name, please?"
*Wait for the potential buyer to respond. Do not interrupt them.*
2. You: "Nice to meet you, [Buyer's Name]. To best assist you, could you tell me more about the type of property you're looking for? What are your must-have features?"
*Wait for the potential buyer to respond. Do not interrupt them.*
3. You: "Great choices! How about the preferred location? Are you looking at specific neighborhoods or areas? If you could spell it out for me, that would ensure we have teh correct information."
*Wait for the potential buyer to respond. Do not interrupt them.*
4. You: "Understood. And what is your budget range for this purchase?"
*Wait for the potential buyer to respond. Do not interrupt them.*
5. You: "Thank you for that information. We have several properties that might fit your criteria. Would you be interested in scheduling a viewing or perhaps a consultation to discuss these options in more detail?"
*Wait for the potential buyer to respond. Do not interrupt them.*
6. You: "Perfect, [Buyer's Name]. I'll arrange that for you. Please expect a follow-up from our team shortly. In the meantime, feel free to visit our website or reach out with any questions. We're excited to help you find your perfect home. Have a great day!"
Last updated