Level 3 Written Exam

Certified Voice AI Architect

📝 Take Exam: Start Level 3 Written Exam
Accept the assignment to get your exam repository. Submit answers by creating an issue.


Duration	75 minutes
Questions	50
Passing Score	80% (40 correct)

Section A: Architecture Patterns (12 questions)

Question 1: Which pattern uses a central entry point to distribute calls to specialized agents?

A) Skill Composition
B) Gateway Router
C) Context State Machine
D) DataMap Integration

Question 2: When should you use the Skill Composition pattern?

A) When you need call routing logic
B) When you have common functionality across agents
C) When you need complex state transitions
D) When you need serverless API integration

Question 3: Which is an example of the “Monolithic Agent” anti-pattern?

A) Using environment variables for configuration
B) One agent with 50 functions covering all domains
C) Using multiple specialized agents
D) Implementing health checks

Question 4: What is the recommended approach for sharing state in horizontally scaled agents?

A) Store state in local memory
B) Use global variables
C) Externalize state to Redis or a database
D) Store state in the SWML document

Question 5: Which anti-pattern exposes implementation details to callers?

A) Hardcoded Configuration
B) Synchronous External Calls
C) Exposing Internal Errors
D) Monolithic Agent

Question 6: What is an Architecture Decision Record (ADR)?

A) A log of all API calls
B) A document explaining why a decision was made
C) A list of system errors
D) A performance benchmark

Question 7: Which principle suggests agents should have focused, specific responsibilities?

A) Defense in Depth
B) Single Responsibility
C) Fail Secure
D) Least Privilege

Question 8: What is the primary benefit of the Context State Machine pattern?

A) Better API integration
B) Structured multi-step workflows
C) Faster response times
D) Easier deployment

Question 9: Which scaling pattern requires stateless agents?

A) Vertical scaling
B) Horizontal scaling
C) Cold start scaling
D) Serverless scaling

Question 10: What should be documented in agent documentation?

A) Only the agent name
B) Endpoints, functions, dependencies, and configuration
C) Just the code
D) Marketing materials

Question 11: Which is NOT a recommended practice for agent architecture?

A) Use environment variables for secrets
B) Implement health checks
C) Put all business logic in one agent
D) Document architectural decisions

Question 12: What is the purpose of a Gateway Router in a multi-agent system?

A) Process payments
B) Route calls to appropriate specialized agents
C) Store customer data
D) Generate reports

Section B: Knowledge and RAG (8 questions)

Question 13: What does RAG stand for?

A) Rapid Agent Generation
B) Retrieval-Augmented Generation
C) Remote API Gateway
D) Real-time Audio Gateway

Question 14: Which CLI tool builds search indexes for knowledge bases?

A) swaig-test
B) sw-agent-init
C) sw-search
D) sw-index

Question 15: What is the purpose of “chunking” in knowledge base preparation?

A) Compress the data
B) Break content into searchable segments
C) Encrypt sensitive information
D) Validate document format

Question 16: Which parameter controls how many search results are returned?

A) distance
B) count
C) limit
D) max_results

Question 17: What is the distance parameter in search configuration?

A) Network latency
B) Similarity threshold for results
C) Maximum document size
D) Search timeout

Question 18: How can you combine static knowledge with live data?

A) You cannot combine them
B) Use search skill for static, DataMap for live
C) Put all data in the knowledge base
D) Only use live APIs

Question 19: What should you do when the knowledge base doesn’t have an answer?

A) Make up an answer
B) Admit limits and offer alternatives
C) Hang up the call
D) Repeat the question

Question 20: What is the recommended chunk size for FAQ content?

A) 50 tokens
B) 200 tokens
C) 1000 tokens
D) 5000 tokens

Section C: Performance Optimization (8 questions)

Question 21: What is a “cold start” in serverless deployments?

A) Running without cache
B) Initial container/runtime startup latency
C) Server reboot
D) Network timeout

Question 22: Which technique keeps users engaged during slow operations?

A) Hangup
B) Fillers
C) Transfer
D) Recording

Question 23: What is the recommended timeout for external API calls in voice agents?

A) 30 seconds
B) 2-5 seconds
C) 1 minute
D) No timeout needed

Question 24: Which caching decorator is built into Python?

A) @cache
B) @lru_cache
C) @memoize
D) @cached

Question 25: How can you run multiple warehouse checks in parallel?

A) Sequential loops
B) ThreadPoolExecutor
C) Longer timeouts
D) Multiple agents

Question 26: What makes prompts “optimized” for performance?

A) Using more words
B) Being concise and focused
C) Including all possible scenarios
D) Using complex language

Question 27: What is the purpose of the end_of_speech_timeout parameter?

A) Call duration limit
B) Time to wait after user stops speaking
C) Function timeout
D) Recording duration

Question 28: Which metric type is best for measuring function latency?

A) Counter
B) Gauge
C) Histogram
D) Summary

Section D: Security and Compliance (10 questions)

Question 29: What should happen to call recording when collecting credit card numbers?

A) Continue recording
B) Pause recording
C) Stop recording permanently
D) Increase volume

Question 30: What is the maximum number of PIN attempts before lockout (best practice)?

A) 1
B) 3
C) 10
D) Unlimited

Question 31: Which parameter marks a function as handling sensitive data?

A) private=True
B) secure=True
C) sensitive=True
D) protected=True

Question 32: What should be stored in logs regarding credit card numbers?

A) Full card number
B) Encrypted card number
C) Last 4 digits only
D) Nothing about cards

Question 33: Which type of authentication protects SWML endpoints?

A) OAuth
B) Basic Auth
C) API Keys only
D) No authentication needed

Question 34: What is tokenization in payment processing?

A) Breaking text into words
B) Replacing card numbers with secure tokens
C) Encrypting all data
D) Validating card format

Question 35: Which input pattern might indicate prompt injection?

A) “What time is it?”
B) “ignore previous instructions”
C) “What’s my balance?”
D) “Transfer to sales”

Question 36: What should happen when an account is locked due to failed PIN attempts?

A) Keep trying
B) End the call with guidance
C) Transfer to sales
D) Reset the PIN automatically

Question 37: What is the purpose of timing-safe comparison for PIN verification?

A) Faster verification
B) Prevent timing attacks
C) Better logging
D) Simpler code

Question 38: Which data should NEVER be in logs?

A) Call ID
B) Timestamps
C) Full credit card numbers
D) Function names

Section E: Monitoring and Troubleshooting (12 questions)

Question 39: What are the three pillars of observability?

A) Speed, Size, Security
B) Logs, Metrics, Traces
C) CPU, Memory, Network
D) Input, Output, Errors

Question 40: Which log format enables easy querying and analysis?

A) Plain text
B) Structured JSON
C) CSV
D) XML

Question 41: What is a correlation ID used for?

A) Measuring latency
B) Tracking requests across systems
C) Counting errors
D) Storing state

Question 42: Which Prometheus metric type tracks current values (like active calls)?

A) Counter
B) Gauge
C) Histogram
D) Summary

Question 43: What should a health check endpoint verify?

A) Only that the server responds
B) Agent, database, and external API status
C) Just CPU usage
D) Only memory

Question 44: Which alert would indicate potential AI handling issues?

A) High CPU usage
B) High transfer rate
C) Low memory
D) High disk usage

Question 45: What does histogram_quantile(0.95, ...) calculate?

A) Average latency
B) 95th percentile latency
C) Maximum latency
D) Minimum latency

Question 46: What is the purpose of the swaig-test command --dump-swml?

A) Test a function
B) Output the SWML document
C) List all tools
D) Start the server

Question 47: How do you test a specific function with swaig-test?

A) --exec function_name
B) --test function_name
C) --call function_name
D) --run function_name

Question 48: What indicates a “degraded” health status?

A) All checks pass
B) Some checks fail but system works
C) Complete system failure
D) Network timeout

Question 49: Which pattern helps handle temporary external service failures?

A) Immediate failure
B) Circuit breaker
C) Ignore errors
D) Infinite retry

Question 50: What should logs EXCLUDE for security?

A) Timestamps
B) Sensitive data (passwords, card numbers)
C) Function names
D) Call IDs

Submission

Submit your answers via GitHub Classroom. Your exam will be auto-graded.

Scoring

45-50 correct: Excellent
40-44 correct: Good (Pass)
Below 40: Review required

Next: Level 3 Practical Exam