01 · Two interview brains
Agent-controlled, or autonomous.
Doubao mode runs Claude Sonnet through eight explicit tools (speak, ask_question, probe_deeper, …); the voice engine is just a mouth. OpenAI / Gemini mode is autonomous: the voice model has semantic VAD and runs the conversation, while Claude monitors as a safety supervisor only. Both modes share the same role-adaptive system prompt and rubric, so a recruiter never has to know which one ran the call.




