eva 2.0 by TrajectoryL · Pull Request #36 · AutoArk/EVA-OS

TrajectoryL · 2026-05-29T07:00:28Z

This commit introduces a major upgrade to the Python SDK, establishing parity between WebSocket and WebRTC (LiveKit) transports, alongside immediate architectural and bug fixes.

Key Features & Enhancements:

feat: Added eva_ws_client.py for lightweight WebSocket transport.
feat: Introduced TaskFSM for client-side task state coordination and edge-command routing.
feat: Integrated VOSK-based background WakeWordRunner for hands-free activation.
refactor: Extracted common audio buffer, state machine, and RTVI logic into shared.py.
fix: Corrected audio channel output truncation in stereo scenarios.
fix: Eliminated thread-safety deadlocks by releasing FSM locks prior to invoking external callbacks.
perf: Improved wake word detection to zero-latency using threading.Event instead of sleep polling.
refactor: Decoupled livekit making it a truly optional dependency for WS-only deployments.
refactor: Exposed auto_switch_confidence_threshold as a configurable initialization parameter.

…skFSM, WS/WebRTC parity) This commit introduces a major upgrade to the Python SDK, establishing parity between WebSocket and WebRTC (LiveKit) transports, alongside immediate architectural and bug fixes. Key Features & Enhancements: - feat: Added `eva_ws_client.py` for lightweight WebSocket transport. - feat: Introduced `TaskFSM` for client-side task state coordination and edge-command routing. - feat: Integrated VOSK-based background `WakeWordRunner` for hands-free activation. - refactor: Extracted common audio buffer, state machine, and RTVI logic into `shared.py`. - fix: Corrected audio channel output truncation in stereo scenarios. - fix: Eliminated thread-safety deadlocks by releasing FSM locks prior to invoking external callbacks. - perf: Improved wake word detection to zero-latency using `threading.Event` instead of sleep polling. - refactor: Decoupled `livekit` making it a truly optional dependency for WS-only deployments. - refactor: Exposed `auto_switch_confidence_threshold` as a configurable initialization parameter.

chenbin11200 · 2026-06-02T12:51:53Z

+云端作为中央大脑，拥有充足的算力，它持续接收音视频流并统筹复杂的业务逻辑：
+1. **全局 VAD 与 ASR：** 判断用户何时说话结束，并将音频转为文本。
+2. **意图理解与业务流转：** 执行复杂的工作流并响应用户请求。
+3. **任务调度大权：** 云端是状态流转的最终决策者。它评估边端发来的切换建议，只有当云端根据上下文下发了 `TaskSwitchResult (approved=True)` 时，边端才会真正切入新的交互任务。


端侧是状态决定者

chenbin11200 · 2026-06-02T12:52:20Z

-        self.mic_index = self._find_device_index(mic_index, input=True)
-        self.spk_index = self._find_device_index(spk_index, input=False)
+        # Resolve audio device indices using shared function
+        self.mic_index = find_audio_device_index(self.pa, mic_index, is_input=True)


代码风格

chenbin11200 requested changes Jun 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

eva 2.0#36

eva 2.0#36
TrajectoryL wants to merge 1 commit into
mainfrom
feat/eva2.0

TrajectoryL commented May 29, 2026

Uh oh!

chenbin11200 Jun 2, 2026

Uh oh!

chenbin11200 Jun 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

TrajectoryL commented May 29, 2026

Uh oh!

chenbin11200 Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

chenbin11200 Jun 2, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants