Perplexity has unveiled a new hybrid AI orchestrator designed to automatically determine whether specific tasks should be processed on a user’s local device or offloaded to cloud infrastructure. This system, set for integration into the Always-on agent product Personal Computer starting in July, aims to simultaneously optimise accuracy, privacy, and energy efficiency. Developed in partnership with Intel, the model-agnostic framework is also compatible with other hardware such as Nvidia’s RTX Spark. The architecture routes sensitive data, including financial documents and health records, to local processing units while directing compute-intensive operations to powerful cloud models. Perplexity states that this approach reduces reliance on centralised computing infrastructure and addresses data sovereignty concerns by keeping private information on-premise.
The significance of this development lies in the shifting paradigm of artificial intelligence deployment, moving away from exclusive cloud dependence towards a more balanced hybrid model. By prioritising correct answers over raw compute consumption, Perplexity aligns its business incentives with efficiency, potentially lowering operational costs and carbon footprints. This strategy empowers users to maintain control over their data while still leveraging the vast capabilities of remote servers for complex queries. As the market competes for local compute power, this system establishes a practical framework for balancing privacy requirements with the need for advanced processing capabilities in everyday applications.
- The new orchestrator automatically decides whether to run AI tasks locally or in the cloud based on task requirements.
- Sensitive data remains on the user’s device, enhancing privacy and addressing data sovereignty issues.
- Perplexity’s business model rewards accuracy rather than high compute usage, driving efficiency in AI deployment.
Stay ahead of AI. Get the most important stories delivered to your inbox — no spam, no noise.




