Carlos Santana Predicts Full Jarvis Mode AI is Within Reach

Autonomous agents like OpenClaw and Anthropic's Dispatch bring Iron Man's AI assistant closer to reality, with voice as the final frontier.

March 24, 2026 at 9:20 PM·5 min read

The future depicted in science fiction, where an AI assistant effortlessly manages your digital life, is no longer distant. Spanish computer engineer and AI popularizer Carlos Santana, known as @DotCSV, recently declared we are on the cusp of a "Jarvis mode" AI. He pointed to two groundbreaking developments—OpenClaw and Cowork+Dispatch—as the key enablers, noting that only a significant leap in voice technology remains.

The Pillars of Autonomy: OpenClaw and Remote Command

The first pillar of this Jarvis-like future is OpenClaw, an open-source, autonomous AI agent that burst onto the scene in late 2025. This project, which quickly garnered over 60,000 GitHub stars in just 72 hours, runs locally on your machine, using large language models to interpret commands and execute tasks across various applications. Developers have hailed it as "the closest thing to JARVIS we've seen," allowing users to control an AI agent through familiar chat apps.

OpenClaw, initially named Clawdbot and then Moltbot, earned its final moniker from developer Peter Steinberger because "Moltbot never quite rolled off the tongue." It's been aptly described as "ChatGPT without the chatbox" or an "AI that can actually do things." It provides "AgentSkills" for shell commands, file management, and web automation, truly empowering AI to act on your behalf. The remarkable success of OpenClaw even led to Steinberger being "acqui-hired" by OpenAI, signaling its profound impact.

The second critical piece is Anthropic's Cowork+Dispatch. Cowork is Anthropic's desktop platform for AI-assisted productivity, offering a persistent conversational layer. In March 2026, they launched Dispatch, a new feature allowing users to remotely control their desktop AI agent from a mobile device. This creates a continuous conversation thread between your Claude mobile app and desktop, letting you assign tasks asynchronously and return to finished work.

"The pitch is 'text Claude from your phone, come back to finished work.' When that actually works reliably, it changes how you structure your entire workday," one user observed. This integration transforms your smartphone into a remote control for deep work, letting you manage research or automate background tasks from anywhere, fundamentally shifting AI interaction from synchronous to asynchronous.

The Voice Revolution and the Road Ahead

Carlos Santana's final piece of the puzzle is a "good upgrade" in voice capabilities. In 2026, Voice AI is seeing dramatic progress towards human-like conversational intelligence, multimodal interaction, and edge computing. The conversational AI market is booming, projected to reach $41.39 billion by 2030, expanding at a 23.7% CAGR from $14.29 billion in 2025. The expectation now is for natural, context-aware conversations, far beyond robotic responses.

Companies like OpenAI are actively restructuring to prioritize real-time, natural conversation in AI interfaces, aiming for advanced audio models capable of handling interruptions gracefully. This push for low-latency, emotionally intelligent, and proactive voice agents will make interacting with AI as seamless as talking to another person. This capability will unlock the true potential of autonomous agents, allowing them to truly integrate into our daily lives without friction.

However, the path to a complete Jarvis mode isn't without its challenges. Autonomous agents like OpenClaw raise security concerns due to their system-level access, demanding robust safeguards. Features like Claude Dispatch are still in a "research preview" phase, with reports of 50/50 reliability for complex tasks, highlighting the need for consistent performance. The ultimate goal for voice AI—true conversational fluency with natural prosody and context-awareness—requires overcoming significant technical hurdles. But as history shows, from GUIs to touchscreens, technology constantly evolves to make interaction more intuitive. The "Jarvis mode" vision is now rapidly transitioning from science fiction to tangible reality, poised to redefine how we interact with technology and structure our work.

The Voice Revolution and the Road Ahead — Photo: picture-alliance.com

Jarvis Mode AI Closer

Keep reading

Cursor Accelerates Agentic Coding With New Instant Grep Search Engine

Cursor's new Instant Grep tool is an architectural leap that allows AI agents to scan vast codebases in milliseconds, fundamentally changing how autonomous coding tools process information.

March 24, 2026 at 9:25 AM

Elon Musk Bets on Brute Force to Supercharge xAI’s Development

In a high-stakes effort to close the gap on industry leaders, xAI is banking on rapid iteration and massive infrastructure scaling—a strategy Musk insists is already paying off in development velocity.

March 23, 2026 at 11:11 AM