Wednesday, June 17, 2026

DataRobot for Builders — integrating with the Google Antigravity CLI


Antigravity CLI is the latest agentic coding CLI from Google, changing the now-deprecated Gemini CLI. It inherits the asynchronous subagent mannequin that makes Antigravity stand out from the sphere, syncs bidirectionally with Antigravity Desktop, and is optimized for pace on Gemini 3.5 Flash.

DataRobot ships a full plugin for Antigravity CLI straight from the identical open supply repository that powers our Cursor, Claude Code, and Gemini CLI integrations. One set up provides you the entire DataRobot talent set inside Antigravity’s agent and slash-command interface.

Set up the DataRobot plugin with a single command:

agy plugin set up https://github.com/datarobot-oss/datarobot-agent-skills.git

If you happen to’re nonetheless on Gemini CLI, the identical repository installs there too:

gemini extensions set up https://github.com/datarobot-oss/datarobot-agent-skills.git

Already utilizing the DataRobot extension in Gemini CLI and switching to Antigravity? Migrate it straight:

agy plugin import gemini

As soon as put in, the complete DataRobot talent set is out there together with datarobot-setup and datarobot-agent-assist and could be invoked with slash instructions like /datarobot-skills:datarobot-agent-assist

Debugging brokers is tough. LLM calls return plausible-sounding output even when one thing has gone improper, device calls fail silently, and latency issues are invisible within the remaining response. With out structured hint information, the one possibility is log-hunting and guesswork.

To point out how the DataRobot tracing talent works in observe, right here’s a concrete instance: a LangGraph agent in a single most important.py file that manages bike exercises. It has a number of instruments, produces inconsistent solutions, and the foundation trigger isn’t apparent from the conversational output alone.

Including production-grade tracing to this agent takes a single talent invocation: /datarobot-skills:datarobot-external-agent-monitoring.

Invoking the datarobot-external-agent-monitoring skill in Antigravity CLI

The talent provisions a brand new DataRobot Use Case, devices the agent to emit traces by way of OpenTelemetry, and writes a monitoring_setup.md artifact with the runtime configuration steps.

The monitoring skill inspecting the project and provisioning a DataRobot Use Case
Summary of actions the skill completed, including the monitoring_setup.md artifact

With instrumentation in place, run the agent and ship it a query — on this case, “What’s the schedule this week?”

Running the bike training agent and asking for the week's schedule

The talent generates setup directions that embrace the Use Case entity ID and the surroundings variables wanted to route traces to DataRobot:

The generated monitoring setup instructions with runtime variables and telemetry steps

The DataRobot tracing interface surfaces the complete request historical past. Every hint reveals end-to-end latency, whole token consumption, and the entire span tree:

The DataRobot tracing interface listing the agent's traced requests

Drilling into the “schedule this week” request reveals the complete image: 2,700 tokens consumed, tool-level latency for every name, LLM invocation rely, and any customized attributes emitted by way of normal OTel instrumentation. That is the info that makes debugging tractable, not inference from remaining output.

The trace detail view showing span hierarchy, latency, and token counts

For native growth, the DataRobot CLI surfaces hint updates in actual time: dr plugin set up xp adopted by dr xp --entity-id=. This creates a decent iteration loop — run the agent, examine the hint, repair the problem, repeat.

On this case, the span output makes the foundation trigger specific: the agent lacks calendar entry, which is why it couldn’t reply the scheduling query. That failure wasn’t surfaced within the agent’s conversational response:

A span's output showing the agent explaining it lacks calendar access

As an alternative, the agent responded with generic steering:

2. **Construct every week from scratch** - If you happen to inform me a number of issues, I can sketch out a balanced week for you:

   - Your aim (normal health, an occasion/race, constructing endurance, and so on.)
   - What number of days/hours you possibly can prepare
   - Your present health stage and any FTP 

A strong normal week would possibly seem like:

- **Mon** - Relaxation or straightforward restoration spin
- **Tue** - Intervals
- **Wed** - Endurance trip (zone 2)
- **Thu** - Restoration or relaxation
- **Fri** - Tempo/threshold work
- **Sat** - Lengthy endurance trip
- **Solar** - Simple trip or relaxation

The hint made the hole between anticipated and precise agent habits instantly actionable. This identical sample applies at enterprise scale: whether or not the agent is operating on a laptor or in manufacturing on a cloud supplier, DataRobot traces the complete execution tree and surfaces what the agent really did — not simply what it mentioned.

The hole between an agent prototype and an agent in manufacturing is generally operational context. Your coding agent writes the code. DataRobot provides the observability layer and the ruled deployment goal. One plugin set up, one talent execution — and you’ve got production-grade hint information from the primary run.

Related Articles

Latest Articles