More

willydouhard · 2026-04-11T13:25:19 1775913919

We see developers build their own setup over SSH with success, so in that sense I agree with you.

However, once you want to trigger tasks from Slack, Linear, or GitHub issues or onboard teammates who aren't comfortable wiring up LXC + tmux + agent forwarding, a managed layer is needed.

I think we're at a moment where builders with great setups like yours and products like ours are feeding each other good ideas. The patterns you figure out in your zshrc inform what we productize, and the workflows we ship give you new things to try. It's a virtuous circle. Everyone should use the right-sized solution for their situation.

willydouhard · 2026-04-11T13:16:29 1775913389

Definitely the same category of product. The main difference with cursor is that we reuse raw harnesses from ai-labs (claude code, codex). Cursor rebuilds its own harness. We believe nothing will beat the "natural" harness of each model because of RL.

You are also free to swap/combine these harnesses as you please, which is something Anthropic can't do. For instance claude code implements and codex reviews.

willydouhard · 2026-04-11T13:12:12 1775913132

By default each plan has a limit (https://twill.ai/pricing). Then you can manually set an overage limit if you want to.

The ralph loop mode also has the concept of a budget per task.

willydouhard · 2026-04-11T13:10:34 1775913034

The main difference is that you can pick/combine coding agent CLIs (claude code, codex, open code). There is no vendor lock-in.

dbbk · 2026-04-12T14:02:22 1776002542

That's literally just GitHub agents?

danoandco · 2026-04-13T02:38:00 1776047880

Yes, broadly. The main structural difference is that we’re agent-agnostic, so we can combine lab-native CLIs in one workflow. GitHub will likely struggle there because they have direct partnerships with Anthropic and OpenAI.

On the features themselves, we have a better UX across integrations, and more advanced features like video recording.

dbbk · 2026-04-13T19:03:32 1776107012

This seems like a weak argument. GitHub is already agent (not just model) agnostic, they have Copilot and Claude Code. I just don't see how this is a business, sorry.

willydouhard · 2026-04-11T10:03:10 1775901790

This definitely feels like the end state. We need to improve models, agent/human UX and make the transition from cloud work to local work seemless to fully get there

ibrahimhossain · 2026-04-11T10:26:38 1775903198

Cloud to local seamless transition is exactly where the real value lies. Local setups with zero API cost and full privacy make the biggest difference

willydouhard · 2026-04-11T09:59:40 1775901580

Yes or when you get good feedback/idea talking to someone, being able to spawn tasks from your phone makes everything much faster

willydouhard · 2026-04-11T09:49:14 1775900954

We still need to get an oauth token to connect to GitHub. We started with the GitHub mcp but migrated to giving the gh cli to the agent directly.

One learning we had is that most of the time CLI > MCP

willydouhard · 2026-04-11T09:47:18 1775900838

There is room for plan adaptation but the agent has to justify and highlight it in the PR.

Defining the plan/acceptance criteria for long running task is the hard part.

We recently added a Ralph loop mode in that spirit. The implementation won't start until the human and agent align on verifiable criteria and a different agent judges if criteria are met at the end of each run.

Overall I think this problem is not yet completely solved and improvement on both the UX and model judgement are needed

willydouhard · 2026-04-11T09:40:36 1775900436

This is very convenient but has limitations. GitHub actions are not built to resume state (conversations in our case) and handle multi player experiences.

However reusing the GitHub workflows out of the box feels really nice

willydouhard · 2026-04-10T23:05:41 1775862341

Thank you for the feedback, we are working on it!