Proprietary · Autonomous · Cognition AI

Devin

Name: Devin Review
Item: Devin
Rating: 3
Author: Automation Switch

The pioneer of autonomous AI software engineering. Works independently in sandboxed environments to turn task descriptions into pull requests.

Editorial Score

3/5

Visit Devin →← All Tools

3/5

Editorial score

MCP SUPPORTED

Mode

Autonomous

IDE Support

Web

License

Proprietary

Pricing

Paid

MCP Support

Yes

Vendor

Cognition AI

Core $20/mo (250 ACUs) | Team $30/user/mo | Enterprise custom

// Key Numbers

67%

PR Merge Rate

$73M

ARR (June 2025)

$10.2B

Valuation

Goldman Sachs, Palantir, Cisco, Dell

Enterprise Users

~15 min of active work per unit

ACU Duration

$400M+ raised

Total Funding

// Our Verdict

The pioneer of autonomous AI software engineering. Impressive in demos and enterprise pilots, but the gap between autonomous promise and practical reliability remains significant.

Best for: Teams with well-defined, repeatable tasks where fully autonomous execution is viable.

// Pricing

Core

$20/month base + $2.25/ACU

Pay-as-you-go access, full autonomous agent, cloud IDE and browser, Linear/Jira/Slack integrations, API access

Team

$500/month

250 ACUs included ($2.00/ACU overage), team dashboards, shared knowledge base, priority support

Enterprise

Custom

VPC deployment, SAML SSO, Enterprise ACUs with stricter planning and testing, dedicated account management, custom SLAs

// Our Assessment

Devin is the most ambitious bet in AI coding: a fully autonomous agent that takes a Jira ticket, disappears into its own cloud IDE, and comes back with a pull request. Cognition AI's $10.2 billion valuation and Goldman Sachs deployment validate the concept, and the numbers back it up, 67% of Devin's PRs now merge successfully, up from 34% a year ago, while task completion speed has improved 4x. For well-scoped tickets with clear acceptance criteria, Devin genuinely acts like a junior engineer you can assign work to overnight.

The reality check is equally important. Independent evaluations consistently put Devin's autonomous success rate on complex, ambiguous tasks at roughly 14-15%. The ACU-based billing model means costs are unpredictable, a task that hits edge cases can burn through credits fast, and at $2.00-$2.25 per ACU (each representing about 15 minutes of compute), a stubborn bug can quietly cost more than a contractor. Enterprise teams at Goldman Sachs report 20% efficiency gains, but those gains come from carefully curating which tasks Devin handles, not from handing it the backlog wholesale.

The Windsurf acquisition in mid-2025 expanded Cognition's reach into IDE-integrated assistance, but Devin itself remains a standalone autonomous agent, you delegate, you don't collaborate. That makes it a powerful force multiplier for teams with large volumes of well-defined work (migrations, test generation, boilerplate features) and a frustrating black box for anything requiring creative judgment or iterative human feedback.

// Ideal For

+Engineering teams with large backlogs of well-defined tickets (migrations, test writing, boilerplate CRUD) who want to delegate entire tasks overnight

+Enterprise organisations piloting hybrid human-AI workforces for clearly scoped feature work and legacy code refactoring

+Teams that need async autonomous coding without tying up developer machines or IDE seats

// Not Ideal For

–Solo developers or small teams who need real-time pair-programming, Devin's async delegation model means no live collaboration

–Projects with ambiguous requirements or heavy creative problem-solving where the 14-15% autonomous success rate on complex tasks becomes a liability

–Budget-conscious teams without predictable workloads, ACU costs can spike unpredictably on edge-case-heavy tasks

// Strengths

+Category pioneer, defined the autonomous AI engineer concept

+Goldman Sachs pilot alongside 12K developers

+Price dropped from $500 to $20/mo, accessible entry point

+Parallel session support (Feb 2026)

// Weaknesses

–Only 14-15% success rate on complex autonomous tasks

–Fully autonomous mode requires significant trust and well-scoped tasks

–Proprietary model, no model flexibility

–Community skepticism after overhyped launch

// Models Supported

Proprietary (not user-selectable)

// Related Comparisons

Devin vs OpenHands vs Copilot Agent: Autonomous Coding Compared →

How the three leading autonomous agents compare on task delegation, cost per completion, and enterprise readiness.

Agentic vs Autocomplete: Understanding the New AI Coding Spectrum →

Why autonomous agents like Devin and IDE copilots solve fundamentally different problems, and when each approach wins.

The Enterprise Guide to AI Coding Assistants →

Evaluating Devin, Copilot, and Cursor for enterprise deployment: compliance, cost modelling, and rollout strategies.

Stacking AI Coding Tools: When Two Tools Beat One →

How teams pair Devin's async task execution with an IDE assistant for real-time work, and whether the combined cost is justified.

// AI Coding Audit

NOT SURE IF DEVIN
FITS YOUR WORKFLOW?

We map your development workflow, evaluate which AI coding tools fit your team size, stack, and security requirements, and give you a prioritised adoption plan.

Book an AI Coding Audit →Or visit Devin directly →

← Back to all AI Coding Assistants

Devin

Spot something stale, broken, or unclear?