Azure DevOps extension · v1.14 · BYOK

From backlog quality
to production-ready tests

Score user stories with INVEST, get AI-powered analysis, improve acceptance criteria, and generate tests in 8 frameworks — all inside ADO. Your data never leaves your tenant.

Install on Marketplace Read the docs

Test frameworks

LLM providers

Data to editor

Tests per story max

How it works

Four steps from story
to tested feature

Score instantly

Open any work item. INVEST score in under a second — offline, no API call needed.

Analyse & improve

Run LLM analysis. Criterion-level feedback, improve description and AC, side-by-side diff before writeback.

Generate tests

Manual tests with Gherkin per typology. Push to ADO via Tested By — no Test Plans licence required.

Ship code

Generate test code in your framework. 5-dimension quality score shown before push to Repos.

Features

Everything a QA team
needs, nothing extra

Instant INVEST scoring

Heuristic score on every work item panel load. Offline-first — no API key required for the score itself.

PO / BA

LLM-powered deep analysis

Criterion-level INVEST analysis with suggestions linked directly to I, N, V, E, S, or T.

PO / BA

One-click writeback

Improved description and AC — bullets or Gherkin per AC — written back to the work item via ADO API.

PO / BA

Manual tests + Gherkin

Given/When/Then per typology. Pushed via Tested By — no Test Plans licence required.

Automated test generation

Playwright TS/Python/.NET, Selenium Java, Cypress JS, Cucumber. Code quality score before push.

Data sovereignty by design

BYOK — all LLM calls go browser-direct to your endpoint. Zero data to TestForge servers.

Platform

Security & privacy

Your data stays
in your tenant

Designed for regulated industries. Every architecture decision starts with data sovereignty.

Zero editor data processing

TestForge servers never receive your user stories, AC, or generated tests.

Browser-direct LLM calls

All AI requests go directly from your browser tab to your endpoint, under your credentials.

ADO-scoped key storage

Your LLM config is stored in ADO Extension Data Service, scoped to your user account.

Open-source Anthropic proxy

Azure Function or Cloudflare Worker — deployed in your own tenant. Fully auditable.

Minimum ADO scopes

vso.work_full, vso.code_write, vso.extension.data_write — nothing more.

On-prem ADO Server

Air-gapped deployment with Ollama coming 2027 for organisations with no cloud connectivity.

ROI calculator

See the numbers
for your team

QA engineers 5

User stories per sprint 15

Hours writing tests per story today 2h

Hourly cost loaded 75€

Time reduction with TestForge 70%

Your estimated impact

Hours saved per sprint

—hrs

Value per sprint

—€

Annual savings

—€ / year

—

LLM token cost ≈ €2/sprint already included.

Why TestForge

Not just another
test generator

Quality over quantity

Xray claims 60 tests. We generate the right number per typology, controlled by you. A QA expert knows 8 well-targeted tests beat 30 generic ones.

Full pipeline, one tool

From US quality score to Gherkin to Playwright code — inside ADO. No context switching, no duplicate tooling.

Regulatory-grade privacy

BakeQA and CasePilot process your stories on their servers. TestForge never does. For banking, public sector, healthcare — this is the only option.

Backlog quality over time

Coming Q4 2026 — track INVEST score evolution sprint-by-sprint. The only metric you can present at steering committee level.

Roadmap

Built in the open,
shipping fast

Now · v1.14

Foundation

Available today

INVEST scoring

Heuristic offline + LLM deep analysis

US improvement & writeback

Description + Gherkin AC in one click

Manual tests + Gherkin

No Test Plans licence required

Auto tests · 8 frameworks

Playwright, Selenium, Cypress, Cucumber

Q3 2026

Breadth

In progress

Quality Gate on US

Configurable Definition of Ready enforced

Domain templates

Banking, e-commerce, HR test patterns

Q4 2026

Depth

Planned

Backlog quality dashboard

INVEST score by team/sprint

Audit trail export

Timestamped PDF — score + tests + proof

Platform

2027

Scale

Enterprise

On-prem ADO Server

Air-gapped + Ollama

Enterprise

Regulatory templates

DORA, NIS2, PCI-DSS traceability

Enterprise

Pricing

Transparent pricing,
no surprises

You pay for TestForge. LLM token costs go directly to your provider — typically €2/sprint. No hidden fees.

Free

€0

Unlimited heuristic scoring. 10 LLM analyses/month.

Unlimited INVEST scoring (offline)
10 LLM INVEST analyses/month
BYOK — your own endpoint

Ready to stop writing tests
from scratch?

Install TestForge free on the Azure DevOps Marketplace. No credit card. Your data, your tenant, your rules.

Install free on Marketplace →

Also available: Documentation · Contact sales

From backlog qualityto production-ready tests

Four steps from storyto tested feature

Score instantly

Analyse & improve

Generate tests

Ship code

Everything a QA teamneeds, nothing extra

Instant INVEST scoring

LLM-powered deep analysis

One-click writeback

Manual tests + Gherkin

Automated test generation

Data sovereignty by design

Your data staysin your tenant

Zero editor data processing

Browser-direct LLM calls

ADO-scoped key storage

Open-source Anthropic proxy

Minimum ADO scopes

On-prem ADO Server

See the numbersfor your team

Your estimated impact

Not just anothertest generator

Quality over quantity

Full pipeline, one tool

Regulatory-grade privacy

Backlog quality over time

Built in the open,shipping fast

Transparent pricing,no surprises

Ready to stop writing testsfrom scratch?

From backlog quality
to production-ready tests

Four steps from story
to tested feature

Everything a QA team
needs, nothing extra

Your data stays
in your tenant

See the numbers
for your team

Not just another
test generator

Built in the open,
shipping fast

Transparent pricing,
no surprises

Ready to stop writing tests
from scratch?