Azure DevOps extension · v1.14 · BYOK

From backlog quality
to production-ready tests

Score user stories with INVEST, get AI-powered analysis, improve acceptance criteria, and generate tests in 8 frameworks — all inside ADO. Your data never leaves your tenant.

Install on Marketplace Read the docs
8
Test frameworks
6
LLM providers
0
Data to editor
32
Tests per story max

Four steps from story
to tested feature

01

Score instantly

Open any work item. INVEST score in under a second — offline, no API call needed.

02

Analyse & improve

Run LLM analysis. Criterion-level feedback, improve description and AC, side-by-side diff before writeback.

03

Generate tests

Manual tests with Gherkin per typology. Push to ADO via Tested By — no Test Plans licence required.

04

Ship code

Generate test code in your framework. 5-dimension quality score shown before push to Repos.

Everything a QA team
needs, nothing extra

Instant INVEST scoring

Heuristic score on every work item panel load. Offline-first — no API key required for the score itself.

PO / BA

LLM-powered deep analysis

Criterion-level INVEST analysis with suggestions linked directly to I, N, V, E, S, or T.

PO / BA

One-click writeback

Improved description and AC — bullets or Gherkin per AC — written back to the work item via ADO API.

PO / BA

Manual tests + Gherkin

Given/When/Then per typology. Pushed via Tested By — no Test Plans licence required.

QA

Automated test generation

Playwright TS/Python/.NET, Selenium Java, Cypress JS, Cucumber. Code quality score before push.

QA

Data sovereignty by design

BYOK — all LLM calls go browser-direct to your endpoint. Zero data to TestForge servers.

Platform

Your data stays
in your tenant

Designed for regulated industries. Every architecture decision starts with data sovereignty.

Zero editor data processing

TestForge servers never receive your user stories, AC, or generated tests.

Browser-direct LLM calls

All AI requests go directly from your browser tab to your endpoint, under your credentials.

ADO-scoped key storage

Your LLM config is stored in ADO Extension Data Service, scoped to your user account.

Open-source Anthropic proxy

Azure Function or Cloudflare Worker — deployed in your own tenant. Fully auditable.

Minimum ADO scopes

vso.work_full, vso.code_write, vso.extension.data_write — nothing more.

On-prem ADO Server

Air-gapped deployment with Ollama coming 2027 for organisations with no cloud connectivity.

See the numbers
for your team

Your estimated impact

Hours saved per sprint
hrs
Value per sprint
Annual savings
€ / year

LLM token cost ≈ €2/sprint already included.

Not just another
test generator

Quality over quantity

Xray claims 60 tests. We generate the right number per typology, controlled by you. A QA expert knows 8 well-targeted tests beat 30 generic ones.

Full pipeline, one tool

From US quality score to Gherkin to Playwright code — inside ADO. No context switching, no duplicate tooling.

Regulatory-grade privacy

BakeQA and CasePilot process your stories on their servers. TestForge never does. For banking, public sector, healthcare — this is the only option.

Backlog quality over time

Coming Q4 2026 — track INVEST score evolution sprint-by-sprint. The only metric you can present at steering committee level.

Built in the open,
shipping fast

Now · v1.14
Foundation
Available today
INVEST scoring
Heuristic offline + LLM deep analysis
PO
US improvement & writeback
Description + Gherkin AC in one click
PO
Manual tests + Gherkin
No Test Plans licence required
QA
Auto tests · 8 frameworks
Playwright, Selenium, Cypress, Cucumber
QA
Q3 2026
Breadth
In progress
Quality Gate on US
Configurable Definition of Ready enforced
PO
Domain templates
Banking, e-commerce, HR test patterns
QA
Q4 2026
Depth
Planned
Backlog quality dashboard
INVEST score by team/sprint
PO
Audit trail export
Timestamped PDF — score + tests + proof
Platform
2027
Scale
Enterprise
On-prem ADO Server
Air-gapped + Ollama
Enterprise
Regulatory templates
DORA, NIS2, PCI-DSS traceability
Enterprise

Transparent pricing,
no surprises

You pay for TestForge. LLM token costs go directly to your provider — typically €2/sprint. No hidden fees.

Free
€0

Unlimited heuristic scoring. 10 LLM analyses/month.

  • Unlimited INVEST scoring (offline)
  • 10 LLM INVEST analyses/month
  • BYOK — your own endpoint
Business
€990/month

Unlimited users. All 8 frameworks.

  • Everything in Team
  • All 8 frameworks incl. Cucumber
  • Backlog quality dashboard
  • Audit trail export PDF
Enterprise
On request

On-prem, SSO Azure AD, regulatory templates.

  • Everything in Business
  • On-prem + Ollama (2027)
  • DORA / NIS2 / PCI-DSS
  • SLA 8h · Customer Success

Ready to stop writing tests
from scratch?

Install TestForge free on the Azure DevOps Marketplace. No credit card. Your data, your tenant, your rules.

Install free on Marketplace →

Also available: Documentation · Contact sales