Automated COBOL Documentation Done Right
The COBOL Documentation Factory

Your COBOL Experts Are Retiring.
Their Knowledge Doesn't Have To.

Automated Program Logic Manuals (PLM) for legacy COBOL systems. Deterministic fact extraction meets constrained AI. Result: production-ready documentation with zero fabricated entities.

IBM 3270 · SESSION A · COBOL-85 READY COBOL Documentation Factory CALL 'DOC-GEN' USING BREVIDY-REC.
Request a Demonstration →
The Mainframe Knowledge Cliff
Financial institutions and government organizations run trillions of dollars in daily transactions on COBOL. The people who understand that code are leaving — and taking decades of institutional knowledge with them.
⚠️

The Talent Exodus

The average COBOL programmer is nearing retirement age. With 85% of universities having dropped COBOL from their curriculum, the pipeline of replacements has dried up. Organizations report that finding qualified mainframe talent is their top challenge.

📄

Undocumented Critical Systems

Millions of lines of COBOL run banking, insurance, and government systems with little or no documentation. When the developers who understand a program retire, the knowledge goes with them, leaving maintenance teams guessing.

🧠

AI Tools Hallucinate

Generic AI documentation tools fabricate field names, invent paragraph references, and create plausible-sounding but wrong descriptions. In mission critical environments, fabricated documentation is worse than no documentation.

Two-Stage Pipeline. Zero Entity Fabrications.
The COBOL Documentation Factory (CDF) combines deterministic parsing with constrained AI synthesis. Every fact is verified against source code. Every entity is cross-referenced. Every claim cites line numbers.
STAGE 1

Deterministic Fact Extraction

// No AI involved

Three independent parser layers break each COBOL program into a Structured Technical Baseline (STB), a machine-readable JSON containing every verifiable fact in the source.

  • Data structures and copybook hierarchies
  • Control flow graphs
  • I/O operations and file definitions
  • CICS commands, BMS screen layouts
  • JCL job configurations
  • Business logic patterns
// STB output (verified facts)
{ "program": "CBACT01C",
  "archetype": "CICS_INQUIRY",
  "paragraphs": 14,
  "cics_commands": ["SEND MAP", "RECEIVE MAP"...],
  "fabrications": 0 }
STAGE 2

Constrained AI Synthesis

// Grounded in verified facts

The AI reads source code independently and uses the STB as a cross-reference, grounding every statement in specific source lines. Archetype-aware specifications ensure each program type receives appropriate treatment.

  • Narrative business logic explanations
  • Step-by-step program flow descriptions
  • Maintenance guidance and impact analysis
  • Line-number citations for every claim
  • Archetype-specific documentation structure
  • Production-ready Program Logic Manuals
// Maintenance impact analysis
field: ACCT-BAL PIC S9(11)V99
referenced in:
  CBACT01C.cbl lines 145, 312
  CBACT04C.cbl line 78
  ACCT-REC.cpy line 23
  ACCTBATCH.JCL line 14
impact: 4 files, 7 references
AWS CardDemo Benchmark Results
The CDF pipeline was validated against AWS CardDemo, an open-source mainframe reference application used industry-wide to benchmark COBOL modernization tools.

Overall Quality Score

98.5 / 100
— mean rubric score across 42 programs spanning 8 archetypes
Zero entity fabrications detected

Results by Program Archetype

Archetype Score
CICS Other99.7
CICS Inquiry98.9
CICS Data Entry98.6
Batch File Maintenance98.3
Batch Utility98.3
CICS Menu98.2
Called Subprogram98.2
Batch Report96.7
What The Scores Mean

What Maintenance Programmers Get

A maintenance programmer receiving these PLMs can trust the field names, control flow descriptions, I/O operations, and business logic explanations without needing to cross-reference source code for basic comprehension.

Where the 1.5 Points Go

Minor deductions below 100 typically reflect inferences about business context that cannot be mechanically verified from source code alone, such as who consumes a report or when a batch job is scheduled. These are appropriately qualified in the PLMs rather than stated as fact.

The Bottom Line

A score of 98.5 across 42 programs spanning 8 archetypes means the generated documentation is consistently accurate enough for production maintenance use.

Built for Regulated Environments
The COBOL Documentation Factory was designed from the ground up for organizations where accuracy is non-negotiable.
🔒

Auditable by Design

Every fact traced to source lines. Every entity evaluated against the Structured Technical Baseline. Both the deterministic scaffold and AI narrative are delivered, giving compliance teams a verifiable evidence trail.

⚖️

Hallucination Hard-Gate

Every named reference — field names, paragraph names, CICS commands, file names, dataset names — is cross-referenced. An output containing a name that doesn't appear in the source is flagged before delivery. The PLM is rejected, not patched. There are no fabricated facts.

🛠️

Archetype-Aware

Batch reports, CICS data entry, file maintenance, called subprograms; each program type receives specialized treatment with rubrics tailored to its maintenance patterns.

📈

Gap Detection

When onboarding a new repository, a gap detector identifies COBOL language patterns the parsers haven't yet encountered, ensuring extraction quality is evaluated before documentation is delivered.

📑

Dual Deliverables

Clients receive both the deterministic PLM (verified factual scaffold with zero AI) and the AI-synthesized PLM (narrative documentation with business context). A verified baseline alongside readable documentation.

Full Pipeline Automation

Source expansion, deterministic parsing, context pack generation, constrained AI synthesis, and automated evaluation. Our entire documentation pipeline runs without manual intervention.

From Compliance to Maintenance to Modernization
Production-ready documentation that serves multiple strategic objectives.

Compliance and Audit

Give regulators and auditors verifiable, line-referenced documentation for critical financial systems. Every fact is traceable to source.

Maintenance Acceleration

Cut onboarding time for new developers. PLMs explain business logic, control flow, and data structures without requiring source code archaeology.

Modernization Readiness

Comprehensive program documentation is the prerequisite for any migration initiative whether refactoring to microservices or converting to modern languages.

Sample Output
See What You'll Receive
A real CDF deliverable from the AWS CardDemo benchmark — the same format, depth, and style of documentation your team would receive. Browse an annotated walkthrough, see how the pipeline reads a program, or open the full 30-page PLM.
See Sample CDF Documentation Output →
ANNOTATED TOUR

Anatomy of a Program Logic Manual

An annotated walkthrough of a real PLM section by section — the program overview, every file touched, control flow, status checks and failure paths, copybooks, and how it runs in production. Callouts explain what each block means.

Open annotated PLM →
PIPELINE WALKTHROUGH

How Brevidy Reads a COBOL Program

A scroll-through tour showing how deterministic parsing builds the Structured Truth Base, and how constrained AI synthesis turns that base into the readable Program Logic Manual — with zero entity fabrications.

Open walkthrough →
FULL PDF · CBTRN03C

Complete Gold-Standard PLM

The full production-ready Program Logic Manual for CBTRN03C, the daily transaction posting batch program from AWS CardDemo. Same format your engineers and auditors would receive on day one of an engagement.

Download PDF →
Get Started
See CDF Run Against Your Code
We'll process a sample of your COBOL repository through the full CDF pipeline and deliver production-ready Program Logic Manuals so you can evaluate the output quality against your systems.

Or reach out directly to: tony.materna@brevidy.com
Or Phone: 818-613-7600