EXPERTS STILL MATTER

Why Domain Expertise Still Matters in an AI world

Summary: 
AI is now materially useful in software and capability delivery.  It can summarize workshops, draft requirements, generate user stories, propose designs, create test cases, and accelerate documentation.  Yet these gains should not be confused with business understanding.  Requirements engineering and business analysis are still grounded in stakeholder needs, operational context, value, and validation across the life cycle. The central issue is not whether AI can produce artefacts quickly; it plainly can.  The issue is whether those artefacts describe the right problem, reflect real workflows, account for exceptions, and support a successful deployment.  The evidence suggests that domain expertise remains indispensable because needs, solutions, and value are context-dependent; tacit knowledge is only partly codified; and human-AI combinations work best when people contribute contextual judgement while AI contributes speed and scale.  For Requirementum QGR, this reinforces a clear position:  AI should accelerate delivery work, but domain-led analysis must still govern meaning, priority, and fitness for operational use.

multi-layers

Generative AI has entered the mainstream of solution delivery. Teams now use it to draft requirement statements, transform meeting notes into user stories, propose acceptance criteria, generate regression tests, summarize current-state processes, and support solution design.  That matters because delivery work is often delayed by the time needed to convert fragments of stakeholder knowledge into structured artefacts.  AI reduces that production cost dramatically.  At the same time, the core standards of the field still define deliverhy as a contextual, stakeholder-based discipline rrather than a document-production exercise.   The IIBA defines business analysis as enabling change by defining needs and recommending solutions that delivery value to stakeholders within a given context, and its standard treats requirements, validation, prioritization, modelling, and solution evaluation as integral parts of that work. ISO/IEC/IEEE 29148 likewise frames requirements engineering as a life-cycle activity concerned with stakeholder satisfaction in the operational or business environment.

Traditional Artefact

AI-consumable reformulation

Typical downstream AI outputs

BRD or SRS section

Modular intent packets with goals, constraints, terminology, examples, and trace IDs

Feasibility analysis, backlog decomposition, implementation plans

User story and acceptance criteria

Well-scoped issue or agent task with explicity success conditions

Branch changes, pull requests, test generation, iteration comments

Business rules and NFRs

Source-controlled instructions, policy constraints, security requirements, schemas

Policy-aware design review, code review findings, complaint code / configuration

Workflow or process description

Structured flow description with states, decisions, data, and exceptions

Prototypes, UI flows, orchestration logic, task automation

Data and interface requirements

JSON-schema, typed objects, API or contact definitions

Structured outputs, API stubs, validation logic, test fixtures

Behavior examples

Example-based specs or Gherkin scenarios in version control

Executable acceptance tests, living documentation, regression checks

The comparison reflects current platform and practice patterns in GitHub Copilot, OpenAI and Azure structured outputs, AWS Security Agent, Figma Make, and Cucumber rather than a single proprietary method (GitHub Docs, 2026c; GitHub Docs, 2026d; OpenAi, 2025b; Microsoft, 2026a; AWS, 2025c; Cucumber, n.d.; Figma, n.d.).

Why Structured Makes Requirements Executable
Structure improves AI output because it reduces interpretive freedom at exactly the point where large language models are otherwise likely to improvise.  OpenAI recommends placing instructions clearly, separating instruction from context, and using the latest capable models; Microsoft recommends specificity, ordering, consistent output formats, and a "when unsure" policy; both OpenAI and Microsoft now support schema-bound structured outputs so downstream systems can consume responses reliably rather than parsing brittle free text (OpenAI, 2025a; OpenAi, 2025b; Microsoft, 2025a; Microsoft, 2026a; Microsoft, 2026b).

Recent research aligns strongly with that practice guidance.  Midolo et al. (2026) found that prompts improved when they specified I/O, pre- and post-conditions, examples, detail, and ambiguity clarification. Yang et al. (2026) showed that ambiguity consistently degraded function-level code generation and led models to produce divergent implementations from the same requirement. Vogelsang et al. (2025) found that requirement smells can affect LLM-enabled traceability performance. Conversely, Wei (2024) showed that well-structured requirements can be processed progressively into functional requirements, object models, unit tests, and code, while Krishna et al. (2024) found that GPT-4 could generate complete and consistent SRS drafts and help validate requirements documents.  The evidence is therefore clear: structure is not documentation overhead in AI delivery; it is a direct quality control mechanism (Midolo et al., 2026; Yang et al., 2026; Vogelsang et al., 2025; Wei, 2024; Krishna et al., 2024).

A useful historical bridge survives in behavior-driven development.  Gherkin remains plain text "structured enough for Cucumber to understand," and Cucumber describes it as both executable specification and documentation of real behavior.  The contemporary difference is scale and scope: the same principle now stretches beyond acceptance testing into planning, coding, review, and deployment agents across mainstream delivery platforms (Cucumber, n.d.; GitHub, 2025b; Thoughtworks, 2025).

Delivery Compression Across the Lifecycle
Vendor tooling already demonstrates the reduction of hand-offs.  Figma Make turns natural-language descriptions into interactive prototypes with logic, layout, and data; GitHub Spark and v0 take natural language to full-stack applications and deployment pathways; GitHub Copilot cloud agent can research a repository, create a plan, change code on a branch, and open a pull request;  Microsoft's Copilot testing can generate, build, fix, and rerun unit tests; AWS and OpenAI now describe AI-enabled SDLC patterns that span planning, design, coding, testing, deployment, and operations (Figma, n.d.; GitHub, 2025a; Vercel, n.d.; GitHub Docs, 2026a; Microsoft, 2026c; AWS, 2025a; OpenAI, 2025c).

Figure 1 synthesizes the emerging flow described across GitHub, OpenAI, Microsoft, AWS and design-tool documentation:  intent is increasingly decomposed once, then re-used many times across delivery (GitHub Docs, 2026a; GitHub Docs, 2026f; OpenAi, 2025c; AWS, 2025a; Microsoft, 2026c).

The deeper change is that requirements become living operational assets. Cucumber notes that .feature files are versioned alongside software.  GitHub stores repository instructions, path-specific instructions, agent profiles and AGENTS.md files in source control. MCP extends those artefacts by connecting agents to external tools and data sources  AWS Security Agent similarly treats organisational security requirements as reusable checks applied in both design review and code review.  This is the clearest contemporary meaning of "executable requirements": business intent becomes a governed control surfact for delivery, not just a memo that precedes it (Cucumber, n.d.; GitHub Docs, 2026c; GitHub Docs, 2026d; GitHub Docs, 2026e; GitHub Docs, 2026f; GitHub Docs, 2026g; AWS, 2025c).

Governance, Risk, and Human Judgement
AI does not eliminate requirements engineering; it raises its stakes. GitHub recommends not assigning broadly scoped, ambiguous, deeply domain-specific, or highly sensitive tasks to coding agents.  Microsoft warns that prompt techniques still require validation and may fail to generalise.  OpenAI states that true ownership of code, escpecially in new or ambiguous problems, remains with engineers. NIST's generative-AI profile further frames trustworthiness, legal requirements, lifecycle risk management, and governance as essential rather than optional (GitHub Docs, 2026b; Microsoft, 2025a; OpenAI, 2025c; Autio et al., 2024).

The risk picture is now well defined.  NIST catalogues risks unique to or exacerbated by generative AI, including confabulation, information integrity, data privacy, intellectual property, and value-chain/component integration.  Research in 2026 also suggests that agent-generated code can introduce redundancy and maintainability debt that reviewers may underestimate. Accordingly, mitigation must combine better requirements with better controls: schema-constrained outputs, explicity uncertainty policies, source-controlled instructions, design and code review against policies, security scanning, and CI/CD gates that preserve architectural coherence (Autio et al., 2024; Huang et al., 2026; GitHub, 2026a; AWS 2025c; AWS, n.d.; Thoughtworks, 2025).

Risk

Typical Failure Mode

Practical Mitigation

Ambiguous Intent

Divergent or incorrect implementations

Clarify terms, add examples, specify pre/post conditions

Hidden assumptions

Model silently fills gaps

Add "when unsure" behaviour, force clarification and plan review

Fragile prompts or context loss

Inconsistent outputs across runs and tools

Store instructions in version control; use schemas and path-specific guidance

Compliance and security gaps

Fast code violates policy or design controls

Apply policy-aware design review, code review, scanning, and guardrails

Architectural drift and debt

Passing tests but degraded maintainability

Human architecture review, smaller scopes, deterministic CI/CD checks

Weak traceability

Poor auditability from intent to release

Trace IDs across specs, issues, PRs, tests, and deployment records

The mitigation pattern in Table2 is supported by current platform guidance and recent empirical work rather than be a single tool vendor (Yang et al., 2026; Midolo et al.,; Microsoft, 2026b; GitHub Docs, 2026c; GitHub Docs, 2026d; AWS, 2025c; Autio et al., 2024).

Conclusion
The requirements document is becoming executable because AI-enabled delivery increasingly treats structured requirements as operational inputs, not as terminal hand-off artefacts. The decisive move is from descriptive documentation to governed intent: goals, rules, constraints, examples,  acceptance criteria, schemas, and validation logic that can be consumed directly by design tools, coding agents, test generators, security reviewers, and deployment workflows.  The stronger the structure, the more reliable the automation; the weaker the structure, the faster the error propagation (Thoughtworks, 2025; GitHub Docs, 2026b; OpenAI, 2025b; Microsoft, 2026a; Yang et al., 2026).

Requirementum QGR, Intentium™, and Intent-Centered Delivery
For Requirementum QGR, the practical lesson is clear: the winning delivery model is not "AI-first" in the sense of unconstrained generation; it is intent-centred. Intentium™ should therefore be positioned as adisciplined method for transforming business intent into AI-ready delivery inputs: structured requirements, rules, workflows, constraints, examples, acceptance criteria, validation logic, traceability links, and governance artefacts.  That proposition aligns directly with the evidence in this paper, which shows that AI delivery improves when context is explicit, versioned, testable, and bounded by policy and review.

In that sense, Requirementum QGR's delivery approach is a strong fit for the emerging reality. Intentium™ can act as the bridge between business meaning and machine execution, while preserving human oversigh where it matters most:  prioritisation, domain judgement, architectural integrity, compliance,and final accountability.  Its value is therefore not simply speed.  Its value is that it helps clients move from traditional requirements documentation to AI-ready, governed, traceable delivery in whichbusinessintent remains visible and testable throughout the lifecycle. That is precicely what an executable requirements discipline now requires.

REFERENCES
Autio, C., Schwartz, R., Dunietz, J., Jain, S., Stanley, M., Tabassi, E., Hall, P. and Roberts, K. (2024) Artificial Intelligence Risk Management Framework: Generative Artificial Intelligence Profile. NIST AI 600-1. Gaithersburg, MD: National Institute of Standards and Technology.
AWS (2025a) ‘Transforming the Software Development Lifecycle (SDLC) with Generative AI’, AWS Partner Network Blog, 16 January.
AWS (2025b) ‘AI-Driven Development Life Cycle: Reimagining Software Engineering’, AWS DevOps & Developer Productivity Blog, 31 July.
AWS (2025c) ‘New AWS Security Agent secures applications proactively from design to deployment’, AWS News Blog, 2 December.
AWS (n.d.) Amazon Bedrock Guardrails. Amazon Web Services.
Cucumber (n.d.) Introduction. Cucumber Documentation.
Figma (n.d.) AI Prototype Generator - Build Interactive Prototypes in Seconds. Figma.
GitHub (2025a) ‘GitHub Spark in public preview for Copilot Pro+ subscribers’, GitHub Changelog, 23 July.
GitHub (2025b) ‘How to build reliable AI workflows with agentic primitives and context engineering’, GitHub Blog, 13 October.
GitHub (2026a) ‘What’s new with GitHub Copilot coding agent’, GitHub Blog, 26 February.
GitHub (2026b) ‘Agent pull requests are everywhere. Here’s how to review them’, GitHub Blog, 7 May.
GitHub Docs (2026a) About GitHub Copilot cloud agent. GitHub Documentation.
GitHub Docs (2026b) Best practices for using GitHub Copilot to work on tasks. GitHub Documentation.
GitHub Docs (2026c) Adding repository custom instructions for GitHub Copilot. GitHub Documentation.
GitHub Docs (2026d) About custom agents. GitHub Documentation.
GitHub Docs (2026e) Support for different types of custom instructions. GitHub Documentation.
GitHub Docs (2026f) About Model Context Protocol. GitHub Documentation.
GitHub Docs (2026g) Connect agents to external tools. GitHub Documentation.
Hemmat, A.H., Sharbaf, M.S., Kolahdouz-Rahimi, S.K., Lano, K. and Tehrani, S.Y. (2025) ‘Research directions for using LLM in software requirement engineering: a systematic review’, Frontiers in Computer Science, 7.
Huang, H., Jaisri, P., Shimizu, S., Chen, L., Nakashima, S. and Rodríguez-Pérez, G. (2026) ‘More Code, Less Reuse: Investigating Code Quality and Reviewer Sentiment towards AI-generated Pull Requests’, arXiv, arXiv:2601.21276.
JetBrains (2025a) ‘The State of Developer Ecosystem 2025: Coding in the Age of AI, New Productivity Metrics, and Changing Realities’, JetBrains Research Blog, 15 October.
JetBrains (2025b) ‘Coding Guidelines for Your AI Agents’, IntelliJ IDEA Blog, 21 May.
JetBrains (2026a) ‘Which AI Coding Tools Do Developers Actually Use at Work?’, JetBrains Research Blog, 2 April.
Krishna, M., Gaur, B., Verma, A. and Jalote, P. (2024) ‘Using LLMs in Software Requirements Specifications: An Empirical Evaluation’, 2024 IEEE 32nd International Requirements Engineering Conference.
McKinsey (2023) ‘Unleashing developer productivity with generative AI’, McKinsey Digital, 27 June.
McKinsey (2024) ‘How generative AI could accelerate software product time to market’, McKinsey, 31 May.
Microsoft (2025a) Prompt engineering techniques.
Microsoft Learn.Microsoft (2026a) How to use structured outputs with Azure OpenAI in Microsoft Foundry Models. Microsoft Learn.
Microsoft (2026b) System message design for Azure OpenAI. Microsoft Learn.
Microsoft (2026c) Generate and run unit tests using GitHub Copilot testing. Microsoft Learn.
Midolo, A., Giagnorio, A., Zampetti, F., Tufano, R., Bavota, G. and Di Penta, M. (2026) ‘Guidelines to Prompt Large Language Models for Code Generation: An Empirical Characterization’, arXiv, arXiv:2601.13118.
OpenAI (2025a) Best practices for prompt engineering with the OpenAI API. OpenAI Help Centre.
OpenAI (2025b) Structured model outputs. OpenAI Developers.
OpenAI (2025c) Building an AI-Native Engineering Team. OpenAI Developers.
Thoughtworks (2025) ‘Spec-driven development: Unpacking one of 2025’s key new AI-assisted engineering practices’, Thoughtworks Insights, 4 December.
Vercel (n.d.) What is v0? Vercel Documentation.
Vogelsang, A., Korn, A., Broccia, G., Ferrari, A., Fischbach, J. and Arora, C. (2025) ‘On the Impact of Requirements Smells in Prompts: The Case of Automated Traceability’, arXiv, arXiv:2501.04810.
Wei, B. (2024) ‘Requirements are All You Need: From Requirements to Code with LLMs’, arXiv, arXiv:2406.10101.
Yang, D., Xie, X., Yang, X., Hu, M., Huang, Y., Zhang, Y., Miao, W., Su, T., Wan, C. and Pu, G. (2026) ‘Assessing the Impact of Requirement Ambiguity on LLM-based Function-Level Code Generation’, arXiv, arXiv:2604.21505

Ready to bring structure to AI-enabled delivery?

Thank you.  Your inquiry has been received.  We appreciate your interest in Requirementum QGR.  We will review your message and respond where there appears to be a strong fit for a conversation.
Submission failed. Please try again.