From Specs to Stories: AI-Powered Language for Clear Requirements

Author: Denis Avetisyan

A new approach leverages artificial intelligence to translate formal specifications into easily understandable natural language, ensuring consistency and reducing ambiguity in requirement engineering.

This review details a method for systematically constructing controlled natural language requirements from formal specifications using AI assistants and pattern-based techniques.

Despite the increasing need for precise and unambiguous requirements in complex systems, bridging the gap between formal specification and natural language remains a significant challenge. This paper, ‘Developing controlled natural language for formal specification patterns using AI assistants’, introduces a novel method for systematically constructing controlled natural language from formal specifications by leveraging AI assistance to generate requirement patterns and formalize their syntax. The approach focuses on event-driven temporal requirements, creating a corpus of patterns reduced through partial evaluation of attributes within a generalized template. Could this pattern-based, AI-driven technique offer a scalable solution for improving clarity and consistency in requirements engineering across diverse domains?

The Illusion of Clarity: Why Natural Language Fails

The prevalent practice of detailing system requirements through natural language, while seemingly intuitive, frequently introduces a critical impediment to advanced development techniques. Traditional specifications, often penned in everyday prose, suffer from inherent imprecision – terms are open to interpretation, and logical relationships remain implicit rather than explicitly defined. This lack of formalization prevents automated reasoning tools from effectively analyzing a system’s intended behavior, hindering crucial processes like verification and validation. Consequently, ambiguities can persist undetected until late in the development lifecycle, leading to costly rework, integration issues, and ultimately, systems that fail to meet intended objectives. A truly robust engineering approach demands a shift towards requirement definitions that are not only understandable by humans but also rigorously interpretable by machines, enabling a higher degree of confidence in system reliability and correctness.

The pervasive use of natural language in systems requirements introduces significant vulnerabilities to the development process. While seemingly convenient, imprecise wording and inherent ambiguities within everyday language often lead to multiple interpretations of a single requirement. This lack of clarity can result in developers implementing features that deviate from the intended functionality, necessitating costly rework and potentially causing system failures. Inconsistency further exacerbates the problem, as contradictory statements within the requirements documentation create confusion and necessitate time-consuming resolution. Studies have demonstrated that errors stemming from ambiguous or inconsistent requirements represent a substantial portion of overall project costs, highlighting the critical need for more precise and formal methods of specification to ensure reliable system development.

The pursuit of truly reliable and trustworthy systems necessitates a shift towards formally defined requirements, moving beyond the inherent limitations of natural language. Traditional specifications, while seemingly comprehensive, often harbor ambiguities and inconsistencies that can propagate through the development lifecycle, leading to costly errors and system failures. A robust framework addresses this by employing mathematical or logical notations to express requirements with absolute precision, enabling automated reasoning, verification, and validation. This formalization isn’t merely about preventing errors; it establishes a solid foundation for building systems where behavior is predictable, safety is assured, and trust is earned. By translating stakeholder needs into a precise, unambiguous language that machines can understand, developers can create systems that consistently meet expectations and operate with unwavering reliability, particularly crucial in safety-critical applications like aerospace, medicine, and autonomous vehicles.

Building Blocks: Defining System Attributes

A formal specification utilizes a defined set of attributes to comprehensively detail system requirements. These attributes – Trigger, Reaction, Invariant, Delay, Release, and Final – each serve a distinct purpose in specifying behavior. The Trigger attribute denotes the condition that initiates the requirement; the Reaction attribute defines the system’s expected response. Invariant attributes establish conditions that must remain consistently true throughout the requirement’s execution. Temporal aspects are addressed by Delay, which specifies a waiting period before execution, and Release, which defines conditions for pausing execution. Finally, the Final attribute indicates the conditions that signify the requirement’s completion or termination, ensuring a well-defined lifecycle.

The Trigger Attribute within a formal specification defines the initiating condition or event that activates a given requirement; this attribute precisely specifies what must occur to begin the execution of the requirement’s defined functionality. Correspondingly, the Reaction Attribute details the expected outcome or action that should occur in direct response to the Trigger; it outlines the system’s behavior once the triggering condition is met. These attributes establish a clear cause-and-effect relationship, ensuring that the system responds predictably to specific inputs or events and providing a basis for verification and testing of the requirement’s correct operation.

Invariant attributes within a formal specification define conditions that remain consistently true during the entire execution of a requirement. These attributes are not time-dependent and serve as continuous constraints on the system’s state. Violations of invariant conditions indicate a failure in the requirement’s implementation or a breach of its intended behavior. They are crucial for maintaining system stability and predictability, as they establish a baseline for acceptable operational parameters throughout the requirement’s lifecycle and are typically expressed as logical or mathematical statements that must always evaluate to true.

The Delay attribute specifies the time that must elapse before a requirement’s reaction can begin following trigger activation. The Release attribute defines the conditions under which a requirement is considered complete and no longer active, potentially involving a specific event or time period. Finally, the Final attribute establishes any post-completion conditions or actions that must occur after the Release condition is met, such as resource deallocation or state resets; these three attributes collectively govern the temporal lifecycle of a requirement, ensuring predictable and controlled execution and termination.

From Attributes to Patterns: A Surprisingly Systematic Approach

The CNL Construction Method initiates formal specification development by explicitly identifying and defining core attributes relevant to the system being modeled. This process involves decomposing high-level requirements into fundamental characteristics, each representing a specific property or aspect of the system’s entities and their relationships. These attributes are then formally defined with precise data types and value ranges, establishing a clear and unambiguous foundation for subsequent specification stages. The initial attribute definition phase aims to capture the essential characteristics necessary to express system behavior and constraints, serving as the building blocks for more complex requirement patterns and ultimately, a complete formal specification.

Attribute Evaluation is a technique used within the CNL Construction Method to manage requirement complexity by assigning fixed, constant values to specific attributes. This process directly reduces ambiguity inherent in variable or undefined attributes, leading to a more precise specification. By pre-defining these values, the computational overhead associated with evaluating these attributes during formal verification or automated reasoning is significantly decreased. This simplification streamlines the subsequent derivation of Requirement Patterns and facilitates more efficient processing by the AI Assistant used in pattern generation and grammar construction.

Requirement Patterns are constructed by combining defined attributes to represent common requirement types, enabling a standardized approach to specification. These patterns encapsulate specific constraints and behaviors, such as the Simplified Universal (SUP) Pattern, which asserts a condition must hold true for all possible values within a defined domain. The SUP pattern, for example, can be formally expressed as $\forall x : \phi(x)$ , where $\phi(x)$ represents the requirement condition. By predefining these patterns, the CNL Construction Method facilitates the efficient creation of complex specifications from a limited set of core attributes, promoting consistency and reducing the potential for errors.

The CNL Construction Method incorporates an AI Assistant to automate the generation of Requirement Patterns and associated formal grammars. This automation significantly reduces the manual effort required for specification development and minimizes the potential for human error during pattern creation and grammar construction. The AI Assistant functions by translating the defined attributes and derived patterns into a formal grammar capable of generating a diverse range of natural language sentences that accurately reflect the initial, compact representation of the requirements. This capability allows for efficient creation of comprehensive and consistent specifications from a limited set of core attributes, and facilitates validation through the generation of testable statements.

Formalizing Language: The Illusion of Control

The construction of a controlled natural language (CNL) grammar begins with a foundation of pre-defined requirement patterns, effectively acting as building blocks for formal expression. This isn’t simply imposing rules onto existing language; rather, the grammar is derived from the patterns themselves, ensuring a direct correspondence between the expressed requirement and its underlying logic. By systematically analyzing these patterns, a formal grammar is built – one that limits ambiguity and promotes consistency. This carefully constructed grammar then enables the translation of complex, technical specifications into statements that, while formally precise, remain readily understandable to human stakeholders. The result is a language engineered for clarity, bridging the gap between rigorous specification and accessible communication, and laying the groundwork for automated verification processes.

The foundation of the CNL grammar’s reliability lies in its rigorous logical underpinnings, specifically Event-Driven Temporal Logic (EDTL). EDTL allows for the precise definition of how events unfold over time, capturing not just what happens, but also when and in what order. This temporal aspect is crucial for requirements specification, where the sequence of actions often dictates correct system behavior. Supporting EDTL are Linear Temporal Logic (LTL) formulas, which provide a formal language for expressing these temporal properties. $G(p \rightarrow F q)$ , for example, formally states that “always (globally), if p is true, eventually q will be true.” By grounding the CNL grammar in these established logical systems, the resulting language avoids ambiguity and ensures that statements possess a consistent, unambiguous meaning – a critical feature for both human understanding and automated verification processes.

The process of transforming complex formal specifications into understandable requirements relies heavily on a Generalized Natural Language Pattern, which acts as an intermediary bridge between machine logic and human comprehension. This pattern doesn’t simply map keywords; instead, it defines a structured framework for representing the core meaning of each formal element in a way that aligns with natural language syntax. By adhering to this pattern, the system can consistently generate Controlled Natural Language (CNL) statements that are both precise and readable. This ensures that requirements aren’t just logically sound, but also readily understood by all stakeholders – from engineers to end-users – fostering clear communication and minimizing ambiguity throughout the development lifecycle. The resulting CNL statements offer a human-interpretable version of the formal logic, allowing for validation and verification without needing specialized expertise in formal methods.

The construction of controlled natural language (CNL) requirements from formal specification patterns offers a systematic pathway to bridge the communication gap between technical teams and stakeholders. This methodology doesn’t simply translate specifications; it formalizes them into a language that is both human-readable and machine-interpretable, fostering clearer understanding and reducing ambiguity. Critically, this allows for automated verification of requirements against the original formal specifications, identifying inconsistencies or errors that might otherwise slip through traditional review processes. By embedding logical precision within a natural language framework, the approach moves beyond mere documentation towards a robust system for managing and validating complex system requirements, ultimately enhancing project reliability and reducing the potential for costly misunderstandings.

The pursuit of perfectly formalized requirements, as detailed in this work on controlled natural language, feels predictably optimistic. The article proposes leveraging AI assistants to translate formal specifications into understandable patterns, aiming for clarity and consistency. Yet, history suggests these patterns, no matter how elegantly constructed, will eventually succumb to the realities of production systems. As Ken Thompson observed, “Debugging is twice as hard as writing the code in the first place. Therefore, if you write the code as cleverly as possible, you are, by definition, not smart enough to debug it.” The effort to build these systems, while valuable, merely shifts the complexity-and the inevitable technical debt-elsewhere. This pattern-based specification, like all attempts at perfect foresight, will ultimately be a temporary reprieve from chaos, not a permanent solution.

What’s Next?

The pursuit of translating intent into machine-readable form via ‘controlled natural language’ feels…familiar. It always does. One suspects that after a few successful deployments, the ‘control’ will begin to erode, inevitably yielding a language that is ‘natural’ in all the wrong ways. The system, once a tightly governed set of patterns, will gradually accommodate the ambiguities of human expression. They’ll call it ‘AI-powered flexibility’ and raise funding. It’s a predictable arc.

The current focus on formalizing syntax is, at least, a pragmatic acknowledgement of reality. Event-driven temporal logic is all well and good, but production systems rarely adhere to theoretical purity. The real challenge isn’t creating a formal specification; it’s maintaining it when the business logic inevitably shifts. One imagines the system, initially a simple bash script, now resembling a sprawling, undocumented codebase.

Future work will undoubtedly explore ‘smarter’ AI assistants, capable of inferring intent and resolving ambiguity. But ambiguity is, often, a feature, not a bug. The more likely outcome is a proliferation of ‘specification debt’ – the accumulated cost of imprecise or incomplete requirements. Tech debt is just emotional debt with commits, after all. And the documentation? It lied again.

Original article: https://arxiv.org/pdf/2512.24159.pdf

Contact the author: https://www.linkedin.com/in/avetisyan/

The Illusion of Clarity: Why Natural Language Fails

Building Blocks: Defining System Attributes

From Attributes to Patterns: A Surprisingly Systematic Approach

Formalizing Language: The Illusion of Control

What’s Next?

See also: