Submitted:
14 May 2023
Posted:
18 May 2023
You are already at the latest version
Abstract
Keywords:
1. Introduction
1.1. Requirements Definition and Requirements Engineering
1.2. Shift towards Model-based Systems Engineering (MBSE)
1.3. Semi-Machine-Readable Requirements
1.3.1. Requirement Table
1.3.2. Requirement Boilerplates
1.3.3. Leveraging Language Models
1.4. Research Focus and Expected Contributions
2. Background
2.1. Overview of Enablers
2.1.1. Natural Language Processing for Requirements Engineering (NLP4RE)
2.1.2. Large Language Models (LLMs) Fine-Tuned for Use in Aerospace Domain
- aeroBERT-NER [23] can identify named entities belonging to five distinct categories (as shown in Table 1), which were selected based on their frequency and importance to aerospace texts. The fine-tuning process involved using annotated aerospace text from various sources, as well as requirements from Parts 23 and 25 of Title 14 of the Code of Federal Regulations (CFRs), to fine-tune BERT and generate aeroBERT-NER.
- aeroBERT-Classifier [24] can classify aerospace requirements into three categories: design, functional, and performance requirements. To train aeroBERT-Classifier, the authors fine-tuned BERT using annotated aerospace certification requirements from Parts 23 and 25 of Title 14 of the CFRs. The ability to classify requirements is crucial when analyzing NL requirements, particularly in large-scale systems where there are a significant number of requirements to be defined [43].
2.1.3. Usefulness of Sentence Chunks in Requirements Standardization
2.2. Ways to Standardize Requirements
2.2.1. Requirement Table in SysML
2.2.2. Requirement Boilerplates
Conformance to Boilerplate Templates
The Need for Tailored Boilerplate Templates
3. Research Gaps & Objectives
- Creation of requirements tables: The proposed requirement table contains columns populated by outputs obtained from aeroBERT-Classifier [24] and aeroBERT-NER [23]. This table can further aid in the creation of model-based (e.g., SysML) requirement objects by automatically extracting relevant words (system names, resources, quantities, etc.) from free-form NL requirements.
-
Identification and creation of requirements boilerplates: Distinct requirement types have unique linguistic patterns that set them apart from each other. “Patterns” in this case means the order of types of tags (sentence chunks and NEs) in sequences representing the original requirement.To ensure the boilerplate templates are tailored to each requirement type in a swift and efficient manner, it is crucial to adopt an agile approach based on dynamically identified syntactic patterns in requirements, which is a more adaptive approach when compared to their rule-based counterparts. This study leverages language models to detect the linguistic patterns in requirements, which in turn aid in creating bespoke boilerplates.The aeroBERT-Classifier is used for requirement classification, aeroBERT-NER for identifying named entities relevant to the aerospace domain, flair/chunk-english for extracting text chunks present in each requirement type. The extracted named entities and text chunks are analyzed to identify different elements present in a requirement sentence, and their presence and order in a requirement are used to determine distinct boilerplate templates. The templates identified might resemble the following formats:
- (a)
- <Each/The/All> <system/systems> shall <action>.
- (b)
- <Each/The/All> <system/systems> shall allow <entity> to be <state> for at least <value>.
4. Methodology
4.1. Creating Requirement Tables
4.2. Observation and Analysis of Linguistic Patterns in Aerospace Requirements for Boilerplate Creation
4.2.1. General Patterns Observed in Requirements
5. Results
5.1. Requirement Table
5.2. Boilerplate Templates
5.2.1. Design Requirements
5.2.2. Functional Requirements
5.2.3. Performance Requirements
6. Conclusions & Future Work
7. Other Details
Author Contributions
- Archana Tikayat Ray: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing—original draft preparation, Writing—review and editing
- Bjorn F. Cole: Conceptualization, Writing—review and editing
- Olivia J. Pinon Fischer: Conceptualization, Writing—review and editing
- Anirudh Prabhakara Bhat: Data curation, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing—review and editing
- Ryan T. White: Writing—review and editing
- Dimitri N. Mavris: Writing—review and editing
Data Availability Statement
- The annotated aerospace requirements dataset can be found on the Hugging Face platform. URL: https://huggingface.co/datasets/archanatikayatray/aeroBERT-classification.
- The annotated aerospace NER dataset can be found on the Hugging Face platform. URL: https://huggingface.co/datasets/archanatikayatray/aeroBERT-NER.
Abbreviations
| BERT | Bidirectional Encoder Representations from Transformers |
| CFR | Code of Federal Regulations |
| FAA | Federal Aviation Administration |
| FAR | Federal Aviation Regulations |
| GPT | Generated Pre-trained Transformer |
| INCOSE | International Council on Systems Engineering |
| LM | Language Model |
| LLM | Large Language Model |
| MBSE | Model-Based Systems Engineering |
| NE | Named Entity |
| NER | Named Entity Recognition |
| NL | Natural Language |
| NLP | Natural Language Processing |
| NLP4RE | Natural Language Processing for Requirements Engineering |
| ORG | Organization (Entity label) |
| RE | Requirements Engineering |
| RES | Resource (Entity label) |
| SME | Subject Matter Expert |
| SYS | System (Entity label) |
| SysML | Systems Modeling Language |
References
- Guide to the Systems Engineering Body of Knowledge; BKCASE Editorial Board, INCOSE, 2020; p. 945.
- INCOSE. Incose Infrastructure Working Group Charter; pp. 3–5.
- NASA. Appendix C: How to Write a Good Requirement; pp. 115–119.
- Regnell, B.; Svensson, R.B.; Wnuk, K. Can we beat the complexity of very large-scale requirements engineering? In Proceedings of the International Working Conference on Requirements Engineering: Foundation for Software Quality; Springer, 2008; pp. 123–128.
- NASA. 2.1 The Common Technical Processes and the SE Engine. J. Object Technol. 4.
- Nuseibeh, B.; Easterbrook, S. Requirements Engineering: A Roadmap. In Proceedings of the Conference on The Future of Software Engineering; Association for Computing Machinery: New York, NY, USA, 2000; ICSE ’00; pp. 35–46. [CrossRef]
- Firesmith, D. Common Requirements Problems, Their Negative Consequences, and the Industry Best Practices to Help Solve Them. J. Object Technol. 2007, 6, 17–33. [Google Scholar]
- Haskins, B.; Stecklein, J.; Dick, B.; Moroney, G.; Lovell, R.; Dabney, J. 8.4. 2 error cost escalation through the project life cycle. In Proceedings of the INCOSE International Symposium; Wiley Online Library, 2004; Volume 14, pp. 1723–1737.
- Bell, T.E.; Thayer, T.A. Software requirements: Are they really a problem? In Proceedings of the 2nd International Conference on Software Engineering, 1976; pp. 61–68.
- Estefan, J.A.; others. Survey of model-based systems engineering (MBSE) methodologies. Incose MBSE Focus Group 2007, 25, 1–12. [Google Scholar]
- Jacobson, L.; Booch, J.R.G. The unified modeling language reference manual. 2021.
- Ballard, M.; Peak, R.; Cimtalay, S.; Mavris, D.N. Bidirectional Text-to-Model Element Requirement Transformation. IEEE Aerosp. Conf. 2020, 1–14. [Google Scholar]
- Lemazurier, L.; Chapurlat, V.; Grossetête, A. An MBSE approach to pass from requirements to functional architecture. IFAC-PapersOnLine 2017, 50, 7260–7265. [Google Scholar]
- Needs, Requirements, Verification, Validation Lifecycle Manual; BKCASE Editorial Board, INCOSE, 2022; p. 457.
- Wheatcraft, L.; Ryan, M.; Llorens, J.; Dick, J. The Need for an Information-based Approach for Requirement Development and Management. In Proceedings of the INCOSE International Symposium; Wiley Online Library, 2019; Volume 29, pp. 1140–1157.
- Requirement Table. Available online: https://docs.nomagic.com/display/SYSMLP182/Requirement+Table. (accessed on 10 February 2023).
- Modeling Requirements with SysML. Available online: https://re-magazine.ireb.org/articles/modeling-requirements-with-sysml. (accessed on 10 February 2023).
- Vallejo, P.; Mazo, R.; Jaramillo, C.; Medina, J.M. Towards a new template for the specification of requirements in semi-structured natural language. J. Softw. Eng. Res. Dev. 2020, 8, 1–3. [Google Scholar]
- Arora, C.; Sabetzadeh, M.; Briand, L.; Zimmer, F. Automated checking of conformance to requirements templates using natural language processing. IEEE Trans. Softw. Eng. 2015, 41, 944–968. [Google Scholar]
- Arora, C.; Sabetzadeh, M.; Briand, L.C.; Zimmer, F. Requirement boilerplates: Transition from manually-enforced to automatically-verifiable natural language patterns. In Proceedings of the 2014 IEEE 4th International Workshop on Requirements Patterns (RePa) 2014; pp. 1–8.
- Rupp, C. Requirements-Engineering und-Management: Professionelle, iterative Anforderungsanalyse für die Praxis; Hanser Verlag, 2007.
- Mavin, A.; Wilkinson, P.; Harwood, A.; Novak, M. Easy approach to requirements syntax (EARS). In Proceedings of the 2009 17th IEEE International Requirements Engineering Conference, 2009; pp. 317–322.
- Ray, A.T.; Pinon-Fischer, O.J.; Mavris, D.N.; White, R.T.; Cole, B.F., aeroBERT-NER: Named-Entity Recognition for Aerospace Requirements Engineering using BERT. In Proceedings of the AIAA SCITECH 2023 Forum. [CrossRef]
- Ray, A.T.; Cole, B.F.; Pinon-Fischer, O.J.; White, R.T.; Mavris, D.N., aeroBERT-Classifier: Classification of Aerospace Requirements using BERT. Preprints. [CrossRef]
- Akbik, A.; Blythe, D.; Vollgraf, R. Contextual String Embeddings for Sequence Labeling. In Proceedings of the COLING 2018, 27th International Conference on Computational Linguistics, 2018; pp. 1638–1649.
- Ferrari, A.; Dell’Orletta, F.; Esuli, A.; Gervasi, V.; Gnesi, S. Natural Language Requirements Processing: A 4D Vision. IEEE Softw. 2017, 34, 28–35. [Google Scholar]
- Abbott, R.J.; Moorhead, D. Software requirements and specifications: A survey of needs and languages. J. Syst. Softw. 1981, 2, 297–316. [Google Scholar]
- Manning, C.; Surdeanu, M.; Bauer, J.; Finkel, J.; Bethard, S.; McClosky, D. The Stanford CoreNLP Natural Language Processing Toolkit. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations; Association for Computational Linguistics: Baltimore, Maryland, 2014; pp. 55–60. [CrossRef]
- Natural Language Toolkit. Available online: https://www.nltk.org/ (accessed on 10 January 2023).
- spaCy. Availabke online: https://spacy.io/ (accessed on 10 January 2023).
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser. ; Polosukhin, I. Attention is all you need. Adv. Neural Inf. Process. Syst. 2017, 30. [Google Scholar]
- Devlin, J.; Chang, M.W.; Lee, K.; Toutanova, K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv 2019, arXiv:1810.04805. [Google Scholar]
- Lewis, M.; Liu, Y.; Goyal, N.; Ghazvininejad, M.; Mohamed, A.; Levy, O.; Stoyanov, V.; Zettlemoyer, L. BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension. CoRR 2019, abs/1910.13461.
- Sebastiani, F. Machine Learning in Automated Text Categorization. ACM Comput. Surv. 2002, 34, 1–47. [Google Scholar] [CrossRef]
- Zhao, L.; Alhoshan, W.; Ferrari, A.; Letsholo, K.J.; Ajagbe, M.A.; Chioasca, E.V.; Batista-Navarro, R.T. Natural language processing for requirements engineering: A systematic mapping study. ACM Comput. Surv. (CSUR) 2021, 54, 1–41. [Google Scholar]
- Dalpiaz, F.; Ferrari, A.; Franch, X.; Palomares, C. Natural language processing for requirements engineering: The best is yet to come. IEEE Softw. 2018, 35, 115–119. [Google Scholar]
- Zhao, L.; Alhoshan, W.; Ferrari, A.; Letsholo, K.J. Classification of Natural Language Processing Techniques for Requirements Engineering, 2022. [CrossRef]
- Bengio, Y.; Ducharme, R.; Vincent, P. A Neural Probabilistic Language Model. In Advances in Neural Information Processing Systems; Leen, T.; Dietterich, T.; Tresp, V., Eds. MIT Press, 2000; Volume 13.
- Raffel, C.; Shazeer, N.; Roberts, A.; Lee, K.; Narang, S.; Matena, M.; Zhou, Y.; Li, W.; Liu, P.J. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer. J. Mach. Learn. Res. 2020, 21, 1–67. [Google Scholar]
- Radford, A.; Wu, J.; Child, R.; Luan, D.; Amodei, D.; Sutskever, I.; others. Language models are unsupervised multitask learners. OpenAI blog 2019, 1, 9. [Google Scholar]
- Brown, T.; Mann, B.; Ryder, N.; Subbiah, M.; Kaplan, J.D.; Dhariwal, P.; Neelakantan, A.; Shyam, P.; Sastry, G.; Askell, A.; et al. Language Models are Few-Shot Learners. In Advances in Neural Information Processing Systems; Larochelle, H.; Ranzato, M.; Hadsell, R.; Balcan, M.; Lin, H., Eds. Curran Associates, Inc., 2020; Volume 33, pp. 1877–1901.
- Hugging Face. Available online: https://huggingface.co/ (accessed on 10 January 2023).
- Dalpiaz, F.; Dell’Anna, D.; Aydemir, F.B.; Çevikol, S. Requirements Classification with Interpretable Machine Learning and Dependency Parsing. In Proceedings of the 2019 IEEE 27th International Requirements Engineering Conference (RE), 2019; pp. 142–152. [CrossRef]
- Sonbol, R.; Rebdawi, G.; Ghneim, N. The Use of NLP-Based Text Representation Techniques to Support Requirement Engineering Tasks: A Systematic Mapping Review. IEEE Access 2022, 10, 62811–62830. [Google Scholar] [CrossRef]
- Jurafsky, D.; Martin, J.H. Speech and language processing (draft). 2021.
- Syntax. https://webspace.ship.edu/cgboer/syntax.html. (accessed on 21 February 2023).
- Riesener, M.; Dölle, C.; Becker, A.; Gorbatcheva, S.; Rebentisch, E.; Schuh, G. Application of natural language processing for systematic requirement management in model-based systems engineering. INCOSE International Symposium 2021, 31, 806–815. Available online: https://incose.onlinelibrary.wiley.com/doi/pdf/10.1002/j.2334-5837.2021.00871.x. [CrossRef]
- Rajan, A.; Wahl, T. CESAR: Cost-efficient methods and processes for safety-relevant embedded systems; Number 978-3709113868, Springer, 2013.
- Ruiz, A.; Sabetzadeh, M.; Panaroni, P.; others. Challenges for an open and evolutionary approach to safety assurance and certification of safety-critical systems. In Proceedings of the 2011 First International Workshop on Software Certification. IEEE, 2011, pp. 1–6.
- Arora, C.; Sabetzadeh, M.; Briand, L.; Zimmer, F.; Gnaga, R. Automatic checking of conformance to requirement boilerplates via text chunking: An industrial case study. In Proceedings of the 2013 ACM/IEEE International Symposium on Empirical Software Engineering and Measurement. IEEE, 2013, pp. 35–44.
- Arora, C.; Sabetzadeh, M.; Briand, L.; Zimmer, F. Automated extraction and clustering of requirements glossary terms. IEEE Trans. Softw. Eng. 2016, 43, 918–945. [Google Scholar]
- FAA. Title 14 Code of Federal Regulations; FAA: 2023.
- Tikayat Ray, A. Standardization of Engineering Requirements Using Large Language Models; Georgia Institute of Technology: 2023.































| Category | NER Tags | Example |
|---|---|---|
| System | B-SYS, I-SYS | nozzle guide vanes, flight recorder, fuel system |
| Value | B-VAL, I-VAL | 5.6 percent, 41000 feet, 3 seconds |
| Date time | B-DATETIME, I-DATETIME | 2017, 2014, Sept. 19,1994 |
| Organization | B-ORG, I-ORG | DOD, NASA, FAA |
| Resource | B-RES, I-RES | Section 25-341, Section 25-173 through 25-177, Part 25 subpart C |
| Sentence Chunk | Definition |
|---|---|
| Noun Phrase (NP) | Consists of a noun and other words modifying the noun (determinants, adjectives, etc.); |
| Example:The airplane design must protect the pilot and flight controls from propellers. | |
| Verb Phrase (VP) | Consists of a verb and other words modifying the verb (adverbs, auxiliary verbs, prepositional phrases, etc.); |
| Example: The airplane design must protect the pilot and flight controls from propellers. | |
| Subordinate Clause (SBAR) | Provides more context to the main clause and is usually introduced by subordinating conjunction (because, if, after, as, etc.) |
| Example: There must be a means to extinguish any fire in the cabin such that the pilot, while seated, can easily access the fire extinguishing means. | |
| Adverbial Clause (ADVP) | Modifies the main clause in the manner of an adverb and is typically preceded by subordinating conjunction; |
| Example: The airplanes were grounded until the blizzard stopped. | |
| Adjective Clause (ADJP) | Modifies a noun phrase and is typically preceded by a relative pronoun (that, which, why, where, when, who, etc.); |
| Example: I can remember the time when air-taxis didn’t exist. |
| Requirement type | Count |
|---|---|
| Design | 149 |
| Functional | 99 |
| Performance | 62 |
| Total | 310 |
| Column Name | Description | Method used to populate |
|---|---|---|
| Name | System (SYS named entity) that the requirement pertains to | aeroBERT-NER [23] |
| Text | Original requirement text | Original requirement text |
| Type of Requirement | Classification of the requirement as design, functional, or performance | aeroBERT-Classifier [24] |
| Property | Identified named entities belonging to RES, VAL, DATETIME, and ORG categories present in a requirement related to a particular system (SYS) | aeroBERT-NER [23] |
| Related to | Identified system named entity (SYS) that the requirement properties are associated with | aeroBERT-NER [23] |



| Requirement type | Count | Boilerplate Count | % of requirements covered |
|---|---|---|---|
| Design | 149 | 2 | ∼55% |
| Functional | 100 | 5 | 63% |
| Performance | 61 | 3 | ∼58% |

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).