Thesis
Software documentation classification in open-source applications
Washington State University
Master of Science (MS), Washington State University
05/2020
DOI:
https://doi.org/10.7273/000004235
Handle:
https://hdl.handle.net/2376/124620
Abstract
With the increasing popularity of open-source software development, there is a tremendous growth of software artifacts that provide insight into how people build software. Researchers are always looking for large-scale and representative software artifacts to produce systematic and unbiased validation of novel and existing techniques. For example, in the domain of software requirements traceability, researchers often use software applications with multiple types of artifacts, such as requirements, system elements, verifications, or tasks to develop and evaluate their traceability analysis techniques. However, the manual identification of rich software artifacts is very labor-intensive. In this work, we first conduct a large-scale study to identify which types of software artifacts are produced by a wide variety of open-source projects at different levels of granularity. Then we propose an automated approach based on Machine Learning techniques to identify various types of software artifacts. Through a set of experiments, we report and compare the performance of these algorithms when applied to software artifacts.
Metrics
6 File views/ downloads
29 Record Views
Details
- Title
- Software documentation classification in open-source applications
- Creators
- Michael Anthony Christensen
- Contributors
- Venera Arnaoudova (Advisor) - Washington State University, Electrical Engineering and Computer Science, School of
- Awarding Institution
- Washington State University
- Academic Unit
- Electrical Engineering and Computer Science, School of
- Theses and Dissertations
- Master of Science (MS), Washington State University
- Publisher
- Washington State University
- Identifiers
- 99900896424001842
- Language
- English
- Resource Type
- Thesis