Term of Award

Spring 2022

Degree Name

Master of Science in Computer Science (M.S.)

Document Type and Release Option

Thesis (restricted to Georgia Southern)

Copyright Statement / License for Reuse

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


Department of Computer Science

Committee Chair

Andrew A. Allen

Committee Member 1

Gursimran S. Walia

Committee Member 2

Lixin Li


Agile methodologies such as Scrum have been increasingly used in software development processes to create high-quality software products more quickly and adaptively. Identifying faults early and analyzing the impact of changing requirements (user stories) is time-consuming and tedious. This research presents a novel approach to supporting the change impact analysis of user stories. The pairwise semantic similarities of the features (TFIDF vectors Word2Vec) extracted from 147 user stories were computed using Cosine similarity and Word Mover's distance measures. After quantifying the similarities of all pairs of user stories, we applied K-Means, Agglomerative, and Spectral clustering to identify semantically similar clusters of user stories. The experimental results of these combinations of techniques obtained using different clustering techniques show varied performance results compared to the ground truth (from the implementation of these user stories). The research findings indicate that the developers can use our approach to locate the inter-related user stories as they fix requirements, which reduces the time otherwise spent on manual analysis.

Research Data and Supplementary Material