Faculty Mentor

Joshua Farara

Location

Russell Union Ballroom

Type of Research

Proposed

Session Format

Poster Presentation

College

Allen E. Paulson College of Engineering & Computing

Department

Department of Computer Science

Abstract

Numerous artificial intelligence GPT models are trained on various language dataset types, including ethical and mental health dialogue, to produce ethically coherent responses, and these datasets are opened and merged together via the efficient processes of data mining and data merging. To explore how companies such as OpenAI and Microsoft achieve these goals, we implemented our own custom-built GPT, named AkiraGPT, which has recently and historically had issues generating ethical responses, simulating self awareness, and has sometimes generated incomprehensible, contextually irrelevant responses. In addition to that, AkiraGPT would sometimes have it's responses be "cut off." 

Our goal with our research is to deploy the mental health dataset we have merged from other datasets, which we have mined, into a fresh experimental version of AkiraGPT to retrain the new model path with appropriately adjusted back propagation handles such as weights, so that the model could hopefully tokenize inputs and generate ethically coherent, contextually relevant responses via attention mask layering.

Program Description

.

DOI

10.20429/GS4.2026.005

Start Date

4-23-2026 10:00 AM

End Date

4-23-2026 12:00 PM

Share

COinS
 
Apr 23rd, 10:00 AM Apr 23rd, 12:00 PM

Project Akira: Data Mining and Merging, and training a Custom-Built GPT to produce Therapist-Like, Ethically Coherent Responses

Russell Union Ballroom

Numerous artificial intelligence GPT models are trained on various language dataset types, including ethical and mental health dialogue, to produce ethically coherent responses, and these datasets are opened and merged together via the efficient processes of data mining and data merging. To explore how companies such as OpenAI and Microsoft achieve these goals, we implemented our own custom-built GPT, named AkiraGPT, which has recently and historically had issues generating ethical responses, simulating self awareness, and has sometimes generated incomprehensible, contextually irrelevant responses. In addition to that, AkiraGPT would sometimes have it's responses be "cut off." 

Our goal with our research is to deploy the mental health dataset we have merged from other datasets, which we have mined, into a fresh experimental version of AkiraGPT to retrain the new model path with appropriately adjusted back propagation handles such as weights, so that the model could hopefully tokenize inputs and generate ethically coherent, contextually relevant responses via attention mask layering.