Research Methodology: ChatGPT as a Tool for Coding 4,000 Dissertations

Location

Preston 2

Session Format

Presentation

Abstract

This research study is in the process of examining over 4,000 Ed.D. dissertations collected from 56 doctoral programs nationwide. The long-term goal is to conduct a meta-analysis, but the project is currently focused on developing a methodology for large-scale data extraction. Initially, a graduate student was manually reviewing and coding each dissertation into a spreadsheet. Over the course of a year, this yielded data from approximately 400 dissertations. This data includes the population of the study (P-12 or higher education), research methodology (quantitative, qualitative, or mixed methods), chapter lengths, and the number of citations used. While this process was effective, the time to do so makes it impractical to analyze the full dataset. To address this, the researchers have explored using artificial intelligence to automate this process. Using ChatGPT, the data was extracted from the PDFs of the dissertations into tables that can be exported. To ensure the reliability and accuracy of using AI in this process, the ChatGPT-coded data was compared against the 400 manually coded dissertations. The initial findings suggest that AI-assisted coding can accurately extract such data while drastically reducing the time to do so. This presentation will discuss this methodology and findings.

Keywords

Artificial Intelligence (AI), Dissertation Research, Meta-Analysis, Educational Leadership (Ed.D.), Research Methodology

Professional Bio

Steve Tolman: Steven Tolman, Ed.D., is an Associate Professor of Educational Leadership at Georgia Southern University. His previous roles included serving as a Higher Education Administration graduate program director and 12 years as a student affairs administrator in Residence Life, Student Conduct, and Student Life. He holds a Doctorate from Rutgers University, a Master’s from Texas Tech University, and a Bachelor’s from Central Michigan University. The two streams of his scholarly agenda include 1) the profession of student affairs and 2) the professional preparation of educational leaders.

Creative Commons License

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.

This document is currently not available here.

Share

COinS
 
Jan 30th, 9:00 AM Jan 30th, 10:00 AM

Research Methodology: ChatGPT as a Tool for Coding 4,000 Dissertations

Preston 2

This research study is in the process of examining over 4,000 Ed.D. dissertations collected from 56 doctoral programs nationwide. The long-term goal is to conduct a meta-analysis, but the project is currently focused on developing a methodology for large-scale data extraction. Initially, a graduate student was manually reviewing and coding each dissertation into a spreadsheet. Over the course of a year, this yielded data from approximately 400 dissertations. This data includes the population of the study (P-12 or higher education), research methodology (quantitative, qualitative, or mixed methods), chapter lengths, and the number of citations used. While this process was effective, the time to do so makes it impractical to analyze the full dataset. To address this, the researchers have explored using artificial intelligence to automate this process. Using ChatGPT, the data was extracted from the PDFs of the dissertations into tables that can be exported. To ensure the reliability and accuracy of using AI in this process, the ChatGPT-coded data was compared against the 400 manually coded dissertations. The initial findings suggest that AI-assisted coding can accurately extract such data while drastically reducing the time to do so. This presentation will discuss this methodology and findings.