Is conversational ai ready for engineering licensure? A chatgpt performance benchmark and prompting strategy evaluation

Yayın:
Is conversational ai ready for engineering licensure? A chatgpt performance benchmark and prompting strategy evaluation

dc.contributor.buuauthor	GENÇ, OLCAY
dc.contributor.department	Mühendislik Fakültesi
dc.contributor.department	İnşaat Mühendisliği Ana Bilim Dalı
dc.contributor.researcherid	AFH-5568-2022
dc.date.accessioned	2025-11-06T16:39:28Z
dc.date.issued	2025-10-18
dc.description.abstract	The integration of artificial intelligence (AI) into specialized fields like civil engineering presents transformative opportunities. This study provides a comprehensive quantitative benchmark of the ChatGPT-4 model's performance on the Fundamentals of Engineering (FE) Civil Exam. A two-phase methodology was employed. First, the model's baseline performance was evaluated on a dataset of 100 representative exam questions using zero-shot prompting. Second, a follow-up experiment investigated the impact of advanced prompting strategies, including one-shot, on the 51 incorrectly answered questions. The initial findings reveal a significant performance disparity: ChatGPT-4 achieved high accuracy in conceptual domains like "Ethics and Professional Practice" (100%) but struggled in calculation-intensive areas such as "Statics" (14%), with an overall accuracy of 49%. The subsequent prompt engineering experiment, however, demonstrated that providing a single example (one-shot prompting) was the most effective strategy, correctly answering 30 of the 51 previously failed questions. These combined results offer critical evidence that while current LLMs have inherent limitations in analytical reasoning, their effectiveness can be substantially enhanced through strategic user interaction. The study concludes that AI should be implemented as a powerful supplemental tool, with an educational focus on teaching students how to effectively guide these models to achieve desired outcomes.
dc.identifier.doi	10.1080/13467581.2025.2574556
dc.identifier.issn	1346-7581
dc.identifier.scopus	2-s2.0-105019232462
dc.identifier.uri	https://doi.org/10.1080/13467581.2025.2574556
dc.identifier.uri	https://hdl.handle.net/11452/56566
dc.identifier.wos	001594517800001
dc.indexed.wos	WOS.SCI
dc.indexed.wos	WOS.AHCI
dc.language.iso	en
dc.publisher	Taylor & francis ltd
dc.relation.journal	Journal of asian architecture and building engineering
dc.subject	Artificial intelligence in education
dc.subject	ChatGPT-4
dc.subject	Civil Engineering education
dc.subject	conversational AI
dc.subject	LLM
dc.subject	educational technology integration
dc.subject	Arts & Humanities
dc.subject	Science & Technology
dc.subject	Technology
dc.subject	Architecture
dc.title	Is conversational ai ready for engineering licensure? A chatgpt performance benchmark and prompting strategy evaluation
dc.type	Article
dspace.entity.type	Publication
local.contributor.department	Mühendislik Fakültesi/İnşaat Mühendisliği Ana Bilim Dalı
local.indexed.at	WOS
local.indexed.at	Scopus
relation.isAuthorOfPublication	2d57a04e-3183-4474-a6b0-1e54038d3d1c
relation.isAuthorOfPublication.latestForDiscovery	2d57a04e-3183-4474-a6b0-1e54038d3d1c