Experimenting with ChatGPT for Spreadsheet Formula Generation: Evidence of Risk in AI Generated Spreadsheets

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

33 Downloads (Pure)

Abstract

Large Language Models (LLM) have become sophisticated enough that complex computer programs can be created through interpretation of plain English sentences and implemented in a variety of modern languages such as Python, Java Script, C++ and Spreadsheets. These tools are powerful and relatively accurate and therefore provide broad access to computer programming regardless of the background or knowledge of the individual using them. This paper presents a series of experiments with ChatGPT to explore the tool's ability to produce valid spreadsheet formulae and related computational outputs in situations where ChatGPT has to deduce, infer and problem solve the answer. The results show that in certain circumstances, ChatGPT can produce correct spreadsheet formulae with correct reasoning, deduction and inference. However, when information is limited, uncertain or the problem is too complex, the accuracy of ChatGPT breaks down as does its ability to reason, infer and deduce. This can also result in false statements and "hallucinations" that all subvert the process of creating spreadsheet formulae.
Original languageEnglish
Title of host publicationProceedings of the EuSpRIG 2023 Conference
Subtitle of host publicationThe Spreadsheet Crisis: Regaining Control
PublisherEuropean Spreadsheet Risks Interest Group
Number of pages15
ISBN (Print)978-1-905404-57-5
Publication statusPublished - 1 Jun 2023
EventEuropean Spreadsheet Risks Interest Group EuSpRIG 2023 Conference: The Spreadsheet Crisis: Regaining Control - The Foundling Museum, London
Duration: 6 Jul 20236 Jul 2023
https://eusprig.org/conferences/conference-papers-and-abstracts/

Conference

ConferenceEuropean Spreadsheet Risks Interest Group EuSpRIG 2023 Conference
CityLondon
Period6/07/236/07/23
Internet address

Cite this