Experimenting with ChatGPT for Spreadsheet Formula Generation: Evidence of Risk in AI Generated Spreadsheets

Allbwn ymchwil: Pennod mewn Llyfr/Adroddiad/Trafodion CynhadleddCyfraniad mewn cynhadleddadolygiad gan gymheiriaid

9 Wedi eu Llwytho i Lawr (Pure)

Crynodeb

Large Language Models (LLM) have become sophisticated enough that complex computer programs can be created through interpretation of plain English sentences and implemented in a variety of modern languages such as Python, Java Script, C++ and Spreadsheets. These tools are powerful and relatively accurate and therefore provide broad access to computer programming regardless of the background or knowledge of the individual using them. This paper presents a series of experiments with ChatGPT to explore the tool's ability to produce valid spreadsheet formulae and related computational outputs in situations where ChatGPT has to deduce, infer and problem solve the answer. The results show that in certain circumstances, ChatGPT can produce correct spreadsheet formulae with correct reasoning, deduction and inference. However, when information is limited, uncertain or the problem is too complex, the accuracy of ChatGPT breaks down as does its ability to reason, infer and deduce. This can also result in false statements and "hallucinations" that all subvert the process of creating spreadsheet formulae.
Iaith wreiddiolSaesneg
TeitlProceedings of the EuSpRIG 2023 Conference
Is-deitlThe Spreadsheet Crisis: Regaining Control
CyhoeddwrEuropean Spreadsheet Risks Interest Group
Nifer y tudalennau15
ISBN (Argraffiad)978-1-905404-57-5
StatwsCyhoeddwyd - 1 Meh 2023
DigwyddiadEuropean Spreadsheet Risks Interest Group EuSpRIG 2023 Conference: The Spreadsheet Crisis: Regaining Control - The Foundling Museum, London
Hyd: 6 Gorff 20236 Gorff 2023
https://eusprig.org/conferences/conference-papers-and-abstracts/

Cynhadledd

CynhadleddEuropean Spreadsheet Risks Interest Group EuSpRIG 2023 Conference
DinasLondon
Cyfnod6/07/236/07/23
Cyfeiriad rhyngrwyd

Dyfynnu hyn