Document Type

Article

Source of Publication

Journal of Innovation and Entrepreneurship

Publication Date

12-1-2025

Abstract

Artificial intelligence (AI) chatbots show remarkable abilities across applications. Despite a growing literature, their capability in the field of entrepreneurship is not fully understood. The aim of this study is to empirically evaluate and compare capabilities of five major AI chatbots—GPT-3.5, GPT-4, Gemini 1.0, Llama 2, and Claude—in the context of entrepreneurship theory, using a benchmark entrepreneurship test. In particular, the performance of the chatbots on a set of multiple-choice questions, short-answer questions, and essay questions related to entrepreneurship is assessed. The results indicate that GPT-4 delivers the strongest overall performance. Meanwhile, Llama 2 offers precise responses with a significantly lower word count compared to the GPT models. Although chatbots do not always provide correct or precise answers to questions or complex prompts, they still prove to be valuable analytical tools for entrepreneurs. While the study offers compelling insights into chatbots’ grasp of entrepreneurship concepts, the findings are somewhat limited by the scarce availability of data.

ISSN

2192-5372

Volume

14

Issue

1

Disciplines

Life Sciences

Keywords

AI, Business, Chatbots, Claude, Comparative analysis, Entrepreneurship, Gemini, Generative AI, GPT, Large language models, Llama, Machine learning, Zero-shot prompting

Scopus ID

05007043176

Indexed in Scopus

yes

Open Access

yes

Open Access Type

Gold: This publication is openly available in an open access journal/series

Included in

Life Sciences Commons

Share

COinS