Page 1 of 1

LLM Usage and Evaluation Survey

This survey aims to understand users’ experiences, preferences, and evaluations of large language models (LLMs), including which models they use, how frequently they use them, and how they perceive their performance.

The survey is anonymous, and all responses will be used for research and analysis purposes only. There are no right or wrong answers, so please feel free to answer honestly based on your own experience.

Section 1. Usage

01. Which large language models have you used?

02. Which large language models have you paid for?

03. Which large language models are you still paying for?

04. Please rank these models by how often you use them

Please provide a rough ranking based on your usage frequency. If some models are tied, the exact order does not matter. Models you have not used can be placed near the end in their original order.

If you have not used any of the listed models, please rank “I have not used any” first. If you use the listed models about equally often, please rank “I use them about equally” first.

04. Please rank these models by how often you use them

Use the up and down arrow keys to change the rank.

ChatGPT (OpenAI)

Claude (Anthropic)

Gemini (Google)

Grok (X)

Llama (Meta)

DeepSeek

Doubao / Cici (ByteDance)

Yuanbao (Tencent)

Qwen (Alibaba)

Kimi (Moonshot AI)

Zhipu Qingyan (Zhipu AI)

MiniMax

I have not used any

I use them about equally

05. Please rank these models by how much you like them

Please provide a rough ranking based on your usage frequency. If some models are tied, the exact order does not matter. Models you have not used can be placed near the end in their original order.

If you have not used any of the listed models, please rank “I have not used any” first. If you don't have a clear preference on any of them, please rank “No clear preference” first.

05. Please rank these models by how much you like them

Use the up and down arrow keys to change the rank.

ChatGPT (OpenAI)

Claude (Anthropic)

Gemini (Google)

Grok (X)

Llama (Meta)

DeepSeek

Doubao / Cici (ByteDance)

Yuanbao (Tencent)

Qwen (Alibaba)

Kimi (Moonshot AI)

Zhipu Qingyan (Zhipu AI)

MiniMax

I have not used any

No clear preference

06. Which specific model series or versions do you use right now?

If you are not sure or have never paid attention to the exact model version, just write the name or series you remember. If you do not know, you may leave it blank or write “Not sure.”

07. How often do you use large language models?

Many times a day, not sure of the exact number

A few times a day, and I can roughly count them

Several times a week, but not every day / only on workdays

Several times a month

I do not use them often

08. Do you cross-check answers across different models?

Never

Occasionally

Often

09. Do you feel that you rely on LLMs?

Not at all

A little

Clearly yes

10. How much are you willing to spend per month on paid LLM services?

The followings are in US dollars

11. Which AI IDEs or plugins do you use?

12. Is there any model or feature that particularly impressed you?

Section 2. Model Choice by Use Case

13. Which large language models have you used?

14. When you do in-depth research, write papers, or prepare reports, which LLMs do you prefer?

15. When you write or debug code, which LLMs do you prefer?

16. When you process long texts or read academic papers, which LLMs do you prefer?

17. When you do creative writing or brainstorming, which LLMs do you prefer?

18. What are your thoughts on the current state of large language models?

Feel free to share anything you want.

Section 3. Basic Information

This survey is anonymous. We do not collect any personally identifying information, so please feel free to answer honestly.

19. Which country or region do you currently live in or stay in long-term?

20. What is your gender?

Male

Female

Other

Prefer not to say

21. What is your age group?