Qwen2.5-Coder, a six-model family of LLMs, boasts enhanced code generation, reasoning, and debugging. Trained on 5.5 trillion tokens, its 32B parameter model rivals GPT-4o, offering versatile capabilities for coding and broader applications.
Select the AI model to use for chat
Define the AI assistant's personality and behavior
Maximum length of response (higher = longer replies)
Creativity level (higher = more creative, lower = more focused)
Nucleus sampling threshold
Limit vocabulary choices to top K tokens
Penalize repeated words (higher = less repetition)