Understanding large language models (LLMs)

  • Release version: Yokohama
  • Updated November 20, 2025
  • 2 minutes to read
  • Summarize
    Summarized using AI
    This content was generated using new OpenAI-powered functionality. Results are provided on an as is basis and are not guaranteed to be accurate or complete.

    Summary of Understanding Large Language Models (LLMs)

    Large language models (LLMs) are generative AI systems that create responses dynamically rather than retrieving fixed answers. This characteristic allows for flexibility and creativity in interactions, making them valuable tools in various applications.

    Show full answer Show less

    Key Features

    • Generative Nature: LLMs generate responses one word at a time based on probability, ensuring variability in outputs.
    • Probabilistic Sampling: The model samples from multiple likely words rather than selecting the top choice, leading to diverse responses.
    • Temperature Settings: This parameter controls randomness; higher settings yield more creative responses while lower settings result in more repetitive outputs.
    • Context Sensitivity: Minor changes in input or conversation history can significantly affect the generated output.
    • System-Level Factors: Variations may arise from hardware and backend updates, even with identical prompts.

    Key Outcomes

    Users can expect variations in responses to the same prompts, which enhances creativity and adaptability. The ServiceNow AI Platform incorporates these LLMs in its search tools, where similar searches may yield different results. Understanding this variability helps manage expectations and utilize these models effectively.

    Large language models are generative, not retrieval-based. They create responses dynamically using probability, which means you can’t expect identical outputs every time. This variability is a feature, not a bug, because it allows for flexibility, creativity, and adaptability.

    How LLMs work

    Large language models (LLMs), like ChatGPT or Copilot, are advanced AI systems that are trained on massive amounts of text to understand and generate human-like language. They build a statistical model of language, so they don’t store fixed answers like an encyclopedia. When you ask a question, the model generates an answer one word (or token) at a time, choosing the next most likely word based on probabilities learned during training. This prediction process makes them powerful, but it's also why they are non-deterministic. This means the system does not always produce the exact same result (output) for the same prompt (input).

    Why results may vary

    Even if you provide the same question or prompt twice, the response can differ. Here’s why:
    Probabilistic sampling
    The model doesn’t always pick the single most likely word. It samples from several likely options. This introduces variation.
    Temperature settings
    Temperature controls randomness, and this internal parameter varies among LLM models. A higher temperature delivers more creative responses, while lower temperatures tend to be more repetitive.
    Multiple valid answers
    Many questions have more than one correct way to explain something. The model may choose different phrasing or emphasis each time.
    Context sensitivity
    Tiny changes in punctuation or prior conversation can shift the output.
    System-level factors
    Hardware concurrency, floating-point math, and backend updates can introduce slight variations, even when everything else is fixed.

    For example, think of it like rolling dice to pick words. When you ask a question, the model doesn’t follow a fixed script. Instead, it looks at many possible next words and picks one based on probabilities—like rolling weighted dice. The dice are weighted toward the most likely words, but there’s still a chance for variation. If you roll again (ask the same question), you might get a slightly different sequence, even though the rules didn’t change. This randomness is intentional. It makes the model flexible and creative, rather than rigid and repetitive.

    For more information about supported LLM models, see Large language models on the ServiceNow AI Platform.

    Variations in search

    The ServiceNow AI Platform® offers a variety of search tools, which may return different answers for the same or similar searches. This disparity in results is expected. For more information, see the following topics: