Dechiffrierung von LLM-Parameter, Teil 2: Top-P (Kernsampling)

Tutorials

MM-Parameter

Wie jeder Maschinenlern-Modelle hat auch große Sprachmodelle verschiedene Parameter, die die Varianz der generierten Textausgaben steuern. Wir haben eine mehrteilige Serie begonnen, um den Einfluss dieser Parameter detailliert zu erklären. Wir werden abschließend den perfekten Gleichgewicht zwischen Inhaltsgenerierung erzielen, indem wir alle in unserer mehrteiligen Reihe behandelten Parameter verwenden.

Willkommen beim zweiten Teil, wo wir über ein anderes bekanntes Parameter, „Top-P“ sprechen.

Top-P (Kernauswahl)

Wenn das Ziel die Vielfalt der Modelleausgaben steuern ist, dann ist Top-P das richtige für Sie. Ein niedriger Top-P zwingt das Model dazu, die am ehesten zu erwartenden Wörter zu verwenden, während ein hoher Top-P das Model dazu bringt, vielfältigere Wörter zu verwenden, was die Kreativität erhöht.

Lassen Sie uns Top-P in Aktion sehen, indem wir folgenden Code und Output verwenden.

Python

import torch

from transformers import GPT2LMHeadModel, GPT2Tokenizer

​

# Load GPT-2 model and tokenizer

tokenizer = GPT2Tokenizer.from_pretrained("gpt2")

model = GPT2LMHeadModel.from_pretrained("gpt2")

​

# Add pad token to tokenizer (GPT-2 doesn't have it by default)

tokenizer.pad_token = tokenizer.eos_token

​

# Function to generate response with varying top_p

def generate_with_top_p(prompt, top_p):

    inputs = tokenizer(prompt, return_tensors='pt', padding=True)

​

    # Set the attention_mask and pad_token_id

    outputs = model.generate(

        inputs.input_ids,

        attention_mask=inputs['attention_mask'],

        do_sample=True,

        max_length=200, 

        top_p=top_p,

        pad_token_id=tokenizer.eos_token_id

    )

    return tokenizer.decode(outputs[0], skip_special_tokens=True)

​

​

prompt = "What are some effective ways to manage stress in daily life?"

​

# List of top-p values and their descriptions

top_p_values = {

    0.1: "Very conservative: Generates highly probable and safe responses.",

    0.3: "Conservative: Generates probable responses with less risk.",

    0.5: "Balanced: A mix of safe and creative responses.",

    0.7: "Creative: Generates more diverse and creative responses.",

    0.9: "Very creative: Allows for highly diverse and less probable responses."

}

​

# Test top_p variations

for top_p, description in top_p_values.items():

    print(f"\nTop-p {top_p} ({description}):\n")

    print(generate_with_top_p(prompt, top_p=top_p))

​

Ausgabe:

PowerShell

python test_top_p.py

​

Top-p 0.1 (Very conservative: Generates highly probable and safe responses.):

​

What are some effective ways to manage stress in daily life?

​

The following are some of the most common ways to manage stress in daily life.

​

1. Avoiding stress

​

The most common way to avoid stress is to avoid it.

​

The most common way to avoid stress is to avoid it.

​

2. Avoiding stress

​

The most common way to avoid stress is to avoid it.

​

The most common way to avoid stress is to avoid it.

​

3. Avoiding stress

​

The most common way to avoid stress is to avoid it.

​

The most common way to avoid stress is to avoid it.

​

4. Avoiding stress

​

The most common way to avoid stress is to avoid it.

​

The most common way to avoid stress is to avoid it.

​

5. Avoiding stress

​

The most common way to avoid stress is to avoid it.

​

The most common way to avoid stress

​

Top-p 0.3 (Conservative: Generates probable responses with less risk.):

​

What are some effective ways to manage stress in daily life?

​

What are some effective ways to manage stress in daily life?

​

What are some effective ways to manage stress in daily life?

​

What are some effective ways to manage stress in daily life?

​

What are some effective ways to manage stress in daily life?

​

What are some effective ways to manage stress in daily life?

​

What are some effective ways to manage stress in daily life?

​

What are some effective ways to manage stress in daily life?

​

What are some effective ways to manage stress in daily life?

​

What are some effective ways to manage stress in daily life?

​

What are some effective ways to manage stress in daily life?

​
What are some effective ways to manage stress in daily life?
​
What are some effective ways to manage stress in daily life?
​
What are some effective ways to manage stress in daily life?
​
What are some effective
​
Top-p 0.5 (Balanced: A mix of safe and creative responses.):
​
What are some effective ways to manage stress in daily life?
​
1. Stay on top of your body's natural stress levels
​
When you're stressed, your body's natural stress levels are low.
​
If you're stressed, your body's natural stress levels are high.
​
If you're stressed, your body's natural stress levels are low.
​
2. Avoid excessive exercise
​
Exercise can make you feel better.
​
Exercise can make you feel better.
​
3. Get up early to avoid fatigue
​
Exercise can make you feel better.
​
Exercise can make you feel better.
​
4. Avoid the temptation to take the wrong thing
​
Exercise can make you feel better.
​
Exercise can make you feel better.
​
5. Avoid eating the wrong foods
​
Exercise can make you feel better.
​
Exercise can make you feel better.
​
6. Avoid the temptation to
​
Top-p 0.7 (Creative: Generates more diverse and creative responses.):
​
What are some effective ways to manage stress in daily life?
​
I am talking about a very specific situation. The person I am talking about has been stressed, but has not been doing much work for a long time. I want to tell you, because this person has had a lot of stress in his life, that it is not something you can just go back to. But what I'm trying to say is, that if you don't have a job, you have to go back to work every day, so you can spend more time with your family. So I've been doing that for a long time now. And so, that is a very common occurrence.
​
But what do you think is the best way to deal with the stress?
​
You know, it's not easy to deal with it. It is very difficult to deal with the stress that we experience. So, that is a very good way to deal with it. So, I think it's the
​
Top-p 0.9 (Very creative: Allows for highly diverse and less probable responses.):
​
What are some effective ways to manage stress in daily life?
​
There are many things that can be done by daily meditation and practice. As a general rule of thumb, meditation can help you stay mindful of your own needs, goals, feelings, desires, emotions, and the life and emotions around you.
​
The purpose of meditation is to feel a deep desire to practice more, to be more mindful, and to be more productive. It also serves to enhance your overall well-being.
​
1. Be active, be creative, be mindful, and be optimistic.
​
This is where the first step towards meditation comes from. If we're looking for inspiration, there's a whole section on being "active" and "creative."
​
While I'm not sure I know much about meditation, I know some of its practitioners and some that I never met. My mom used to tell me that she'd always find a way to make her feel more connected and involved.

Nun lassen Sie uns die Ausgabe verstehen.

Top-P 0.1 – Sehr konservativ: Da das Model aus den top 10% der wahrscheinlichsten folgenden Wortwahlauswahl auswählt, gibt es viele Wiederholungen in der generierten Inhalte. Daher fehlt dieser Ausgangsbeitrag Vielfalt und ist auch in der Regel uninformativ.
Top-P 0.3 – Konservativ: Der Modell wählt aus den obersten 30% der wahrscheinlichsten folgenden Wortwahl, somit ist er etwas weniger konservativ als die vorherige Top-P Einstellung. Wie Sie aus dem Ergebnis sehen können, hat das keine Verbesserung der Inhaltserzeugung gebracht und der Auftrag wurde immer wieder im fertigen Inhalt wiederholt. In diesem Fall bedeutet die Wiederholung des Auftrags, dass die am wahrscheinlichsten folgende Fortsetzung nach dem Auftrag für das Modell selbst der Auftrag selbst ist.
Top-P 0.5 – Ausgewogen: Hier sehen Sie das Modell erstmals einige nummerierte Strategien auflisten. Es gibt immer noch eine gewisse Wiederholung in dieser Einstellung. Aber das Ergebnis ist, dass bei dieser Top-P Einstellung das Modell einen breiteren Bereich von Wörtern aufnehmen beginnt. Die Ausgabe ist eine Mischung vonStandard-Ratschlägen mit einigen Unstimmigkeiten. Dieser Top-P-Wert ermöglicht eine verbesserte Kreativität, aber es besteht immer noch eine Schwierigkeit mit der Tiefe der Information.
Top-P 0.7 – Kreativ: In diesem Fall kann das Modell aus einem breiteren Bereich von Wörtern wählen, und wie Sie sehen können, geht die Reaktion in Richtung einer narrativen Art und Weise. Der Inhalt ist kreativer, da er jetzt eine Situation einbezieht, in der ein Mensch mit Stress befasst ist. Der Nachteil besteht darin, dass die Fokussierung verloren geht, da der Schwerpunkt nicht auf die Stressbewältigung lag, sondern auf den Schwierigkeiten, die mit der Stressbewältigung zu kämpfen waren.
Top-P 0.9 – Sehr Kreativ: In dieser Einstellung hat das Modell Zugriff auf einen breiten Bereich von Wörtern und Ideen, einschließlich weniger wahrscheinlicher Wörter und Konzepte. Diese Einstellung ermöglicht dem Modell die Verwendung von expressiveren Sprache. Erneut ist der Nachteil der starken Kreativität, dass das Modell vom Auftrag abweicht, um reichhaltige und variantenreiche Inhalte zu produzieren.

Der wichtigste Punkt, den wir aus dem oben genannten Experiment ableiten können, ist die Änderung des Inhalts mit der Änderung der Top-P Einstellung. Es gibt auch eine Ahnung, dass dieser Parameter nicht der einzige ist, der für die Variation des Inhalts und seiner Relevanz behandelt werden muss.

Nun lassen Sie uns uns anschauen, wie sich Top-P auf einige Anwendungsfälle auswirkt, genauso wie im letzten Teil dieser Reihe zu „kreativer Geschichte Generierung“ und „technischer Erklärung“.

Python

import torch
from transformers import GPT2LMHeadModel, GPT2Tokenizer
​
# Load GPT-2 model and tokenizer
tokenizer = GPT2Tokenizer.from_pretrained("gpt2")
model = GPT2LMHeadModel.from_pretrained("gpt2")
​
# Add pad token to tokenizer (GPT-2 doesn't have it by default)
tokenizer.pad_token = tokenizer.eos_token
​
# Function to generate response based on top_p
def generate_with_top_p(prompt, top_p, max_length=250):
    inputs = tokenizer(prompt, return_tensors='pt')
    outputs = model.generate(
        inputs.input_ids,
        attention_mask=inputs.attention_mask,
        do_sample=True,
        max_length=max_length,
        top_p=top_p,
        pad_token_id=tokenizer.eos_token_id,
        eos_token_id=tokenizer.eos_token_id,
        no_repeat_ngram_size=2  # Prevents repetition of phrases
    )
    return tokenizer.decode(outputs[0], skip_special_tokens=True)
​
### USE CASE 1: CREATIVE STORY GENERATION ###
​
def creative_story_generation():
    prompt = ("In the mystical land of Eldoria, a young warrior found an ancient map "
              "that led to a hidden treasure guarded by dragons. He knew that courage and "
              "wisdom would be his allies on this perilous journey.")
​
    # Negative Impact: Low top_p for creative writing (less creative)
    print("\n=== Creative Story with Low top_p (0.2) - Negative Impact: ===")
    low_top_p_story = generate_with_top_p(prompt, top_p=0.2)
    print(low_top_p_story)
​
    # Perfect Impact: High top_p for creative writing (more creative)
    print("\n=== Creative Story with High top_p (0.95) - Perfect Impact: ===")
    high_top_p_story = generate_with_top_p(prompt, top_p=0.95)
    print(high_top_p_story)
​
### USE CASE 2: TECHNICAL EXPLANATION ###
​
def technical_explanation():
    prompt = ("Explain step by step how the internet works, focusing on how computers "
              "use IP addresses and data packets to communicate with each other.")
​
    # Negative Impact: High top_p for technical writing (less precise)
    print("\n=== Technical Explanation with High top_p (0.95) - Negative Impact: ===")
    high_top_p_explanation = generate_with_top_p(prompt, top_p=0.95)
    print(high_top_p_explanation)
​
    # Perfect Impact: Optimal top_p for technical writing (accurate)
    print("\n=== Technical Explanation with Optimal top_p (0.5) - Perfect Impact: ===")
    optimal_top_p_explanation = generate_with_top_p(prompt, top_p=0.5)
    print(optimal_top_p_explanation)
​
# Run both use cases
creative_story_generation()
technical_explanation()
​

Ausgabe:

PowerShell

python top_p_multiple.py
​
=== Creative Story with Low top_p (0.2) - Negative Impact: ===
In the mystical land of Eldoria, a young warrior found an ancient map that led to a hidden treasure guarded by dragons. He knew that courage and wisdom would be his allies on this perilous journey.
​
The Dragon King
...
 (The Book of the Dragon)
,
-
: The Dragon Lord is a legendary warrior who has been the focus of many legends. The dragon king is the most powerful of all the dragons in the world. In the magical land, he is known as the "Dragon King". He is also known to be the leader of a group of dragons called the Black Dragons. His name is derived from the dragon's name, "the dragon".
"The Black Dragon" is an important symbol of power and powerlessness. It is said that the black dragon is able to create a dragon that can defeat the strongest of his enemies. However, the true power of this dragon lies in his ability to manipulate the minds of others. This ability is called "The Dark Dragon". The Dark dragon has a powerful sense of self-preservation and is capable of manipulating others to his will. When he has control over others, his power is so great that he can destroy entire cities. As a result
​
=== Creative Story with High top_p (0.95) - Perfect Impact: ===
In the mystical land of Eldoria, a young warrior found an ancient map that led to a hidden treasure guarded by dragons. He knew that courage and wisdom would be his allies on this perilous journey.
​
Spirits are like gods. In this world, there are no gods without secrets. There are also no secrets about being a fighter or a thief. But every dragon has a special hidden skill, and he or she can use that skill to destroy and gain strength or hide something hidden in the secret. Many dragons are skilled at their martial arts, while most are unaware of the secrets of their true power. These dragons cannot only use these skills, but that will only allow them to escape the dragons' clutches. Because their training will be tested before they're even born, dragon fighting has never been so hard, even without training, so they should be able to break a dragon's body.
​
=== Technical Explanation with High top_p (0.95) - Negative Impact: ===
Explain step by step how the internet works, focusing on how computers use IP addresses and data packets to communicate with each other. If a person with the same identity as a user on the US government's private network uses the online address bar, then this data is sent to a server on a computer on your local network. Your IP address is a small byte in the string. The IP and network address are identical. Do you remember, you just want to do that instead of using IPs or numbers. In addition, remember that IP can be used to verify a particular IP for you and your computer. For instance, your name does not always match an address on our government network and you should have your public IP in this country. This does seem quite unusual and perhaps a bit bizarre.
​
There was a time in Silicon Valley when you could set your identity out. But in most of today's world, how do you set up your own address and how does one look for it? What about the public? The internet itself was different. It was just a set of rules around data flow that you were supposed to follow. Now, even in today the "internet in general" seems a little more complicated to define. Let's say
​
=== Technical Explanation with Optimal top_p (0.5) - Perfect Impact: ===
Explain step by step how the internet works, focusing on how computers use IP addresses and data packets to communicate with each other.
​
"We've been trying to understand how it works and what it means for the future," says James. "It's not just about the IP address, it's about how people communicate. It's also about what's going on with the data. We want to see how this works. What is the Internet going to look like in the next 10 years?"
, the director of the Computer Science and Artificial Intelligence Laboratory at the University of Michigan, says that while there's still a lot of work to be done, "we've got to start to think about it."

Lassen Sie uns nun die Ausgabe für kreatives Storygenerieren und technische Erklärungen aufgrund der Top-P Einstellungen und wie die Ausgabe beeinflusst wurde, analysieren und zusammenbrechen.

Um die Auswirkungen von Top-P effektiv zu demonstrieren, haben wir bessere Anweisungen integriert, um das Ergebnis in eine Richtung zu lenken, in der die Auswirkungen leichter beobachtet werden können.

Kreatives Storygenerieren

Niedrige Top-P (Negativer Einfluss): Wie Sie sehen können, mit der niedrigen Top-P, ist das Modell auf die Verwendung von Wörtern oder Phrasen beschränkt und verursacht somit Wiederholungen und Redundanzen. Die Kreativität ist in diesem Fall ebenfalls begrenzt, da das Modell versucht, keine neuen Ideen einzubringen. Aber wenn Sie beobachten, wird der logische Fluss dennoch aufrechterhalten und das Modell bleibt auf das Thema fokussiert, was typisch für niedrige Top-P Werte ist.
Hohe Top-P (Perfekter Einfluss): In diesem Fall führt das Modell neue Konzepte ein und bringt eine kreative Note zur Erzählung. Eine breitere Vokabular wird verwendet, was den Text tiefgründiger und reicher macht. Allerdings wurde die logische Fluss durch die verstärkten Kreativität eingeschränkt.

Der Kontrast zwischen den beiden Erzählungen zeigt klarmachen, wie der Top-P Einfluss, es einfach zu verstehen, wie er die kreative Schreibweise beeinflusst.

Technische Erklärung

Hohe Top-P (Negativer Einfluss):Wie Sie sehen können, hat eine hohe Top-P einen negativen Einfluss auf technische Erklärungen, indem sie den logischen Fluss behindert und vom Thema abweicht. Das Modell fügt auch unerwünschte Informationen ein, die nicht mit der Erklärung in Verbindung stehen.
Optimale Top-P (Perfekter Einfluss):Mit optimaler Top-P ist die Erklärung kohärenter und näher am Thema. Der Inhalt entspricht mehr dem Auftrag und besitzt eine gute Balance zwischen Genauigkeit und Ausdruck. Die Glaubwürdigkeit der Informationen wird verbessert, weil das Modell auf mehr wahrscheinliche Wörter begrenzt wird.

Fazit

Mit dieser Experimentierung haben wir erfolgreich die Bedeutung des Top-P-Parameters in der Kontrolle der Randomheit und der Kreativität der generierten Texte gezeigt. Wir untersuchten zunächst ein einzelnes Anfragezeichen und wie sich die Ausgabe mit unterschiedlicher Top-P ändert und anschließend einen eher anwendungsbasierten Ansatz, wie Top-P die Ausgabe basierend auf dem Anwendungsfall steuert.

Allerdings haben wir anhand der vorhergehenden Teile und dieser Serie erkannt, dass einzelne Parameter selbst nicht genügen, um die Qualität der Inhaltsgenerierung zu gewährleisten. Deshalb ist es wichtig, den Einfluss aller dieser Parameter zu betrachten, was wir im abschließenden Teil dieser Serie tun werden.

Source:
https://dzone.com/articles/decoding-llm-parameters-top-p