Bug in the L4 notebook (Optimizing a Model with LLM Compressor)
Hello, Cedric said in the video (9:20) that each window overlaps with the previous one, but actually there is no overlap in the code (see below) and “target_slice[:, :-trg_len] = -100” does nothing. def calculate_perplexity(model, tokenizer, dataset, max_tokens=5000, stride=512): encodings = tokeniz
⚡
Key Insights
10 editorial insights.
AiFeed24 Team·⏱ 1 min read·News
Deep Analysis
Multi-Source Intelligence
Tags:#ai
Found this useful? Share it!

