support for batch size > 1 for single problem generations (n_samples=1) #23

Muennighoff · 2022-12-13T22:36:32Z

The below works when setting batch_size 1 🧐

(bigcode) niklas@hf-dgx-01:~/bigcode-evaluation-harness$ accelerate launch main.py --model bigcode/christmas-models --revision fim --tasks codexglue_code_to_text-python --batch_size 16
The following values were not passed to `accelerate launch` and had defaults used instead:
	`--num_cpu_threads_per_process` was set to `64` to improve out-of-box performance
To avoid this warning pass in values for each of the problematic parameters or run `accelerate config`.
Selected Tasks: ['codexglue_code_to_text-python']
Loading the model and tokenizer
100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 3/3 [00:00<00:00, 840.09it/s]
100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 3/3 [00:00<00:00, 949.22it/s]
number of problems for this task is 14918
0it [00:06, ?it/s]
Traceback (most recent call last):
  File "/home/niklas/bigcode-evaluation-harness/main.py", line 188, in <module>
    main()
  File "/home/niklas/bigcode-evaluation-harness/main.py", line 175, in main
    results[task] = evaluator.evaluate(task)
  File "/home/niklas/bigcode-evaluation-harness/lm_eval/evaluator.py", line 62, in evaluate
    generations, references = self.generate_text(task_name)
  File "/home/niklas/bigcode-evaluation-harness/lm_eval/evaluator.py", line 45, in generate_text
    generations = parallel_generations(
  File "/home/niklas/bigcode-evaluation-harness/lm_eval/generation.py", line 82, in parallel_generations
    generations = complete_code(
  File "/home/niklas/bigcode-evaluation-harness/lm_eval/utils.py", line 83, in complete_code
    for step, batch in tqdm(enumerate(dataloader)):
  File "/home/niklas/miniconda3/envs/bigcode/lib/python3.10/site-packages/tqdm/std.py", line 1195, in __iter__
    for obj in iterable:
  File "/home/niklas/miniconda3/envs/bigcode/lib/python3.10/site-packages/accelerate/data_loader.py", line 491, in __iter__
    observed_batch_size = find_batch_size(batch)
  File "/home/niklas/miniconda3/envs/bigcode/lib/python3.10/site-packages/accelerate/utils/operations.py", line 177, in find_batch_size
    raise TypeError(f"Can only find the batch size of tensors but got {type(data)}.")
TypeError: Can only find the batch size of tensors but got <class 'NoneType'>.

Probably related:

(bigcode) niklas@hf-dgx-01:~/bigcode-evaluation-harness$ accelerate launch main.py --model bigcode/christmas-models --revision fim --tasks codexglue_code_to_text-python --limit 8 --max_length_generation 512 --do_sample False --n_samples 100 --batch_size 16
The following values were not passed to `accelerate launch` and had defaults used instead:
	`--num_cpu_threads_per_process` was set to `64` to improve out-of-box performance
To avoid this warning pass in values for each of the problematic parameters or run `accelerate config`.
Selected Tasks: ['codexglue_code_to_text-python']
Loading the model and tokenizer
100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 3/3 [00:00<00:00, 782.52it/s]
100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 3/3 [00:00<00:00, 901.68it/s]
number of problems for this task is 8
0it [00:00, ?it/s]
Traceback (most recent call last):
  File "/home/niklas/bigcode-evaluation-harness/main.py", line 188, in <module>
    main()
  File "/home/niklas/bigcode-evaluation-harness/main.py", line 175, in main
    results[task] = evaluator.evaluate(task)
  File "/home/niklas/bigcode-evaluation-harness/lm_eval/evaluator.py", line 62, in evaluate
    generations, references = self.generate_text(task_name)
  File "/home/niklas/bigcode-evaluation-harness/lm_eval/evaluator.py", line 45, in generate_text
    generations = parallel_generations(
  File "/home/niklas/bigcode-evaluation-harness/lm_eval/generation.py", line 82, in parallel_generations
    generations = complete_code(
  File "/home/niklas/bigcode-evaluation-harness/lm_eval/utils.py", line 87, in complete_code
    generated_tokens = accelerator.unwrap_model(model).generate(
  File "/home/niklas/miniconda3/envs/bigcode/lib/python3.10/site-packages/torch/autograd/grad_mode.py", line 27, in decorate_context
    return func(*args, **kwargs)
  File "/home/niklas/miniconda3/envs/bigcode/lib/python3.10/site-packages/transformers/generation/utils.py", line 1513, in generate
    raise ValueError(
ValueError: num_return_sequences has to be 1, but is 16 when doing greedy search.

The text was updated successfully, but these errors were encountered:

loubnabnl · 2022-12-14T12:40:08Z

We currently parallelize generations for tasks that require multiple generations for each problem like HumanEval and MBPP. So batch_size should be increased when the number of candidate solutions n_samples is higher than 1 which is not the case here.

Muennighoff · 2022-12-14T12:43:08Z

👍 ; Would make sense to also support batch_size to batch multiple examples even when n_sample is 1, no?

loubnabnl · 2022-12-14T12:53:30Z

Yes definitely! Especially since some benchmarks can have thousands of problems

infinitylogesh · 2023-01-05T11:09:40Z

I am interested to work on this! If nobody else is currently working on it :)

loubnabnl · 2023-01-05T11:29:20Z

yes, feel free to work on it!

infinitylogesh linked a pull request Jan 23, 2023 that will close this issue

Add batch evaluation support when batch_size > 1 #36

Open

loubnabnl changed the title ~~batch size > 1 not working~~ support for batch size > 1 for single problem generations (n_samples=1) Apr 30, 2023

loubnabnl added the enhancement New feature or request label Apr 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support for batch size > 1 for single problem generations (n_samples=1) #23

support for batch size > 1 for single problem generations (n_samples=1) #23

Muennighoff commented Dec 13, 2022

loubnabnl commented Dec 14, 2022 •

edited

Loading

Muennighoff commented Dec 14, 2022

loubnabnl commented Dec 14, 2022

infinitylogesh commented Jan 5, 2023

loubnabnl commented Jan 5, 2023

support for batch size > 1 for single problem generations (n_samples=1) #23

support for batch size > 1 for single problem generations (n_samples=1) #23

Comments

Muennighoff commented Dec 13, 2022

loubnabnl commented Dec 14, 2022 • edited Loading

Muennighoff commented Dec 14, 2022

loubnabnl commented Dec 14, 2022

infinitylogesh commented Jan 5, 2023

loubnabnl commented Jan 5, 2023

loubnabnl commented Dec 14, 2022 •

edited

Loading