[Feature] Add vllm inference example #863

wheresmyhair · 2024-06-20T04:26:43Z

Description

Add vllm inference example
Change detokenize to decode_inference_result to make it clearer

Tests

MemorySafeVLLMInference test
Example sh test

Note that the fetal python error comes from the killing signal. This will not affect the inference, as the results are already saved:

research4pan

The major part looks good to me. Some minor problems before merging into main branch.

`scripts/run_vllm_inference.sh`

[Bug] line 59: remove absolute path, which contains personal information.
[Style] line 77: maybe used a fixed log name?

`src/lmflow/args.py`

[Style] line 963: better to be enable_decode_inference_result to indicate that this is a flag.

`src/lmflow/pipeline/utils/memory_safe_vllm_inference.py`

[Style] line 57: better to be enable_decode_inference_result to indicate that this is a flag.

`src/lmflow/pipeline/vllm_inferecer.py`

[Style] line 87, 102, 112, 117, 128: better to be enable_decode_inference_result to indicate that this is a flag.
[Style] line 204: add comments, or RETURN_CODE_ERROR_BUFFER = 134 to make it easier to read.

wheresmyhair · 2024-06-20T07:05:19Z

Changes made, tests passed.

Tests

Example sh test
MemorySafeVLLMInference test

research4pan

LGTM

wheresmyhair added 5 commits June 20, 2024 02:59

[Usability] error handling

397bbef

[Feature] Add vllm inference example

48f26e6

[Usability] rename detokenize to decode_inference_result

9fddcb1

[Bug fix] vllm inference minor bug fix

2fb4f77

[Usability] arg passing fix

1e8b460

research4pan reviewed Jun 20, 2024

View reviewed changes

wheresmyhair added 5 commits June 20, 2024 14:13

[Style] add err code explanation, detokenize flag name change

20b4722

[Style] detokenize flag name change

514879a

[Doc] vllm inference readme update

76bbf3c

[Style] mem safe vllm inference arg name update

706eeac

[Style] mem safe vllm inf test arg name update

844df44

research4pan approved these changes Jun 20, 2024

View reviewed changes

research4pan merged commit e5ab2fd into main Jun 20, 2024
2 checks passed

wheresmyhair mentioned this pull request Jun 20, 2024

[Roadmap] LMFlow Roadmap #862

Open

34 tasks

wheresmyhair deleted the yizhenjia-vllm-inferencer branch June 20, 2024 07:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Add vllm inference example #863

[Feature] Add vllm inference example #863

Uh oh!

wheresmyhair commented Jun 20, 2024

Uh oh!

research4pan left a comment

Uh oh!

wheresmyhair commented Jun 20, 2024

Uh oh!

research4pan left a comment

Uh oh!

Uh oh!

Uh oh!

[Feature] Add vllm inference example #863

[Feature] Add vllm inference example #863

Uh oh!

Conversation

wheresmyhair commented Jun 20, 2024

Description

Tests

Uh oh!

research4pan left a comment

Choose a reason for hiding this comment

scripts/run_vllm_inference.sh

src/lmflow/args.py

src/lmflow/pipeline/utils/memory_safe_vllm_inference.py

src/lmflow/pipeline/vllm_inferecer.py

Uh oh!

wheresmyhair commented Jun 20, 2024

Tests

Uh oh!

research4pan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

`scripts/run_vllm_inference.sh`

`src/lmflow/args.py`

`src/lmflow/pipeline/utils/memory_safe_vllm_inference.py`

`src/lmflow/pipeline/vllm_inferecer.py`