Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix: fail fast on invalid serve parsers
#4701 opened Jun 23, 2026 by CUHKSZzxy Collaborator Loading…
fix _reduce_split_kernel for triton 3.5.1 Bug:P1
#4696 opened Jun 22, 2026 by irexyc Collaborator Loading…
Optimize TTFT
#4695 opened Jun 22, 2026 by grimoire Collaborator Draft
1 task
Remove interactive chat and make inference stateless
#4694 opened Jun 22, 2026 by lvhan028 Collaborator Draft
chore: remove deprecated model support
#4693 opened Jun 22, 2026 by CUHKSZzxy Collaborator Draft
bump version to v0.14.0
#4689 opened Jun 18, 2026 by lvhan028 Collaborator Loading…
Support long-context and MTP prefix-cache hits
#4688 opened Jun 17, 2026 by grimoire Collaborator Loading…
fix: gate multimodal preprocessing concurrency
#4687 opened Jun 17, 2026 by CUHKSZzxy Collaborator Loading…
[Improve]: Remove dlblas from lmdeploy
#4682 opened Jun 16, 2026 by RunningLeon Collaborator Loading…
fix: parse multimodal tool messages Bug:P1
#4680 opened Jun 16, 2026 by CUHKSZzxy Collaborator Loading…
Batch invariant support PART1
#4666 opened Jun 10, 2026 by grimoire Collaborator Draft
refactor: unify interleaved MRoPE rotary embedding
#4644 opened Jun 3, 2026 by CUHKSZzxy Collaborator Draft
Add multimodal and preemption metrics
#4640 opened Jun 1, 2026 by CUHKSZzxy Collaborator Loading…
TEST: Improve tool test
#4632 opened May 28, 2026 by littlegy Contributor Loading…
Interleave long-context prefill chunks with decode
#4631 opened May 28, 2026 by grimoire Collaborator Loading…
1 task done
modify save model in lite module improvement
#4624 opened May 26, 2026 by 43758726 Contributor Loading…
feat(turbomind): support priority schedule policy
#4614 opened May 22, 2026 by 4mengy Loading…
3 of 4 tasks
perf: optimize guided decoding with xgrammar upgrade, batched API, and async D2H overlap
#4605 opened May 21, 2026 by windreamer Collaborator Loading…
1 of 4 tasks
Intern s2 preview lite awq fix bug
#4600 opened May 19, 2026 by 43758726 Contributor Loading…
[WIP]: Support reuse routed experts on eviction
#4599 opened May 19, 2026 by RunningLeon Collaborator Loading…
ProTip! Updated in the last three days: updated:>2026-06-20.