Dynamic Batch Processing with FlexiDecode Scheduler for Efficient LLM Inference in IIoT
journal contribution
posted on 2025-10-06, 04:56 authored by X Jia, B Gu, J Chen, Longxiang GaoLongxiang Gao, W Pang, G Lv, Y Qu, L CuiDynamic Batch Processing with FlexiDecode Scheduler for Efficient LLM Inference in IIoT
Funding
Funder: Taishan Scholar Project of Shandong Province | Grant ID: TSQNZ20230621
History
Related Materials
- 1.
Open access
- Yes
Language
engJournal
Big Data Mining and AnalyticsVolume
8Pagination
1307-1323ISSN
2096-0654eISSN
2097-406XIssue
6Publisher
Tsinghua University PressUsage metrics
Keywords
Licence
Exports
RefWorksRefWorks
BibTeXBibTeX
Ref. managerRef. manager
EndnoteEndnote
DataCiteDataCite
NLMNLM
DCDC


