Using Edge in Python Tutorial

A Trustworthy and Efficient Inference Scheduling Scheme for Edge MoEs Using DRL

Abstract: With the widespread popularity of Large Language Models (LLMs), the mixture of experts (MoE) has not only emerged as a key enabler for scaling up model capacity by significantly reducing ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

A Trustworthy and Efficient Inference Scheduling Scheme for Edge MoEs Using DRL

Trending now