PowerInfer
About

Posts

  • Jun 3, 2024

    PowerInfer-2: Fast Large Language Model Inference on a Smartphone

subscribe via RSS

PowerInfer

  • PowerInfer
  • hodlenx@gmail.com
  • SJTU-IPADS

High-speed Large Language Model Serving leveraging activation locality on PC and Mobile devices.