Posted inArtificial Intelligence Python PyTorch Web Dev Offload LLM using Facebook FairScale Posted by By Jacob K. Lo October 1, 2023 What is Fairscale FairScale is an open-source library from Facebook. It help researchers and engineers…
Posted inArtificial Intelligence Python PyTorch DeepSpeed API ZeRO Offload Posted by By Jacob K. Lo September 12, 2023 The biggest problem in Deep learning and training large language models (LLMs) is the limit…
Posted inArtificial Intelligence Python PyTorch Offload LLM custom back propagation Posted by By Jacob K. Lo September 2, 2023 What is Model Offloading? Model offloading, as described in the DeepSpeed ZeRO's paper, refers to…
Posted inArtificial Intelligence PyTorch How to generate prompts with 4x less GPU memory Posted by By Jacob K. Lo August 19, 2023 What is 32-bit full precision training and 16-bit half precision training? This picture pretty much…
Posted inArtificial Intelligence Python PyTorch NanoGPT – My takes Posted by By Jacob K. Lo August 5, 2023 NanoGPT is develop by Andrew Karpathy, a Lead engineer from Tesla. The simplest, fastest repository…