Advances in Lightweight LLMs and On-Device AI for Enhanced Privacy

In the past few weeks, both OpenAI and Google have introduced smaller-scale large-language models. OpenAI’s ChatGPT-4o Mini boasts approximately 8B parameters, while Google’s Gemma 2 2B is even more compact at just 2B parameters—small enough to run on the free tier of T4 GPUs in Google Colab. It’s exciting to see the advancement towards more

Advances in Lightweight LLMs and On-Device AI for Enhanced Privacy Read More »

Posts, Thoughts and Reflections, , ,