January 4, 2024
More large language models. Always more large language models. Will the new year be any different? But there is a difference in this month’s AI news: there’s an emphasis on tools that make it easy for users to use models. Whether it’s just tweaking a URL so you can ask questions of a paper on arXiv or using LLamafile to run a model on your laptop (make sure you have a lot of memory!) or using the Notebook Language Model to query your own documents, AI is becoming widely accessible—and not just a toy with a web interface.
Artificial Intelligence
- Adding talk2 to the start of any arXiv URL (e.g., talk2arxiv.org) loads the paper into an AI chat application so you can talk to it. This is a very clever application of the RAG pattern.
- Google’s Autonomous Vehicle startup, Waymo, has reported a total of three minor injuries to humans in over 7 million miles of driving. This is clearly not Tesla, not Uber, not Cruise.
- Google’s DeepMind has used a large language model to solve a previously unsolved problem in mathematics. This is arguably the first time a language model has created information that didn’t previously exist.
- The creator of llamafile has offered a set of one-line bash scripts for laptop-powered AI.
- Microsoft has released a small language model named Phi-2. Phi-2 is a 2.7B parameter model that has been trained extensively on “textbook-quality ...
Get Radar Trends to Watch: January 2024 now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.