October 1, 2024
The model release train continues, with Mistral’s multimodal Pixtral 12B, OpenAI’s o1 models, and Roblox’s model for building 3D scenes. We also have another important AI-enabled programming tool: Cursor is an alternative to GitHub Copilot that’s getting rave reviews.
Security will never cease to be a problem, but this month seems particularly problematic. The Mirai botnet is infecting a widely used surveillance camera that is unpatchable; the only known mitigation is to replace the camera. And attackers are targeting participants in GitHub projects, telling them that their project has vulnerabilities and sending them to a malware site to learn more.
Artificial Intelligence
- Simon Willison uses the curl utility to discover how streaming APIs for large language models work.
- Goldfish loss is a new loss function that language models can use to minimize the “memorization” of long passages during training. Models trained this way would be less likely to output material they were trained on.
- OpenAI has put two models into limited (preview) release: OpenAI o1-mini and o1-preview. Both reduce errors and hallucinations by implementing chain-of-thought reasoning. o1-preview spends more effort reasoning through problems before generating a response; o1-mini claims to be a cost-effective model that’s more accurate for scientific reasoning.
- Mistral has released Pixtral ...
Get Radar Trends to Watch: October 2024 now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.