August 6, 2024
July was a big month for model releases: There are new large models from Mistral and Meta, smaller multilingual models from Mistral and DeepL, another Mistral model that specializes in code generation, and a small version of GPT-4o. The security world saw another software supply chain disaster when CrowdStrike released a bad software update that disabled many Windows machines worldwide. While CrowdStrike’s release wasn’t “hostile,” strictly speaking, it demonstrates that there’s no real difference between a hostile attack or a bug that disables your IT infrastructure. We’re also seeing a surge in malware traffic, along with bogus vulnerability reports in CVE.
Artificial Intelligence
- Google’s AlphaProof and Alpha Geometry solved four of six Math Olympiad problems, a performance that would have earned a silver medal in an actual competition. This is by far the best that an AI has ever achieved. However, it was significantly slower than humans.
- Mistral has released Mistral Large 2, a 123 billion parameter model that (like other models) claims performance similar to GPT-4o. It is particularly strong at code generation. Mistral also highlights its multilingual capabilities. Large 2 is available on Hugging Face.
- Facebook/Meta has released Llama 3.1, a 405 billion parameter model that claims performance superior to GPT-4 and Claude 3.5 Sonnet (at least on benchmarks). ...
Get Radar Trends to Watch: August 2024 now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.