April 4, 2023
In March, it felt like large language models sucked all the air out of the room. There were so many announcements and claims and new waiting lists to join that it was difficult to find news about other important technologies. Those technologies still exist, and are still developing. There’s a world beyond AI.
One important shift in the past month: The new cybersecurity strategy for the United States shifts responsibility from customers to software and service providers. If something bad happens, it’s no longer (entirely) your fault; vendors need to build more secure software and services. The use of memory-safe languages, particularly Rust, but also older languages like Java and new contenders like Zig, will help software to become more secure.
Artificial Intelligence
- According to Simon Willison, gpt4All is the easiest way to get a (small) large AI model running on a laptop. It’s the base LLaMA model with further training on 800,000 questions and answers generated by GPT-3.5.
- Hugging Face has created a tool called Fair Diffusion for de-biasing images generated by generative graphics tools. With minimal changes to the image, Fair Diffusion changes gender and ethnic characteristics to reflect diversity in populations. It’s suggested that similar techniques will work for language models.
- Databricks has released Dolly, a small large language model (6B parameters). ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access