Radar Trends to Watch: July 2023
Developments in AI, Security, Quantum Computing, and
More
By Mike Loukides
July 5, 2023
A surprising number of the entries for AI are about generative models that don’t
generate text or artwork—specifically, they generate human voices or music. Is
voice the next frontier for AI? Google’s AudioPaLM, which unites speech
recognition, speech synthesis, and language modeling, may show the direction in
which AI is heading. There’s also increasing concern about the consequences of
training AI on data that was generated by AI. With less input from real humans,
does “model collapse” lead to output that is mediocre at best?
Artificial Intelligence
- RoboCat is an AI
model for controlling robots that learns how to learn. Unlike most
robotics, which are designed to perform a small number of tasks,
RoboCat can learn new tasks after it is deployed, and the learning
process speeds up as it learns more tasks.
- AudioPaLM is a new
language model from Google that combines speech generation, speech
understanding, and natural language processing. It’s a large
language model that understands and produces voice.
- Voicemod is a tool
for turning human speech into AI-generated speech in real time. The
company offers a number of “sonic avatars” that can be further
customized.
- Tree-of-thought
prompting expands on chain-of-thought by causing language models to
consider multiple reasoning
paths in the process of generating an output.
- Facebook/Meta has built a new generative ...