AI research

Meta’s new open source models speak more than 1,100 languages

Summary As part of the Massively Multilingual Speech project, Meta is releasing AI models that can convert spoken language to text and text to speech in 1,100 languages. The new set of models is based on Meta’s wav2vec, as well as a curated dataset of examples for 1,100 languages ​​and another uncurated dataset for nearly …

Meta’s new open source models speak more than 1,100 languages Read More »

“System 2”-inspired method enhances GPT-4’s logic capability

Summary The “Tree of Thoughts” framework combines tree search with GPT-4 to dramatically improve the problem-solving capabilities of the language model. “Tree of Thoughts” is a new framework from researchers at Princeton University and Google DeepMind for inferencing language models like GPT-4, inspired by prompt engineering methods like Chain of Thought. Unlike those, however, ToT …

“System 2”-inspired method enhances GPT-4’s logic capability Read More »

Chatbot Arena helps you find the best open-source chatbot

Summary Until now, there has been no easy way to compare the quality of open-source models. An e-sports-inspired system could help. The Large Model System Organization (LMSYS), which is behind the open-source model Vicuna, has launched the benchmark platform “Chatbot Arena” to compare the performance of large language models. Different models compete against each other …

Chatbot Arena helps you find the best open-source chatbot Read More »

Google researchers make voice a solid smartphone interface

Summary Until now, AI has had a hard time controlling smartphone interfaces. But Google researchers seem to have found a way. To improve voice-based interaction with mobile user interfaces, researchers at Google Research have been investigating the use of large language models (LLM). Current mobile intelligent assistants are limited in conversational interactions because they cannot …

Google researchers make voice a solid smartphone interface Read More »

HumanRF enables photorealistic 3D avatars

Summary HumanRF brings high-resolution 3D avatars to NeRFs. Behind it is an AI startup for synthetic media. Neural Radiance Fields (NeRFs) learn 3D representations from photos or videos and can render individual objects or entire scenes. Some variants specialize in moving scenes or objects, others experiment with editing capabilities, and others attempt to render people …

HumanRF enables photorealistic 3D avatars Read More »

Starcoder is a performant open-source model for copyright-compliant code

Summary BigCode, a joint initiative of Hugging Face and ServiceNow, introduces Starcoder and StarcoderBase, two large open-source code language models. The researchers place special emphasis on transparent and copyright-compliant data selection. The 15.5 billion parameter Starcoder models can generate code in 86 programming languages. In a novel approach, the researchers used a method called “multi-query …

Starcoder is a performant open-source model for copyright-compliant code Read More »

Between dietary advice and surveillance dystopia

Summary DetGPT gives a preview of the AI ​​applications that will be possible with multimodal models in the future – and not just the good ones. At the GPT-4 launch, OpenAI demonstrated some multimodal capabilities, including converting a photographed and scribbled web design into code or the ability to answer questions about images, which is …

Between dietary advice and surveillance dystopia Read More »

Scroll to Top