AI research

Minecraft bot Voyager programs itself using GPT-4

Summary Voyager uses GPT-4 to guide a learning Minecraft agent through the pixel world. Instead of reinforcement learning, Voyager relies on code generation. Researchers from Nvidia, Caltech, UT Austin, Stanford, and ASU introduce Voyager, the first lifelong learning agent that plays Minecraft. Unlike other Minecraft agents that use classic reinforcement learning techniques, for example, Voyager …

Minecraft bot Voyager programs itself using GPT-4 Read More »

Video-ChatGPT analyzes videos and explains why they might be funny

Summary Video-ChatGPT can describe video over time, solving textual tasks such as describing safety risks in a scene, highlighting humorous aspects, or generating matching ad copy. While companies like Runway ML are making strides in converting text to video, Video-ChatGPT goes the other way, giving a language model the ability to analyze video. Video-ChatGPT can …

Video-ChatGPT analyzes videos and explains why they might be funny Read More »

Open-source language models are no match for GPT-4 and co, study says

Summary The progress of open-source language models is undisputed. But can they really compete with the much pricier, heavily trained language models from OpenAI, Google, and others? Sounds too good to be true: With little training effort and almost no money, open-source language models trained using the Alpaca Formula have set new benchmarks recently, reaching …

Open-source language models are no match for GPT-4 and co, study says Read More »

Study says OpenAI’s business model is sound

Summary The progress of open-source language models is undisputed. But can they really compete with the much pricier, heavily trained language models from OpenAI, Google, and others? Sounds too good to be true: With little training effort and almost no money, open-source language models trained using the Alpaca Formula have set new benchmarks recently, reaching …

Study says OpenAI’s business model is sound Read More »

Guanaco is a ChatGPT competitor trained on a single GPU in one day

Summary A new method named QLoRA enables the fine-tuning of large language models on a single GPU. Researchers used it to train Guanaco, a chatbot that reaches 99% of ChatGPTs performance. Researchers at the University of Washington present QLoRA (Quantized Low Rank Adapters), a method for fine-tuning large language models. Along with QLoRA, the team …

Guanaco is a ChatGPT competitor trained on a single GPU in one day Read More »

Meta’s new open source models speak more than 1,100 languages

Summary As part of the Massively Multilingual Speech project, Meta is releasing AI models that can convert spoken language to text and text to speech in 1,100 languages. The new set of models is based on Meta’s wav2vec, as well as a curated dataset of examples for 1,100 languages ​​and another uncurated dataset for nearly …

Meta’s new open source models speak more than 1,100 languages Read More »

“System 2”-inspired method enhances GPT-4’s logic capability

Summary The “Tree of Thoughts” framework combines tree search with GPT-4 to dramatically improve the problem-solving capabilities of the language model. “Tree of Thoughts” is a new framework from researchers at Princeton University and Google DeepMind for inferencing language models like GPT-4, inspired by prompt engineering methods like Chain of Thought. Unlike those, however, ToT …

“System 2”-inspired method enhances GPT-4’s logic capability Read More »

Chatbot Arena helps you find the best open-source chatbot

Summary Until now, there has been no easy way to compare the quality of open-source models. An e-sports-inspired system could help. The Large Model System Organization (LMSYS), which is behind the open-source model Vicuna, has launched the benchmark platform “Chatbot Arena” to compare the performance of large language models. Different models compete against each other …

Chatbot Arena helps you find the best open-source chatbot Read More »

Scroll to Top