Press "Enter" to skip to content

This model to the never stops learning

Language of great modern modern The models (LLM) could write beautiful sonnets and elegant code, but they also lack a rudimentary ability to learn from experience.

WhatsApp Group Join Now
Telegram Group Join Now

The researchers of the Massachusetts Institute of Technology (MIT) have now conceived a way for LLM to continue improving by changing their parameters in response to new useful information.

The work is a step towards the construction of artificial intelligence models that continuously learn: a longtime objective of the field and something that will be crucial if the machines must imitate human intelligence more faithful. In the meantime, it could give us chatbots and other artificial intelligence tools that are able to incorporate new information including the interests and preferences of a user.

The MIT scheme, called Auto -Adaption Models (SEAL), provides that an LLM generates its synthetic training data based on the input it receives.

“The initial idea was to explore whether the tokens [units of text fed to LLMs and generated by them] It could cause a powerful update to a model, “says Jyothish even, a doctoral student at the MIT involved in the development of Seal. Even says that the idea was to see if the production of a model could be used to train it.

Adam Zweiger, a MIT university researcher involved in the construction of the seal, adds that although the most recent models can “reason” their way for better solutions by performing a more complex inference, the model itself does not benefit from this long -term reasoning.

On the contrary, sealing new intuitions and therefore folds in its weights or parameters. Given a declaration on the challenges faced by the Space Apollo program, for example, the model generated new steps that try to describe the implications of the declaration. The researchers compared this to the way a human student writes and examines the notes to help their learning.

The system then updated the model using these data and has tested how well the new model is able to answer a series of questions. And finally, this provides a reinforcement learning signal that helps to guide the model towards updates that improve its general skills and that help him carry out learning.

The researchers tested their approach on small and medium -sized versions of two open source models, Meta’s Lama and Alibaba’s Qwen. They say that the approach should also work for much larger border models.

The researchers tested the Seal approach on the text and a point of reference called Arc that measures the ability of an AI model to solve the abstract reasoning problems. In both cases they saw that Seal has allowed the models to continue learning well beyond their initial training.

Pulkit Agrawal, professor to MIT who supervised the job, says that the Seal project touches important themes in artificial intelligence, including how to convince the IA to understand on its own what should try to learn. He says he could be used to help make artificial intelligence models more personalized. “The LLM are powerful but we don’t want their knowledge to stop,” he says.

The seal is not yet a way for the ia to improve indefinitely. First of all, as AGRAWAL observes, the tested LLMs suffer from what is known as “catastrophic forgetfulness”, a worrying effect seen when ingesting new information simply makes an ancient knowledge disappear. This can indicate a fundamental difference between artificial and biological neural networks. Even and Zweigler also note that Seal is computationally intensive and it is not yet clear how to better plan new learning periods. A fun idea, mentions Zweigler, is that, like humans, perhaps LLMS could experience “sleep” periods in which new information is consolidated.

However, despite all its limits, Seal is an exciting new path for further artificial intelligence research and could be something that makes its way into the future models of ai border.

What do you think of the AI ​​who is able to continue learning? Send an and -mail to Hello@wired.com to let me know.

Source link


Discover more from Gautam Kalal

Subscribe to get the latest posts sent to your email.

More from TechnologyMore posts in Technology »

Be First to Comment

Leave a Reply