200-year-old mathematics
In his research, Søren Hauberg has therefore sought out mathematical formulas that correct for the errors that can occur in data sets during compression.
“As part of our basic research, we’ve found a systematic solution that allows us to theoretically walk backwards so we can keep track of which patterns are grounded in reality and which ones have been fabricated by the compression process. When we are able to separate these, we humans can gain a better understanding of how artificial intelligence works – while also being reassured that the artificial intelligence isn’t listening to false patterns.”
The mathematical formulas utilized by Søren Hauberg and his colleagues are hardly brand new – they were in fact developed in the 19th century for use in cartography.
“When they tried to draw maps, they were seeking to transfer information from a three-dimensional sphere to a two-dimensional surface. This created a number of distortions: for example, the land masses are not in accurate proportion to one another meaning that Greenland appears to be much bigger than Africa. The mathematical formulas that correct for these distortions can also be used in our research examining the black boxes of artificial intelligence,” says Søren Hauberg.
May prevent ChatGPT’s hallucinations
The researchers have now made sufficient progress that they are able to look inside the black boxes of artificial intelligence models that use data compression.
“These models are typically used in research when researchers try to find out whether there are any underlying patterns in the data they are working with. The prevention of incorrect conclusion is directly relevant to the working processes of academia,” says Søren Hauberg.
He adds that their work is still unable to correct errors that are found in artificial intelligence systems such as ChatGPT. However, he notes that the researchers’ work has the potential to do so in future.
“We’d love to be able to explain why a chatbot like ChatGPT hallucinates. We can’t do that yet, but perhaps we will be able to in a couple of years’ time,” says the professor, who received a new EU grant worth EUR 2 million in early 2024 to support his continued research into black boxes.