
Anthropic released a paper showing that different modals activate the same features. This is the kind of fundamental work I support being done over neural-networks. This might lead to an increase in trust on the use of LLMs and other foundational models.