“What's wrong with LLMs and what we should be building instead” - Tom Dietterich - #VSCF2023

valgrAI

10 Jul 202349:47

Summary

TLDRIn this insightful discussion, Tom reflects on the rapid evolution and challenges of large language models (LLMs). He highlights the importance of open-source initiatives in improving efficiency and adaptability. While LLMs excel in tasks like syntax manipulation, he cautions against using them blindly in high-risk applications, emphasizing the need for verification mechanisms. Tom explores how combining LLMs with traditional systems—such as code execution or proof assistants—can enhance their reliability. Ultimately, he envisions a future where LLMs play a key role in both creative fields and critical applications, provided they are appropriately integrated and verified.

Takeaways

😀 Large language models (LLMs) have impressive abilities, such as reading and ingesting large amounts of web data, but they are fundamentally flawed due to the lack of separation between knowledge components like factual knowledge, language understanding, and common sense.
🧠 Current LLMs are not equipped with episodic memory or situation modeling, which means they cannot remember past events or build a coherent understanding of ongoing conversations or narratives.
🔄 A modular approach to AI systems is proposed, where different components for language, factual knowledge, memory, and reasoning are separated and integrated to overcome the limitations of LLMs.
💡 Knowledge graphs are an important tool for representing factual knowledge, with the idea that new facts should be added to these graphs as they appear in conversations or documents.
🤖 The current architecture of LLMs treats knowledge as a statistical model rather than a knowledge base, making it difficult to ensure accuracy and correctness in outputs.
📈 A key challenge for AI systems is improving truthfulness, as models do not inherently understand what constitutes correct information or be able to provide sound justifications for their answers.
🔍 The proposal suggests that LLMs could output both answers and arguments, providing justifications for their conclusions to ensure that reasoning is transparent and verifiable.
🧳 Current LLMs struggle with epistemic uncertainty (lack of knowledge) and instead treat all uncertainty as randomness (aliatoric uncertainty), leading to overconfidence in their answers.
📚 Hybrid systems, combining LLMs with traditional methods like planning systems or proof assistants, can improve reliability by checking the output of LLMs against other trusted tools.
💬 For high-risk applications (e.g., autonomous vehicles or high-security software), it's essential to verify LLM outputs before accepting them, while more creative tasks can tolerate higher levels of uncertainty and error.
🌍 There is a strong push for open-source collaboration to address the challenges faced by LLMs. By releasing models to the public, small companies and academic researchers can experiment and drive progress in AI development.