Why ChatGPT fails | Language Model Limitations EXPLAINED
How it is possible that ChatGPT makes so many incorrect statements and spits out wrong facts? We explain why ChatGPT isn't a truth oracle.
► Sponsor: Arize AI. Sign up for Arize: https://arize.com/join
Free industry certification in ML observability: https://courses.arize.com/
Learn about embedding drift from Arize research: https://towardsdatascience.com/measuring-embedding-drift-aa9b7ddb84ae
Check out our daily #MachineLearning Quiz Questions: https://www.youtube.com/c/AICoffeeBreak/community
📜 ChatGPT blog: https://openai.com/blog/chatgpt/
📜 Evaluating the Factual Consistency of Large Language Models Through Summarization: https://arxiv.org/abs/2211.08412
📜 WebGPT: Improving the Factual Accuracy of Language Models through Web Browsing: https://openai.com/blog/webgpt/
📜 Behavior cloning is miscalibrated: https://www.alignmentforum.org/posts/BgoKdAzogxmgkuuAt/behavior-cloning-is-miscalibrated
📺 ChatGPT vs. Sparrow: https://youtu.be/SWwQ3k-DWyo
📜 Transformers as Algorithms: Generalization and Stability in In-context Learning: https://arxiv.org/abs/2301.07067
📜 Do Prompt-Based Models Really Understand the Meaning of their Prompts?: https://arxiv.org/abs/2109.01247
Thanks to our Patrons who support us in Tier 2, 3, 4: 🙏
Dres. Trost GbR, Siltax, Edvard Grødem, Vignesh Valliappan, Mutual Information, Mike Ton
➡️ AI Coffee Break Merch! 🛍️ https://aicoffeebreak.creator-spring.com/
Outline:
00:00 ChatGPT spits out wrong facts
02:15 Arize AI (Sponsor)
03:40 How does ChaGPT / a language model work?
05:53 Why ChatGPT generates nonsense
06:38 Confidence and clarifications
07:21 Limits of behavioral cloning
09:04 Phrasing
09:21 Jail breaks
09:45 Is ChatGPT even usable?
▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
🔥 Optionally, pay us a coffee to help with our Coffee Bean production! ☕
Patreon: https://www.patreon.com/AICoffeeBreak
Ko-fi: https://ko-fi.com/aicoffeebreak
▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀
🔗 Links:
AICoffeeBreakQuiz: https://www.youtube.com/c/AICoffeeBreak/community
Twitter: https://twitter.com/AICoffeeBreak
Reddit: https://www.reddit.com/r/AICoffeeBreak/
YouTube: https://www.youtube.com/AICoffeeBreak
#AICoffeeBreak #MsCoffeeBean #MachineLearning #AI #research
Music 🎵 : Intentions – Anno Domini Beats
Video editing: Nils Trost
社群媒體聆聽
How LlamaIndex Brings Data to LLMs
Jerry Liu is Co-Founder of LlamaIndex. This talk was originally delivered at Arize:Observe 2023, a conference on the intersection of large language models, generative AI, and machine learning observability in the era of LLMops. Get updates from Arize on future events: https://arize.com/community/ Get certified in ML observability: https://courses.arize.com
"Make Agent 10x cheaper, faster & better?" - LLM System Evaluation 101
LLM System Eval 101 - Build better agents Get free HubSpot report of how to land a Job using AI: https://clickhubspot.com/fo2 🔗 Links - Join my community: https://www.skool.com/ai-builder-club/about - Follow me on twitter: https://twitter.com/jasonzhou1993 - Join my AI email list: https://www.ai-jason.com/ - My discord: https://discord.gg/eZXprSaCDE - Langsmith: https://smith.langchain.com/ - Phoenix: https://phoenix.arize.com/ - Arize LLM Evaluation guide: https://arize.com/blog-course/llm-evaluation-the-definitive-guide/ - Web scraping agent video: https://www.youtube.com/watch?v=dSX5eoD4-u4 - Signup for universal web scraper: https://forms.gle/zN9w9UyhMKx59yAE6 ⏱️ Timestamps 0:00 Intro 0:27 Why Eval is important 3:30 LLM as evaluator 5:54 How to build eval system 15:10 Case study - Eval & improve research agent 👋🏻 About Me My name is Jason Zhou, a product designer who shares interesting AI experiments & products. Email me if you need help building AI apps! ask@ai-jason.com #gpt4o #aiagents #rag #llamaparse #llamaindex #gpt5 #autogen #gpt4 #autogpt #ai #artificialintelligence #tutorial #stepbystep #openai #llm #chatgpt #largelanguagemodels #largelanguagemodel #bestaiagent #chatgpt #agentgpt #agent #babyagi #evaluation
Why ChatGPT fails | Language Model Limitations EXPLAINED
How it is possible that ChatGPT makes so many incorrect statements and spits out wrong facts? We explain why ChatGPT isn't a truth oracle. ► Sponsor: Arize AI. Sign up for Arize: https://arize.com/join Free industry certification in ML observability: https://courses.arize.com/ Learn about embedding drift from Arize research: https://towardsdatascience.com/measuring-embedding-drift-aa9b7ddb84ae Check out our daily #MachineLearning Quiz Questions: https://www.youtube.com/c/AICoffeeBreak/community 📜 ChatGPT blog: https://openai.com/blog/chatgpt/ 📜 Evaluating the Factual Consistency of Large Language Models Through Summarization: https://arxiv.org/abs/2211.08412 📜 WebGPT: Improving the Factual Accuracy of Language Models through Web Browsing: https://openai.com/blog/webgpt/ 📜 Behavior cloning is miscalibrated: https://www.alignmentforum.org/posts/BgoKdAzogxmgkuuAt/behavior-cloning-is-miscalibrated 📺 ChatGPT vs. Sparrow: https://youtu.be/SWwQ3k-DWyo 📜 Transformers as Algorithms: Generalization and Stability in In-context Learning: https://arxiv.org/abs/2301.07067 📜 Do Prompt-Based Models Really Understand the Meaning of their Prompts?: https://arxiv.org/abs/2109.01247 Thanks to our Patrons who support us in Tier 2, 3, 4: 🙏 Dres. Trost GbR, Siltax, Edvard Grødem, Vignesh Valliappan, Mutual Information, Mike Ton ➡️ AI Coffee Break Merch! 🛍️ https://aicoffeebreak.creator-spring.com/ Outline: 00:00 ChatGPT spits out wrong facts 02:15 Arize AI (Sponsor) 03:40 How does ChaGPT / a language model work? 05:53 Why ChatGPT generates nonsense 06:38 Confidence and clarifications 07:21 Limits of behavioral cloning 09:04 Phrasing 09:21 Jail breaks 09:45 Is ChatGPT even usable? ▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀ 🔥 Optionally, pay us a coffee to help with our Coffee Bean production! ☕ Patreon: https://www.patreon.com/AICoffeeBreak Ko-fi: https://ko-fi.com/aicoffeebreak ▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀ 🔗 Links: AICoffeeBreakQuiz: https://www.youtube.com/c/AICoffeeBreak/community Twitter: https://twitter.com/AICoffeeBreak Reddit: https://www.reddit.com/r/AICoffeeBreak/ YouTube: https://www.youtube.com/AICoffeeBreak #AICoffeeBreak #MsCoffeeBean #MachineLearning #AI #research Music 🎵 : Intentions – Anno Domini Beats Video editing: Nils Trost
總共有 172 筆社群媒體資料需要解鎖才能查看