This Day in AI Podcast is a podcast all about AI.
Community: https://thisdayinai.com
Show Notes: https://thisdayinai.com/bookmarks/52-ep60
SimTheory with Groq Llama3: https://simtheory.ai
Thanks for listening!
Llama3 Tunes Mentioned on Show:
https://huggingface.co/Orenguteng/Lexi-Llama-3-8B-Uncensored
https://huggingface.co/sherazkhan/Mixllama3-8x8b-Instruct-v0.1
https://huggingface.co/mattshumer/Llama-3-8B-16K
https://huggingface.co/McGill-NLP/Llama-3-8B-Web
CHAPTERS:
=====
00:00 - Rabit r1 Launch Party & Can LAMs Be Useful?
13:40 - Microsoft's Phi-3 Impressions, Use Cases & Will It Kill Someone?
32:50 - Llama3, Gemini 1.5 API Closing in on GPT-4 & Llama3 on Groq
40:07 - A Week Later: SO Many Llama3 Fine Tunes and 16K Context
43:50 - Hume AI Releases AI EVI API: Empathic Voice Interface (and Lie Detector Test)
52:11 - Meta Has Put Llama 3 Everywhere with Meta AI. What is the point?
Show Notes: https://thisdayinai.com/bookmarks/51-ep59
SimTheory: https://simtheory.ai
This Day in AI Community: https://thisdayinai.com
CHAPTERS:
======
00:00 - Meta Llama 3: Chris's Cheese Song & Zuck's Silver Chain
04:07 - Everything Meta Announced with Llama 3: 7B & 40B Model with 400B coming soon
21:31 - Is Groq The Ideal API Host for Llama3?
28:44 - Llama 3 Being Made Available via Meta Apps to 3B Users with Meta AI in Instagram, Whatsapp and via Web
38:01 - Llama 3 Licensing Must Include "Llama 3"
40:52 - Llama 3 400B Model Benchmarks While Still in Training & Potential Unlimited Context? & You Can Eat Llama
1:01:51 - OpenAI Assistants API v2 & Is Tooling Important to Win Devs? Google Gemini's Mistakes
1:15:24 - Conor Update: Using VASA-1 To Deep Fake a Record Label
1:23:07 - SimTheory update: what's next from SimTheory
AI News: https://thisdayinai.com
SimTheory: https://simtheory.ai
Show Notes: https://thisdayinai.com/bookmarks/48-ep58
-------
CHAPTERS:
00:00 - Udio, Udio Examples
10:45 - Will a Record Label Sign an AI Udio Artist?
19:09 - 3 Major LLM Updates/Release in a Single Day
22:58 - Google Gemini 1.5 Pro General Availability, Audio Modality & Impressions
30:20 - Google Cloud Next 2024 AI Announcements Discussion
47:18 - OpenAI Announces "improvements" to GPT-4 Turbo, GPT-4 Turbo Official Release & Vision API JSON & Function Calling
57:35 - Mistral Posts BitTorrent To New Open Source Model Mixtral-8-22B
1:03:00 - Humane's AI Pin Reviews are out... and they aren't great.
Special thanks to AI artist Conor for the great content!
Thanks for listening.
AI News & Discord: https://thisdayinai.com
Try AI on SimTheory: https://simtheory.ai
Show Notes: https://thisdayinai.com/bookmarks/46-ep57
------
CHAPTERS:
00:00 - Mike's Meta Ray Band AI Glasses With No AI
03:52 - OpenAI's Voice Engine & Voice Cloning Safety
14:03 - ChatGPT Now Has Inpainting & Comparison to BrushNet by TencentARC
19:44 - Is There a Business Model for AI Right Now? Is Gary Marcus Right?
44:31 - Cohere's Command R+ Model & Tooling
58:20 - Grok-1.5 & Grok Improving X/Twitter
Thanks for listening and supporting the show.
Show notes: https://thisdayinai.com/bookmarks/45-ep56
Try Gemini 1.5 Pro on SimTheory: https://simtheory.ai/agent/865-google-gemini-15-your-ultimate-assistant
Try Gemini Ultra on SimTheory: https://simtheory.ai/agent/866-google-gemini-ultra-the-apex-of-ai-conversation
Join our community: https://thisdayinai.com
CHAPTERS
=====
00:00 - Fun with Suno v3
10:38 - We Have Google Gemini 1.5 Pro API, Google Ultra API Access!
26:21 - Claude Opus is the King According to LMSYS Chatbot Arena Leaderboard
38:25 - The Sink Sub Coding Challenge with Opus, Gemini 1.5 Pro and Gemini Ultra + Building Salesforce CRM with AI
50:06 - Amazon Invest More Billions in Anthropic
53:03 - Hume AI: Empathic AI Voice & Vision Understanding
1:01:06 - Inflection AI Absorbed into Microsoft, Microsoft is below, above and around all top AI labs.
1:09:28 - Does AI Help Students Learn? Maybe Not?
1:17:37 - Stable Code Instruct 3B, a good local coding model?
1:23:12 - Our AI Songs in Full!
Thanks for listening, please consider subbing, liking, commenting - we love hearing from you.
Show Notes: https://thisdayinai.com/bookmarks/42-ep55
SimTheory Claude Haiku Agent: https://simtheory.ai/agent/795-claude-haiku-chatbot
Sign up for daily AI news: https://thisdayinai.com
====
CHAPTERS
00:00 - OpenAI CTO Mira Murati Sora Interview Train Wreck
16:47 - EU Passes the AI Act
24:25 - 1 year since Greg Brockman Unveiled GPT-4 + Cognition's Devin
52:34 - Anthropic Releases Claude 3: Haiku & It's REALLY GOOD!
1:05:20 - DeepSeek-7B Real World Vision Language Understanding
1:16:09 - It's all about the training data, why Tesla might win Robotics & Vision
1:17:27 - Figure1 Robot with OpenAI for Vision and Language + Discussion on Robot Slavery
====
Please consider subscribing if you like the podcast! Thanks for listening.
Join SimTheory: https://simtheory.ai
Try Claude Opus: https://simtheory.ai/agent/689-claude-opus-your-conversational-companion
Subscribe to This Day in AI Daily News: https://thisdayinai.com
Show Notes: https://thisdayinai.com/bookmarks/41-ep54
Seinfeld Trivia Results: https://docs.google.com/spreadsheets/d/1crRzGE_JbQCIR5dEW_ORAq1QA9Yr8qquonZLILQRUpE/edit#gid=0
====
This week we cover Anthropic's impressive Claude 3 Opus, Sonnet and Haiku releases and play with Google's Gemini 1.5 1M Context using all the Seinfeld episodes ever written. We reluctantly recap and discuss the latest OpenAI drama, the Elon Musk lawsuit and finally cover Inflection's Inflection 2.5 release now available on Pi.
If you like the show sub, like, comment to feed the YouTube gods for us. xo.
CHAPTERS:
====
00:00 - Anthropic Claude 3
36:05 - Is The Future of Programming LLM Function Abstraction?
47:13 - Google Gemini 1.5 1M Context Experiments
1:08:38 - If You Had AGI Tomorrow What Would You Do?
1:12:13 - OpenAI's DramaAI & Elon Musk Lawsuit
1:29:38 - Inflection 2.5 Release on Pi
Show notes: https://thisdayinai.com/bookmarks/39-ep53
Join SimTheory: https://simtheory.ai
Try Mistral Large on SimTheory: https://simtheory.ai/agent/645-mistral-large
Join our community: https://thisdayinai.com
====
This week we talk about the release of Mistral's Large model, Mistral Le Chat, and their deal with Microsoft Azure. We cover papers on Emote Portrait Alive, AI Lip Reading and Cover the Gemini Pile On and how it is distracting from Gemini and the 1M context size break through. We cover the great "data sale" of both Reddit, Tumblr and Stackoverflow data and discuss the Forecasting with LLM paper from Berkeley. We also cover Klarna's 700 support agent replacing AI agents and ask... is Sydney Back with GPT-4.5?
====
CHAPTERS:
00:00 - Cold open
00:44 - A Tough Week for AI Influencers
02:29 - Mistral Large, Mistral Le Chat & Microsoft Azure Partnership
30:31 - EMO: Emote Portrait Alive
36:26 - VSP-LLM: Visual Speech Processing incorporated with LLMs. AI Lip reading tech.
40:06 - The Google Gemini Pile On / Backlash: Is it taking attention away from 1M context breakthrough?
55:25 - The Great AI Training Data Sale: Reddit, Tumblr, Stackoverflow
1:00:34 - Forecasting with LLMs Paper: Can AI Predict The Future?
1:10:15 - Klarna Says They Replace 700 Humans with AI
1:18:07 - Is Microsoft's CoPilot Update Really GPT-4.5?
====
If you like the podcast please consider subscribing, comment, liking and all the things required to feed the YouTube overlords.
Show notes: https://thisdayinai.com/bookmarks/32-ep52
Groq Mixtral: https://simtheory.ai/agent/567-groq-mixtral-edition
Groq Llama: https://simtheory.ai/agent/566-groq-the-speed-oriented-chat-companion
SimTheory: https://simtheory.ai
====
This week we discuss Groq's LPU Chips and the implications of low cost low latency LLMs on custom hardware. We revisit our prank calling to see if Groq's low latency gives an advantage and see if we can improve Air Canada's chatbot. We discuss the launch of Google's Open Source Gamma 7B release and Magic's $148M fundraise for an AI co-worker who can reason. We also cover ChatGPT losing it's mind during the week.
If you like the show, please consider subscribing. Thanks for listening.
====
Chapters:
00:00 - Groq, Groq API and Retell with Groq
32:48 - Google Gemma 7B Open Source Model
39:04 - The 'Magic' Breakthrough on Reasoning and Context
50:19 - Sounds for OpenAI Sora Thanks to ElevenLabs Sound FX
51:59 - ChatGPT Goes Haywire
Show Notes: https://thisdayinai.com/bookmarks/28-ep51/
Sign up for daily This Day in AI: https://thisdayinai.com
Try Stable Cascade: https://simtheory.ai/agent/508-stable-cascade
Join SimTheory: https://simtheory.ai
======
This week we take several shots of vodka before trying to make sense of all the announcements. OpenAI attempted to trump Google's Gemini 1.5 with the announcement of Sora, 1 minute video generation that does an incredible job of keeping track of objects. Google showed us that up to 10M context windows are possible with multi-modal inputs. We discuss if a larger context window could end the need for RAG and take a first look at GraphRAG by Microsoft hoping to improve RAG with a knowledge graph. We road test Nvidia's ChatRTX on our baller graphics cards and Chris tries to delete all of his files using Microsoft UFO, a new open source project that uses GPT-4 vision to navigate and execute tasks on your Windows PC. We cover briefly V-JEPA (will try for next weeks show) and it's ability to learn through watching videos and listening, and finally discuss Stability's Stable Cascade which we've made available for "research" on SimTheory.
If you like the show please consider subscribing and leaving a comment. We appreciate your support.
======
Chapters:
00:00 - OpenAI's Sora That Creates Videos Instantly From Text
13:49 - ChatGPT Memory Released in Limited Preview
23:31 - OpenAI Rumored To Be Building Web Search, Andrej Karpathy Leaves OpenAI, Have OpenAI Slowed Down?
33:04 - Google Announces Gemini Pro 1.5. Huge Breakthrough 10M Context Window!
50:11 - Microsoft Research Publishes GraphRAG: Knowledge Graph Based RAG
1:02:03 - Nvidia's ChatRTX Road Tested
1:07:18 - AI Computers, AI PCs & Microsoft's UFO: An Agent for Window OS Interaction. Risk of AI Computers.
1:18:46 - Meta's V-JEPA: new architecture for self-supervised learning
1:24:26 - Stability AI's Stable Cascade
Subscribe to ThisDayInAI: https://thisdayinai.com
Try AI Agents on SimTheory: https://simtheory.ai
Show notes: https://thisdayinai.com/bookmarks/6-ep50
Tell us your thoughts on Gemini here: https://thisdayinai.com/post/62-your-thoughts-gemini-advanced/
Thanks to everyone for all your support and kind reviews to reach 50 episodes! Please consider leaving us a review wherever you get your podcasts.
=====
This week we cover the launch of Google Gemini Advanced, Gemini Ultra 1.0 and Bard being Renamed to Gemini. We compare GPT-4, Gemini Ultra 1.0 and Qwen 1.5 72B by sports betting $1000 on horse racing.
We celebrate 50 episodes and share our excited for Qwen 1.5 72B's performance at coding and quick refusals. We cover new releases including SyncLabs and Retell AI and Apple's Open Source Guiding Instruction-based Image Editing via Multimodal Large Language Models.
Finally, we discuss GOODY-2 and it's high refusal rate.
=====
CHAPTERS:
00:00 - Betting $1,000 To Compare Gemini Ultra 1.0 to GPT-4 to Qwen 1.5
07:33 - Google Gemini Advanced, Ultra: Details of Announcement and First Impressions
25:48 - OpenAI is Developing Agents to Control Your Devices
27:40 - Celebrating 50 Episodes of This Day in AI
30:34 - Qwen 1.5 72B: We're Impressed!
42:47 - SyncLabs: Tested & Impressions
47:58 - Retell AI: Tested & Impressions
54:18 - Apple's Open Source Guiding Instruction-based Image Editing via Multimodal Large Language Models
58:10 - GOODY-2: The World's Most Responsible AI Model
Your feedback is valuable to us. Should you encounter any bugs, glitches, lack of functionality or other problems, please email us on [email protected] or join Moon.FM Telegram Group where you can talk directly to the dev team who are happy to answer any queries.