This Day in AI Podcast

Michael Sharkey

This Day in AI Podcast is a podcast all about AI.

  • 1 hour 1 minute
    EP60: Rabbit r1 Launch Party, LAMs, Microsoft's Phi-3, Hume AI EVI API, Llama3 Updates & Groq Speed

    Community: https://thisdayinai.com
    Show Notes: https://thisdayinai.com/bookmarks/52-ep60
    SimTheory with Groq Llama3: https://simtheory.ai

    Thanks for listening!

    Llama3 Tunes Mentioned on Show:
    https://huggingface.co/Orenguteng/Lexi-Llama-3-8B-Uncensored
    https://huggingface.co/sherazkhan/Mixllama3-8x8b-Instruct-v0.1
    https://huggingface.co/mattshumer/Llama-3-8B-16K
    https://huggingface.co/McGill-NLP/Llama-3-8B-Web

    CHAPTERS:
    =====
    00:00 - Rabit r1 Launch Party & Can LAMs Be Useful?
    13:40 - Microsoft's Phi-3 Impressions, Use Cases & Will It Kill Someone?
    32:50 - Llama3, Gemini 1.5 API Closing in on GPT-4 & Llama3 on Groq
    40:07 - A Week Later: SO Many Llama3 Fine Tunes and 16K Context 
    43:50 - Hume AI Releases AI EVI API: Empathic Voice Interface (and Lie Detector Test)
    52:11 - Meta Has Put Llama 3 Everywhere with Meta AI. What is the point?

    24 April 2024, 6:44 am
  • 1 hour 26 minutes
    EP59: Unhinged Meta Llama 3 *Special Edition*

    Show Notes: https://thisdayinai.com/bookmarks/51-ep59
    SimTheory: https://simtheory.ai
    This Day in AI Community: https://thisdayinai.com

    CHAPTERS:
    ======
    00:00 - Meta Llama 3: Chris's Cheese Song & Zuck's Silver Chain
    04:07 - Everything Meta Announced with Llama 3: 7B & 40B Model with 400B coming soon
    21:31 - Is Groq The Ideal API Host for Llama3?
    28:44 - Llama 3 Being Made Available via Meta Apps to 3B Users with Meta AI in Instagram, Whatsapp and via Web
    38:01 - Llama 3 Licensing Must Include "Llama 3" 
    40:52 - Llama 3 400B Model Benchmarks While Still in Training & Potential Unlimited Context? & You Can Eat Llama
    1:01:51 - OpenAI Assistants API v2 & Is Tooling Important to Win Devs? Google Gemini's Mistakes
    1:15:24 - Conor Update: Using VASA-1 To Deep Fake a Record Label
    1:23:07 - SimTheory update: what's next from SimTheory

    19 April 2024, 3:35 am
  • 1 hour 9 minutes
    EP58: We Convinced a Record Label to Sign an AI Artist + Udio AI Music, Gemini 1.5 Pro, GPT-4 TURBO, Mixtral

    AI News: https://thisdayinai.com
    SimTheory: https://simtheory.ai
    Show Notes: https://thisdayinai.com/bookmarks/48-ep58
    -------

    CHAPTERS:
    00:00 - Udio, Udio Examples
    10:45 - Will a Record Label Sign an AI Udio Artist?
    19:09 - 3 Major LLM Updates/Release in a Single Day 
    22:58 - Google Gemini 1.5 Pro General Availability, Audio Modality & Impressions
    30:20 - Google Cloud Next 2024 AI Announcements Discussion
    47:18 - OpenAI Announces "improvements" to GPT-4 Turbo, GPT-4 Turbo Official Release & Vision API JSON & Function Calling
    57:35 - Mistral Posts BitTorrent To New Open Source Model Mixtral-8-22B
    1:03:00 - Humane's AI Pin Reviews are out... and they aren't great.

    Special thanks to AI artist Conor for the great content!

    Thanks for listening.

    12 April 2024, 2:29 am
  • 1 hour 9 minutes
    EP57: Is Gary Right? VoiceEngine, Cohere Command R+, Stable Audio 2, Grok 1.5

    AI News & Discord: https://thisdayinai.com
    Try AI on SimTheory: https://simtheory.ai
    Show Notes: https://thisdayinai.com/bookmarks/46-ep57
    ------
    CHAPTERS:
    00:00 - Mike's Meta Ray Band AI Glasses With No AI
    03:52 - OpenAI's Voice Engine & Voice Cloning Safety
    14:03 - ChatGPT Now Has Inpainting & Comparison to BrushNet by TencentARC
    19:44 - Is There a Business Model for AI Right Now? Is Gary Marcus Right?
    44:31 - Cohere's Command R+ Model & Tooling
    58:20 - Grok-1.5 & Grok Improving X/Twitter

    Thanks for listening and supporting the show.

    5 April 2024, 2:19 am
  • 1 hour 26 minutes
    EP56: We Wrote a Song! Claude Opus is 👑, Gemini 1.5 Pro & Ultra API Experiments

    Show notes: https://thisdayinai.com/bookmarks/45-ep56
    Try Gemini 1.5 Pro on SimTheory: https://simtheory.ai/agent/865-google-gemini-15-your-ultimate-assistant
    Try Gemini Ultra on SimTheory: https://simtheory.ai/agent/866-google-gemini-ultra-the-apex-of-ai-conversation
    Join our community: https://thisdayinai.com

    CHAPTERS
    =====
    00:00 - Fun with Suno v3
    10:38 - We Have Google Gemini 1.5 Pro API, Google Ultra API Access!
    26:21 - Claude Opus is the King According to LMSYS Chatbot Arena Leaderboard
    38:25 - The Sink Sub Coding Challenge with Opus, Gemini 1.5 Pro and Gemini Ultra + Building Salesforce CRM with AI
    50:06 - Amazon Invest More Billions in Anthropic
    53:03 - Hume AI: Empathic AI Voice & Vision Understanding
    1:01:06 - Inflection AI Absorbed into Microsoft, Microsoft is below, above and around all top AI labs.
    1:09:28 - Does AI Help Students Learn? Maybe Not?
    1:17:37 - Stable Code Instruct 3B, a good local coding model?
    1:23:12 - Our AI Songs in Full! 

    Thanks for listening, please consider subbing, liking, commenting - we love hearing from you.

    28 March 2024, 2:23 am
  • 1 hour 29 minutes
    EP55: Will Devin Take Our Jobs? Sora Interview, Claude Haiku, DeepSeek 7B, Figure1 & Robot Slavery

    Show Notes: https://thisdayinai.com/bookmarks/42-ep55
    SimTheory Claude Haiku Agent: https://simtheory.ai/agent/795-claude-haiku-chatbot
    Sign up for daily AI news: https://thisdayinai.com

    ====
    CHAPTERS
    00:00 - OpenAI CTO Mira Murati Sora Interview Train Wreck
    16:47 - EU Passes the AI Act
    24:25 - 1 year since Greg Brockman Unveiled GPT-4 + Cognition's Devin
    52:34 - Anthropic Releases Claude 3: Haiku & It's REALLY GOOD!
    1:05:20 - DeepSeek-7B Real World Vision Language Understanding
    1:16:09 - It's all about the training data, why Tesla might win Robotics & Vision
    1:17:27 - Figure1 Robot with OpenAI for Vision and Language + Discussion on Robot Slavery
    ====

    Please consider subscribing if you like the podcast! Thanks for listening.

    15 March 2024, 1:42 am
  • 1 hour 36 minutes
    EP54: Claude 3, Gemini 1.5 1M Context Seinfeld Experiment, OpenAI's DramaAI and Inflection 2.5

    Join SimTheory: https://simtheory.ai
    Try Claude Opus: https://simtheory.ai/agent/689-claude-opus-your-conversational-companion
    Subscribe to This Day in AI Daily News: https://thisdayinai.com
    Show Notes: https://thisdayinai.com/bookmarks/41-ep54
    Seinfeld Trivia Results: https://docs.google.com/spreadsheets/d/1crRzGE_JbQCIR5dEW_ORAq1QA9Yr8qquonZLILQRUpE/edit#gid=0

    ====
    This week we cover Anthropic's impressive Claude 3 Opus, Sonnet and Haiku releases and play with Google's Gemini 1.5 1M Context using all the Seinfeld episodes ever written. We reluctantly recap and discuss the latest OpenAI drama, the Elon Musk lawsuit and finally cover Inflection's Inflection 2.5 release now available on Pi.

    If you like the show sub, like, comment to feed the YouTube gods for us. xo.

    CHAPTERS:
    ====
    00:00 - Anthropic Claude 3
    36:05 - Is The Future of Programming LLM Function Abstraction?
    47:13 - Google Gemini 1.5 1M Context Experiments
    1:08:38 - If You Had AGI Tomorrow What Would You Do?
    1:12:13 - OpenAI's DramaAI & Elon Musk Lawsuit
    1:29:38 - Inflection 2.5 Release on Pi

    8 March 2024, 2:34 am
  • 1 hour 23 minutes
    EP53: Mistral Large, Forecasting with LLMs, The Gemini Pile On & Is CoPilot Using GPT-4.5?

    Show notes: https://thisdayinai.com/bookmarks/39-ep53
    Join SimTheory: https://simtheory.ai
    Try Mistral Large on SimTheory: https://simtheory.ai/agent/645-mistral-large
    Join our community: https://thisdayinai.com
    ====

    This week we talk about the release of Mistral's Large model, Mistral Le Chat, and their deal with Microsoft Azure. We cover papers on Emote Portrait Alive, AI Lip Reading and Cover the Gemini Pile On and how it is distracting from Gemini and the 1M context size break through. We cover the great "data sale" of both Reddit, Tumblr and Stackoverflow data and discuss the Forecasting with LLM paper from Berkeley.  We also cover Klarna's 700 support agent replacing AI agents and ask... is Sydney Back with GPT-4.5?

    ====

    CHAPTERS:
    00:00 - Cold open
    00:44 - A Tough Week for AI Influencers
    02:29 - Mistral Large, Mistral Le Chat & Microsoft Azure Partnership
    30:31 - EMO: Emote Portrait Alive
    36:26 - VSP-LLM: Visual Speech Processing incorporated with LLMs. AI Lip reading tech.
    40:06 - The Google Gemini Pile On / Backlash: Is it taking attention away from 1M context breakthrough?
    55:25 - The Great AI Training Data Sale: Reddit, Tumblr, Stackoverflow
    1:00:34 - Forecasting with LLMs Paper: Can AI Predict The Future?
    1:10:15 - Klarna Says They Replace 700 Humans with AI
    1:18:07 - Is Microsoft's CoPilot Update Really GPT-4.5?

    ====

    If you like the podcast please consider subscribing, comment, liking and all the things required to feed the YouTube overlords.

    1 March 2024, 1:51 am
  • 55 minutes
    EP52: The Groq Breakthrough, Google's Gemma 7B, Unlimited Context, Can 'Magic' Reason?

    Show notes: https://thisdayinai.com/bookmarks/32-ep52
    Groq Mixtral: https://simtheory.ai/agent/567-groq-mixtral-edition
    Groq Llama: https://simtheory.ai/agent/566-groq-the-speed-oriented-chat-companion
    SimTheory: https://simtheory.ai
    ====
    This week we discuss Groq's LPU Chips and the implications of low cost low latency LLMs on custom hardware. We revisit our prank calling to see if Groq's low latency gives an advantage and see if we can improve Air Canada's chatbot. We discuss the launch of Google's Open Source Gamma 7B release and Magic's $148M fundraise for an AI co-worker who can reason. We also cover ChatGPT losing it's mind during the week.

    If you like the show, please consider subscribing. Thanks for listening.

    ====
    Chapters:
    00:00 - Groq, Groq API and Retell with Groq
    32:48 - Google Gemma 7B Open Source Model
    39:04 - The 'Magic' Breakthrough on Reasoning and Context
    50:19 - Sounds for OpenAI Sora Thanks to ElevenLabs Sound FX
    51:59 - ChatGPT Goes Haywire

    22 February 2024, 9:08 am
  • 1 hour 29 minutes
    EP51: OpenAI's Sora, Gemini Pro 1.5 10M Context, ChatGPT Memory, GraphRAG, ChatRTX, Microsoft UFO...

    Show Notes: https://thisdayinai.com/bookmarks/28-ep51/
    Sign up for daily This Day in AI: https://thisdayinai.com
    Try Stable Cascade: https://simtheory.ai/agent/508-stable-cascade
    Join SimTheory: https://simtheory.ai
    ======

    This week we take several shots of vodka before trying to make sense of all the announcements. OpenAI attempted to trump Google's Gemini 1.5 with the announcement of Sora, 1 minute video generation that does an incredible job of keeping track of objects. Google showed us that up to 10M context windows are possible with multi-modal inputs. We discuss if a larger context window could end the need for RAG and take a first look at GraphRAG by Microsoft hoping to improve RAG with a knowledge graph. We road test Nvidia's ChatRTX on our baller graphics cards and Chris tries to delete all of his files using Microsoft UFO, a new open source project that uses GPT-4 vision to navigate and execute tasks on your Windows PC. We cover briefly V-JEPA (will try for next weeks show) and it's ability to learn through watching videos and listening, and finally discuss Stability's Stable Cascade which we've made available for "research" on SimTheory.

    If you like the show please consider subscribing and leaving a comment. We appreciate your support.

    ======
    Chapters:
    00:00 - OpenAI's Sora That Creates Videos Instantly From Text
    13:49 - ChatGPT Memory Released in Limited Preview
    23:31 - OpenAI Rumored To Be Building Web Search, Andrej Karpathy Leaves OpenAI, Have OpenAI Slowed Down?
    33:04 - Google Announces Gemini Pro 1.5. Huge Breakthrough 10M Context Window!
    50:11 - Microsoft Research Publishes GraphRAG: Knowledge Graph Based RAG
    1:02:03 - Nvidia's ChatRTX Road Tested
    1:07:18 - AI Computers, AI PCs & Microsoft's UFO: An Agent for Window OS Interaction. Risk of AI Computers.
    1:18:46 - Meta's V-JEPA: new architecture for self-supervised learning
    1:24:26 - Stability AI's Stable Cascade

    16 February 2024, 1:07 am
  • 1 hour 1 minute
    EP50: We Bet $1000 Using Gemini Advanced, Qwen1.5 72B, Retell AI, Apple's MGIE & GOODY-2

    Subscribe to ThisDayInAI: https://thisdayinai.com
    Try AI Agents on SimTheory:
    https://simtheory.ai
    Show notes:
    https://thisdayinai.com/bookmarks/6-ep50

    Tell us your thoughts on Gemini here: https://thisdayinai.com/post/62-your-thoughts-gemini-advanced/

    Thanks to everyone for all your support and kind reviews to reach 50 episodes! Please consider leaving us a review wherever you get your podcasts.
    =====

    This week we cover the launch of Google Gemini Advanced, Gemini Ultra 1.0 and Bard being Renamed to Gemini. We compare GPT-4, Gemini Ultra 1.0 and Qwen 1.5 72B by sports betting $1000 on horse racing.

    We celebrate 50 episodes and share our excited for Qwen 1.5 72B's performance at coding and quick refusals. We cover new releases including SyncLabs and Retell AI and Apple's Open Source Guiding Instruction-based Image Editing via Multimodal Large Language Models.

    Finally, we discuss GOODY-2 and it's high refusal rate.

    =====
    CHAPTERS:

    00:00 - Betting $1,000 To Compare Gemini Ultra 1.0 to GPT-4 to Qwen 1.5
    07:33 - Google Gemini Advanced, Ultra: Details of Announcement and First Impressions
    25:48 - OpenAI is Developing Agents to Control Your Devices
    27:40 - Celebrating 50 Episodes of This Day in AI
    30:34 - Qwen 1.5 72B: We're Impressed!
    42:47 - SyncLabs: Tested & Impressions
    47:58 - Retell AI: Tested & Impressions
    54:18 - Apple's Open Source Guiding Instruction-based Image Editing via Multimodal Large Language Models
    58:10 - GOODY-2: The World's Most Responsible AI Model

    9 February 2024, 1:52 am
  • More Episodes? Get the App
© MoonFM 2024. All rights reserved.