UsefulAINewsBackup29Mar24


Last update: Friday 
3/29/24
This weekly news page provides links to reports about major innovations in language models, with emphasis on small language models (SLMs).

Most of the links refer to reports published after the "AI Big Bang", i.e., after Sam Altman's dismissal from and return to OpenAI, i.e., after  November 20, 2023. The previous edition of this page can be found here  PreBigBang 


A. OpenAI | B. Microsoft | C. Google | D. Other Models | E. LLM News | 
F. SLM News
 | G. Public Policy | H. Misc | I. Hacks | J. Basics

Top stories  ... 
  1. LLM News
    "Inside the Creation of the World’s Most Powerful Open Source AI Model", Will Knight,  Wired, 3/27/24 *** 
    -- Model is called DBRX ... 136 billion parameters
    -- This story also covered by The InformationBusiness Insider ... and 
    Databricks

  2. Other Models
    "Amazon Adds $2.75 Billion to Its Stake in the A.I. Start-Up Anthropic",Karen Weise, NY Times, 3/27/24 *** 
    -- This story also covered by BloombergVentureBeatThe InformationWall Street JournalTechCrunch

  3. Other Models
    "Elon Musk announces Grok-1.5, nearing GPT-4 level performance", Shubham Sharma, VentureBeat, 3/29/24 *** 
    -- This story also covered by TechCrunchWall Street JournalReutersEndgadget,  ... and xAI (Musk)

  4. Microsoft
    "Microsoft and OpenAI Plot $100 Billion Stargate AI Supercomputer",  Anissa Gardizy and Amir Efrati, The Information, 3/29/24 *** 
    -- This story also covered by GizmodoReuters
Evolving context: Links to our newest stories are printed in red font. From time to time, links to older news stories (black font) whose content has been overcome by events will be deleted. However, enough older links will be retained so that readers can get a quick sense of where we are and how we got here by skimming the headlines of the remaining articles in chronological order. No section will ever include more than 12 stories.

B. Microsoft 
  • "The Inside Story of Microsoft’s Partnership with OpenAI", Charles Duhigg, The New Yorker, 12/1/23 

  • "Microsoft Copilot is now available as a ChatGPT-like app on Android", Tom Warren, The Verge, 12/26/23
    -- This story also covered by Engadget 

  • "Is Microsoft Copilot Free? The Complete Guide to Copilot Pricing"UC Today, 1/16/24

  • "Microsoft Dishes on AI Revenue; Google CEO Says ‘Agents’ Are Coming", Aaron Holmes and Jon Victor, The Information, 1/31/24 ... Link downloads the full paywalled article for 3 free reads
    -- Note: This article also discusses Google's earnings call
    -- Microsoft's quarterly earnings also reported by ReutersNY Times

  • "Microsoft and Intel strike a custom chip deal that could be worth billions", Wes Davis, The Verge, 2/23/24  
    -- This story also covered by ReutersBloombergWall Street JournalEngadget

  • "MWC: Microsoft pitches ‘AI access principles’ to offset OpenAI competition concerns", Ingrid Lunden, TechCrunch, 2/26/24  
    -- This story also covered by Reuters, ... and Microsoft

  • "Microsoft strikes deal with Mistral in push beyond OpenAI", Madhumita Murgia, Financial Times, 2/26/24 
    -- This story also covered by VentureBeat
    The Verge, Computerworld, TechCrunch, FastCompany, Bloomberg, 

  • "Microsoft Hires DeepMind Co-Founder Suleyman to Run Consumer AI", Dina Bass, Bloomberg, 3/19/24
    -- This story also covered by  The VergeTechCrunchNY TimesForbes 

  • "Microsoft and OpenAI Plot $100 Billion Stargate AI Supercomputer",  Anissa Gardizy and Amir Efrati, The Information, 3/29/24 *** 
    -- This story also covered by Gizmodo, Reuters
C. Google 

D. Amazon and other Big Tech models
  • "Amazon Takes a Big Stake [$4 billion] in the A.I. Start-Up Anthropic", Adam Satariano and Cade Metz, NY Times, 9/25/23 
    -- This story also covered by EngadgetGizmodoVentureBeatBBC NewsBloombergReuters
    -- Description of Anthropic in Wikipedia  HERE

  • Amazon's re:Invent 2023 conference (Nov 27-Dec 1, Las Vegas, NV) 
    -- Overview "Here’s everything Amazon Web Services announced at AWS re:Invent", Christine Hall, TechCrunch, 11/29/23
    -- Amazon Q, an AI-powered chatbot for AWS customers ..."Amazon Introduces Q, an A.I. Chatbot for Companies", Karen Weise, NY Times, 11/28/23 ... This story also covered by Bloomberg, VentureBeat, Gizmodo
    --
    Image generator "Amazon joins AI image creation fray with new model", Emilia David, The Verge, 11/29/23 ... This story also covered by BloombergEngadgetGizmodo

  • "Elon Musk’s Grok Represents a Serious Threat to ChatGPT",  Shirin Ghaffary, Bloomberg, 12/14/23 
    -- This story also covered by Wired, Fast Company, VentureBeat, TechCrunch

  • "Amazon Enters Chatbot Fray With Shopping Tool", Karen Weise, NY Times, 2/3/24  

  • "Tim Cook confirms Apple’s generative AI features are coming ‘later this year’", Chris Welch, The Verge, 2/1/24  
    -- This story also covered by CNBC, ZDNetComputerworld 

  • "Adobe’s latest AI experiment generates music from text", Will Shanklin, Engadget, 3/1/24 ... Includes YouTube demo of music generation 
    -- This story also covered by The Verge, Gizmodo, TechCrunch,

  • "In Latest A.I. War Escalation, Elon Musk Releases Chatbot Code", Kate Conger and Cade Metz, NY Times, 3/17/24 
    -- This story also covered by TechCrunch, The Verge, Wired, VentureBeat, Mashable, ... and xAI

  • "Amazon Adds $2.75 Billion to Its Stake in the A.I. Start-Up Anthropic",Karen Weise, NY Times, 3/27/24 *** 
    -- This story also covered by Bloomberg, VentureBeat, The Information, Wall Street Journal, TechCrunch

  • "Elon Musk announces Grok-1.5, nearing GPT-4 level performance", Shubham Sharma, VentureBeat, 3/29/24 *** 
    -- This story also covered by TechCrunchWall Street Journal, Reuters, Endgadget,  ... and xAI (Musk)
    -- Musk also posted this comment on X on 3/28/24, "Should be available on 𝕏 next week. Grok 2 should exceed current AI on all metrics. In training now."

E. Large Language Model (LLM) news
  • "ChatGPT Helps, and Worries, Business Consultants, Study Finds", David Berreby, NY Times, 12/28/23 
  • "Inside the News Industry’s Uneasy Negotiations With OpenAI", Benjamin Mullin, NY Times, 12/29/23
  • "Generative AI isn’t a home run in the enterprise", Kyle Wiggers, TechCrunch, 1/11/24
  • "OpenAI Doesn’t Want to Train on New York Times Data After Lawsuit, Altman Says", Brad Stone and Jake Rudnitsky, Bloomberg, 1/18/24 
    -- This story also covered by CNN,

  • "Here are the key differences between the Samsung Galaxy S24 phones", Sheena Vasani, The Verge, 1/18/24  
    -- This story also covered by Mashable,  CNETThe Verge, WiredBloomberg 
    ... and Samsung

  • "AI Chip Database --  Top Startups Designing Chips", Stephanie Palazzolo, The Information, 1/30/24
  • "Allen Institute for AI releases ‘truly open source’  LLM to drive ‘critical shift’ in AI development", Sharon Goldman, VentureBeat, 2/1/24 
    -- This story also covered by TechCrunch

  • "Hugging Face launches open source AI assistant maker to rival OpenAI’s custom GPTs", Carl Franzen, VentureBeat, 2/2/24
  • "A.I. Start-Up Anthropic Challenges OpenAI and Google With New Chatbot", Cade Metz, NY Times, 3/4/24 
    -- This story also covered by TechCrunch, CNBC, CNET, Bloomberg,

  • "Inside the Creation of the World’s Most Powerful Open Source AI Model", Will Knight,  Wired, 3/27/24 *** 
    -- Model is called DBRX ... 136 billion parameters
    -- This story also covered by The InformationBusiness Insider ... and Databricks
F. Small Language Model (SLM) news 
Note: News reports are classified in this category if and only if they explicitly refer to models with less than 30 billion (30B) parameters. Unfortunately, for the time being this means that this section will include small models whose powers are comparable to the powers of the largest models, e.g., Microsoft's Phi-2 ...  AND  ... small models whose powers are much weaker than the powers of the largest models. e.g. LLaMA 2 and Gemma ...  UGH!! ... :-(
  •  "LLaMA 2: How to access and use Meta’s versatile open-source chatbot right now", Michael Nuñez, VentureBeat, 7/19/23-- This story also covered by Hugging Face, Axios, CNET ... and Meta

  • "Microsoft releases Phi-2, a small language model AI that outperforms Llama 2, Mistral 7B", Carl FranzenVentureBeat, 12/12/23 
    -- This story also covered by ZDNetTechRepublicMediumComputerworld ... and Microsoft

  • "Stability AI unveils smaller, more efficient 1.6B language model as part of ongoing innovation", Sean Michael Kerner, VentureBeat, 1/19/24
    -- This story also covered by Stability AI

  • "Exclusive: Microsoft has created a new team to build “small” AI that’s cheaper than OpenAI’s.", Aaron Holmes, The Information, 1/23/24  ... Link presents the full paywalled article for 3 free reads 1/31/24

  • "Meet the Creator of Microsoft Phi-2", Mohit Pandey, AIM, 2/15/24
  • "Google goes 'open AI'" with Gemma, a free, open-weights chatbot family", Benj Edwards, Ars Technica, 2/21/24 
    -- This story also covered by ReutersFast CompanyHugging FaceFortune, NY Times ... and Google

  • "Generative AI and the big buzz about small language models", Clint Boulton, VentureBeat, 2/29/24

G. Public policy and legal considerations 
  • "Joe Biden’s Big AI Plan Sounds Scary—but Lacks Bite", Matt Laslo, Wired, 10/31/23  ...  This event also coveed by NY Times, VentureBeatForbes
  • "E.U. reaches deal on landmark AI bill, racing ahead of U.S.", Anthony Faiola, Cat Zakrzewski and Beatriz Ríos, Washington Post, 12/8/23
    -- This story also covered by APNews, NY Times, BBC, Forbes

  • "AI cannot be patent 'inventor', UK Supreme Court rules in landmark case", Reuters, 12/20/23
  • "The New York Times sued Microsoft and OpenAI for alleged copyright infringement, touching off a legal fight over generative-AI technologies, with implications for the future of the news business", Alexandra Bruell, Wall Street  Journal, 12/27/23 
    -- A related story (podcast) ➡ "How Adobe is managing the AI copyright dilemma, with general counsel Dana Rao", Nilay Patel, TheVerge, 1/9/24

  • "Federal Trade Commission Launches Inquiry Into A.I. Deals by Tech Giants", David McCabe, NY Times, 1/25/24
  • "F.C.C. Bans A.I.-Generated Robocalls", Cecilia Kang, NYTimes, 2/8/24  
    -- This story also covered by NY TimesCNN, Forbes, NPR ... and the FCC

  • "OpenAI and Other Tech Giants Will Have to Warn the US Government When They Start New AI Projects", Will Knight, Wired, Will Knight, 1/27/24
    -- This story also covered by Mashable
  • "Google, Apple, Meta and other huge tech companies join US consortium to advance responsible AI", Sharon Goldman, Engadget 2/8/24 
    -- This story also covered by VentureBeat ... and the U.S. Dept of Commerce

  • "Forced to Change: Tech Giants Bow to Global Onslaught of Rules", Adam Satariano and David McCabe, NY Times, 3/4/24
  • "World’s Most Extensive AI Rules Approved in EU Despite Criticism", Jillian Deutsch, Bloomberg, 3/13/24  
    -- This story also covered by TechCrunch, BBC, Wall StreetJournal, VentureBeat, The Information, Mashable ... and European Parliament

  • "France Fines Google Amid A.I. Dispute With News Media", Adam Satariano, NY Times, 3/20/24
  • "The White House knows the risks of AI being used by federal agencies. Here's how they're handling it.", Cecily Mauran, Mashable, 3/28/24

H. Misc = Opinions, other news, rumors, long reads ... and humor
  • Two interviews with Demis Hassabis, CEO of Google DeepMind
    --  "
    Inside Google’s big AI shuffle — and how it plans to stay competitive, with Google DeepMind CEO Demis Hassabis", Nilay Patel, The Verge, 7/10/23 ... audio and transcript of interview erge
    -- "A.I. Could Solve Some of Humanity's Hardest Problems. It Already Has.". Guest = Demis  Hassabis, The Ezra Klein Show (podcast with transcript), 7/11/23 

  • "Why the Godfather of A.I. Fears What He’s Built", Joshua Rothman, The New Yorker, 11/13/23 
  • "Nvidia is now worth more than Amazon and Alphabet", Amrita Khalid, The Verge, 2/14/24
    -- This story also covered by ReutersBloomberg
    -- 2/23/24 Nvidia's stock/revenue surge continues as per BloombergBBCReuters, NY Times,  

  • "Inside the Funding Frenzy at Anthropic, One of A.I.’s Hottest Start-Ups",  Erin Griffith and Cade Metz, NY Times, 2/20/24
  • "Amazon, Google Quietly Tamp Down Generative AI Expectations", Aaron Holmes and Anissa Gardizy, The Information, 3/12/24
  • "Nvidia's next-gen AI chips are way more powerful and use a lot less energy", Alex Perry, Mashable, 3/19/24 
    -- This story also covered by Bloomberg, CNBC, VentureBeat, FastCompany, Reuters
  • "Saudi Arabia Plans $40 Billion Push Into Artificial Intelligence", Maureen Farrell and Rob Copeland, NY Times, 3/19/24
  • "In One Key A.I. Metric, China Pulls Ahead of the U.S.: Talent", Paul Mozur and Cade Metz,  NY Times,3/20/24
  • "'He’s a Megalomaniac': VCs Reportedly Fed Up with OpenAI's Sam Altman", Lucas Ropek, Gizmodo, 3/27/24

I.  Language model flaws, hacks, and remedies
  • "Prompt Injection Attack on GPT-4", William Zhang, Robust Intelligence, 4/31/23
  • "The Security Hole at the Heart of ChatGPT and Bing", Matt Burgess, Wired, 5/25/23 
    -- Prompt-injection hack for plugins covered by Mashable

  • "Computer scientists claim to have discovered ‘unlimited’ ways to jailbreak ChatGPT", Clint Rainey,  FastCompany, 7/27/23 
    -- This story also covered by NY Times,  MashableYahoo FinanceWiredZDNet
    -- Note: The research reported in these articles was conducted by Carnegie Mellon University

  • "Stanford study challenges assumptions about language models: Larger context doesn’t mean better understanding", Matt Marshall, VentureBeat, 7/21/23  
  • "Hackers Trick AI With ‘Bad Math’ to Expose Flaws and Biases", Katrina Manson, Bloomberg, 8/12/23 
    -- This story also covered by the NY Times

  • "Uh-oh! Fine-tuning LLMs compromises their safety, study finds", Ben Dickson, VentureBeat, 10/13/23 
  • "Patronus AI finds ‘alarming’ safety gaps in leading AI systems", Michael Nuñez, VentureBeat, 12/19/23
  • "Anthropic researchers find that AI models can be trained to deceive", Emilia David, TechCrunch, 1/13/24 
    -- This story also covered by
    VentureBeat  ... and Anthropic 
  • "OpenAI's GPT Is a Recruiter's Dream Tool. Tests Show There's Racial Bias", Leon Yin, Davey Alba and Leonardo Nicoletti, TechCrunch, 3/7/24 
  • "OpenAI’s chatbot store is filling up with spam", Kyle Wiggers, TechCrunch, 3/20/24
  • "Why Two Models Are Better Than One", Stephanie Palazzolo, The Information, 3/28/24
J. Basics 
  • "Watch an A.I. Learn to Write by Reading Nothing but Shakespeare or Harry Potter or Jane Austen or Star Trek or Moby Dick", Aatish Bhatia, NY Times, 4/27/23 
  • "What Is a Large Language Model, the Tech Behind ChatGPT?", Kurt Muehmel, Data Iku, 6/7/23 
  • "Textbooks Are All You Need II: phi-1.5 technical report", Yuanzhi Li, Sébastien Bubeck, Ronen Eldan, Allie Del Giorno, Suriya Gunasekar, Yin Tat Lee, Microsoft Research, September 2023
  • "Phi-2: The surprising power of small language models", Mojan Javaheripi and Sébastien Bubeck , Microsoft Research, 12/12/24

  • WTF are diffusion transformers 
    -- "Stable Diffusion 3.0 debuts new diffusion transformation architecture to reinvent text-to-image gen AI", Sean Michael Kerner, VentureBeat, 2/22/24
    -- "Diffusion transformers are the key behind OpenAI’s Sora — and they’re set to upend GenAI", Kyle Wiggers, TechCrunch, 2/28/24
    -- "Diffusion Transformer Explained", Mario Namtao Shianti Larcher, Towards Data Science, 2/28/24

  • "Compact Guide to Large Language Models", DataBricks, 2023 ... Link to form that enables download of their eBook (9 pages) pdf file.
  • Dozen Basic AI FAQs
    This page contains links to responses by Google's Bard chatbot to 12 questions that should be asked more frequently, but aren't. 

  • "Let's learn about artificial intelligence -- A series about AI, machine learning, ChatGPT, and more", Mark Wiemer, Medium, 3/21/23

___________________________________
Links to some back issues and TL;DRs/podcasts of this news page:  

No comments:

Post a Comment

Your comments will be greatly appreciated ... Or just click the "Like" button above the comments section if you enjoyed this blog note.