,
Last update: Friday 3/29/24
Last update: Friday 3/29/24
This weekly news page provides links to reports about major innovations in language models, with emphasis on small language models (SLMs).
Most of the links refer to reports published after the "AI Big Bang", i.e., after Sam Altman's dismissal from and return to OpenAI, i.e., after November 20, 2023. The previous edition of this page can be found here ➡ PreBigBang
... Our publication schedule ...
A. OpenAI | B. Microsoft | C. Google | D. Other Models | E. LLM News |
F. SLM News | G. Public Policy | H. Misc | I. Hacks | J. Basics
F. SLM News | G. Public Policy | H. Misc | I. Hacks | J. Basics
Top stories ...
- LLM News
"Inside the Creation of the World’s Most Powerful Open Source AI Model", Will Knight, Wired, 3/27/24 ***
-- Model is called DBRX ... 136 billion parameters
-- This story also covered by The Information, Business Insider ... and Databricks - Other Models
"Amazon Adds $2.75 Billion to Its Stake in the A.I. Start-Up Anthropic",Karen Weise, NY Times, 3/27/24 ***
-- This story also covered by Bloomberg, VentureBeat, The Information, Wall Street Journal, TechCrunch, - Other Models
"Elon Musk announces Grok-1.5, nearing GPT-4 level performance", Shubham Sharma, VentureBeat, 3/29/24 ***
-- This story also covered by TechCrunch, Wall Street Journal, Reuters, Endgadget, ... and xAI (Musk) - Microsoft
"Microsoft and OpenAI Plot $100 Billion Stargate AI Supercomputer", Anissa Gardizy and Amir Efrati, The Information, 3/29/24 ***
-- This story also covered by Gizmodo, Reuters,
Evolving context: Links to our newest stories are printed in red font. From time to time, links to older news stories (black font) whose content has been overcome by events will be deleted. However, enough older links will be retained so that readers can get a quick sense of where we are and how we got here by skimming the headlines of the remaining articles in chronological order. No section will ever include more than 12 stories.
A. OpenAI
- "OpenAI Shifts AI Battleground to Software That Operates Devices, Automates Tasks", Stephanie Palazzolo and Amir Efrati, The Information, 2/7/24 ... Link downloads the full exclusive paywalled article for 3 free reads
- "OpenAI’s Sora Turns AI Prompts Into Photorealistic Videos", Steven Levy, Wired, 2/13/24
-- This story also covered by VentureBeat, The Verge, Bloomberg, TechCrunch, Engadget, Mashable, NY Times
- "OpenAI Completes Deal That Values the Company at $80 Billion", Cade Metz and Tripp Mickle, NY Times, 2/16/24
- "Elon Musk Sues OpenAI and Sam Altman for Violating the Company’s Principles", Adam Satariano, Cade Metz and Tripp Mickle, NY Times, 3/1/24
-- A copy of Musk's electronically filed complaint can be found HERE
-- This story also covered by Financial Times, Wired, Bloomberg, Engadget, Washington Post, The Verge, Wall Street Journal, Reuters, BBC, TechCrunch, VentureBeat, NY Times #2 - "Key OpenAI Executive Played a Pivotal Role in Sam Altman’s Ouster", Mike Isaac, Tripp Mickle and Cade Metz, NY Times, 3/7/24
- "Sam Altman Asserts Control of OpenAI as He Rejoins Its Board", Cade Metz, Tripp Mickle and Mike Isaac, NY Times, 3/8/24
-- This story also covered by Engadget, Bloomberg, VentureBeat, TechCrunch, Wired, Wall Street Journal, The Verge, ... and OpenAI - "OpenAI Made AI Videos for Us. These Clips Are Good Enough to Freak Us Out.", Joanna Stern, Wall Street Journal, 3/13/24
-- See also "The 12 OpenAI Sora TikToks That Broke Our Brains", Maxwell Zeff. Gizmodo, 3/14/24 - '"Chatbot App Store Is Off to a Slow Start", Stephanie Palazzolo, The Information, 3/19/24
- "OpenAI is expected to release a 'materially better' GPT-5 for its chatbot mid-year, sources say", Kali Hays and Darius Rafieyan, Business Insider, 3/19/24
-- This story also covered by Mashable, ZDNet, - "OpenAI’s chatbot store is filling up with spam", Kyle Wiggers, TechCrunch, 3/20/24
B. Microsoft
- "The Inside Story of Microsoft’s Partnership with OpenAI", Charles Duhigg, The New Yorker, 12/1/23
- "Microsoft Copilot is now available as a ChatGPT-like app on Android", Tom Warren, The Verge, 12/26/23
-- This story also covered by Engadget - "Is Microsoft Copilot Free? The Complete Guide to Copilot Pricing", UC Today, 1/16/24
- "Microsoft Dishes on AI Revenue; Google CEO Says ‘Agents’ Are Coming", Aaron Holmes and Jon Victor, The Information, 1/31/24 ... Link downloads the full paywalled article for 3 free reads
-- Note: This article also discusses Google's earnings call
-- Microsoft's quarterly earnings also reported by Reuters, NY Times, - "Microsoft and Intel strike a custom chip deal that could be worth billions", Wes Davis, The Verge, 2/23/24
-- This story also covered by Reuters, Bloomberg, Wall Street Journal, Engadget, - "MWC: Microsoft pitches ‘AI access principles’ to offset OpenAI competition concerns", Ingrid Lunden, TechCrunch, 2/26/24
-- This story also covered by Reuters, ... and Microsoft - "Microsoft strikes deal with Mistral in push beyond OpenAI", Madhumita Murgia, Financial Times, 2/26/24
-- This story also covered by VentureBeat, The Verge, Computerworld, TechCrunch, FastCompany, Bloomberg, - "Microsoft Hires DeepMind Co-Founder Suleyman to Run Consumer AI", Dina Bass, Bloomberg, 3/19/24
-- This story also covered by The Verge, TechCrunch, NY Times, Forbes - "Microsoft and OpenAI Plot $100 Billion Stargate AI Supercomputer", Anissa Gardizy and Amir Efrati, The Information, 3/29/24 ***
-- This story also covered by Gizmodo, Reuters,
C. Google
- "Google DeepMind's Demis Hassabis Says Gemini Is a New Breed of AI", Will Knight, Wired, 12/6/23
-- This story also covered by The Verge, MIT Tech Review, NY Tmes, Ars Technica ... and Google DeepMind
-- Users access Gemini Pro via Bard: "How to Use Google’s Gemini AI Right Now in Its Bard Chatbot", Reece Rogers, Wired, 12/6/23
-- Google's new Gemini powered notebook: "Google’s AI note-taking app is now available to users in the US", Emma Roth, The Verge, 12/8/23 ... Also reported by Engadget, Gizmodo,
-- Announcements boosted Google/Alphabet's stock: "Alphabet soars as Wall Street cheers arrival of AI model Gemini", Aditya Soni, Reuters, 12/7/23 - "Google to Team Up With Startup Hugging Face to Host AI Software", Julia Love, Bloomberg, 1/25/24
- "Google’s Ad Sales Fall Short of Wall Street’s Lofty Expectations", Miles Kruppa, Wall Street Journal, 1/30/24
-- This story also covered by NY Times, Forbes, and The Information, 1/31/24 (Link downloads the full paywalled article for 3 free reads) - "Google rebrands Bard chatbot as Gemini, rolls out paid subscription", Jeffrey Dastin, Reuters, 2/8/24
-- This story also covered by VentureBeat, NY Times, ZDNet, Engadget, MIT Tech Review - "Google unveils Gemini 1.5, a next-gen AI model with million-token context window", Michael Nuñez, VentureBeat, 2/15/24
-- This story also covered by The Verge, TechCrunch, Bloomberg, Wired, Mashable - "Google suspends Gemini from making AI images of people after a backlash complaining it was 'woke'", Joshua Zitser, Business Insider, 2/22/24
-- This story also covered byThe Verge, Gizmodo, Wall Street Journal, Fast Company, TechCrunch, Associated Press, NY Times ... and Google (on X/Twitter) - "Google Chrome’s new AI can finish your sentences for you", Jess Weatherbed, The Verge, 2/22/24
-- This story also covered by Engadget, Gizmodo, - "Google won’t let you use its Gemini AI to answer questions about an upcoming election in your country", Jagmeet Singh, TechCrunch, 3/13/24
-- This story also covered by Ars Technica, BBC, Reuters, Engadget, - "Apple Is in Talks to Let Google Gemini Power iPhone AI Features", Mark Gurman. Bloomberg, 3/19/24
-- This story also covered by The Information, Reuters.
D. Amazon and other Big Tech models
- "Amazon Takes a Big Stake [$4 billion] in the A.I. Start-Up Anthropic", Adam Satariano and Cade Metz, NY Times, 9/25/23
-- This story also covered by Engadget, Gizmodo, VentureBeat, BBC News, Bloomberg, Reuters,
-- Description of Anthropic in Wikipedia ➡ HERE - Amazon's re:Invent 2023 conference (Nov 27-Dec 1, Las Vegas, NV)
-- Overview "Here’s everything Amazon Web Services announced at AWS re:Invent", Christine Hall, TechCrunch, 11/29/23
-- Amazon Q, an AI-powered chatbot for AWS customers ..."Amazon Introduces Q, an A.I. Chatbot for Companies", Karen Weise, NY Times, 11/28/23 ... This story also covered by Bloomberg, VentureBeat, Gizmodo,
-- Image generator "Amazon joins AI image creation fray with new model", Emilia David, The Verge, 11/29/23 ... This story also covered by Bloomberg, Engadget, Gizmodo - "Elon Musk’s Grok Represents a Serious Threat to ChatGPT", Shirin Ghaffary, Bloomberg, 12/14/23
-- This story also covered by Wired, Fast Company, VentureBeat, TechCrunch, - "Amazon Enters Chatbot Fray With Shopping Tool", Karen Weise, NY Times, 2/3/24
- "Tim Cook confirms Apple’s generative AI features are coming ‘later this year’", Chris Welch, The Verge, 2/1/24
-- This story also covered by CNBC, ZDNet, Computerworld - "Adobe’s latest AI experiment generates music from text", Will Shanklin, Engadget, 3/1/24 ... Includes YouTube demo of music generation
-- This story also covered by The Verge, Gizmodo, TechCrunch, - "In Latest A.I. War Escalation, Elon Musk Releases Chatbot Code", Kate Conger and Cade Metz, NY Times, 3/17/24
-- This story also covered by TechCrunch, The Verge, Wired, VentureBeat, Mashable, ... and xAI - "Amazon Adds $2.75 Billion to Its Stake in the A.I. Start-Up Anthropic",Karen Weise, NY Times, 3/27/24 ***
-- This story also covered by Bloomberg, VentureBeat, The Information, Wall Street Journal, TechCrunch - "Elon Musk announces Grok-1.5, nearing GPT-4 level performance", Shubham Sharma, VentureBeat, 3/29/24 ***
-- This story also covered by TechCrunch, Wall Street Journal, Reuters, Endgadget, ... and xAI (Musk)
-- Musk also posted this comment on X on 3/28/24, "Should be available on 𝕏 next week. Grok 2 should exceed current AI on all metrics. In training now."
E. Large Language Model (LLM) news
- "ChatGPT Helps, and Worries, Business Consultants, Study Finds", David Berreby, NY Times, 12/28/23
- "Inside the News Industry’s Uneasy Negotiations With OpenAI", Benjamin Mullin, NY Times, 12/29/23
- "Generative AI isn’t a home run in the enterprise", Kyle Wiggers, TechCrunch, 1/11/24
- "OpenAI Doesn’t Want to Train on New York Times Data After Lawsuit, Altman Says", Brad Stone and Jake Rudnitsky, Bloomberg, 1/18/24
-- This story also covered by CNN, - "Here are the key differences between the Samsung Galaxy S24 phones", Sheena Vasani, The Verge, 1/18/24
-- This story also covered by Mashable, CNET, The Verge, Wired, Bloomberg ... and Samsung - "AI Chip Database -- Top Startups Designing Chips", Stephanie Palazzolo, The Information, 1/30/24
- "Allen Institute for AI releases ‘truly open source’ LLM to drive ‘critical shift’ in AI development", Sharon Goldman, VentureBeat, 2/1/24
-- This story also covered by TechCrunch - "Hugging Face launches open source AI assistant maker to rival OpenAI’s custom GPTs", Carl Franzen, VentureBeat, 2/2/24
- "A.I. Start-Up Anthropic Challenges OpenAI and Google With New Chatbot", Cade Metz, NY Times, 3/4/24
-- This story also covered by TechCrunch, CNBC,CNET, Bloomberg, - "Inside the Creation of the World’s Most Powerful Open Source AI Model", Will Knight, Wired, 3/27/24 ***
-- Model is called DBRX ... 136 billion parameters
-- This story also covered by The Information, Business Insider ... and Databricks
F. Small Language Model (SLM) news
Note: News reports are classified in this category if and only if they explicitly refer to models with less than 30 billion (30B) parameters. Unfortunately, for the time being this means that this section will include small models whose powers are comparable to the powers of the largest models, e.g., Microsoft's Phi-2 ... AND ... small models whose powers are much weaker than the powers of the largest models. e.g. LLaMA 2 and Gemma ... UGH!! ... :-(
Note: News reports are classified in this category if and only if they explicitly refer to models with less than 30 billion (30B) parameters. Unfortunately, for the time being this means that this section will include small models whose powers are comparable to the powers of the largest models, e.g., Microsoft's Phi-2 ... AND ... small models whose powers are much weaker than the powers of the largest models. e.g. LLaMA 2 and Gemma ... UGH!! ... :-(
- "LLaMA 2: How to access and use Meta’s versatile open-source chatbot right now", Michael Nuñez, VentureBeat, 7/19/23-- This story also covered by Hugging Face, Axios, CNET ... and Meta
- "Microsoft releases Phi-2, a small language model AI that outperforms Llama 2, Mistral 7B", Carl Franzen, VentureBeat, 12/12/23
-- This story also covered by ZDNet, TechRepublic, Medium, Computerworld ... and Microsoft - "Stability AI unveils smaller, more efficient 1.6B language model as part of ongoing innovation", Sean Michael Kerner, VentureBeat, 1/19/24
-- This story also covered by Stability AI - "Exclusive: Microsoft has created a new team to build “small” AI that’s cheaper than OpenAI’s.", Aaron Holmes, The Information, 1/23/24 ... Link presents the full paywalled article for 3 free reads 1/31/24
- "Meet the Creator of Microsoft Phi-2", Mohit Pandey, AIM, 2/15/24
- "Google goes 'open AI'" with Gemma, a free, open-weights chatbot family", Benj Edwards, Ars Technica, 2/21/24
-- This story also covered by Reuters, Fast Company, Hugging Face, Fortune, NY Times ... and Google - "Generative AI and the big buzz about small language models", Clint Boulton, VentureBeat, 2/29/24
G. Public policy and legal considerations
- "Joe Biden’s Big AI Plan Sounds Scary—but Lacks Bite", Matt Laslo, Wired, 10/31/23 ... This event also coveed by NY Times, VentureBeat, Forbes,
- "E.U. reaches deal on landmark AI bill, racing ahead of U.S.", Anthony Faiola, Cat Zakrzewski and Beatriz Ríos, Washington Post, 12/8/23
-- This story also covered by APNews, NY Times, BBC, Forbes, - "AI cannot be patent 'inventor', UK Supreme Court rules in landmark case", Reuters, 12/20/23
- "The New York Times sued Microsoft and OpenAI for alleged copyright infringement, touching off a legal fight over generative-AI technologies, with implications for the future of the news business", Alexandra Bruell, Wall Street Journal, 12/27/23
-- A related story (podcast) ➡ "How Adobe is managing the AI copyright dilemma, with general counsel Dana Rao", Nilay Patel, TheVerge, 1/9/24 - "Federal Trade Commission Launches Inquiry Into A.I. Deals by Tech Giants", David McCabe, NY Times, 1/25/24
- "F.C.C. Bans A.I.-Generated Robocalls", Cecilia Kang, NYTimes, 2/8/24
-- This story also covered by NY Times, CNN, Forbes, NPR ... and the FCC - "OpenAI and Other Tech Giants Will Have to Warn the US Government When They Start New AI Projects", Will Knight, Wired, Will Knight, 1/27/24
-- This story also covered by Mashable - "Google, Apple, Meta and other huge tech companies join US consortium to advance responsible AI", Sharon Goldman, Engadget 2/8/24
-- This story also covered by VentureBeat ... and the U.S. Dept of Commerce - "Forced to Change: Tech Giants Bow to Global Onslaught of Rules", Adam Satariano and David McCabe, NY Times, 3/4/24
- "World’s Most Extensive AI Rules Approved in EU Despite Criticism", Jillian Deutsch, Bloomberg, 3/13/24
-- This story also covered by TechCrunch, BBC, Wall StreetJournal, VentureBeat, The Information, Mashable ... and European Parliament - "France Fines Google Amid A.I. Dispute With News Media", Adam Satariano, NY Times, 3/20/24
- "The White House knows the risks of AI being used by federal agencies. Here's how they're handling it.", Cecily Mauran, Mashable, 3/28/24
H. Misc = Opinions, other news, rumors, long reads ... and humor
- Two interviews with Demis Hassabis, CEO of Google DeepMind
-- "Inside Google’s big AI shuffle — and how it plans to stay competitive, with Google DeepMind CEO Demis Hassabis", Nilay Patel, The Verge, 7/10/23 ... audio and transcript of interview erge
-- "A.I. Could Solve Some of Humanity's Hardest Problems. It Already Has.". Guest = Demis Hassabis, The Ezra Klein Show (podcast with transcript), 7/11/23 - "Why the Godfather of A.I. Fears What He’s Built", Joshua Rothman, The New Yorker, 11/13/23
- "Nvidia is now worth more than Amazon and Alphabet", Amrita Khalid, The Verge, 2/14/24
-- This story also covered by Reuters, Bloomberg
-- 2/23/24 Nvidia's stock/revenue surge continues as per Bloomberg, BBC, Reuters, NY Times, - "Inside the Funding Frenzy at Anthropic, One of A.I.’s Hottest Start-Ups", Erin Griffith and Cade Metz, NY Times, 2/20/24
- "Amazon, Google Quietly Tamp Down Generative AI Expectations", Aaron Holmes and Anissa Gardizy, The Information, 3/12/24
- "Nvidia's next-gen AI chips are way more powerful and use a lot less energy", Alex Perry, Mashable, 3/19/24
-- This story also covered by Bloomberg, CNBC, VentureBeat, FastCompany, Reuters, - "Saudi Arabia Plans $40 Billion Push Into Artificial Intelligence", Maureen Farrell and Rob Copeland, NY Times, 3/19/24
- "In One Key A.I. Metric, China Pulls Ahead of the U.S.: Talent", Paul Mozur and Cade Metz, NY Times,3/20/24
- "'He’s a Megalomaniac': VCs Reportedly Fed Up with OpenAI's Sam Altman", Lucas Ropek, Gizmodo, 3/27/24
I. Language model flaws, hacks, and remedies
- "Prompt Injection Attack on GPT-4", William Zhang, Robust Intelligence, 4/31/23
- "The Security Hole at the Heart of ChatGPT and Bing", Matt Burgess, Wired, 5/25/23
-- Prompt-injection hack for plugins covered by Mashable - "Computer scientists claim to have discovered ‘unlimited’ ways to jailbreak ChatGPT", Clint Rainey, FastCompany, 7/27/23
-- This story also covered by NY Times, Mashable, Yahoo Finance, Wired, ZDNet,
-- Note: The research reported in these articles was conducted by Carnegie Mellon University - "Stanford study challenges assumptions about language models: Larger context doesn’t mean better understanding", Matt Marshall, VentureBeat, 7/21/23
- "Hackers Trick AI With ‘Bad Math’ to Expose Flaws and Biases", Katrina Manson, Bloomberg, 8/12/23
-- This story also covered by the NY Times - "Uh-oh! Fine-tuning LLMs compromises their safety, study finds", Ben Dickson, VentureBeat, 10/13/23
- "Patronus AI finds ‘alarming’ safety gaps in leading AI systems", Michael Nuñez, VentureBeat, 12/19/23
- "Anthropic researchers find that AI models can be trained to deceive", Emilia David, TechCrunch, 1/13/24
-- This story also covered by VentureBeat ... and Anthropic - "OpenAI's GPT Is a Recruiter's Dream Tool. Tests Show There's Racial Bias", Leon Yin, Davey Alba and Leonardo Nicoletti, TechCrunch, 3/7/24
- "OpenAI’s chatbot store is filling up with spam", Kyle Wiggers, TechCrunch, 3/20/24
- "Why Two Models Are Better Than One", Stephanie Palazzolo, The Information, 3/28/24
J. Basics
- "Watch an A.I. Learn to Write by Reading Nothing but Shakespeare or Harry Potter or Jane Austen or Star Trek or Moby Dick", Aatish Bhatia, NY Times, 4/27/23
- "What Is a Large Language Model, the Tech Behind ChatGPT?", Kurt Muehmel, Data Iku, 6/7/23
- "Textbooks Are All You Need II: phi-1.5 technical report", Yuanzhi Li, Sébastien Bubeck, Ronen Eldan, Allie Del Giorno, Suriya Gunasekar, Yin Tat Lee, Microsoft Research, September 2023
- "Phi-2: The surprising power of small language models", Mojan Javaheripi and Sébastien Bubeck , Microsoft Research, 12/12/24
- WTF are diffusion transformers
-- "Stable Diffusion 3.0 debuts new diffusion transformation architecture to reinvent text-to-image gen AI", Sean Michael Kerner, VentureBeat, 2/22/24
-- "Diffusion transformers are the key behind OpenAI’s Sora — and they’re set to upend GenAI", Kyle Wiggers, TechCrunch, 2/28/24
-- "Diffusion Transformer Explained", Mario Namtao Shianti Larcher, Towards Data Science, 2/28/24 - "Compact Guide to Large Language Models", DataBricks, 2023 ... Link to form that enables download of their eBook (9 pages) pdf file.
- Dozen Basic AI FAQs
This page contains links to responses by Google's Bard chatbot to 12 questions that should be asked more frequently, but aren't. - "Let's learn about artificial intelligence -- A series about AI, machine learning, ChatGPT, and more", Mark Wiemer, Medium, 3/21/23
___________________________________
Links to some back issues and TL;DRs/podcasts of this news page:
- Back issues ...
29 Mar 24
22 Feb 24, 29 Jan2 4, 1 Jan 24, PreBigBang,
6 Oct 23, 27 Aug 23, 18 July 23, 22 June 23,
16 May 23, 5 April 23, 28 Mar 23 - TL;DRs and podcasts...
24Mar24, 18Mar24,
25Feb24, 18Feb24, 12Feb24, 5Feb24
28Jan24, 14Jan24, 9Jan24
21Dec23, 3Dec23,
26Nov23, 19Nov23, 12Nov23, 3Nov23,
27OCT23, 20Oct23,
No comments:
Post a Comment
Your comments will be greatly appreciated ... Or just click the "Like" button above the comments section if you enjoyed this blog note.