Last update:Saturday 3/8/25
Our weekly news page provides links to reports about major innovations in generative AI. Our page is not for AI experts. It's for (1) computer savvy professionals who want to be alert to genAI's potential impact on their careers, and (2) computer savvy citizens who want to be alert to genAI's potential impact on our society.
... Most of the links on this page refer to reports published after the "AI Big Bang", i.e., after Sam Altman's sudden firing and rapid rehiring by OpenAI, i.e., after November 20, 2023. The previous edition of this page can be found here ➡ PreBigBang
A. OpenAI | B. Microsoft | C. Google | D. Other Models | E. Agents |
F. SLM News + Open Source | G. Public Policy | H. OpEds+Misc | I. Hacks | J. Basics
Top 4 stories in past week ...
- Public Policy
"Justice Dept. Doubles Down on Request to Break Up Google", David McCabe, NY Times, 3/7/25 ***
-- This story also covered by Wired, Washington Post, Engadget, - Public Policy
"Judge Denies Musk’s Request to Block OpenAI’s For-Profit Plan", Cade Metz, NY Times, 3/5/25 ***
-- This story also covered by Reuters, CNN, The Guardian, Bloomberg - Microsoft
"Exclusive: Microsoft’s AI Guru Wants Independence From OpenAI. That’s Easier Said Than Done", Aaron Holmes, The Information, 3/7/25 ***
-- This story also covered by TechCrunch, Gizmodo, Bloomberg - Other Models
"Apple Says Some AI-Powered Enhancements to Siri to Be Delayed", Sabela Ojea, Wall Street Journal, 3/5/25 ***
-- This story also covered by TechCrunch, The Verge, 9to5Mac, Bloomberg, Engadget,
Upcoming events ...
Evolving context and connections ...
Fast moving AI news is often exciting, but also confusing. This page is designed to provide readers with an evolving context for the news and reminders of the connections between news items. From time to time, links to older stories whose content has been overcome by newer events will be deleted. However, enough older links will be retained so that readers can get a quick sense of how we got to where we are today by skimming the headlines of the preceding stories in chronological order. Sectionz will usually include 7 to 10 stories.
➡ Our publication schedule
Fast moving AI news is often exciting, but also confusing. This page is designed to provide readers with an evolving context for the news and reminders of the connections between news items. From time to time, links to older stories whose content has been overcome by newer events will be deleted. However, enough older links will be retained so that readers can get a quick sense of how we got to where we are today by skimming the headlines of the preceding stories in chronological order. Sectionz will usually include 7 to 10 stories.
➡ Our publication schedule
A. OpenAI
- "OpenAI Says DeepSeek May Have Improperly Harvested Its Data", Cade Metz, NY Times, 1/28/25
-- This story also covered by TechCrunch, The Verge, Financial Times, Forbes - "OpenAI launches new o3-mini reasoning model with a free ChatGPT version", Tom Warren, The Verge, 1/31/25 ***
--- This story also covered by ZDNet, CNet, The Guardian, .. and OpenAI - "ChatGPT [paid] Subscribers Nearly Tripled to 15.5 Million in 2024", Stephanie Palazzolo and Amir Efrati, The Information, 2/1/25
- "ChatGPT’s agent can now do deep research for you", Richard Lawler, The Verge, 2/2/25-- This story also covered by TechCrunch, The Guardian ... and OpenAI
- "OpenAI Forecast Shows Shift From Microsoft to SoftBank", Cory Weinberg, Jon Victor and Anissa Gardizy, The Information (exclusive), 2/21/25
-- This story also covered by TechCrunch, Windows Forum - "OpenAI unveils GPT-4.5 ‘Orion,’ its largest AI model yet", Maxwell Zeff, TechCrunch, 2/27/25
-- This story also covered by The Verge, Wired, Bloomberg, VentureBeat, Engadget, ...and OpenAI (video) - "OpenAI CEO Sam Altman says the company is ‘out of GPUs’", Kyle Wiggers, TechCrunch, 2/27/25
- "OpenAI Plots Charging $20,000 a Month For PhD-Level Agents", Stephanie Palazzo and Cory Weinberg, The Information, 3/5/25
-- This story also covered by TechCrunch - "OpenAI’s GPT-4.5 AI model comes to more ChatGPT users," Kyle Wiggers, -- TechCrunch, 3/5/25
B. Microsoft
- "The Inside Story of Microsoft’s Partnership with OpenAI", Charles Duhigg, The New Yorker, 12/1/23
- "Microsoft now sees OpenAI as a competitor in AI and search", Mikael Markander, ComputerWorld, 8/2/24
- "Microsoft taps Three Mile Island nuclear plant to power AI", Rebecca Bellan, TechCrunch, 9/20/24
-- This story also covered by Gizmodo, Reuters, Washington Post, Wall Street Journal, Engadget, GeekWire, - "Inside Microsoft's struggles with Copilot", Ashley Stewart, Busisiness Insider, 11/15/24hCrunch, 1/2/25
- "Microsoft to spend $80 billion in FY’25 on data centers for AI", Mary Ann Azevedo, TechCrunch, 1/2/25
-- This story also covered by Bloomberg, Reuters, ... and Microsoft - "Microsoft makes DeepSeek’s R1 model available on Azure AI and GitHub", Tom Warren. The Verge, 1/29/25
-- This story also covered by TechCrunch, - "Microsoft Says It Has Created a New State of Matter to Power Quantum Computers", Cade Metz, NY Times, 2/19/25
-- This story also covered by Wired, Engadget, Bloomberg, GeekWire, and Microsoft (in Nature) - "Microsoft makes Copilot Voice and Think Deeper free with unlimited use", Tom Warren, The Verge, 2/26/25
- "Microsoft releases a Copilot app for Mac", Tom Warren, The Verge, 2/28/25
-- This story also covered by TechCrunch, Ars Technica - "Exclusive: Microsoft’s AI Guru Wants Independence From OpenAI. That’s Easier Said Than Done", Aaron Holmes, The Information, 3/7/25 ***
-- This story also covered by TechCrunch, Gizmodo, Bloomberg
C. Google
- "Geoffrey Hinton, AI pioneer and figurehead of doomerism, wins Nobel Prize" (physics), Will Heaven, MIT Tech Review, 10/8/94
-- This story also covered by NY Times, Ars Technica,Washington Post, ... and Nobel Prize
-- Note: Dr. Hinton is a former Google employee - "Google AI scientists win Nobel Prize in chemistry", Jess Weatherbed, The Verge, 9/9/24
-- This story also covered by Science, Reuters, VentureBeat, NY Times, ... and Nobel Prize, Nobel Prize interview with Hasabis (video) - "Google’s AI weather prediction model is pretty darn good", Justine Calma, The Verge, 12/7/24
-- This story also covered by TechCrunch, Gizmodo - "Quantum Computing Inches Closer to Reality After Another Google Breakthrough", Cade Metz, NY Times, 12/9/24
-- This story also covered by Bloomberg, Wall Street Journal, Forbes, TechCrunch, AI Revolution (video) ... and Google - "Google’s NotebookLM now lets you talk to its AI podcast hosts", Aisha Malik, TechCrunch, 12/12/24
- "Google launches Gemini 2.0 Pro, Flash-Lite and connects reasoning model Flash Thinking to YouTube, Maps and Search", Carl Franzen, VentureBeat, 2/5/25
-- This story also covered by TechCrunch, Thr Verge ... and Google - "Google Search’s new ‘AI Mode’ lets users ask complex, multi-part questions", Aisha Malik, TechCrunch, 3/5/25
-- This story also covered by The Verge, - "Google’s Gemini now lets you ask questions using videos and what’s on your screen", Ivan Mehta, TechCrunch, 3/3/25
D. Amazon, Meta, Anthropic, Apple, and other Big Tech model
- "Meta unveils a new, more efficient Llama model [3.3 70B]", TechCrunch, Kyle Wiggers, 12/6/24
-- This story also covered by VentureBeat, - "Nvidia’s Top Customers Face Delays From Glitchy AI Chip Racks", Qianer Liu and Anissa Gardizy, The Information, 1/13/25
-- This story also covered by Reuters, TechSpot, Bloomberg (YouTube) - "Anthropic Projects Soaring Growth to $34.5 Billion in 2027 Revenue", Jon Victor and Stephanie Palazzolo, The Information, 2/12/25
-- This paywalled exclusive story also covered by Reuters - "Elon Musk’s xAI adds ‘Big Brain’ reasoning to Grok-3", Jess Weatherbed, The Verge, 2/18/25
-- This story also covered by TechCrunch, Computerworld, Bloomberg, TechDivision (video)
-- Earlier stories about how Musk built the Colossus supercomputer were given insufficient notice by this blog. Here's a recent report -- "How xAI turned a factory shell into an AI ‘Colossus’ to power Grok 3 and beyond", Brian Buntz, R&D World, 2/17/25 - "Amazon Unveils Alexa+, Powered by Generative A.I.", Karen Weise, NY Times, 2/26/25
-- This story also covered by TechCrunch, The Verge, Wired, Gizmodo, Engadget, MacRumors. .. and Amazon, - "Apple shareholders vote to keep its diversity policies", Stephen Nellis, Reuters, 2/25/25
-- This story also covered by TechCrunch, Washington Post, The Guardian,
-- President Trump's angry reaction was covered by Gizmodo.
-- This blog's data driven damnation of all Big Tech DEI programs as despicable, hypocritical, sham operations -- except Apple's commendable efforts -- was featured in the introduction to our 13Feb25 podcast. The BS DEI operations won't be missed. They won't be missed. - "Apple Vows to Build A.I. Servers in Houston and Spend $500 Billion in U.S.", Meaghan TobinTripp Mickle and Jason Karaian, NY Times, 2/25/25
-- This story also covered by Forbes, Washington Post, Reuters - "A.I. Start-Up Anthropic Closes Deal That Values It at $61.5 Billion", Cade Metz, NY Times, 3/3/25
- "Apple Says Some AI-Powered Enhancements to Siri to Be Delayed", Sabela Ojea, Wall Street Journal, 3/5/25 ***
-- This story also covered by TechCrunch, The Verge, 9to5Mac, Bloomberg, Engadget,
E. Agents
Editor's note -- This section replaces the old "Large Lange Model News" section because, nowadays, all AI models are large language models ... except the small models that will continue to be reported in the next section.
Editor's note -- This section replaces the old "Large Lange Model News" section because, nowadays, all AI models are large language models ... except the small models that will continue to be reported in the next section.
- "Anthropic’s agentic Computer Use is giving people ‘superpowers", Taryn Plumb, VentureBeat, 10/24/24
-- This story also covered by Wired, TechCrunch, Bloomberg, Corbin Brown's demo (YouTube video) ... and Anthropic - Other recent announcements of agents
-- Salesforce (Bloomberg, Axios) ... and Salesforce
-- Microsoft (Bloomberg, ZDNet) ... and Microsoft - "Microsoft’s new Magentic-One system directs multiple AI agents to complete user tasks", Emilia David, VentureBeat, 11/5/24
- "Google Unveils A.I. Agent That Can Use Websites on Its Own", Cade Metz and Nico Grant, NY Times, 12/11/24
- "OpenAI’s Operator Lets ChatGPT Use the Web for You", Will Knight, Wired, 1/23/25
-- This story also covered by TechCrunch, The Verge, VentureBeat, Bloomberg, ... and OpenAI
F. Small Language Model (SLM) news + Open Source
Editor's note -- News reports are classified in this category if and only if they explicitly refer to models with less than 70 billion (70B) parameters. The pace of development should heat up as small, fast models are hooked up to large models and as the competition to become the dominant self-contained AI model on smart phones intensifies.
Editor's note -- News reports are classified in this category if and only if they explicitly refer to models with less than 70 billion (70B) parameters. The pace of development should heat up as small, fast models are hooked up to large models and as the competition to become the dominant self-contained AI model on smart phones intensifies.
- "Microsoft releases Phi-2, a small language model AI that outperforms Llama 2, Mistral 7B", Carl Franzen, VentureBeat, 12/12/23
-- This story also covered by ZDNet, TechRepublic, Medium, Computerworld ... and Microsoft - "Allen Institute for AI releases ‘truly open source’ LLM to drive ‘critical shift’ in AI development", Sharon Goldman, VentureBeat, 2/1/24 ... OLMo 7B parameters
-- This story also covered by TechCrunch ... and Allen Institute - "Google goes 'open AI'" with Gemma, a free, open-weights chatbot family", Benj Edwards, Ars Technica, 2/21/24 ...7B and 28B
-- This story also covered by Reuters, Fast Company, Hugging Face, Fortune, NY Times ... and Google - "Meta releases Llama 3, claims it’s among the best open models available", Kyle Wiggers, TechCrunch, 4/18/24
-- Meta's announcement of Llama 3 ➡ HERE
-- Another Big Tech's opinion: "Elon Musk’s ‘not bad’ review thrusts Meta’s Llama 3 AI into spotlight", Michael Nuñez, VentureBeat, 4/19/24 - "Microsoft Makes a New Push Into Smaller A.I. Systems", Karen Weise and Cade Metz, NY Times, 4/23/24
-- This story also covered by The Verge, Engadget, Ars Technica, ZDNet ... and AI Revolution YouTube Video... and Microsoft ➡ Microsoft benchmarks pdf - "Why Apple is taking a small-model approach to generative AI", Brian Heater, TechCrunch, 6/11/24
-- Apple's small language model strategy also discussed by VentureBeat, Wired, The Verge, ... and Apple (SLM) plus Apple ("private cloud compute" extension of SLM) - "Meta just beat Google and Apple in the race to put powerful AI on phones", Michael Nuñez, VentureBeat, 20/24/24
- "Meta makes its MobileLLM open for researchers, posting full weights", Carl Franzen, VentureBeat, 10/31/24
- "Microsoft makes powerful Phi-4 model fully open-source on Hugging Face", Carl Franzen, VentureBeat, 1/8/25
- This story also covered by Hugging Face, AI News, ... and Microsoft - "DeepSeek’s new AI model appears to be one of the best ‘open’ challengers yet", Kyle Wiggers, TechCrunch, 1/26/25
-- This story also covered by Nature, VentureBeat, ZDNet, Business Insider, ... and DeepSeek - "DeepSeek claims ‘theoretical’ profit margins of 545%", Anthony Ha, TechCrunch, 3/1/25
-- This story also covered by Bloomberg,
G. Public policy and legal considerations
- "California Passes Election ‘Deepfake’ Laws, Forcing Social Media Companies to Take Action", Stuart A. Thompson, NY Times, 9/17/24
-- This story also covered by Washington Post, Fortune, - "Canada Accuses Google of Creating an Ad Tech Monopoly", Ian Austen, NY Times, 11/28/24
- "UK tribunal green-lights $2.7B Facebook collective action antitrust lawsuit", Ingrid Lunden, TechCrunch, 12/5/24
- "Lawsuit says Mark Zuckerberg approved Meta's use of pirated materials to train Llama AI'', mariella moon, Engadget, 1/10/25
-- This story also covered by TechCrunch, Mashable, Reuters, - "EU puts out guidance on uses of AI that are banned under its AI Act", Natasha Lomas, TechCrunch, 2/4/25
- "US lawmakers want DeepSeek banned from government devices", will shanklin, Engadget, 2/6/25
-- This story also covereed by Wall Street Journal, Forbes, Reuters, ... and House.gov - "OpenAI’s board ‘unanimously rejects’ Elon Musk’s offer to buy the company", Emma Roth, The Verge, 2/14/25
- This story also covered by TechCrunch, CNN. Wall Street Journal and ... OpenAI Board (post on X) - "AI Action Summit", Wikipedia, 2/12/25
-- "The Artificial Intelligence (AI) Action Summit was held at the Grand Palais, Paris, France, from 10 to 11 February 2025.[ The summit was co-chaired by France and India."
-- Outcome
"At the summit, the US and UK refused to sign a declaration on "inclusive and sustainable" AI, which was supported by 60 countries, including France, China, and India."
... Outcome discussed in "Vance tells Europeans that heavy regulation could kill AI", Jeffrey Dastin and Ingrid Melander, Reuters, 2/11/25 ... and NY Times, AP News - "Judge Denies Musk’s Request to Block OpenAI’s For-Profit Plan", Cade Metz, NY Times, 3/5/25 ***
-- This story also covered by Reuters, CNN, The Guardian, Bloomberg - "Justice Dept. Doubles Down on Request to Break Up Google", David McCabe, NY Times, 3/7/25 ***
-- This story also covered by Wired, Washington Post, Engadget,
H. Misc = Opinions, other news, rumors, long reads (max = 15)
- Big Tech Quarterly reports ...
-- Microsoft ... "Microsoft shares slide as cloud forecast, AI spending disappoint", Stephen Nellis and Deborah Mary Sophia, Reuters, 1/31/2
... This story also coveed by Bloomberg, CNBC, ... and Microsoft
-- Meta ... "Meta gains after Zuckerberg predicts ‘really big year’ in AI", Riley Griffin, Fortune, 1/29/25
... This story also coveed by Wall Street Journal, Reuters, CNBC, ... and Meta
-- Apple .. "Apple shares rise after positive sales outlook signals iPhone recoverya", Stephen Nellis, Reuters, 1/30/25
... This story also covered by The Verge, CNBC, TechCrunch, Bloomberg, Investopedia .. and Apple
-- Alphabet (Google) ... "Alphabet plans massive capex hike, reports cloud revenue growth slowed", Kenrick Cai and Deborah Mary Sophia, Reuters, 2/5/25
... This story also covered by The Verge,Wall Street Journal, Business Insider, Bloomberg, CNBC, ... and Alphabet (pdf)
-- Amazon ... "Amazon shares drop as cloud growth, sales forecast lag", cBy Greg Bensinger and Deborah Mary Sophia, Reuters, 2/6/25 ***
.. This story also covered by TechCrunch, Business Insider, GeekWire, Wall Street Journal, CNBC, ... and Amazon - "Eyeing Higher Margins, AWS and Other Cloud Providers Embrace DeepSeek", Jon Victor, The Information, 2/4/25
- "What DeepSeek? Big Tech Keeps Its A.I. Building Boom Alive.", Karen Weise, NY Times, 2/8/25
- "Thinking Machines Lab is ex-OpenAI CTO Mira Murati’s new startup", Kyle Wiggers, TechCrunch, 2/18/25
-- This story also covered by Reuters, NY Times, Forbes - "Did xAI lie about Grok 3’s benchmarks?", Kyle Wiggers, TechCrunch, 2/22/25
- "DeepMind CEO Says AGI Definition Has Been ‘Watered Down’", Shirin Ghaffary, Bloomberg, 2/27/95
- "Nvidia Stock Wobbles On Q4 Details. Blackwell Sales Look Strong, For Now.", Patrick Seitz, Investors Business Daily, 2/26/25
-- This story also covered by Business Insider, Reuters, CNBC, ... and Nvidia - Magafication ... and the beat goes on
-- "Anthropic quietly removes Biden-era AI policy commitments from its website", Kyle Wiggers, TechCrunch, 3/5/25
-- "AT&T Drops Pronoun Pins, Cancels Pride Programs in DEI Unwind", Kelcee Griffis and Jeff Green, Bloomberg, 3/7/25
-- "Salesforce cuts diversity hiring goals, joining Meta and Google in scaling back DEI initiatives", Effie Webb, Business Insider, 3/6/25 - "Exclusive: Microsoft, Amazon, and Google Race to Use AI to Drive Sales Gains", Aaron Holmes, The Information, 3/4/25
I. Language model flaws, hacks, and remedies
- "Why Two Models Are Better Than One", Stephanie Palazzolo, The Information, 3/28/24
- "Many-shot jailbreaking", Anthropic, 4/2/24
-- This story also covered by video on TechCrunch, - "DeepMind researchers discover impressive learning capabilities in long-context LLMs", Ben Dickson, VentureBeat, 4/24/24
- "[Gen] AI Is a Black Box. Anthropic Figured Out a Way to Look Inside", Steven Levy, Wired, 5/22/24
-- This story also covered by Fast Company, NY Times, Time, ... and Anthropic - "OpenAI Offers a Peek Inside the Guts of ChatGPT", Will Knight, Wired, 6/6/24
-- This story also covered by OpenAI Blog Note ... and underlying OpenAI Research Paper (pdf) co-authored by Dr. Ilya Sutskever and Jan Leike (before they resigned from OpenAI) and others on the recently disbanded “superalignment” team - "MIT researchers release a repository of AI risks", Kyle Wiggers, TechCrunch, 8/14/24
-- This story also covered by VentureBeat, CSO, ZDNet, MIT Tech Review ... and AI Risk Repository - "ChatGPT caught lying to developers: New AI model tries to save itself from being replaced and shut down", Economic Times, 12/9/24
-- This story also covered by Business Insider, ... and Apollo Research (pdf) - "New Anthropic study shows AI really doesn’t want to be forced to change its views", Kyle Wiggers, TechCrunch, 12/18/24
-- This story also covered by Anthropic - "ChatGPT search tool vulnerable to manipulation and deception, tests show", Nick Evershed, The Guardian, 12/24/24
- "Buyer beware: OpenAI’s o1 reasoning model is an entirely different beast", Anthony Diamond, GeekWire, 12/24/24
- "DeepSeek’s Safety Guardrails Failed Every Test Researchers Threw at Its AI Chatbot", Matt Burgess and Lily Hay Newman, Wired, 1/31/25 *
-- Related story ... "Pentagon scrambles to block DeepSeek after employees connect to Chinese servers", Charles Rollet, TechCrunch, 1/30/25
-- Related story ... "UK Warns DeepSeek Users of Data Risks But Won’t Ban It Yet", Ellen Milligan, Ryan Gallagher, and Alex Wickham, Bloomberg, 1/31/25
J. Basics
- "Watch an A.I. Learn to Write by Reading Nothing but Shakespeare or Harry Potter or Jane Austen or Star Trek or Moby Dick", Aatish Bhatia, NY Times, 4/27/23
- "What Is a Large Language Model, the Tech Behind ChatGPT?", Kurt Muehmel, Data Iku, 6/7/23
- "Textbooks Are All You Need II: phi-1.5 technical report", Yuanzhi Li, Sébastien Bubeck, Ronen Eldan, Allie Del Giorno, Suriya Gunasekar, Yin Tat Lee, Microsoft Research, September 2023
- "Phi-2: The surprising power of small language models", Mojan Javaheripi and Sébastien Bubeck , Microsoft Research, 12/12/24
- WTF are diffusion transformers
-- "Stable Diffusion 3.0 debuts new diffusion transformation architecture to reinvent text-to-image gen AI", Sean Michael Kerner, VentureBeat, 2/22/24
-- "Diffusion transformers are the key behind OpenAI’s Sora — and they’re set to upend GenAI", Kyle Wiggers, TechCrunch, 2/28/24
-- "Diffusion Transformer Explained", Mario Namtao Shianti Larcher, Towards Data Science, 2/28/24 - "Compact Guide to Large Language Models", DataBricks, 2023 ... Link to form that enables download of their eBook (9 pages) pdf file.
- "Large language models can do jaw-dropping things. But nobody knows exactly why.", Will Douglas Heaven, MIT Tech Review, 3/4/24
- "Let's learn about artificial intelligence -- A series about AI, machine learning, ChatGPT, and more", Mark Wiemer, Medium, 3/21/23
- "A Glossary for the AI Revolution", Seth Fiegerman and Nate Lanxon ... Illustrations by Mathieu Labrecque, Bloomberg, 10/4/24
- "Transformers (how LLMs work) explained visually | DL5, 3Blue1Brown, ",
___________________________________
Links to some back issues
No comments:
Post a Comment
Your comments will be greatly appreciated ... Or just click the "Like" button above the comments section if you enjoyed this blog note.