AI Pulse News Roundup (December 2024 Edition)

platypus · December 1, 2024, 9:19am

Welcome, everyone, to the first-ever monthly AI Pulse News Roundup Thread!

When we began the AI Pulse News Roundup, our goal was to spark engaging discussions about the latest developments in AI. What we learned along the way is that the most vibrant conversations happened as the news unfolded, not just when the Roundup was released.

So, starting this month, we’re doing things differently. This new monthly format is your space to:

Share breaking news about AI research, applications, policies, and more.
Discuss key developments in real-time with the community.
Explore and archive the highlights of the month.

Here’s how it works:

Post any news link, big or small. From groundbreaking papers to policy shifts, product launches, or ethical debates—everything is welcome.

Join the conversation. Ask questions, share insights, and debate the implications with fellow members.

Review the big picture. By the end of the month, this thread will be a snapshot of December’s most important moments in AI.

Let’s make this new format a success. Got a story to share or a topic that’s caught your attention? Post it below and let’s start the conversation!

Happy December, and let’s dive into the news!

platypus · December 1, 2024, 9:29am

I can kick things off here from LLM agent research news.

Yesterday I found the paper from University of Maryland and Adobe Research, titled DynaSaur: Large Language Agents Beyond Predefined Actions. In a nutshell, it’s an unconstrained method of creating agent actions. It is roughly summarized in the diagram below. A standard way of creating LLM agent networks (e.g. using Swarm) is to define precise actions ahead of time, and generally the sequence of steps as well - LLM’s job is to then pick the right pre-defined action given a query or some input. Here researchers propose a more dynamic and unconstrained way of doing this. Would be interesting to see how this would work in production setting!

PaulBellow · December 1, 2024, 8:59pm

New model “Olympus” from Amazon…

Amazon Introduces “Olympus,” a Generative AI Model with Multi-Modal Capabilities

Amazon has developed a new generative AI model, code-named “Olympus,” which can process images, videos, and text, as reported by The Information. This advancement reduces Amazon’s reliance on the AI startup Anthropic, whose Claude chatbot is a key component of Amazon Web Services (AWS).

Key Features of Olympus:

Multi-Modal Understanding: Processes text, images, and videos to enable advanced scene recognition.

Enhanced Search Capabilities: Users can search for specific moments, such as a “winning basketball shot,” using natural language prompts.

Strategic Implications:

Competition in Generative AI: Olympus positions Amazon as a stronger competitor against Google, Microsoft, and OpenAI in the generative AI space.

Reduced Dependency: With Olympus, Amazon becomes less dependent on Anthropic, despite recent investments totaling $8 billion into the AI startup.

Upcoming Announcement:

Amazon is expected to unveil Olympus during the AWS re:Invent conference next week, highlighting its commitment to innovation in generative AI.

This development reflects Amazon’s ongoing efforts to close the perceived gap with leading AI developers while enhancing its generative AI offerings across e-commerce and cloud services.

platypus · December 1, 2024, 9:17pm

Interesting! MS has its own models so I suppose why not Amazon. I wonder how Olympus compares with Claude?

Side note: I was on Mt Olympus in July this year. Amazing mountain, the Enipeas gorge is beautiful

PaulBellow · December 1, 2024, 9:33pm

Yeah, what was interesting for me is that Amazon might not rely on Anthropic as much now? Amazon has a TON of datacenters and logistics, so who knows. Big beasts like them are slow to move, but when they get going, watch out! haha

PaulBellow · December 1, 2024, 9:37pm

I wonder if we should make the first post a Wiki to add a long list of links/headlines?

Also…

OpenAI has filed to trademark its “o1” reasoning models, with an intriguing twist: an earlier trademark application was quietly made in Jamaica months before the models were officially announced. (TechCrunch)

sps · December 2, 2024, 8:33am

IIRC, Amazon also just became the training compute provider for Anthropic. The interesting part of this deal is that Anthropic will use Amazon’s own custom training compute, AWS Trainium, for training its models, and Inferentia for serving them.

PaulBellow · December 2, 2024, 4:17pm

platypus · December 2, 2024, 4:49pm

Very creative naming, Trainium, Finetunium, Promptium, Agentium…

Munna23 · December 2, 2024, 6:17pm

Trainium Finetunium sounds more like a spell lol

platypus · December 2, 2024, 6:28pm

Agentium Leviosa? Levitating agents? Drones?

PaulBellow · December 3, 2024, 4:38pm

gaming is about to change…

PaulBellow · December 3, 2024, 4:40pm

We’ve heard about ChatGPT ads by now likely, but here’s an answer from Sam Altman earlier this year…

PaulBellow · December 3, 2024, 4:52pm

Smart Browser, you say?

PaulBellow · December 3, 2024, 5:36pm

Speaking of Bezos…

anon10827405 · December 3, 2024, 5:43pm

AI-focused browsers are going to be strange IMO.

Are they going to help automation? Are they just LLM wrappers that can see the page? Will they have constant surveillance? Can they interact with the DOM? What benefit does it bring versus using the built-in AI assistant?

For example, I use Google Workspace for practically everything, and Gemini is already increasingly more useful as each day passes as it has very specific, fine-tuned knowledge and tooling. Will it be able to interact with these domain’s AI?

I know OpenAI is also apparently building this, or maybe this is what OpenAI is building with TBC. It’s gonna be a hard no for me though, but maybe it’s because I can do all the automations and AI toolings easily myself, and then I already can rely on the specialized AI assistants in the platforms I use to assist me.

One thing that’s surprised me is how little I have heard regarding the ChatGPT app, and Computer Use by Anthropic. Maybe I’m just not in the right social groups but I really was expecting to see some epic workflows created using these tools. Yet, all I’ve seen has been novelty, and silence.

It’s been very exciting watching companies shove AI into every single little crevice they can find. Reminds me of the early dotcom days.

Fun exaggeration:

Be me

Turn on computer by asking ChatGPT advanced voice mode

Sit down on LLM chAIr, it doesn’t talk, crap, forgot to pay my AI-Company #6 bill. No matter. Now it doesn’t swivel and the arm rests don’t work.

UbAI Ubuntu turns on. Asks me if I’d like to hear The Independent or Breitbart. I get 1,000 free tokens if I do

Push away the ads, I mentioned a headache and now I keep getting doctors’ AI trying to call me and their ads sent directly to my operating system. Had to pay extra for my AI phone to handle the calls and ensure no real calls are missed. Receiving 12 calls per hour

Instinctively Try to open up Dia Browser with my mouse, but I forgot that UbAI gives me 10,000 free tokens for taking part of an experiment to not have any mouse or keyboard. “Open browser” I say. OpenAI Advanced Voice mode is heard in the background saying “Sorry, I can’t do that”. Woops.

“You spent a lot of time yesterday looking at goats, did you know that George therapist specializes in goat therapy? His AI assistant is on the line waiting to speak, we had a discussion and I have all the plans ready, it costed 10,000 tokens”. “No, thank you”. “That’s okay. Unfortunately I just finished speaking with DrAIke from DrAIke Insurance and this will cost an extra 100 tokens per month if it’s not managed”. “That’s fine”.

Dia browser finally opens. “Please use eye-ball scanner to access the WWW as a human. You will be credited 500 tokens for the inconvenience”. Ugh.

“Okay, let’s check my e-mails”. “You are not signed up for mAIl mail. Would you like to?”. “No no no I meant GMail”. “An AI assistant has been found in the current website. Communicating… Please wait…”

“I’m sorry, you have run out of tokens. Would you like to watch some ads to continue?”

“No… Thank you”

Obviously a completely overblown exaggeration. It’s becoming almost annoying how everything is shoving AI into anything so that they can justify that sweet monthly fee & training data.

EricGT · December 4, 2024, 2:37pm

From Hacker news

My son (9 yrs old) used plain JavaScript to make a game, and wants your feedback (comments)

https://www.armaansahni.com/game/

For why this relates to AI

https://www.armaansahni.com/how-i-coded-a-game-using-ai/

PaulBellow · December 4, 2024, 5:15pm

Topic		Replies	Views
Foundational must read GPT/LLM papers Community research , large-language-model	79	69815	May 16, 2024
AI Pulse Edition #2: Latest AI News Updates for the Developer Community Community news , in-the-news , ai-pulse-roundup	17	1169	September 19, 2024
AI Pulse News Roundup (March 2025 Edition) Community in-the-news , ai-pulse-roundup	24	734	March 20, 2025
What is the impact of DeepSeek on the AI sector? 🔥 Community o1	166	8454	February 16, 2025
Discussion thread for "Foundational must read GPT/LLM papers" Community gpt-4 , gpt-35-turbo , chatgpt , research	75	10552	September 3, 2024

Key Features of Olympus:

Strategic Implications:

Upcoming Announcement:

Related topics