AI Pulse News Roundup (December 2024 Edition)

platypus · December 16, 2024, 9:43am

Research Alerts

Introducing Phi-4: Microsoft’s Newest Small Language Model Specializing in Complex Reasoning

Microsoft is releasing their latest Small LLM (SLM) Phi-4, which now comes with advanced reasoning capabilities (think o1). Based on maths AMC benchmarks, it seems to beat much larger models including Gemini 1.5 Pro. What is particularly interesting with this model release, is the technical report and lots of details on how they generated synthetic data.

Taming Multimodal Joint Training for High-Quality
Video-to-Audio Synthesis

Not a lot of information nor the paper is available yet, but this is joint research from Sony and University of Illinois on join audio-video synthesis - think Sora with audio. Super interesting.

_j · December 16, 2024, 10:33am

As long as the forum gets other AI company news in this topic, an impactful one is the release of Gemini 2.0 flash.

Fresh model knowledge with no search grounding enabled

The image includes a prompt for you to test out on any other models with updated knowledge that may come your way…

platypus · December 16, 2024, 6:43pm

Research Alert

Veo 2 - State of the Art Video Generation Model from DeepMind

The title says it all.

mitchell_d00 · December 16, 2024, 7:25pm

Thank you @platypus, I really love the pulse and all the effort you and your fellow coconspirators put into it

vb · December 16, 2024, 10:43pm

Not available in the EU…
This is bound to become a running gag.

mitchell_d00 · December 16, 2024, 10:53pm

A cartoon scientist laughs while operating a machine labeled "SORRY, NOT AVAILABLE IN EU" that causes a spark on the head of a puzzled man. (Captioned by AI)1024×1024 450 KB

PaulBellow · December 17, 2024, 5:14pm

Meta rolled out new AI-powered features for its Ray-Ban smart glasses, including live virtual assistance, instant language translation, and Shazam integration for identifying music hands-free. These upgrades position the glasses as a versatile everyday tool for real-time assistance. Source: The Verge

YouTube now lets content creators decide which AI companies can use their videos for model training. The program initially includes partnerships with 18 major tech firms, such as OpenAI, Microsoft, and Meta, signaling a new era of transparency in AI development. Source: The Verge

Google Labs introduced “Whisk,” an experimental tool that leverages Imagen 3 and Gemini AI to empower users to remix, refine, and transform images seamlessly. This tool expands creative possibilities with image-to-image AI functionality. Source: The Verge

In a recent interview, former Google CEO Eric Schmidt expressed growing concern over advanced AI systems. He argued that intervention, including potentially “shutting down” self-improving AI, may be required to prevent unforeseen consequences. Source: The Sun

SoftBank’s Masayoshi Son committed to investing $100 billion in the United States’ AI industry, with plans to generate 100,000 new jobs over the next four years. The announcement followed discussions with President-elect Donald Trump. Source: New York Post

Lockheed Martin established a new subsidiary, Astris AI, to accelerate artificial intelligence adoption in defense and commercial sectors. The initiative aims to enhance operational capabilities and drive innovation across industries. Source: Reuters

anon10827405 · December 17, 2024, 5:16pm

This is interesting.

The start of companies passing some of the controls, and maybe profits to their users?

I am very interested to see how Reddit handles this. Or, more than likely, mishandles this

(fediverse, woohoo!)

platypus · December 18, 2024, 7:14pm

Infrastructure Alerts

Sam Altman-backed nuclear startup Oklo lands massive data center power deal, with caveats

Oklo, a nuclear startup chaired by Sam Altman, has secured a 20-year agreement to supply 12 gigawatts of electricity to data centers. This deal highlights the tech industry’s growing interest in sustainable nuclear energy to meet increasing AI and cloud computing demands. However, Oklo still requires regulatory approvals and must construct its small modular reactors, with plans to have its first reactor operational by 2027.

Microsoft Outpaces Rivals with 485,000 Nvidia AI Chip Orders, Solidifying AI Leadership

Microsoft has surpassed its competitors by adding 485,000 new subscribers to its cloud services in the past quarter, bringing its total to over 5 million. This growth is attributed to the company’s strategic investments in artificial intelligence and cloud infrastructure, which have enhanced service offerings and attracted a broader customer base. In contrast, rivals like Google and Amazon have reported slower subscriber growth during the same period. Microsoft’s focus on integrating AI capabilities into its cloud platform has positioned it favorably in the competitive tech industry landscape.

platypus · December 18, 2024, 7:16pm

I know a bit about nuclear (both in Sweden and Finland there is plenty of nuclear power), but I don’t know much about these micro nuclear reactor designs. The 2027 timeframe seems incredibly aggressive, but maybe that’s normal for these types of reactors?

PaulBellow · December 18, 2024, 7:19pm

Midjourney has introduced ‘Moodboards,’ a feature that enables users to upload curated image collections to inspire new AI-generated art. This addition allows for the creation of multiple personalization profiles, facilitating seamless organization and deployment of diverse styles. Source: VentureBeat

Google has enhanced its Gemini Code Assist tools by integrating the Gemini 2.0 language model and expanding connections to external code repositories and cloud-based databases. These improvements aim to boost developer productivity within integrated development environments (IDEs). Source: TechZine

The United Arab Emirates’ Technology Innovation Institute (TII) has launched Falcon 3, an open-source family of language models available in four sizes—1B, 3B, 7B, and 10B parameters. These models are designed to democratize access to advanced AI capabilities, offering high performance while being efficient enough to run on lightweight hardware, including laptops. Source: Zawya

Databricks has secured a $10 billion Series J funding round, elevating its valuation to $62 billion. The investment, led by Thrive Capital, Andreessen Horowitz, and Insight Partners, is intended to support the company’s AI product expansion, potential acquisitions, and international go-to-market operations. Source: YourStory

PaulBellow · December 18, 2024, 7:26pm

12/18 AI Summary

AI Pulse News Roundup (December 2024 Edition)

@platypus opens the first-ever monthly AI Pulse News Roundup Thread, inviting users to share and discuss breaking news, explore key developments in real-time, and archive highlights of the month. The new format encourages community engagement around AI advancements.
@platypus shares insights on a paper from University of Maryland and Adobe Research titled DynaSaur, which presents an unconstrained method for creating LLM agent actions.
@PaulBellow discusses Amazon’s new generative AI model, “Olympus,” highlighting its multi-modal capabilities and strategic implications for Amazon in the competitive AI space.
@sps mentions Amazon becoming the training compute provider for Anthropic, leading to discussions about both companies’ interdependencies.
Various users comment on Amazon’s developments, including @PaulBellow’s observation of potential competitive shifts in the AI market and whimsical discussions around creative names like Trainium.
User participation is active, with users posting diverse content, such as breaking news about OpenAI filing for a trademark on its reasoning models, Amazon’s AI enhancements for its services, and updates on AI projects globally from different tech firms.
@mitchell_d00 suggests creating a “proud parent thread” to involve families in AI projects, emphasizing the community aspect.
Reports on significant developments include:

Soracom’s venture into mixed-reality systems with Android XR.
Meta’s new AI features for Ray-Ban smart glasses.
YouTube allowing creators to control which AI companies use their content.

Future trends include Google’s Gemini Code Assist, Falcon 3’s launch by the UAE, and significant funding rounds for Databricks to enhance AI product expansion.
@vb notes the increasing role of nuclear energy in tech, particularly regarding Sam Altman’s nuclear startup and its 20-year power agreement for data centers. Various models, including Microsoft’s new small language model Phi-4, are discussed, with emphasis on competition among major players like Google and Microsoft in AI infrastructure.

This thread reflects vibrant discussions on ongoing advancements in AI, showcasing community engagement, humor, and diverse insights into the ever-evolving technological landscape.

platypus · December 18, 2024, 7:26pm

Two things: (1) “Series J”??? Wow!!! and (2) super exciting to see SLMs getting better and better, Phi-4 and now Falcon 3 - does anyone have any experience with small LLMs and any takeaways from actual applications?

PaulBellow · December 18, 2024, 7:27pm

Would love to hear this too!

We’re so close to embedding LLMs inside of games, I think…

mitchell_d00 · December 18, 2024, 7:45pm

I kind of envision it. Having the generic language logic and rules that can be applied to local data sources. it don’t need data just instructions on how to read data that is generic and modular. More like a smart word processor.

platypus · December 24, 2024, 10:59am

Research Alert

Large Concept Models: Language Modeling in a Sentence Representation Space

This paper from Meta presents Large Concept Models (LCMs), a novel approach to natural language modeling that operates on sentence-level embeddings instead of token-level representations, as used in traditional Large Language Models (LLMs). There are lots of interesting goodies here, such as its sampling mechanism - instead of sampling tokens sequentially, LCMs generate semantic level outputs by predicting sentence embeddings; they also show very impressive zero-shot generalization capabilities.

anon61591753 · December 26, 2024, 9:44am

Hey @platypus ,
I love the concept but I could not find any real world use case…
Can you help me dig into this more.
Thanks.

DavidMM · December 26, 2024, 10:31am

I think I know where this is going; if OpenAI doesn’t hurry, others will get ahead of them.

mitchell_d00 · December 26, 2024, 11:02am

I found a video but it seems it is very new tech not much is public.

DavidMM · December 26, 2024, 12:00pm

I would say that this is one of the points of my ideas, not exactly as they describe it in that video, but in essence, it’s the same.

Topic		Replies	Views
Foundational must read GPT/LLM papers Community research , large-language-model	79	69446	May 16, 2024
AI Pulse Edition #2: Latest AI News Updates for the Developer Community Community news , in-the-news , ai-pulse-roundup	17	1156	September 19, 2024
AI Pulse News Roundup (March 2025 Edition) Community in-the-news , ai-pulse-roundup	24	656	March 20, 2025
What is the impact of DeepSeek on the AI sector? 🔥 Community o1	166	8369	February 16, 2025
Discussion thread for "Foundational must read GPT/LLM papers" Community gpt-4 , gpt-35-turbo , chatgpt , research	75	10518	September 3, 2024

AI Pulse News Roundup (December 2024 Edition)

AI Pulse News Roundup (December 2024 Edition)

Related topics