Hume AI Unveils Octave: An Emotionally Intelligent Text-to-Speech Model
Hume AI has introduced Octave, a groundbreaking text-to-speech system that comprehends the emotional context of text, enabling creators to generate custom voices with precise control over emotion and delivery. Unlike traditional models that merely read words, Octave interprets their meaning, allowing for human-like expressiveness in applications such as audiobooks, podcasts, and voiceovers. Users can direct Octave with natural language instructions to modify emotional delivery and speaking style, offering unparalleled versatility.
hume.ai
Perplexity Enhances iOS App with Redesigned Voice Mode
Perplexity has updated its iOS application to include a redesigned voice mode featuring an interactive user interface and six distinct voice options. This enhancement allows users to ask questions and receive real-time, spoken answers, integrating search functionality that displays results directly within the voice interface. The update aims to provide a more engaging and efficient user experience.
testingcatalog.com
Poe Introduces Poe Apps for Building Visual AI Interfaces
Poe has launched Poe Apps, a new feature that enables users to create visual interfaces utilizing a combination of over 100 text, image, video, and audio models available on the platform. This development allows for the construction of applications with functionalities beyond traditional chat interfaces, supporting workflows, games, multimedia generation, and document editing. Poe Apps aim to simplify the process of building AI-driven applications with customized visual components.
poe.com
Vevo Therapeutics Releases Tahoe-100M Dataset via Arc Virtual Cell Atlas
Vevo Therapeutics has open-sourced Tahoe-100M, the worldās largest single-cell dataset, as the inaugural contribution to the Arc Instituteās Virtual Cell Atlas. Tahoe-100M encompasses data from approximately 100 million cells across 60,000 drug perturbation experiments, mapping responses of 50 cancer models to over 1,100 drug treatments. This extensive dataset is freely accessible to the scientific community and is expected to advance research in drug discovery and cellular biology.
arcinstitute.org
Exa Launches Websets: An AI-Driven Search Tool
Exa has introduced Websets, an innovative search product that employs AI agents to deliver highly accurate and comprehensive search results. Websets utilizes Exaās web-scale vector search technology, allowing users to input natural language queries and receive verified information curated by AI agents. This approach has demonstrated superior performance, reportedly outperforming traditional search engines like Google by over 20 times and OpenAIās Deep Research by 10 times on complex queries.
lsvp.com
IBM Expands Granite Model Family with Granite 3.2
IBM has unveiled Granite 3.2, the latest addition to its family of AI models designed for enterprise applications. Granite 3.2 introduces enhanced reasoning capabilities, a vision-language model for multimodal tasks, and specialized time series models for forecasting. These models are open-source and optimized for efficiency, aiming to provide businesses with scalable AI solutions that balance performance and computational cost.
ibm.com
Microsoft Launches Phi-4 Multimodal and Phi-4 Mini Small Language Models
Microsoft has expanded its Phi family of small language models with the release of Phi-4 Multimodal and Phi-4 Mini. Phi-4 Multimodal is capable of processing text, vision, and audio inputs simultaneously, facilitating context-aware applications and innovative solutions. Phi-4 Mini, a compact model with 3.8 billion parameters, excels in text-based tasks and is optimized for chat applications, offering high accuracy and scalability in a smaller form factor.
azure.microsoft.com