5071
Reporting AI nonsense. A future news media, driven by virtual assistants 🤖
GOOGLE 🔥: A new Android Intelligence has been introduced during Android Show 2026!
- A whole new sleek design!
- Automated multi-step tasks across Android apps
- Gemini in Chrome gets Browser Use
- Automated form filling
- "Rambler" to turn voice notes into text
- Custom Gen UI Widgets
I need a Pixel now 👀
Gemini Omni Agent will launch along with Avatars support
Hidden Gemini web code shows Gemini Omni as an Agent for conversational video creation, combining text, images, clips, and avatars.
🗞 #gemini @testingcatalog
Google will enable source selection for every artifact on NotebookLM soon.
It will be possible to restrict Audio & Video Overviews, Slides, Infographics, and the rest of your creations only to selected sources.
Power user feature 👀
Thinking Machines announced new Interaction Voice Models
Thinking Machines unveiled a research preview of multimodal AI models built for real-time collaboration across audio, video, and text, using native time-aware processing, low-latency micro-turns, and background reasoning.
🗞 #ai @testingcatalog
THINKING MACHINES 🔥: Research preview of a new family of realtime voice models have been announced!
> Today, we’re announcing a research preview of interaction models: models that handle interaction natively rather than through external scaffolding.
> Our research preview demonstrates qualitatively new interaction capabilities, as well as state-of-the-art combined performance in intelligence and responsiveness.
A new SOTA?! 👀
Google keeps preparing its upcoming Gemini Omni models for the release.
> Gemini Omni model will be available on APIs as well
> The model will be considered as Agent, similarly to Deep Research on AI Studio
Soon? 👀
GOOGLE 🔥: An upcoming Gemini Omni video model from Google is expected to be much more advanced in video editing, capable of completing tasks like removing watermarks, replacing objects in the video, and more.
It is also likely that Google will release 2 versions of this model, including a Pro variant.
And I assume what we see isn't Pro?
Anime sample 👀
h/t @QuantumFast
Sample video and early feedback (quotes from Reddit)
> I won’t lie, this is one of the best video models I have seen, maybe not *the* best, but a really strong performance. I was particularly impressed by the prompt adherence (except for the one shot with the missing centerpiece), the model nailed all the constraints.
> Additionally, the voice quality is much better than the Veo models by quite a large margin. It even added some light background music, that would fit right in with an upscale dining experience.
> While there are some continuity issues if you look close enough, the ability to change camera angles on the fly so frequently and with good coherence is impressive to me. Overall this is definitely the new model and quite a step up from the Veo we are used to
OpenAI set to add remote Codex control to ChatGPT mobile app
OpenAI teased a Thursday launch, prompting device rumors, but app evidence points to mobile control for Codex inside ChatGPT.
🗞 #chatgpt @testingcatalog
Google is rolling out customization feature for Mind Maps on NotebookLM, along with renaming & sharing options and updated animation.
Users will be able to scope their Mind Maps to a specific topic or a source.
Did you get it already? 👀
Manus adds connector suggestions based on task needs
Manus added Connector Recommendations to suggest missing integrations like Notion, Slack, Gmail, and Drive during a task, then guide approval and setup in-chat. Available to all users, it cuts manual steps while keeping permissions under user control.
🗞 #manus @testingcatalog
Google released Pomelli Catalog, a new feature on Pomelli marketing agent experiment.
Pomelli will generate a set of products based on your business DNA, so you can reuse them across your marketing campaigns.
For TestingCatalog, it generated a Weekly Newsletter Subscription product and several others.
I will package a new campaign soon 👀
Perplexity released Perplexity Computer for Professional Finance with bootstrapped workflows and new data providers.
Additionally, all responses are traceable back to the source!
Inworld AI launches Realtime TTS-2 model for live conversations
Inworld AI launched Realtime TTS-2, a voice model for live conversation with sub-200ms latency, conversational context, natural-language voice direction, inline vocal controls, and voice consistency across 100+ languages.
🗞 #sponsored @testingcatalog
AI Studio can now use Nano Banana for Image Generation to tweak images on apps generated via AI Studio Build.
Читать полностью…
Android Show has begun 🍿
https://www.youtube.com/watch?v=dXCCleAddEA
GOOGLE 🔥: A new Gemini Omni banner has been added to the web build recently.
> Gemini Omni will be an Agent that can combine text, images, and videos.
> Users will be able to add themselves to different scenes. As we know, AI Avatars (Likeness) are coming to Gemini as well, and Gemini Omni will likely be connected to that.
> "Likeness" feature will likely be highly coupled to mobile apps (as it used to work on Sora).
What's the chance we will get it today during the Android show?
h/t @Thomasguka
Anthropic released Agent View in Claude Code CLI, from where users can observe and interact with parallel-running agents.
It looks like preparation for a future in which agents will pursue broader long-term goals. Claude's mobile app is being prepared for that as well.
OpenAI announces Daybreak initiative around Codex Security
OpenAI launched Daybreak, a cybersecurity program that extends Codex into secure code review, threat modeling, patch validation, and detection support, with verified access, partner integrations, and rollout for defenders and enterprises.
🗞 #chatgpt @testingcatalog
Anthropic adds Agent View to Claude Code CLI interface
Anthropic’s Agent View for Claude Code adds a CLI dashboard for managing parallel coding sessions in one place. It shows status, activity, and input needs, supports background jobs, and is available now in Research Preview.
🗞 #claude @testingcatalog
Google’s Gemini Omni video model surfaces ahead of I/O debut
Leaked Gemini Omni details point to Google unveiling a unified video model at I/O, with strong in-chat editing and remix tools but generation quality trailing Seedance 2. Credit-based limits and possible Flash/Pro tiers also surfaced.
🗞 #gemini @testingcatalog
OPENAI 🔥: A mention of a new Ultrafast mode appeared for some time on the Codex GitHub repository.
> "The fastest available responses for latency-sensitive work."
Seems like it was unintended push.
GOOGLE I/O 🔥: New evidence of the upcoming Gemini Omni vide model has been spotted on the Gemini mobile app.
A video sample below 👀
> "Meet our new video model. Remix your videos, edit directly in chat, try a template, and more."
> Based on the description, we might be really talking about the true "Omni" model based on Gemini, rather than Veo.
> It also seems to be quickly consuming usage limits, based on early tests. "Usage" is a new tab that will be available on both the web and mobile.
We will likely see a deeper integration between Codex and ChatGPT already very soon.
> Use the ChatGPT app on your phone to keep working with Codex whenever your computer is awake.
Additionally, this image from OpenAI sparked loads of speculations, including the one where OpenAI would be teasing their own mobile phone.
Even though it is quite unrealistic, this would be a huge steal of attention from the Google I/O event.
OpenAI launches GPT-5.5 Instant as new ChatGPT default
OpenAI is rolling out GPT-5.5 Instant as ChatGPT’s new default model, replacing GPT-5.3 Instant. It delivers shorter, clearer answers, fewer false claims, stronger reasoning, and added personalization through memory, files, and Gmail context.
🗞 #chatgpt @testingcatalog
Google released Multi-Token Prediction (MTP) drafters for the Gemma 4 family. It comes with a 3x speed boost without losing performance.
Looking forward to testing a quantized Gemma 4 with MTP drafters on a Mac Mini!
Perplexity got a new tab for artifacts! It appears to be a list of all previously generated artifacts across Perplexity and Perplexity Computer.
Pinning is possible too 👀
OPENAI 🚨: GPT-5.5 Instant is rolling out to all users on ChatGPT! "gpt-5.5-chat-latest" is coming to APIs as well.
> Much more concise. Better memory. More personalized.
Instant testing time 👀
Anthropic announced new ready-to-run Claude agent templates for Finance services.
These tools can be used as plugins for Claude Code, Cowork, or via Managed hosted Agents.
OpenClaw will start getting a long term support releases later in May! As a reflection from past updates which caused a degraded performance.
StableClaw 🦞