AI News - ### Dec 2024
AI News - ### Dec 2024
A lot is happening in the AI world. It is extremely difficult to followup, learn and implement. Therefore I am planning to write a weekly AI development news summary. This will help me and my readers to understand about the trends and new developement some of these may be interesting to know and someone can pickup to expand as research or as a product, we don’t know.
Google Announcements
Gemini 2.0 Foundation Model
- Released “Gemini 2.0 Flash.”
- Features include structured output, code execution, function calling, grounding, and real-time voice and screen-sharing interactions.
- Gemini 2.0 supports image manipulation, such as transforming or blending images.
- Analyze videos without audio, identifying scenes and key moments.
- Future feature for geographic exploration and context.
- Benchmarks
- Demo - Introducing Gemini 2.0
Project Astra
- Universal AI-powered assistant with advanced vision capabilities, able to recognize objects, read books, and identify environmental details. It is enabled by Gemini 2.### Multimodel memory and realtime information. It is multilingual and can switch languages on the fly.
- Future integration with smart glasses for hands-free interaction.
- Demo - Project Astra
Project Mariner
- Browser agent prototype for automating repetitive tasks, such as gathering contact information from websites.
- Enhanced web research tool that works from chrome as extension. It can work as an agent and do work on your behalf.
- Demo - Project Mariner
Jules (Developer Assistant)
- AI assistant for coding tasks and game assistance.
- Exploring your virtual world in video games.
- Reason objects in ### real world aroud us.
- Demo - Gemini for Games
Some Other Testing of Gemini 2.0
OpenAI Announcements
Sora Turbo
Video generation tool capable of creating 20-second videos and blending multiple video concepts.
ChatGPT Canvas
Now available to all users, featuring Python code execution and an updated user interface for writing and coding tasks.
ChatGPT with Siri Integration
ChatGPT accessible via Siri on iPhone ### and macOS, with features like screen sharing and enhanced intelligence.
Advanced Voice Mode
Combines vision and voice to describe objects and read text from images in real-time.
Santa Claus Interaction
Seasonal feature allowing users to interact with Santa via ChatGPT.
Other Announcements
Anthropic Claude 3.5 Haiku
Faster and cheaper model available for chatbot applications.
Grok’s New Image Generator
Introduced a new image generation model using autoregressive mixture of experts.
MidJourney Patchwork
Collaborative canvas for generating and organizing images.
Adobe Reflection Removal
AI-powered feature to remove reflections from photos taken through glass.
YouTube Automatic Dubbing
Translate and dub videos into multiple languages to expand audience reach.
Cognition Labs’ Devin
AI coding assistant priced at $500/month, designed for large codebase management.
Meta Quest & Windows Integration
Enables virtual desktops and workspaces in the Meta Quest 3 VR headset.
Google Android XR
Augmented and virtual reality platform competing with Apple Vision Pro.
Tesla Optimus Robot Update
Progress in humanoid robots learning to walk on uneven terrain.
Miscellaneous
- Hostinger AI Website Builder: Simplified website creation using AI.
- Virtual Reality and XR developments from Meta and Google.
- AI livestreams starting December ### to explore tools in real-time.
Leave a comment