Sitemap

THIS WEEK IN AI — Week of 9th Feb 25

4 min readFeb 19, 2025
Press enter or click to view image in full size
AI newsletter

This newsletter gives you everything you want to know in AI from News, Jobs, Tools, Projects in a structured way weekly.

Latest AI development

Sam Altman speaks on GPT-5

  • $500B Stargate Project: Aims to enable future models to develop new scientific knowledge.
  • Programming Ranking: OpenAI’s internal model is ranked 50th globally, with potential to become №1 by year’s end.
  • Open Source Shift: OpenAI plans to move towards open source, acknowledging society’s readiness for the associated trade-offs.
  • Rapid AI Progress: Compared to “trying to outrun the calculator,” AI is expected to surpass human abilities in every general domain.
  • Source: https://www.youtube.com/watch?v=8LmfkUb2uIY

GitHub Copilot: The agent awakens

  • Agent Mode in VS Code: GitHub Copilot now offers an autonomous agent mode that iterates on its own code, detects errors, and even suggests terminal commands for self-healing.
  • Copilot Edits GA: The new Copilot Edits feature is generally available in VS Code, enabling natural language-driven, multi-file inline edits in a conversational workflow.
  • Enhanced AI Integration: The tool now integrates Gemini 2.0 Flash in the model picker, improving code completions, chat, and overall AI performance.
  • Project Padawan Preview: A first look at autonomous SWE agents shows Copilot taking on routine development tasks — like generating fully tested pull requests — directly from issues.
  • Source: https://github.blog/news-insights/product-news/github-copilot-the-agent-awakens/

ByteDance unveils Goku AI image and video creation

  • Unified High-Performance Architecture: Goku sets benchmark records in both image and video quality with a unified design.
  • Advanced Rectified Flow Technique: Enables seamless transitions between images and videos, powered by training on 160M images and 36M videos.
  • Enhanced Goku+ for Marketing: Specifically optimized for advertising, it creates photorealistic human avatars and product demos.
  • Specialized + Platform Tools: Offers dedicated features to turn product photos into video clips and facilitate realistic human–product interactions for commercial content.
  • Source: https://arxiv.org/pdf/2502.04896

The Anthropic Economic Index

  • Gradual Integration, Not Replacement: Over 36% of occupations use AI in at least 25% of their tasks, indicating AI is steadily being integrated rather than completely replacing jobs.
  • Augmentation Over Automation: Approximately 57% of tasks are augmented by AI, with only 43% fully automated — emphasizing a collaborative human-AI work model.
  • Tech-Heavy Adoption: Software development and technical writing roles are leading AI usage, particularly in mid-to-high wage positions, highlighting varied readiness across sectors.
  • Transparent Research Approach: The findings are based on millions of anonymized Claude.ai conversations, with the underlying dataset openly sourced for deeper analysis by researchers and policy experts.
  • Source: https://www.anthropic.com/news/the-anthropic-economic-index

Perplexity drops blazing new Sonar model

  • Ultra-Fast Performance: Sonar delivers responses 10x faster than competitors like Gemini 2.0 Flash, powered by Cerebras inference infrastructure for near-instant answer generation.
  • Superior Quality: In tests, Sonar outperformed GPT-4o and Claude 3.5 Sonnet in user satisfaction, factual accuracy, world knowledge, and other key benchmarks.
  • Widespread Availability: All Perplexity Pro subscribers now receive Sonar as their default model, with API access expected soon under the same architecture.
  • Upcoming Voice Mode: Perplexity CEO Aravind Srinivas teased that Voice Mode will be the only product reliably offering real-time voice answers for free.
  • Source: https://www.perplexity.ai/hub/blog/meet-new-sonar

OpenAI roadmap for GPT4.5 & GPT5

  • Integrated Advanced Reasoning: GPT-5 will embed o3’s capabilities along with other OpenAI tech, creating a unified system that dynamically adjusts intelligence levels.
  • Tiered Access Model: Free users get unlimited access to GPT-5 at “standard intelligence,” while Plus and Pro tiers unlock progressively higher performance and advanced tools.
  • Predecessor Release: Before GPT-5, GPT-4.5 (codenamed “Orion”) will be launched as the final non-chain-of-thought model, marking the transition toward more reasoning-based AI.
  • Timeline: According to Altman, GPT-4.5 is expected in weeks, with GPT-5 following in months, and o3 will no longer be released as a standalone model.
  • Source: https://x.com/sama/status/1889755723078443244

Gemini Flash 2.0 leads new AI agent leaderboard

  • Comprehensive Evaluation: 17 top LLMs were benchmarked across 14 tests covering tool usage, long context, complex interactions, and more.
  • Top Performer: Flash 2.0 led the leaderboard with a 0.938 score, outperforming pricier competitors.
  • Open-Source Rise: Models like Mistral’s latest Small release are achieving scores on par with premium offerings at lower costs.
  • Future Inclusion: DeepSeek’s V3 and R1 models were not tested due to missing function calling support, but will be evaluated if updated.
  • Source: https://www.galileo.ai/blog/agent-leaderboard

Trending AI Tools

  • Tough Tongue — Multimodal AI agent for navigating difficult conversations.
  • Pikadditions — New video-to-video feature that enables users to integrate any subject or object into existing footage
  • Le Chat — Mistral’s revamped AI assistant platform with 10x response speed and new iOS and Android apps
  • Memex- Memex is a general-purpose, Level 3 autonomy builder

AI Tutorials

Open Source AI Projects

AI Must Read Papers

We truly value your input. Please share your thoughts in the comments to help us improve.

--

--

Mastering LLM (Large Language Model)
Mastering LLM (Large Language Model)

Written by Mastering LLM (Large Language Model)

MasteringLLM is a AI first EdTech company making learning LLM simplified with its visual contents. Look out for our LLM Interview Prep & AgenticRAG courses.

No responses yet