Language Model

  1. Microsoft’s Phi-4-multimodal AI model handles speech, text, and video

    Microsoft’s Phi-4-multimodal AI model handles speech, text, and video
    Microsoft has unveiled a new small language model (SLM) designed to help developers build multimodal AI applications that can operate efficiently on lightweight computing devices. The company emphasizes that this model can process speech, vision, and text locally on-device, consuming significantly less computational power than previous iterations. The Rise of Small Language Models (SLMs) While much of the innovation in...
  2. OpenAI Launches ChatGPT 4.5 for. It is Huge and Compute-Intensive

    OpenAI Launches ChatGPT 4.5 for. It is Huge and Compute-Intensive
    OpenAI has officially unveiled GPT-4.5, internally codenamed ‘Orion,’ marking a significant milestone in its language model development. This release represents the final iteration under OpenAI’s traditional scaling paradigm before shifting towards more advanced reasoning-based architectures. CEO Sam Altman described GPT-4.5 as the first model that “feels like talking to a thoughtful person,” emphasizing its enhanced ability to engage users in...
  3. xAI Unveils Grok 3, Claims It Outperforms Leading AI Models

    xAI Unveils Grok 3, Claims It Outperforms Leading AI Models
    Elon Musk’s AI startup, xAI, has unveiled its latest artificial intelligence model, Grok 3, claiming it surpasses top competitors, including OpenAI and China’s DeepSeek. According to xAI, early testing—including evaluations in math, science, and coding—suggests Grok 3 has achieved superior performance compared to existing models. Grok 3 Launch and Features The new AI model will be rolled out starting Tuesday...
  4. Perplexity AI Unveils "Deep Research" Advancement in AI-Powered Knowledge Retrieval

    Perplexity AI Unveils "Deep Research" Advancement in AI-Powered Knowledge Retrieval
    With rapid advancements in artificial intelligence research, particularly in natural language processing (NLP) and large-scale machine learning models, Perplexity AI has introduced its latest feature, "Deep Research." This new capability is engineered to enable users to conduct extensive, expert-level research within minutes, delivering highly detailed and contextually enriched responses to complex queries. The Technical Backbone of Deep Research Deep Research...
  5. Google’s Gemini expands access to ‘thinking’ AI models

    Google’s Gemini expands access to ‘thinking’ AI models
    Google is taking a significant step forward in artificial intelligence with the introduction of its experimental "reasoning" AI model, now integrated into the Gemini app. This advancement, part of the Gemini 2.0 Flash Thinking update, aims to enhance AI's ability to process and explain complex queries. The rollout is just one of several major updates to Google's AI ecosystem, as...
  6. ChatGPT Developer Unveils AI Agent 'Deep Research' Amid Rising Competition from China's DeepSeek

    ChatGPT Developer Unveils AI Agent 'Deep Research' Amid Rising Competition from China's DeepSeek
    OpenAI, the company behind the popular AI chatbot ChatGPT, has taken another major step in the race to develop advanced artificial intelligence agents. The San Francisco-based firm has announced the release of a new tool called "Deep Research," which is designed to produce research reports at a level comparable to that of a professional analyst. This announcement comes at a...
  7. Alibaba Launches Qwen 2.5-Max AI Model That 'Outperforms' DeepSeek

    Alibaba Launches Qwen 2.5-Max AI Model That 'Outperforms' DeepSeek
    Alibaba has announced that its latest AI model, Qwen 2.5-Max, has achieved a top score of 89.4 in the Arena-Hard benchmark, a rigorous evaluation framework that assesses AI models based on their ability to respond effectively to human prompts. This development comes as ecommerce giant Alibaba rushes to release Qwen 2.5-Max, aiming to maintain its competitive edge in an industry...
  8. DeepSeek's Janus-Pro-7B Outshines AI Giants in Image Generation, Marking a New Era in Multimodal AI

    DeepSeek's Janus-Pro-7B Outshines AI Giants in Image Generation, Marking a New Era in Multimodal AI
    In a groundbreaking move that has sent ripples through the artificial intelligence (AI) industry, Chinese AI company DeepSeek has unveiled its latest open-source multimodal model, Janus-Pro-7B, which is already being hailed as a game-changer in the field of image generation and multimodal understanding. This release comes hot on the heels of the company’s earlier open-source model, R1, which caused a frenzy...
  9. Marc Andreessen Declares DeepSeek’s R1 as AI’s “Sputnik Moment,” Signaling a Shift in Global AI Dominance

    Marc Andreessen Declares DeepSeek’s R1 as AI’s “Sputnik Moment,” Signaling a Shift in Global AI Dominance
    DeepSeek, a Chinese AI company, has made waves with its groundbreaking reasoning model, R1, challenging the long-held belief that China merely imitates Western technological advancements. This development not only shatters stereotypes but also positions China to potentially surpass the West in the AI race, marking what tech luminary Marc Andreessen calls “AI’s Sputnik moment.” The release of DeepSeek’s R1, an...
  10. ChatGPT Introduces Tasks for Scheduling and Recurring Actions

    ChatGPT Introduces Tasks for Scheduling and Recurring Actions
    OpenAI is taking a step closer to turning ChatGPT into a full-fledged digital assistant with the launch of a new beta feature called Tasks. This functionality, available to Plus, Team, and Pro subscribers starting today, allows users to set reminders and schedule recurring actions. By integrating this feature, OpenAI aims to extend ChatGPT’s capabilities beyond real-time conversations, making it more...
  11. Google Publishes In-Depth Whitepaper on Generative AI Agents and Their Potential

    Google Publishes In-Depth Whitepaper on Generative AI Agents and Their Potential
    Google has released an extensive whitepaper that delves into the development, architecture, and applications of Generative AI agents. This detailed document sheds light on how these advanced AI systems operate, emphasizing their ability to use external tools to expand their functionality far beyond the limits of traditional language models. Defining Generative AI Agents The whitepaper defines a Generative AI agent...
  12. The Best Generative AI Tools: From Chatbots to Image and Video Generators

    The Best Generative AI Tools: From Chatbots to Image and Video Generators
    Top AI Tools Transforming the Generative Landscape in 2024—and the Disappointments Along the Way The generative AI space has become a competitive and rapidly evolving battlefield in 2024, with a growing number of players challenging the dominance of OpenAI. From language models to image generators, the tools redefining AI are both exciting and, at times, disappointing. In this comprehensive guide...
  13. Standford's STORM's AI Revolution for Research Writing

    Standford's STORM's AI Revolution for Research Writing
    Writing detailed, well-researched articles is often a daunting and time-intensive process, even for seasoned authors. What if artificial intelligence could assist in crafting summaries of complex topics? Enter STORM (Synthesis of Topic Outlines through Retrieval and Multi-perspective Question Asking), an innovative AI system developed by researchers at Stanford University. Designed to generate Wikipedia-style articles on virtually any subject, STORM leverages...
  14. Google Introduces Gemini 2.0

    Google Introduces Gemini 2.0
    Over the past year, we have made remarkable strides in the field of artificial intelligence, setting new benchmarks and unlocking possibilities that were previously unimaginable. Today, we are thrilled to announce the release of the first model in the Gemini 2.0 family: Gemini 2.0 Flash. This experimental model is a testament to our commitment to pushing the boundaries of AI...
  15. Google's Genie 2: Transforming Text into Playable Games in Real-Time

    Google's Genie 2: Transforming Text into Playable Games in Real-Time
    Google DeepMind, a global leader in artificial intelligence innovation, has introduced Genie 2, a revolutionary AI tool capable of converting simple text prompts into fully playable games in real-time. This cutting-edge platform is designed to generate limitless, dynamic 3D environments that can be used for both training and evaluating AI agents, as well as creating novel gaming experiences. Jack Parker-Holder...
  16. Amazon develops video AI model Olympus

    Amazon develops video AI model Olympus
    E-commerce giant Amazon (AMZN.O) has reportedly developed a new generative artificial intelligence (AI) model that goes beyond processing text, with capabilities to analyze and interpret images and videos, according to a report by The Information on Wednesday. This development is seen as a significant move towards reducing Amazon’s dependency on Anthropic, an AI startup whose Claude chatbot is a key...

Items 1 to 16

Page