Breaking 16:21 Brent crude falls below 110 dollars after volatile spike above 120 15:59 Uber CTO shows how Claude Code can blow up AI budgets 15:40 Spain Morocco tunnel faces delays as insurance and technical hurdles mount 15:20 China suspends robotaxi licences after Baidu system failure 14:59 Tanger Med handles 11 million containers, strengthens global port ranking 14:40 Starmer demands swift response after London knife attack 14:17 Morocco joins Artemis Accords to expand role in space cooperation 13:59 Germany backs Morocco autonomy plan as basis for Sahara solution 13:43 Morocco and United States deepen ties after Rabat diplomatic meeting 13:23 Oil crisis accelerates global shift toward renewable energy transition 13:04 Japanese yen crosses 160 level after sharp intervention warning 12:15 Mayor of small village in France placed in custody over rape allegations 12:00 Moody’s relinquishes regulatory licence of South African subsidiary 11:45 Russia rejects Tuareg rebels’ call for withdrawal of troops from Mali 11:30 Irish Supreme Court allows TikTok EU–China data transfers during appeal process 11:20 China’s coal powered grid shields factories amid Asian energy shock 11:15 Fire breaks out at French military camp ahead of Emmanuel Macron’s visit 11:04 Featherless.ai raises 20 million dollars to scale open source AI platform 11:00 Volkswagen expects limited benefit from possible U.S. tariff refunds, CFO says 10:45 Casablanca expands Timeless Festival into a three-day international music event in 2026 10:41 NASA and ISRO satellite maps rapid land subsidence in Mexico City 10:30 California declares state of emergency after golden mussel invasion threatens waterways 10:23 Amazon plans external sales of Trainium artificial intelligence chips 10:15 Teacher and school staff attacked by 15-year-old student in western France 10:00 Casablanca hosts first edition of GITEX Future Health Africa Morocco 10:00 EU finds Meta failed to block under-13 access to Instagram and Facebook 09:45 Italy posts modest growth in first quarter as economic risks increase 09:42 Terror attack in London leaves two Jewish men stabbed 09:30 Canada to host new multilateral defence bank to support global security funding 09:20 France unveils roadmap to end fossil fuels by 2050 09:15 Russia says OPEC+ will endure despite UAE exit, rules out oil price war 09:00 Researchers turn plastic waste into hydrogen fuel pathway 09:00 France’s growth stalls in first quarter amid weak domestic demand 08:45 Hammer attack in Tokyo leaves five injured, including police officers 08:40 Bitcoin faces 80,000 dollar ceiling as options cluster builds resistance 08:30 India approves major banking deal as Emirates NBD moves to take control of RBL Bank 08:22 Anthropic considers funding round valuing firm above $900 billion 08:15 North Korea faces severe and unusual drought threatening crops 08:00 New gene editing method enables full gene replacement in one step 08:00 Macron reshuffles team, opening path for new French central bank chief 07:50 Study finds embryonic epigenome follows universal physical laws 07:45 Inflation rises in April, reaching 2.2% year-on-year 07:40 Google signals plans to introduce ads into Gemini chatbot 07:30 Washington reaffirms recognition of Morocco’s sovereignty over the Sahara 07:20 Tech giants raise ai investment plans to $725 billion in 2026 07:15 Nvidia’s B300 servers surge to $1 million in China amid U.S. export curbs 07:00 UK pledges additional funding and fast-tracks legislation to combat antisemitism

Nvidia launches open source AI model for vision and audio tasks

Wednesday 29 April 2026 - 11:02
By: Dakir Madiha
Nvidia launches open source AI model for vision and audio tasks

Nvidia has released Nemotron 3 Nano Omni, an open source multimodal artificial intelligence model designed to process and connect text, images, audio and video within a single system. The company positions the model as a step toward replacing fragmented AI pipelines commonly used in enterprise environments.

The model can handle a wide range of inputs, including documents, graphics, user interfaces, images, audio and video, while producing text outputs. It is built on a hybrid mixture of experts architecture with 30 billion total parameters, of which around 3 billion are active per inference. Nvidia says this structure allows high level performance while reducing computational cost compared with larger traditional models.

Unlike current systems that rely on separate models for speech recognition, vision processing and language reasoning, Nemotron 3 Nano Omni integrates these functions into one unified architecture. It uses specialized encoders for audio, vision and graphical interfaces, allowing the system to maintain context across different types of data without transferring information between separate modules.

Nvidia claims the model delivers significantly improved efficiency, including up to nine times higher throughput compared with similar open omni models in certain tasks. It also supports a context window of up to 256,000 tokens, enabling long document analysis and complex multimodal reasoning. The company reports strong performance in benchmarks involving document understanding and audio video interpretation.

The model has already attracted adoption and testing from several companies across the technology sector. Nvidia has made it available through multiple platforms, including Hugging Face and cloud providers, along with open weights and training resources. It forms part of the broader Nemotron 3 family, which Nvidia says has reached tens of millions of downloads over the past year.


  • Fajr
  • Sunrise
  • Dhuhr
  • Asr
  • Maghrib
  • Isha

Read more

This website, walaw.press, uses cookies to provide you with a good browsing experience and to continuously improve our services. By continuing to browse this site, you agree to the use of these cookies.