NVIDIA Releases Nemotron 3.5 ASR: A 600M-Parameter Cache-Aware Streaming Model Transcribing 40 Language-Locales in Real Time

NVIDIA’s Nemotron Speech team has released Nemotron 3.5 ASR. It is a 600M-parameter streaming Automatic Speech Recognition (ASR) model. A single checkpoint transcribes 40 language-locales in real time. Punctuation and capitalization are built in natively. The model ships as open weights on Hugging Face. The license is OpenMDW-1.1. The architecture is a Cache-Aware FastConformer-RNNT. What…

Read More

Startup Battlefield 200 applications officially close in 3 days

Founders, your window to enter Startup Battlefield 200 closes in just three short days. Applications for Startup Battlefield 200 officially close on June 8, 11:59 p.m. PT. Do not wait any longer. Secure your shot at competing on the Disrupt Stage at TechCrunch Disrupt 2026 this October at San Francisco’s Moscone West. Thousands of startups have already stepped forward. If you’re building a company…

Read More

Microsoft Fara Tutorial: Run a Browser-Use Agent in Google Colab with a Mock OpenAI-Compatible Endpoint

mock_server_code = r”’ from fastapi import FastAPI, Request import time app = FastAPI() STATE = {“calls”: 0} @app.post(“/v1/chat/completions”) async def chat_completions(request: Request): payload = await request.json() STATE[“calls”] += 1 model_name = payload.get(“model”, “mock-fara-7b”) if STATE[“calls”] == 1: content = ( “I will open a stable public test page so the browser-control loop can be demonstrated.\n”…

Read More

A Hands-On Coding Tutorial on Qualcomm AI Hub Models for Classification, Object Detection, and Hardware-Aware Deployment

In this tutorial, we work through an end-to-end workflow for Qualcomm AI Hub Models. We start by setting up the required package, discovering the available model collection, and loading MobileNet-V2 for local PyTorch inference. We also handle an important input-shape issue by converting NHWC image tensors into the NCHW format expected by the model. From…

Read More