Multi-user, full-stack blogging application

NVIDIA released Nemotron-TwoTower-30B-A3B-Base-BF16, an unusual diffusion-based language model built on Nemotron 3 architecture. Represents alternative approach to traditional autoregressive language modeling.

Reddit r/MachineLearning

Softmax-Free Attention Model with Triton Kernels at GPT-2 Medium Scale

Jun 21, 2026

1 min

Novel attention architecture achieving 354M parameters with long-context VRAM savings via structural sparsity and custom Triton kernels. Open weights released.

GitHub

DeepSeek-R1: Major Update to DeepSeek Reasoning Model

Jun 21, 2026

1 min

DeepSeek-R1 repository shows significant activity with 91,986 stars. Represents latest iteration of Chinese open-source reasoning model gaining substantial adoption.

GitHub

Kimi-K2: Latest Model by Moonshot/Kimi

Jun 20, 2026

1 min

Moonshot's Kimi released Kimi-K2 with 10,866 GitHub stars. Active development with recent update 2026-06-20.

GitHub

Qwen3.6 foundation model released by Alibaba

Jun 19, 2026

1 min

Alibaba releases Qwen3.6 model with 3,587 GitHub stars. Updated foundation model in open-source Qwen family.

GitHub

Qwen3-TTS text-to-speech model released by Alibaba

Jun 19, 2026

1 min

Alibaba releases Qwen3-TTS with 12,029 GitHub stars. Multimodal capability expansion for Qwen model family.

GitHub

Qwen3-VL model reaches 19,406 GitHub stars

Jun 17, 2026

1 min

Alibaba's Qwen3-VL multimodal model gains 19.4k+ GitHub stars. Updated 2026-06-17. Indicates strong adoption of vision-language model.