Image & Video Generation

Diffusion models, Stable Diffusion, ComfyUI, text-to-image/video

289 articles across 104 editions

Articles

  1. Running DeepSeek-V4 locally with 4x legacy RTX 2080 Ti ($2k budget setup). Custom Turing kernels, W8A8 quantization, and 255 prefill tok/s! -- 2026-05-20
  2. Ran the same models across Strix Halo, RTX 3090, and RTX 5070 because I wanted my own numbers -- 2026-05-20
  3. Intel's Crescent Island PCB Leaks, Showing a Massive Xe3P GPU, 16-Pin Connector, 160GB LPDDR5X as Intel Sidesteps the HBM Shortage -- 2026-05-20
  4. Sipeed's K3 RISC-V SBCs can run 30B-parameter LLMs 60 TOPS (INT4), Supports BF16/FP16/INT4 -- 2026-05-20
  5. club-5060ti: practical RTX 5060 Ti local LLM notes and configs -- 2026-05-20
  6. ScioMind: Cognitively Grounded Multi-Agent Social Simulation with Anchoring-Based Belief Dynamics -- 2026-05-15
  7. [MIT] RLCR: Teaching AI models to say "I'm not sure" -- 2026-05-15
  8. CDM: Continuous-Time Distribution Matching for Few-Step Diffusion Distillation -- 2026-05-15
  9. Built an open-source one-prompt-to-cinematic-reel pipeline on a single GPU — FLUX.2 + Wan2.2 + vision critic + music + 9-language narration -- 2026-05-15
  10. HumeAI/tada-3b-ml -- 2026-05-15
  11. PRX Part 3 — Training a Text-to-Image Model in 24h! -- 2026-03-12
  12. New LTX2.3 Tool for OpenWebui -- 2026-03-12
  13. Kotlin creator's new language: a formal way to talk to LLMs instead of English -- 2026-03-12
  14. Building a TB-303 from Scratch -- 2026-03-12
  15. PKU-YuanGroup/Helios: Real Real-Time Long Video Generation Model -- 2026-03-04
  16. StyleStream: Real-Time Zero-Shot Voice Style Conversion -- 2026-03-04
  17. KokoClone: Kokoro TTS, but it clones voices now -- 2026-03-04
  18. Ling-2.5-1T: 1T Parameter Open-Source Instant Model with 1M Context -- 2026-02-16
  19. Qwen3.5-397B-A17B Unsloth GGUFs — Run on Consumer Hardware -- 2026-02-16
  20. Running Qwen3-Coder-Next 80B on 8GB VRAM — 300x Speedup via Custom Expert Caching -- 2026-02-16
  21. Flame Graphs vs Tree Maps vs Sunburst (2017) -- 2025-12-31
  22. 39C3: Recreating Sandstorm -- 2025-12-31
  23. EditMGT — fast, localized image editing with Masked Generative Transformers -- 2025-12-30
  24. Francis-Rings/FlashPortrait -- 2025-12-30
  25. zai-org/GLM-TTS -- 2025-12-30
  26. Flowception: Temporally Expansive Flow Matching for Video Generation -- 2025-12-30
  27. Key Highlights of NVIDIA’s New Model: Nemotron 3 -- 2025-12-17
  28. The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator -- 2025-12-17
  29. ostris/Z-Image-De-Turbo -- 2025-12-12
  30. zai-org/GLM-TTS -- 2025-12-11
  31. openbmb/VoxCPM1.5 -- 2025-12-11
  32. ByteDance-Seed/Depth-Anything-3 -- 2025-12-10
  33. seominseok0429/Upsample-Anything-A-Simple-and-Hard-to-Beat-Baseline-for-Feature-Upsampling -- 2025-12-09
  34. lrzjason/QwenEdit-Anything2Real_Alpha -- 2025-12-08
  35. How Big is Your Video Again? Square vs Rectangular Pixels -- 2025-12-08
  36. shubh-io/DockMate -- 2025-12-08
  37. Comfy-Org/HunyuanVideo_1.5_repackaged -- 2025-12-08
  38. ByteDance/BindWeave -- 2025-12-08
  39. apple/starflow -- 2025-12-04
  40. princepainter/ComfyUI-PainterLongVideo -- 2025-12-04
  41. OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing -- 2025-12-03
  42. Z-Image: Powerful and highly efficient image generation model with 6B parameters -- 2025-12-03
  43. FLUX.2: Frontier Visual Intelligence -- 2025-11-28
  44. Diffusers welcomes FLUX-2 -- 2025-11-26
  45. dx8152/Relight -- 2025-11-26
  46. Question About Motherboards -- 2025-11-26
  47. Qwen/Qwen3-VL-4B-Instruct -- 2025-11-20
  48. Soul-AILab/SoulX-Podcast-1.7B -- 2025-11-20
  49. wildminder/ComfyUI-DyPE -- 2025-11-19
  50. lightx2v/Autoencoders -- 2025-11-19
  51. We ran over 600 image generations to compare AI image models -- 2025-11-13
  52. dx8152/Qwen-Image-Edit-2509-Relight -- 2025-11-13
  53. meituan-longcat/LongCat-Video -- 2025-11-05
  54. allenai/olmOCR-2-7B-1025-FP8 -- 2025-11-05
  55. deepseek-ai/DeepSeek-OCR -- 2025-11-04
  56. LiquidAI/LFM2-VL-3B -- 2025-11-04
  57. Qwen/Qwen3-VL-235B-A22B-Thinking -- 2025-11-04
  58. DeepSeek may have found a new way to improve AI’s ability to remember -- 2025-11-02
  59. Qwen/Qwen3-VL-8B-Thinking -- 2025-11-02
  60. nvidia/omnivinci -- 2025-11-02
  61. OpenImagingLab/FlashVSR -- 2025-11-02
  62. Build Your Own Force-Feedback Joystick -- 2025-11-02
  63. ZOZO's Contact Solver for physics-based simulations -- 2025-11-01
  64. valiantcat/Qwen-Image-Edit-MeiTu -- 2025-11-01
  65. ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing -- 2025-11-01
  66. fireleyfreya/AI-Art-Generator -- 2025-10-30
  67. krea/krea-realtime-video -- 2025-10-30
  68. Cerebras REAP'd GLM4.6: 25%, 30%, 40% pruned FP8 checkpoints on HF! -- 2025-10-28
  69. Qwen/Qwen3-VL-30B-A3B-Instruct-FP8 -- 2025-10-28
  70. lightx2v/Wan2.2-Distill-Loras -- 2025-10-28
  71. GPT-OSS-20b TAKE THE HELM! Further experiments in autopilot. -- 2025-10-28
  72. 5060ti chads... ram overclocking, the phantom menace -- 2025-10-28
  73. Batch inference locally on 4080 -- 2025-10-28
  74. DeepSeek just released a bombshell AI model (DeepSeek AI) so profound it may be as important as the initial release of ChatGPT-3.5/4 ------ Robots can see-------- And nobody is talking about it -- And it's Open Source - If you take this new OCR Compresion + Graphicacy = Dual-Graphicacy 2.5x improve -- 2025-10-27
  75. Pico Banana: Large-Scale Dataset for Image Editing by Apple -- 2025-10-27
  76. dvlab-research/DreamOmni2 -- 2025-10-25
  77. bytetriper/RAE -- 2025-10-25
  78. tencent/POINTS-Reader -- 2025-10-25
  79. Stitch: Training-Free Position Control in Multimodal Diffusion Transformers -- 2025-10-25
  80. Llama.cpp is looking for M5 Neural Accelerator performance testers -- 2025-10-24
  81. NVIDIA sent me a 5090 so I can demo Qwen3-VL GGUF -- 2025-10-24
  82. AMD ROCm 7.9 and dwindling GPU support -- 2025-10-24
  83. Show HN: Cuq – Formal Verification of Rust GPU Kernels -- 2025-10-24
  84. [Editorial] https://github.com/DrewThomasson/ebook2audiobook -- 2025-10-23
  85. [Editorial] https://github.com/lfnovo/open-notebook -- 2025-10-23
  86. tencent/Hunyuan3D-Omni -- 2025-10-23
  87. Doby-Xu/WithAnyone -- 2025-10-22
  88. lightx2v/Wan2.2-I2V-A14B-Moe-Distill-Lightx2v -- 2025-10-22
  89. mit-han-lab/streaming-vlm -- 2025-10-22
  90. tencent-ailab/SongPrep -- 2025-10-20
  91. opendatalab/MinerU2.5-2509-1.2B -- 2025-10-20
  92. QuantStack/Qwen-Image-Edit-2509-GGUF -- 2025-10-20
  93. LM Studio and VL models -- 2025-10-19
  94. Qwen/Qwen-Image-Edit-2509 -- 2025-10-19
  95. Alpha-VLLM/Lumina-DiMOO -- 2025-10-19
  96. linkedlist771/SoraWatermarkCleaner -- 2025-10-19
  97. Audio transcription with llama.cpp multimodal -- 2025-10-18
  98. I built a fully automated AI podcast generator that connects to ollama -- 2025-10-18
  99. Paper2Video — turn a research paper into a full presentation video (slides, speech, talking head) -- 2025-10-15
  100. Practical OCR with Nanonets OCR2‑3B -- 2025-10-15
  101. neuphonic/neutts-air -- 2025-10-15
  102. Qwen/Qwen3-VL-235B-A22B-Instruct -- 2025-10-15
  103. XiaomiMiMo/MiMo-Audio-Eval -- 2025-10-15
  104. Very interesting! OmniInsert — mask-free video insertion of any reference -- 2025-10-14
  105. facebookresearch/DepthLM_Official -- 2025-10-14
  106. Built a 1288x RTFx Parakeet Speech-to-Text server... Enjoy! -- 2025-10-13
  107. Novel OpenGL Pixel Shader Dewarping -- 2025-10-13
  108. lovis93/next-scene-qwen-image-lora-2509 -- 2025-10-13
  109. Chinny (iOS/MacOS): offline, on-device voice cloning with an optimized Chatterbox model -- 2025-10-12
  110. herimor/voxtream -- 2025-10-12
  111. microsoft/VibeVoice-Large -- 2025-10-12
  112. chetwinlow1/Ovi -- 2025-10-12
  113. Phr00t/Qwen-Image-Edit-Rapid-AIO -- 2025-10-12
  114. NVlabs/rcm -- 2025-10-11
  115. Qwen3-VL-30B-A3B-Thinking GGUF with llama.cpp patch to run it -- 2025-10-10
  116. Does quantization need training data and will it lower performance for task outside of training data? -- 2025-10-10
  117. Qwen/Qwen3-VL-30B-A3B-Instruct -- 2025-10-10
  118. Project running VLMs on a Pi 5 and NV Jetson Orin Nano -- 2025-10-05
  119. Demo: I made an open-source version of Imagine by Claude (released yesterday) -- 2025-10-05
  120. nunchaku-tech/nunchaku-qwen-image-edit-2509 -- 2025-10-05
  121. cvlab-kaist/VIRAL -- 2025-10-05
  122. Tencent-Hunyuan/Hunyuan3D-Omni -- 2025-10-04
  123. tencent/HunyuanImage-3.0 -- 2025-10-04
  124. For llama.cpp/ggml AMD MI50s are now universally faster than NVIDIA P40s -- 2025-10-03
  125. MSI EdgeXpert Compact AI Supercomputer Based on NVIDIA DGX Spark -- 2025-10-03
  126. Kairos: Immutable Distro for K8s at the Edge -- 2025-10-03
  127. Nvidia Has Been Supplying NDA'ed Docs to Red Hat for Helping NVK Driver -- 2025-10-03
  128. Mini Laptop Needs Custom Kernel -- 2025-10-03
  129. jmanhype/vggt-mps -- 2025-10-02
  130. openbmb/VoxCPM-0.5B -- 2025-10-02
  131. Comfy-Org/Qwen-Image-Edit_ComfyUI -- 2025-10-02
  132. SOTA OCR on-device with Core ML and dots.ocr -- 2025-10-02
  133. Tencent-Hunyuan/SRPO -- 2025-09-30
  134. lodestones/Chroma1-Base -- 2025-09-30
  135. MV-RAG: Retrieval Augmented Multiview Diffusion -- 2025-09-30
  136. Phantom-video/HuMo -- 2025-09-27
  137. Build Your Own 6K Camera -- 2025-09-27
  138. Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer -- 2025-09-27
  139. OPPOer/Qwen-Image-Pruning -- 2025-09-27
  140. We made a new AI interface that is compatible with Ollama -- 2025-09-24
  141. if-ai/ComfyUI_HunyuanVideoFoley -- 2025-09-24
  142. Show HN: Inferencer – Run and deeply control local AI models (macOS release) -- 2025-09-24
  143. tencent/HunyuanWorld-Voyager -- 2025-09-24
  144. FireRedTeam/FireRedTTS2 -- 2025-09-24
  145. Wan-AI/Wan2.2-Animate-14B -- 2025-09-22
  146. decart-ai/Lucy-Edit-Dev -- 2025-09-22
  147. OpenBMB/VoxCPM -- 2025-09-22
  148. voicepowered-ai/VibeVoice-finetuning -- 2025-09-22
  149. alibaba-pai/Wan2.2-VACE-Fun-A14B -- 2025-09-21
  150. VideoGuard: Protecting Video Content from Unauthorized Editing -- 2025-09-21
  151. zli12321/Vision-SR1 -- 2025-09-19
  152. lrzjason/Comfyui-QwenEditUtils -- 2025-09-19
  153. Mini-o3/Mini-o3 -- 2025-09-19
  154. The AI-Scraping Free-for-All Is Coming to an End -- 2025-09-18
  155. Visible Watermarking with Gradio -- 2025-09-18
  156. Tencent-Hunyuan/HunyuanImage-2.1 -- 2025-09-17
  157. xiaomi-research/q-frame -- 2025-09-17
  158. TencentCloudADP/youtu-graphrag -- 2025-09-17
  159. Renting GPUs is hilariously cheap -- 2025-09-09
  160. Tencent-Hunyuan/HunyuanWorld-Voyager -- 2025-09-09
  161. Shipping textures as PNGs is suboptimal -- 2025-09-09
  162. LiquidAI/LFM2-VL-1.6B -- 2025-09-06
  163. A Training-Free, Task-Agnostic Framework for Enhancing MLLM Performance on High-Resolution Images -- 2025-09-06
  164. TencentARC/GenCompositor -- 2025-09-06
  165. TencentARC/ToonComposer -- 2025-09-04
  166. MeiGen-AI/InfiniteTalk -- 2025-09-04
  167. OWUI_File_Gen_Export v0.2.0 is out ! -- 2025-09-04
  168. MCP File Generation tool -- 2025-09-04
  169. tencent/Hunyuan-GameCraft-1.0 -- 2025-09-03
  170. InternVL 3.5 released : Best Open-Sourced Multi-Modal LLM, Ranks 3 overall -- 2025-08-31
  171. HunyuanVideo-Foley is out, an open source text-video-to-audio model -- 2025-08-31
  172. peteromallet/Flux-Kontext-InScene -- 2025-08-31
  173. TTS VibeVoice FastAPI -- 2025-08-30
  174. Microsoft VibeVoice TTS : Open-Sourced, Supports 90 minutes speech, 4 distinct speakers at a time -- 2025-08-29
  175. tencent/HunyuanVideo-Foley -- 2025-08-29
  176. bullerwins/Wan2.2-I2V-A14B-GGUF -- 2025-08-28
  177. QuantStack/Qwen-Image-Edit-GGUF -- 2025-08-28
  178. dvlab-research/MGM-Omni -- 2025-08-28
  179. RTX PRO 6000 MAX-Q Blackwell for LLM -- 2025-08-28
  180. Gemini 2.5 Flash Image -- 2025-08-27
  181. Arrexel/pattern-diffusion -- 2025-08-27
  182. An Alternative to Text-to-SQL -- 2025-08-25
  183. Best model for transcribing videos? -- 2025-08-25
  184. Compute Where It Counts: High Quality Sparsely Activated LLMs -- 2025-08-25
  185. moonshotai/Kimi-K2-Base -- 2025-08-25
  186. unsloth/Qwen3-30B-A3B-Instruct-2507-GGUF -- 2025-08-25
  187. BlueLM-2.5-3B Technical Report -- 2025-08-25
  188. lightx2v/Qwen-Image-Lightning -- 2025-08-25
  189. Qwen-Image-Edit #6 overall on LMArena, best open model image editor -- 2025-08-24
  190. flybirdxx/ComfyUI-SDMatte -- 2025-08-24
  191. Wan-AI/Wan2.2-TI2V-5B -- 2025-08-24
  192. HiDream-ai/HiDream-E1-1 -- 2025-08-24
  193. We built a 12B model that beats Claude 4 Sonnet at video captioning while costing 17x less - fully open source -- 2025-08-15
  194. Francis-Rings/StableAvatar -- 2025-08-15
  195. SparcStation 1+ Finally Gets Attention -- 2025-08-15
  196. OmniSVG/OmniSVG -- 2025-08-15
  197. Phi-Ground Tech Report: Advancing Perception in GUI Grounding -- 2025-08-15
  198. NuMarkdown-8B-Thinking - first reasoning OCR VLM -- 2025-08-11
  199. Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling -- 2025-08-11
  200. Vision Language Model Alignment in TRL ⚡️ -- 2025-08-11
  201. Explore KittenTTS with Gradio: Easy Text-to-Speech model -- 2025-08-06
  202. [Editorial] AI personality -- 2025-08-05
  203. CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning -- 2025-08-05
  204. internlm/Intern-S1 -- 2025-08-05
  205. peteromallet/Flux-Kontext-InScene -- 2025-08-02
  206. A Dual-Screen Cyberdeck To Rule Them All -- 2025-08-02
  207. ziangcao0312/PhysX-3D -- 2025-07-29
  208. Playtron's Linux-Based GameOS Hits the Road with 1.0 -- 2025-07-26
  209. Remembering Chiptunes, the Demoscene and the Illegal Music of Keygens -- 2025-07-26
  210. albozes/shotbuddy -- 2025-07-25
  211. uttam-li/dfs -- 2025-07-25
  212. OmniSVG/OmniSVG -- 2025-07-25
  213. Freezer Monitoring: Because Ice Cream Is a Dish Best Served Cold -- 2025-07-25
  214. Fast LoRA inference for Flux with Diffusers and PEFT -- 2025-07-25
  215. boson-ai/higgs-audio -- 2025-07-24
  216. nvidia/canary-qwen-2.5b -- 2025-07-24
  217. TimeScope: How Long Can Your Video Large Multimodal Model Go? -- 2025-07-24
  218. THUDM/GLM-4.1V-Thinking -- 2025-07-23
  219. FunAudioLLM/ThinkSound -- 2025-07-23
  220. RaphaelLiu/PusaV1 -- 2025-07-23
  221. merve/smol-vision -- 2025-07-23
  222. Skywork/Skywork-R1V3-38B -- 2025-07-20
  223. ByteDance-Seed/Seed-X-PPO-7B -- 2025-07-20
  224. ChenDarYen/ComfyUI-NAG -- 2025-07-20
  225. Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation -- 2025-07-20
  226. Introcuding KokoroDoki a Local, Open-Source and Real-Time TTS. -- 2025-07-19
  227. Voxtral – Frontier open source speech understanding models -- 2025-07-19
  228. AI can now translate brain scans to text -- 2025-07-19
  229. quasiblob/ComfyUI-EsesImageEffectBloom -- 2025-07-18
  230. HiDream-ai/HiDream-E1-1 -- 2025-07-18
  231. Autoregressive Image Generation with Linear Complexity: A Spatial-Aware Decay Perspective -- 2025-07-18
  232. runjiali-rl/vmem -- 2025-07-17
  233. Exploring State-Space-Model based Language Model in Music Generation -- 2025-07-16
  234. TextPixs: Glyph-Conditioned Diffusion with Character-Aware Attention and OCR-Guided Supervision -- 2025-07-15
  235. MIDI-VALLE: Improving Expressive Piano Performance Synthesis Through Neural Codec Language Modelling -- 2025-07-14
  236. Need advice on how to improve Handwritten Text Recognition of names using Vision models (for academic research purposes) -- 2025-07-14
  237. DLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching -- 2025-07-08
  238. Efficient MultiModal Data Pipeline -- 2025-07-08
  239. black-forest-labs/FLUX.1-Kontext-dev-onnx -- 2025-07-07
  240. Subpixel Rendering For Impossibly Small Terminal Text -- 2025-07-04
  241. bytedance/ATI -- 2025-07-01
  242. AIDC-AI/Ovis-U1-3B -- 2025-07-01
  243. google/gemma-3n-E4B-it -- 2025-07-01
  244. baidu/ERNIE-4.5-21B-A3B-PT -- 2025-07-01
  245. THU-KEG/LongWriter-Zero-32B -- 2025-06-30
  246. Tencent-Hunyuan/Hunyuan3D-2.1 -- 2025-06-29
  247. bullerwins/FLUX.1-Kontext-dev-GGUF -- 2025-06-28
  248. google/gemma-3n-E2B-it -- 2025-06-28
  249. black-forest-labs/FLUX.1-Kontext-dev -- 2025-06-27
  250. 0.71-{\AA} resolution electron tomography enabled by deep learning aided information recovery -- 2025-06-26
  251. MeiGen-AI/MeiGen-MultiTalk -- 2025-06-26
  252. Tencent-Hunyuan/HunyuanPortrait -- 2025-06-24
  253. (0,2) hybrid models -- 2025-06-24
  254. 0-th Order Pseudo-differential Operator on the Circle -- 2025-06-24
  255. OmniGen2/OmniGen2 -- 2025-06-24
  256. gdhe17/Self-Forcing -- 2025-06-24
  257. Voyager: Real-Time Splatting City-Scale 3D Gaussians on Your Phone -- 2025-06-21
  258. lightx2v/Wan2.1-T2V-14B-StepDistill-CfgDistill -- 2025-06-20
  259. Kijai/WanVideo_comfy -- 2025-06-20
  260. tencent/Hunyuan3D-2.1 -- 2025-06-20
  261. inclusionAI/Ming-Lite-Omni -- 2025-06-15
  262. New method for creating large 3D models of urban areas is faster and cheaper -- 2025-06-15
  263. vrgamedevgirl84/Wan14BT2VFusioniX -- 2025-06-13
  264. rusjoan/streamcrypt -- 2025-06-12
  265. tang-bd/fuse-dit -- 2025-06-12
  266. Show HN: 3DGS implementation in Nvidia Warp: clean, minimal, runs on CPU and GPU -- 2025-06-12
  267. 0.75 atoms improve the clock signal of 10,000 atoms -- 2025-06-12
  268. After Deepfaking YouTube, Google's Veo 3 Could Slop-Ify Video Games Next -- 2025-06-11
  269. manycore-research/SpatialLM -- 2025-06-09
  270. XiaomiMiMo/MiMo-VL-7B-RL -- 2025-06-09
  271. fishaudio/openaudio-s1-mini -- 2025-06-09
  272. rednote-hilab/dots.llm1.inst -- 2025-06-08
  273. tencent/HunyuanPortrait -- 2025-06-08
  274. Better quantization: Yet Another Quantization Algorithm -- 2025-06-08
  275. MCP server to connect LLM agents to any database -- 2025-06-08
  276. new gemma3 abliterated models from mlabonne -- 2025-06-08
  277. Sharing my a demo of tool for easy handwritten fine-tuning dataset creation! -- 2025-06-08
  278. Yess! Open-source strikes back! This is the closest I've seen anything come to competing with @GoogleDeepMind 's Veo 3 native audio and character motion. -- 2025-06-08
  279. For task-specific agents use task-specific LLMs for routing and hand off - NOT semantic techniques. -- 2025-06-08
  280. Face Age Prediction – Achieved Human-Level Accuracy (MAE ≈ 5) -- 2025-06-08
  281. Is there any open source project leveraging genAI to run quality checks on tabular data ? -- 2025-06-08
  282. Precomputing Transparency Order in 3D -- 2025-06-06
  283. nvidia/Llama-3.1-Nemotron-Nano-VL-8B-V1 -- 2025-06-05
  284. hexgrad/Kokoro-82M -- 2025-06-05
  285. Qwen/Qwen3-Embedding-0.6B-GGUF -- 2025-06-05
  286. AMAP-ML/UniVG-R1 -- 2025-06-02
  287. showlab/OmniConsistency -- 2025-06-02
  288. tencent/HunyuanVideo-Avatar -- 2025-06-02
  289. Datadog/Toto-Open-Base-1.0 -- 2025-06-01
  290. facebook/KernelLLM -- 2025-05-31
  291. 0.08 fF, 0.72 nA dark current, 91% Quantum Efficiency, 38 Gb/s Nano-photodetector on a 45 nm CMOS Silicon-Photonic Platform -- 2025-05-30
  292. 1000 FPS HDR Video With a Spike-RGB Hybrid Camera -- 2025-05-30