Best of Computer VisionDecember 2025

  1. 1
    Article
    Avatar of mitMIT News·18w

    Deep-learning model predicts how fruit flies form, cell by cell

    MIT researchers developed a deep-learning model that predicts cell-by-cell development during fruit fly embryo formation with 90% accuracy. The model uses a dual-graph structure representing cells as both point clouds and foam-like bubbles, tracking properties like position, division, and folding minute-by-minute during gastrulation. The approach could eventually predict development in more complex organisms and identify early disease patterns in conditions like asthma and cancer, though high-quality video data remains the primary limitation for broader applications.

  2. 2
    Article
    Avatar of hnHacker News·19w

    I failed to recreate the 1996 Space Jam Website with Claude

    An engineer attempts to use Claude AI to recreate the iconic 1996 Space Jam website from screenshots and assets, but fails despite multiple approaches. The experiment reveals Claude's limitations in spatial reasoning and precise visual measurements. Despite providing grids, comparison tools, and zoomed images, Claude consistently produces inaccurate layouts while confidently claiming success. The author theorizes this stems from how vision models process images in 16x16 patches, losing fine-grained spatial detail. The piece documents the iterative debugging process, Claude's unreliable self-assessment, and the surprising difficulty of a seemingly simple HTML recreation task.

  3. 3
    Article
    Avatar of aiproductsAI Products·19w

    SAM 3 just dropped, and it's a big deal

    Meta released SAM 3, an open-source computer vision model that enables text-based object segmentation in images and videos. The model supports multiple input methods including text prompts, clicks, and bounding boxes, and can track objects across video frames. Trained on over 4 million unique concepts, it reportedly delivers double the accuracy of competing systems on open-vocabulary segmentation tasks. The model is available on GitHub with weights and starter notebooks.

  4. 4
    Article
    Avatar of ghblogGitHub Blog·17w

    This year’s most influential open source projects

    GitHub Universe 2025's Open Source Zone featured twelve influential projects spanning diverse domains: Appwrite (backend platform), GoReleaser (Go release automation), Homebrew (macOS package manager), Ladybird (independent browser), Moondream (lightweight visual AI), Oh My Zsh (shell framework), OpenCV (computer vision library), OSPSB (security baseline), p5.js and Processing (creative coding), PixiJS (2D graphics engine), Spark (3D Gaussian Splatting renderer), and Zulip (threaded team chat). Each project showcases different aspects of open source innovation, from developer tooling to AI and graphics rendering.

  5. 5
    Article
    Avatar of 80lv80 LEVEL·16w

    Particle Simulator Controlled by Hand Gestures

    David Katz created a browser-based particle simulation controlled by hand gestures using only a webcam. The project uses MediaPipe for hand tracking and Three.js for rendering 100,000 particles that react to hand movements in real-time. The setup demonstrates how accessible gesture-controlled interactive graphics have become with modern web technologies.

  6. 6
    Article
    Avatar of ft0is8acgd90jdhvinkgpValdemar·20w

    OpenAGI launched something interesting - Lux

    OpenAGI released Lux, a foundation AI agent that controls computers through screenshots and action sequences rather than text. It outperforms competing solutions from OpenAI, Google, and Anthropic on real-world tasks (83.6% vs 69% for Gemini CUA), operates faster (~1 second per step), and costs 10× less. Unlike browser-only alternatives, Lux works across desktop applications including Excel, Slack, Adobe products, and IDEs. The model is available via API and SDK, with Intel collaboration underway for local laptop optimization.

  7. 7
    Article
    Avatar of searlsJustin Searls·18w

    Seems like nothing interesting happened

    A brief observation about Ring's AI-powered camera event description feature, noting its straightforward and honest assessment of recorded activity. The post is primarily a personal anecdote with minimal technical detail.