Best of Image Processing2025

  1. 1
    Article
    Avatar of hnHacker News·1y

    VERT.sh

    VERT.sh allows you to quickly convert various image, video, and audio files directly on your device with no file size limit, no ads, and all processing done locally. It supports a wide range of file formats and is fully open source. Video conversion can be set up locally by following the guide provided.

  2. 2
    Article
    Avatar of collectionsCollections·48w

    Introducing PNG 3.0: A New Era for the Portable Network Graphics Standard

    PNG 3.0 has been released by the W3C after 20 years, introducing major enhancements including official animated PNG support, improved EXIF metadata storage, HDR imaging capabilities, and CICP color space encoding. The update maintains backward compatibility while providing better image quality and functionality for modern displays and applications.

  3. 3
    Article
    Avatar of hnHacker News·31w

    apple/pico-banana-400k

    Apple released Pico-Banana-400K, a dataset containing approximately 400,000 text-image-edit triplets for training text-guided image editing models. The dataset includes 257K single-turn examples, 56K preference learning samples, and 72K multi-turn conversations, covering 35 edit operations across 8 semantic categories. Built using Gemini-2.5-Flash for instruction generation and the Nano-Banana model for editing, each edit undergoes automated quality evaluation. Source images come from Open Images, with edits spanning object manipulation, scene composition, stylistic changes, and photometric adjustments. The dataset is available under CC BY-NC-ND 4.0 license for non-commercial research use.

  4. 4
    Article
    Avatar of selfhstselfh.st·24w

    Building Slink: Lessons from Creating a Self-Hosted Image Hosting Platform

    A developer shares architectural decisions and lessons learned from building Slink, a self-hosted image hosting platform. Key insights include choosing single-container deployment for simplicity despite scalability trade-offs, evolving features based on community feedback from r/selfhosted and GitHub issues, and tackling technical challenges like animated image processing and real-time notifications. The project prioritizes deployment simplicity and performance over feature richness, with development driven by actual user needs rather than predetermined roadmaps.

  5. 5
    Article
    Avatar of freekFREEK.DEV·1y

    Five levels of handling images in Laravel

    Discover methods for resizing, optimizing images, and enhancing UI in Laravel applications using Spatie's packages to streamline and improve efficiency in image handling.

  6. 6
    Article
    Avatar of arstechnicaArs Technica·49w

    MIT student prints AI polymer masks to restore paintings in hours

    MIT graduate student Alex Kachkine developed a revolutionary art restoration technique using AI-generated polymer films that can restore damaged paintings in hours instead of months. The method creates transparent masks with thousands of precisely color-matched regions that can be applied to artwork and removed when needed, making restoration reversible. An AI model identified damage patterns and generated over 57,000 different colors to restore a 15th-century painting with 5,612 damaged regions in just 3.5 hours. This approach could help make the 70% of institutional art collections currently hidden due to damage accessible to the public again.

  7. 7
    Article
    Avatar of opensourcesquadOpen Source·51w

    Image Editor Pro ChromeExtension - Open-Source

    Image Editor Pro is a new open-source Chrome extension that enables users to capture, annotate, and edit screenshots directly from any web page. The tool offers drawing capabilities, text labels, basic image transformations, filters, and export options in multiple formats. All processing happens locally without uploads or ads, and the source code is available under MIT license on GitHub.

  8. 8
    Article
    Avatar of lnLaravel News·49w

    Manipulate Image URLs in Laravel with the Image Transform Package

    A new Laravel package called laravel-image-transform-url provides a lightweight solution for manipulating images through URL parameters. Inspired by Cloudflare Images, it allows developers to transform images on-the-fly by appending query parameters for width, quality, format, and other modifications. The package includes built-in performance features like automatic caching, storage of processed images, rate limiting, and CDN integration support. It uses Intervention Image for file manipulations and offers a self-hosted alternative to external image processing services.

  9. 9
    Article
    Avatar of ieeespectrumIEEE Spectrum·33w

    Where Was This Photo Taken? AI Knows Instantly

    Researchers developed a machine learning model that matches street-level photos to aerial images for geolocation with 97% accuracy in initial narrowing and 82% for exact location. The system uses deep cross-view hashing with vision transformers to convert images into unique numerical fingerprints, making it twice as fast and using one-third the memory of competing models. The approach could benefit navigation systems when GPS fails, emergency response, and defense applications, though it needs further testing for real-world challenges like seasonal variations and cloud cover.

  10. 10
    Article
    Avatar of hnHacker News·30w

    zserge/grayskull: A tiny, dependency-free computer vision library in C for embedded systems, drones, and robotics.

    Grayskull is a minimalist computer vision library for microcontrollers and resource-constrained devices. Written in pure C99 as a single header file under 1000 lines, it requires no dependencies or dynamic memory allocation. The library provides grayscale image operations including filtering (blur, Sobel edges), thresholding (Otsu, adaptive), morphology (erosion, dilation), connected components, perspective warping, FAST/ORB feature detection for object tracking, and LBP cascades for face/vehicle detection. It includes PGM file I/O and uses integer-based operations optimized for embedded systems.

  11. 11
    Article
    Avatar of communityCommunity Picks·1y

    Shapen

    Shapen allows users to create 3D models from images, offering capabilities to transform photos, sketches, and drawings into detailed 3D representations. It showcases various model examples, highlighting its versatility in producing different 3D objects from diverse image sources.

  12. 12
    Article
    Avatar of baeldungBaeldung·1y

    Extracting Structured Data From Images Using Spring AI

    Learn how to extract structured data from images using the OpenAI chat model with Spring AI. The tutorial provides step-by-step instructions to create a web service that accepts image uploads, analyzes the images using OpenAI, and returns the structured data in JSON format. Key components include configuring Maven dependencies, setting up Spring Boot configurations, and defining POJO classes for structured outputs.

  13. 13
    Article
    Avatar of opensourcesquadOpen Source·47w

    I’ve built an open source image blending tool that lets you create stunning image effects, generate Tailwind CSS code, and more!

    A developer created Image Blender, an open-source web tool that applies CSS blend modes and gradient effects to images. The tool features drag-and-drop image upload, various blend effects like multiply and overlay, customizable gradients, preset collections, and automatic Tailwind CSS code generation. Users can download processed images and contribute community presets to the project.

  14. 14
    Article
    Avatar of appsmithAppsmith·1y

    Building an Image Compression & Resizing Widget with Dropzone.js

    Learn how to build an image compression and resizing widget using Dropzone.js. This tool helps users adjust image sizes before uploading, ensuring they meet backend requirements without needing external solutions.

  15. 15
    Article
    Avatar of vaadinVaadin·1y

    Extracting structured data from images using AI in Java

    Learn how to use Open AI and Spring AI to extract structured data from images into Java records. The tutorial covers configuring Spring AI, handling image uploads, and converting image data into usable Java objects. The guide demonstrates two use cases: extracting data from a receipt and importing a handwritten sign-up sheet.

  16. 16
    Article
    Avatar of hnHacker News·47w

    CyberTimon/RapidRAW: A beautiful, non-destructive, and GPU-accelerated RAW image editor built with performance in mind.

    RapidRAW is an open-source, GPU-accelerated RAW image editor built with Rust and Tauri as a lightweight alternative to Adobe Lightroom. The application features non-destructive editing, AI-powered masking, optional generative AI integration via ComfyUI, and comprehensive photo management tools. Developed by an 18-year-old as a personal challenge, it offers professional-grade adjustments, batch operations, and cross-platform support in under 30MB.

  17. 17
    Article
    Avatar of thevergeThe Verge·37w

    Internet detectives are misusing AI to find Charlie Kirk’s alleged shooter

    Internet users are using AI upscaling tools to enhance blurry FBI surveillance photos of a person of interest in a shooting case. However, these AI enhancements don't reveal hidden details but instead generate fabricated information, potentially creating misleading evidence. Past incidents show AI upscaling has produced inaccurate results, including changing facial features and adding nonexistent details to images.

  18. 18
    Video
    Avatar of aarondfrancisAaron Francis·1y

    I built Cloudflare Images in PHP (to scale & compress images)

    Learn how to recreate key features of Cloudflare Images in Laravel, including image resizing and compression. The guide covers setting up routes, transforming images, caching, and rate limiting, ensuring efficient image processing and delivery using Cloudflare as the CDN.

  19. 19
    Article
    Avatar of nuxtandvueVuejs&Nuxtjs·1y

    I recreated the Text behind Image, using Nuxt & Transformer.js

    A new free tool for removing backgrounds and adding text behind images, built entirely in the browser using transformer.js. The tool is user-friendly and requires no signups or logins. Created as a side project to learn marketing, feedback and improvement ideas are welcomed.