Best of GPUOctober 2024

  1. 1
    Article
    Avatar of notedNoted·2y

    FileFlows - Self-Hosted Media Conversion Guide

    FileFlows is a self-hosted solution to manage and compress media files, saving you substantial disk space. This guide covers setting up FileFlows on Pop-OS with NVIDIA GPU support via Docker Compose, creating custom processing flows, and configuring libraries for media conversion. It highlights the benefits of GPU acceleration for faster file processing, how to add necessary plugins, and offers a detailed example flow for converting MKV to MP4, demonstrating significant disk space savings.

  2. 2
    Article
    Avatar of andersonmanciniThreejs Tips and Inspiration·2y

    Platform for learning GLSL Shaders

    Shader Learning is a platform dedicated to teaching and practicing GPU programming through interactive tasks and theory. It covers topics like fragment and vertex shaders, 2D image manipulation, lighting, shadows, noise functions, texture mapping, and Signed Distance Field functions. The platform also delves into the fundamental mathematical principles behind computer graphics, with support available through a Discord channel.

  3. 3
    Article
    Avatar of hnHacker News·2y

    How I Self-Hosted Llama 3.2 with Coolify on My Home Server: A Step-by-Step Guide

    Inspired by the trend of migrating Next.js applications to self-hosted environments, the author explores self-hosting Llama 3.2 using Coolify on a home server. The main goals include hosting a Next.js website, running Llama 3.2 with GPU acceleration, and setting up a wildcard domain for various services. Key challenges involved configuring the CUDA toolkit for GPU usage and securing the LLM API. The guide provides a detailed walkthrough of the setup process, offering insights into software installations, deployment, and troubleshooting.

  4. 4
    Article
    Avatar of jeffgeerlingJeff Geerling·2y

    Use an External GPU on Raspberry Pi 5 for 4K Gaming

    The post covers the process of setting up and using an external GPU with a Raspberry Pi 5 for enhanced 4K gaming performance. It includes detailed instructions on the necessary hardware setup, choosing compatible graphics cards, and patching the Linux kernel to enable full GPU support. Additionally, it discusses the performance benchmarks achieved and potential applications beyond gaming, such as video transcoding.

  5. 5
    Video
    Avatar of youtubeYouTube·2y

    EASIEST Way to Fine-Tune a LLM and Use It With Ollama

    Learn how to fine-tune a large language model (LLM) on your local machine using Ollama and Unsloth. By using the synthetic text-to-SQL dataset and tools like Anaconda, CUDA libraries, and Jupyter notebook, you'll set up an environment to train a small but effective LLM. Benefits include reduced memory usage and easy deployment with Ollama. Ideal for users with or without a GPU, offering practicality using platforms like Google Colab for cloud-based training.

  6. 6
    Article
    Avatar of hnHacker News·2y

    lifeiteng/OmniSenseVoice: Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯

    Omni SenseVoice, built on SenseVoice, offers high-speed and precise audio transcription with features like automatic language detection, GPU support, and quantized models for faster processing. Users can achieve up to 50x faster processing without sacrificing accuracy. The tool can be easily installed via pip and provides several key options for customization.

  7. 7
    Article
    Avatar of jendrikillnerJendrik Illner·2y

    Graphics Programming weekly - Issue 361 - October 13th, 2024

    The post explores various graphics programming topics, including a basic introduction to mesh shaders, their effects, use-cases, and limitations. It also highlights the high-level implementation details of the mesh shaders used for culling in Alan Wake 2. Additionally, it covers a Python visualization framework, GPU validation systems, Intel's rendering toolkit, and sampling methods for normal Gaussian distributions on GPUs. General Arcade is hiring a Graphics Rendering Engineer.

  8. 8
    Article
    Avatar of hnHacker News·2y

    kelindar/search: Go library for embedded vector search and semantic embeddings using llama.cpp

    The kelindar/search library provides an easy and efficient solution for embedding and vector search in Go applications, designed primarily for small-scale projects. It supports GGUF BERT models, offers GPU acceleration, and includes features such as search index creation from embeddings. While excellent for datasets with fewer than 100,000 entries, it may face performance challenges with larger datasets due to its brute-force search approach. The library also simplifies integration and deployment by avoiding cgo and relying on purego to call shared C libraries directly from Go code.

  9. 9
    Video
    Avatar of communityCommunity Picks·2y

    How do Graphics Cards Work? Exploring GPU Architecture

    Graphics cards, or GPUs, perform trillions of calculations per second to render realistic video game graphics and other computational tasks. With over 10,000 cores, GPUs execute simple arithmetic instructions in parallel, making them ideal for tasks like video game rendering and Bitcoin mining. The GA102 chip featured in many high-end graphics cards is built from billions of transistors and organized into a hierarchical structure of processing clusters and cores. The card's efficiency is further enhanced by its advanced memory architecture, which enables rapid data transfer. This computational power is also leveraged in AI through tensor cores for complex matrix operations.

  10. 10
    Article
    Avatar of phpdevPHP Dev·2y

    PHP Cuda Extension, access Cuda functions in PHP

    A new CUDA extension for PHP has been created to enable GPU functions on NVIDIA cards, bridging the gap between PHP and Python in AI development. The extension is near-production-ready, and feedback from NVIDIA GPU users is welcomed. Future plans include an OpenCL plugin and a Metal plugin for Apple Silicon.

  11. 11
    Article
    Avatar of hnHacker News·2y

    The Best Darn Grid Shader (Yet)

    This post discusses the development of a grid shader that aims to look better than traditional texture-based grids by addressing issues such as aliasing and Moiré patterns. It covers the challenges and solutions encountered in achieving this, including the use of partial derivatives for anti-aliasing and techniques to maintain line widths in perspective. Various iterations and comparisons with existing solutions are presented, culminating in a shader that meets specific visual criteria for real-time rendering.