FauxPilot is an attempt to build a locally hosted version of GitHub Copilot. It uses the SalesForce CodeGen models inside of NVIDIA's Triton Inference Server with the FasterTransformer backend. If you have two NVIDIA RTX 3080 GPUs, you should be able to run the 6B model by putting fauxpilot-triton-1. The container image is governed by the NVIDIA Deep Learning Container License. Don't use it.
4 Comments
Sort: