From-scratch LLM inference engine in C++17/CUDA. Custom kernels, GGUF model loading, quantized inference (Q4/Q8). Runs SmolLM2-135M and Llama 3.2 1B on a 6 GB GPU. - Artemarius/CuInfer ...
Goal: Add a hard second Python mission ("Threat Log Parser") between forensics-timeline and career-boss, where students fix four independent bugs in a firewall log analysis script. Architecture: New ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results