Skip to content
#

gpu-llm

Here are 7 public repositories matching this topic...

Language: All
Filter by language

A project to build GPU acceleration for LLaMA models on local computers and AWS, leveraging GPU resources for efficient inference and training.

  • Updated May 19, 2026
  • Python

AdaAttn is a GPU-native attention mechanism that dynamically adapts both numerical precision and matrix rank at runtime, reducing memory bandwidth and computational overhead in large language models without sacrificing model quality. By aligning linear algebra operations with modern GPU hardware characteristics.

  • Updated May 19, 2026
  • Python

This project implements PDDL-INSTRUCT with Logical Chain-of-Thought (LCoT), a novel approach to improve Large Language Model (LLM) performance on automated planning tasks. The system enhances planning capabilities through:

  • Updated May 19, 2026
  • Python

CSNePS Knowledge Graph Service is a production-ready enterprise system that bridges symbolic AI reasoning with modern ontology engineering. The system combines CSNePS (Cognitive Systems for Natural language Processing and Structured information) - a powerful semantic network reasoning engine - with comprehensive OWL ontology support, advanced graph

  • Updated May 19, 2026
  • Java

Improve this page

Add a description, image, and links to the gpu-llm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gpu-llm topic, visit your repo's landing page and select "manage topics."

Learn more