workflowstacks

The marketplace for AI skills that launch offers, rank in AI search, and automate operations. No coding required.

๐•โšก๐Ÿ’ฌ

Marketplace

  • Browse Skills
  • AI Agents
  • Claude Skills
  • MCP Servers
  • Prompts

Solutions

  • For Founders
  • For Agencies
  • For Ecommerce
  • Agent Builder
  • Starter Packs
  • Playbooks

Learn

  • How It Works
  • What Are Skills
  • What Are Agents
  • What Is MCP
  • For Creators
  • Submit a Tool
  • Security

Company

  • Become a Creator
  • About
  • Enterprise
  • API Docs
  • Terms
  • Privacy
  • Support
Compatible with
๐Ÿค–ChatGPT
โœจClaude
๐Ÿ’ŽGemini
๐Ÿ›๏ธShopify
๐Ÿ”Ahrefs
๐Ÿ“ŠSheets
๐Ÿ’ฌWhatsApp
๐Ÿ“ฑMeta Ads
+50 moreCreator program โ†’

ยฉ 2026 WorkflowStacks. All rights reserved.

TermsPrivacySupport
local-ai

Run LLMs with llama.cpp

Get fast LLM inference with llama.cpp, for founders using C++.
115,047 stars19,266 forksC++Quality 9/10Updated 6/7/2026100% free ยท open source
What it does

Llama.cpp enables fast and efficient inference of large language models in C/C++ applications, allowing developers to integrate AI capabilities into their projects

Install / run
git clone https://github.com/ggml-org/llama.cpp.git
When to use it
  • โ€ขWhen you need to deploy a language model in a resource-constrained environment
  • โ€ขWhen you want to integrate a language model into a C/C++ application
  • โ€ขWhen you need to achieve high-performance inference for large language models
Quick start
  1. 1Compile the project using the command 'mkdir build && cd build && cmake .. && cmake --build .'
  2. 2Download the pre-trained model weights using the provided script 'download-weights.sh'
  3. 3Create a C++ application that includes the 'llama.cpp' header file and links against the 'libllama.so' library
  4. 4Initialize the model using the 'LlamaModel' class and load the pre-trained weights
  5. 5Use the 'generate' function to generate text based on a given prompt
Ready-to-paste prompt
cout << LlamaModel::generate("Tell me a story about a character who", 100) << endl;
Heads up: Ensure that you have the necessary dependencies installed, including a C++ compiler and the CMake build system, and that your system meets the minimum requirements for the pre-trained models, including at least 4GB of RAM
Saves to your device

Topics

ggml
What's inside โ€” free to inspect
No purchase needed

Read the entire source before you build โ€” unlike paid marketplaces that hide it behind a buy button.

31
top-level files
26
folders
400.8M
repo size
MIT
license
Key files
.editorconfig
.pre-commit-config.yaml
AGENTS.md
pyrightconfig.json
README.md
requirements.txt
File tree
.devops/
.gemini/
.github/
.pi/
app/
benches/
ci/
cmake/
common/
conversion/
docs/
examples/
ggml/
gguf-py/
grammars/
include/
licenses/
media/
models/
pocs/
requirements/
scripts/
src/
tests/
Quick Actions
Details
Creator
ggml-org
Language
C++
Category
local-ai
Published
3/10/2023

Are you the creator of this tool? Claim your listing โ†’ and earn 85% of every sale.

Related skills

More local-ai tools founders pair with this one.

local-aiโ˜… 15,386
MNN: Fast On-Device AI
Get high-performance Edge AI with MNN, battle-tested by Alibaba, for startup founders, with 15k+ GitHub stars
local-aiโ˜… 218
Xybrid: AI on-device
Build AI-powered apps with Xybrid, for startup founders, with on-device AI capabilities.