Vllama¶
Vllama ๐ฆ¶
One CLI for everything AI โ locally or on free cloud GPUs.
Image generation ยท AutoML ยท Local LLMs ยท Speech ยท Object Detection ยท 3D ยท VS Code
What is Vllama?¶
Vllama is a single CLI tool that puts state-of-the-art AI at your fingertips โ without needing a powerful GPU or writing any code.
No GPU? No problem.
Vllama can offload heavy models like Stable Diffusion to Kaggle's free GPU and automatically download the results to your machine. All you need is a Kaggle account.
Features¶
Image Generation
Run Stable Diffusion and other diffusion models locally or on Kaggle GPUs.
Local LLMs
Spin up any HuggingFace chat model as a local REST API server in one command.
VS Code Extension
Chat with your locally running LLM directly inside VS Code's native chat panel โ zero API keys.
Object Detection
Run YOLO on images and videos from the terminal instantly.
Video Generation
Generate videos from text prompts using text-to-video models.
Speech
Text-to-speech and speech-to-text using local models โ no cloud API needed.
Image/Video to 3D
Generate 3D .ply models from images or videos via Kaggle GPU.
AutoML
Preprocess any CSV and auto-train 9+ ML models with hyperparameter tuning in two commands.
5-Minute Examples¶
Installation¶
That's it. See Installation for environment-specific setup and optional dependency groups.
Next Steps¶
Full install guide including optional dependencies and environment setup.
Five hands-on examples to get you running in minutes.
Run heavy models for free using Kaggle's GPU โ Vllama's standout feature.
Every command, every flag, every output explained.