Machine Learning and AI Engineer open source stack

Selected repositories matched to this profile using language, topics, activity and real usage patterns.

machine learning engineer open source tools
open source ml stack
llm frameworks for production
open source vector database
transformers
⭐ 154636 Python Score 97 Updated 2026-01-06

πŸ€— Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference...

audio deep-learning deepseek gemma glm hacktoberfest
keras
⭐ 63687 Python Score 97 Updated 2026-01-05

Deep Learning for humans

data-science deep-learning jax machine-learning neural-networks python
yolov5
⭐ 56538 Python Score 97 Updated 2025-12-31

YOLOv5 πŸš€ in PyTorch > ONNX > CoreML > TFLite

coreml deep-learning ios machine-learning ml object-detection
ray
⭐ 40630 Python Score 97 Updated 2026-01-06

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

data-science deep-learning deployment distributed hyperparameter-optimization hyperparameter-search
DocsGPT
⭐ 17583 Python Score 97 Updated 2026-01-05

Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API conn...

agent-builder agents ai chatgpt docsgpt hacktoberfest
annotated_deep_learning_paper_implementations
⭐ 65161 Python Score 90 Updated 2025-11-11

πŸ§‘β€πŸ« 60+ Implementations/tutorials of deep learning papers with side-by-side notes πŸ“; including transformers (original, xl, switch, feedback, vit, ...), optim...

attention deep-learning deep-learning-tutorial gan literate-programming lora
ultralytics
⭐ 50744 Python Score 89 Updated 2026-01-06

Ultralytics YOLO πŸš€

cli computer-vision deep-learning hub image-classification instance-segmentation
ragflow
⭐ 70962 Python Score 89 Updated 2026-01-06

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context...

agent agentic agentic-ai agentic-workflow ai ai-search
Real-Time-Voice-Cloning
⭐ 59155 Python Score 89 Updated 2025-12-15

Clone a voice in 5 seconds to generate arbitrary speech in real-time

deep-learning python pytorch tensorflow tts voice-cloning
ml-engineering
⭐ 16151 Python Score 89 Updated 2025-12-20

Machine Learning Engineering Open Book

ai debugging gpus inference large-language-models llm
wandb
⭐ 10702 Python Score 89 Updated 2026-01-06

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

ai collaboration data-science data-versioning deep-learning experiment-track
txtai
⭐ 11981 Python Score 89 Updated 2026-01-05

πŸ’‘ All-in-one AI framework for semantic search, LLM orchestration and language model workflows

agents ai ai-agents embeddings information-retrieval language-model
deeplake
⭐ 8971 C++ Score 87 Updated 2026-01-06

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time...

ai computer-vision cv data-science datalake datasets
LLMs-from-scratch
⭐ 82392 Jupyter Notebook Score 85 Updated 2026-01-04

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

ai artificial-intelligence chatbot chatgpt deep-learning from-scratch
dify
⭐ 124862 Python Score 81 Updated 2026-01-06

Production-ready platform for agentic workflow development.

agent agentic-ai agentic-framework agentic-workflow ai automation
pytorch
⭐ 96368 Python Score 81 Updated 2026-01-06

Tensors and Dynamic neural networks in Python with strong GPU acceleration

autograd deep-learning gpu machine-learning neural-network numpy
faceswap
⭐ 54845 Python Score 81 Updated 2026-01-05

Deepfakes Software For All

deep-face-swap deep-learning deep-neural-networks deepface deepfakes deeplearning
DeepSpeed
⭐ 41159 Python Score 81 Updated 2026-01-05

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

billion-parameters compression data-parallelism deep-learning gpu inference
stable-diffusion-webui
⭐ 159697 Python Score 81 Updated 2025-12-18

Stable Diffusion web UI

ai ai-art deep-learning diffusion gradio image-generation
langchain
⭐ 123517 Python Score 81 Updated 2026-01-05

πŸ¦œπŸ”— The platform for reliable agents.

agents ai ai-agents anthropic chatgpt deepagents
vllm
⭐ 66907 Python Score 81 Updated 2026-01-06

A high-throughput and memory-efficient inference and serving engine for LLMs

amd blackwell cuda deepseek deepseek-v3 gpt
LlamaFactory
⭐ 64963 Python Score 81 Updated 2026-01-05

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

agent ai deepseek fine-tuning gemma gpt
llama_index
⭐ 46179 Python Score 81 Updated 2026-01-05

LlamaIndex is the leading framework for building LLM-powered agents over your data.

agents application data fine-tuning framework llamaindex
peft
⭐ 20403 Python Score 81 Updated 2025-12-18

πŸ€— PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

adapter diffusion fine-tuning llm lora parameter-efficient-learning
RWKV-LM
⭐ 14264 Python Score 81 Updated 2025-12-19

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "...

attention-mechanism chatgpt deep-learning gpt gpt-2 gpt-3
speechbrain
⭐ 11005 Python Score 81 Updated 2026-01-05

A PyTorch-based Speech Toolkit

asr audio audio-processing deep-learning huggingface language-model
tensorflow
⭐ 193213 C++ Score 77 Updated 2026-01-06

An Open Source Machine Learning Framework for Everyone

deep-learning deep-neural-networks distributed machine-learning ml neural-network
llm-app
⭐ 53213 Jupyter Notebook Score 77 Updated 2025-12-09

Ready-to-run cloud templates for RAG, AI pipelines, and enterprise search with live data. 🐳Docker-friendly.⚑Always in sync with Sharepoint, Google Drive, S3,...

chatbot hugging-face llm llm-local llm-prompting llm-security
haystack
⭐ 23800 MDX Score 77 Updated 2026-01-05

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or...

agent agents ai gemini generative-ai gpt-4
metaflow
⭐ 9699 Python Score 75 Updated 2026-01-05

Build, Manage and Deploy AI/ML Systems

agents ai aws azure cost-optimization datascience
BentoML
⭐ 8350 Python Score 75 Updated 2026-01-05

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

ai-inference deep-learning generative-ai inference-platform llm llm-inference
LEANN
⭐ 8095 Python Score 75 Updated 2026-01-02

RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

ai faiss gpt-oss langchain llama-index llm
airweave
⭐ 5524 Python Score 75 Updated 2026-01-06

Context retrieval for AI agents across apps and databases

agents knowledge-graph llm llm-agent rag search
scikit-learn
⭐ 64489 Python Score 73 Updated 2026-01-06

scikit-learn: machine learning in Python

data-analysis data-science machine-learning python statistics
OpenBB
⭐ 57268 Python Score 73 Updated 2026-01-05

Financial data platform for analysts, quants and AI agents.

ai crypto derivatives economics equity finance
streamlit
⭐ 42966 Python Score 73 Updated 2026-01-06

Streamlit β€” A faster way to build and share data apps.

data-analysis data-science data-visualization deep-learning developer-tools machine-learning
gradio
⭐ 41207 Python Score 73 Updated 2026-01-06

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

data-analysis data-science data-visualization deep-learning deploy gradio
mlflow
⭐ 23560 Python Score 73 Updated 2026-01-06

The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and e...

agentops agents ai ai-governance apache-spark evaluation
browser-use
⭐ 74689 Python Score 73 Updated 2026-01-05

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

ai-agents ai-tools browser-automation browser-use llm playwright
OpenHands
⭐ 66275 Python Score 73 Updated 2026-01-06

πŸ™Œ OpenHands: AI-Driven Development

agent artificial-intelligence chatgpt claude-ai cli developer-tools
unsloth
⭐ 50386 Python Score 73 Updated 2026-01-05

Fine-tuning & Reinforcement Learning for LLMs. πŸ¦₯ Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

agent deepseek deepseek-r1 fine-tuning gemma gemma3
mem0
⭐ 45032 Python Score 73 Updated 2026-01-03

Universal memory layer for AI Agents

agents ai ai-agents application chatbots chatgpt
PaddleNLP
⭐ 12893 Python Score 73 Updated 2025-12-17

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

bert compression distributed-training document-intelligence embedding ernie
segmentation_models.pytorch
⭐ 11237 Python Score 73 Updated 2025-12-23

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

computer-vision deeplab-v3-plus deeplabv3 dpt fpn image-processing
cognee
⭐ 10777 Python Score 73 Updated 2026-01-04

Memory for AI Agents in 6 lines of code

ai ai-agents ai-memory cognitive-architecture cognitive-memory context-engineering
ComfyUI
⭐ 99165 Python Score 73 Updated 2026-01-06

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

ai comfy comfyui python pytorch stable-diffusion
nni
⭐ 14324 Python Score 72 Updated 2024-07-03

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper...

automated-machine-learning automl bayesian-optimization data-science deep-learning deep-neural-network
tensorzero
⭐ 10767 Rust Score 69 Updated 2026-01-06

TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.

ai ai-engineering anthropic artificial-intelligence deep-learning genai
anything-llm
⭐ 52949 JavaScript Score 69 Updated 2026-01-06

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

ai-agents custom-ai-agents deepseek kimi llama3 llm
memvid
⭐ 10868 Rust Score 69 Updated 2026-01-05

Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval and long-term memory.

ai context embedded faiss knowledge-base knowledge-graph