Tools & Frameworks
93 tools, frameworks, and platforms mapped to concepts
extensions
-
Claude Chrome Extension
activeA browser extension that brings Claude AI directly into the Chrome browser. Enables users to interact with Claude while browsing, summarize web pages...
retrieval-augmented-generationtool-use -
Continue
activeAn open-source AI code assistant that runs in VS Code and JetBrains. Continue allows developers to use any LLM (local or cloud) for code completion,...
code-executing-agenttool-using-agentmodel-router -
pgvector
activeA PostgreSQL extension for vector similarity search. pgvector adds vector data types and similarity search operators to PostgreSQL, enabling semantic...
vector-searchhybrid-searchembedding
frameworks
-
Astro
activeA modern web framework for building fast, content-focused websites. Astro pioneered the "islands architecture" where interactive components load ind...
edge-deployment -
AutoGen
activeA Microsoft framework for building multi-agent conversational systems where agents communicate through message passing. AutoGen models agent interacti...
multi-agent-systemagent-orchestrationhuman-in-the-loop +1 more -
AutoGPT
activeAn experimental open-source project that chains LLM calls to autonomously achieve goals defined by the user. AutoGPT pioneered the concept of "autonom...
autonomous-agentagent-looptool-using-agent +1 more -
CrewAI
activeA framework for orchestrating role-playing AI agents that work together as a "crew" to accomplish complex tasks. CrewAI emphasizes agent specializatio...
multi-agent-systemsupervisor-agentagent-orchestration +1 more -
Haystack
activeAn open-source framework by deepset for building production-ready LLM applications, with strong focus on RAG and search. Haystack uses a pipeline abst...
retrieval-augmented-generationretrieverworkflow +1 more -
LangChain
activeA framework for developing applications powered by language models. LangChain provides modular abstractions for building chains (sequential LLM calls...
agent-orchestrationretrieval-augmented-generationtool-binding +2 more -
LangGraph
activeA library for building stateful, multi-actor applications with LLMs, built on top of LangChain. LangGraph models agent workflows as graphs where nodes...
agent-orchestrationmulti-agent-systemagent-state +2 more -
Letta (MemGPT)
activeAn agent framework focused on long-term memory and stateful AI assistants. Originally known as MemGPT, Letta implements a memory hierarchy that allow...
agent-memorylong-term-memoryagent-state +1 more -
LlamaIndex
activeA data framework for LLM applications, specializing in connecting LLMs to external data sources. LlamaIndex provides sophisticated indexing, retrieval...
retrieval-augmented-generationchunkingretriever +2 more -
NeMo Guardrails
activeNVIDIA's toolkit for adding programmable guardrails to LLM applications. NeMo Guardrails uses a domain-specific language (Colang) to define conversa...
guardrailsprompt-injectionalignment -
Oh-My-OpenCode
activeA configuration and enhancement framework for OpenCode that provides opinionated defaults, curated skills, and productivity improvements. Similar to...
skillplugincontext-file -
Playwright
activeMicrosoft's open-source browser automation framework for end-to-end testing and web scraping. Playwright supports Chromium, Firefox, and WebKit brows...
tool-useapi-integration -
RAGAS
activeRetrieval Augmented Generation Assessment - a framework for evaluating RAG pipelines. RAGAS provides reference-free metrics to evaluate retrieval qua...
benchmarkfaithfulnessgrounding -
Semantic Kernel
activeA Microsoft SDK for integrating LLMs into applications with enterprise-grade patterns. Semantic Kernel emphasizes plugins (skills), planners, and memo...
agent-orchestrationtool-bindingagent-memory +1 more -
Superpowers (obra/superpowers)
activeAn open-source AI agent framework by Jesse Vincent (obra) focused on building powerful, composable AI assistants. Superpowers provides a modular arch...
agent-orchestrationskilltool-use
librarys
-
Chroma
activeAn open-source embedding database designed for AI applications. Chroma provides a simple API for storing and querying embeddings, with a focus on deve...
vector-searchknowledge-baseembedding -
Guardrails AI
activeA Python framework for validating LLM outputs. Guardrails AI provides validators for checking response quality, safety, and structure. It can automa...
guardrailsstructured-outputhallucination -
Instructor
activeA Python library for structured output extraction from LLMs using Pydantic models. Instructor patches LLM clients to return validated, typed objects i...
structured-outputfunction-calling -
Llama 3
activeMeta's open-weight large language model family released in 2024. Llama 3 models are available in 8B and 70B parameter sizes with competitive perform...
large-language-modelfoundation-modelchat-completions-api -
Llama Guard
activeMeta's safety classifier model for LLM applications. Llama Guard is a fine-tuned Llama model that classifies prompts and responses according to safe...
guardrailsalignmentred-teaming -
MCP Servers
activeA collection of Model Context Protocol (MCP) server implementations that extend AI assistants with various capabilities. The official repository inc...
model-context-protocoltool-definitiontool-binding -
Meta Llama
activeMeta's family of open-weight LLMs, including Llama 2 and Llama 3. Llama models can be downloaded and run locally or fine-tuned, making them the founda...
large-language-modelfoundation-modelinstruction-tuning -
Unsloth
activeA library for fast and memory-efficient LLM fine-tuning. Unsloth optimizes the training loop with custom CUDA kernels, enabling 2-5x faster fine-tuni...
fine-tuningquantization
methodologys
platforms
-
Anthropic
activeAn AI safety company offering the Claude family of LLMs. Known for research on AI alignment and Constitutional AI. Claude models are competitive with ...
large-language-modeltool-usemultimodal-model +2 more -
Arize AI
activeAn ML observability platform that has expanded to support LLM applications. Arize provides monitoring, tracing, and evaluation for both traditional M...
benchmarkhallucinationretriever -
Cloudflare Pages
activeCloudflare's JAMstack platform for deploying static sites and full-stack applications. Pages provides automatic builds from Git repositories, global ...
edge-deployment -
Cloudflare Workers
activeCloudflare's serverless compute platform that runs JavaScript, TypeScript, and WebAssembly at the edge. Workers execute in V8 isolates with milliseco...
edge-deploymentinference-endpointsandbox -
Cohere
activeAn enterprise-focused AI company offering LLMs and embeddings optimized for business applications. Cohere is known for strong embedding models (Embed ...
large-language-modelembeddingreranking +1 more -
Dify
activeAn open-source LLM application development platform with visual workflow builder. Dify provides a no-code/low-code interface for building chatbots, ...
agent-orchestrationworkflowretrieval-augmented-generation +1 more -
E2B
activeA cloud platform providing secure sandboxed environments for AI agents to execute code. E2B offers isolated, ephemeral virtual machines where AI-gen...
sandboxcode-executing-agent -
Elasticsearch
activeA distributed search and analytics engine that now includes vector search capabilities. Originally built for full-text search, Elasticsearch added d...
hybrid-searchvector-searchretriever -
Flowise
activeAn open-source low-code platform for building LLM applications with a drag-and-drop interface. Flowise is built on top of LangChain and LlamaIndex, ...
agent-orchestrationworkflowretrieval-augmented-generation +1 more -
Google Cloud Platform
activeGoogle's comprehensive cloud computing platform offering infrastructure, compute, storage, databases, AI/ML services, and developer tools. GCP inclu...
inference-endpointmodel-servinglarge-language-model -
Google Cloud Vertex AI
activeGoogle Cloud's unified ML platform that includes access to Gemini models, PaLM, and other Google AI capabilities. Vertex AI provides model serving, ...
inference-endpointmodel-servingretrieval-augmented-generation +2 more -
Google Gemini
activeGoogle's multimodal AI model family, available through Vertex AI and the Gemini API. Gemini models are natively multimodal, processing text, images, a...
large-language-modelmultimodal-modelcontext-window +2 more -
Google Stitch
activeGoogle's AI-powered application development platform that enables rapid prototyping and deployment of AI applications. Stitch provides a visual inte...
agent-orchestrationinference-endpoint -
Helicone
activeAn open-source observability platform for LLM applications. Helicone acts as a proxy layer that logs all LLM API calls, providing analytics, caching,...
api-gatewaycachingrate-limiting -
Hugging Face
activeThe central hub for machine learning models, datasets, and applications. Hugging Face provides model hosting, the Transformers library, dataset mana...
foundation-modelfine-tuninginference-endpoint +1 more -
Labelbox
activeA data-centric AI platform for creating and managing training data. Labelbox provides annotation tools, workflow management, and model-assisted label...
annotationtraining-datahuman-feedback -
Langfuse
activeAn open-source LLM engineering platform for observability, analytics, and evaluation. Langfuse provides tracing for LLM applications, prompt manageme...
benchmarkprompt-templatefaithfulness -
LangSmith
activeA developer platform for debugging, testing, evaluating, and monitoring LLM applications. Built by LangChain, LangSmith provides tracing, prompt pla...
benchmarkprompt-templateagent-orchestration -
Milvus
activeAn open-source vector database built for scalable similarity search and AI applications. Milvus supports billion-scale vector search with high perfor...
vector-searchhybrid-searchembedding -
Mistral AI
activeA French AI company known for efficient, open-weight models that punch above their parameter count. Mistral's models are popular for their quality-to-...
large-language-modelfoundation-modelfunction-calling -
Modal
activeA cloud platform for running compute-intensive Python applications, particularly popular for ML inference, training, and AI agent workloads. Modal p...
sandboxinference-endpointmodel-serving -
n8n
activeA workflow automation platform that has added AI capabilities. n8n connects apps, APIs, and AI models with a visual workflow builder. It includes no...
workflowagent-orchestrationapi-integration -
OpenAI
activeThe leading commercial LLM provider, offering GPT-4, GPT-4o, o1, and other models via API. OpenAI defined many standards in the LLM space—their API fo...
large-language-modelfunction-callingembedding +2 more -
Portkey
activeAn AI gateway and observability platform for LLM applications. Portkey provides a unified API to access 200+ LLMs, with built-in features for reliab...
api-gatewaymodel-routerguardrails +1 more -
Qdrant
activeA high-performance open-source vector database written in Rust. Qdrant focuses on speed and efficiency, with advanced filtering capabilities and suppo...
vector-searchhybrid-searchknowledge-base -
Scale AI
activeA data labeling and AI infrastructure platform. Scale AI provides human-in-the-loop data annotation, evaluation services, and enterprise AI infrastru...
annotationhuman-feedbackbenchmark -
Snorkel AI
activeA data-centric AI platform focused on programmatic labeling and data development. Snorkel pioneered weak supervision - using labeling functions to p...
synthetic-datatraining-dataannotation -
Vercel
activeA cloud platform for deploying and hosting web applications with a focus on frontend frameworks and serverless functions. Vercel pioneered the "deplo...
edge-deploymentinference-endpoint -
Weaviate
activeAn open-source vector database with built-in vectorization modules and GraphQL API. Weaviate emphasizes ease of use with automatic embedding generatio...
vector-searchhybrid-searchknowledge-base +1 more -
Weights & Biases
activeAn ML experiment tracking and model management platform that has expanded to support LLM development. W&B provides experiment logging, hyperparameter...
benchmarkfine-tuningtraining-data
plugins
services
-
Amazon Bedrock
activeA fully managed service from AWS that provides access to foundation models from multiple providers through a single API. Bedrock offers models from ...
inference-endpointmodel-servingretrieval-augmented-generation +2 more -
Azure OpenAI Service
activeMicrosoft's enterprise offering of OpenAI models through Azure. Provides access to GPT-4, GPT-3.5, DALL-E, Whisper, and embedding models with Azure'...
inference-endpointmodel-servingembedding +1 more -
Claude 3
activeAnthropic's third-generation model family released in March 2024, featuring three tiers: Haiku (fast/cheap), Sonnet (balanced), and Opus (most capabl...
large-language-modelmultimodal-modelchat-completions-api +1 more -
Claude 3.5
activeAnthropic's mid-generation update to Claude 3, with Claude 3.5 Sonnet released in June 2024. Despite being a "Sonnet" tier model, 3.5 Sonnet matched...
large-language-modelmultimodal-modelchat-completions-api +1 more -
Claude 4
activeAnthropic's fourth-generation model family representing the latest frontier of Claude capabilities. Claude 4 builds on the foundation of Claude 3 wit...
large-language-modelmultimodal-modelchat-completions-api +1 more -
Claude Cowork
activeA collaborative AI workspace from Anthropic that enables multiple users and Claude instances to work together on shared projects. Cowork provides pe...
multi-agent-systemagent-orchestrationcontext-file -
Codeium
activeAn AI code assistant offering free unlimited autocomplete for individual developers. Codeium provides code completion, chat, and search across 70+ l...
code-executing-agentinference-endpoint -
Context7
activeA documentation retrieval service that provides AI coding assistants with up-to-date library and framework documentation. Context7 indexes official ...
retrieval-augmented-generationknowledge-basetool-use -
Doppler
activeA secrets management platform that provides secure storage, syncing, and injection of environment variables and API keys across development, staging,...
api-integration -
Gemini
activeGoogle's multimodal AI model family, successor to PaLM. Gemini models are natively multimodal, trained from the ground up on text, images, audio, an...
large-language-modelmultimodal-modelchat-completions-api +1 more -
GitHub Copilot
activeAn AI-powered code completion and assistance tool integrated into code editors. GitHub Copilot uses OpenAI Codex (and newer models) to suggest code ...
code-executing-agenttool-using-agentinference-endpoint -
GPT-4
activeOpenAI's flagship large language model family released in March 2023. GPT-4 introduced multimodal capabilities (text and image input), significantly ...
large-language-modelmultimodal-modelchat-completions-api +1 more -
GPT-4 Turbo
activeAn optimized variant of GPT-4 released in November 2023 with a 128K context window, updated knowledge cutoff, and significantly lower pricing. GPT-4 ...
large-language-modelmultimodal-modelchat-completions-api +2 more -
GPT-4o
activeOpenAI's omni model released in May 2024, designed to handle text, audio, image, and video inputs with unified architecture. GPT-4o offers GPT-4-leve...
large-language-modelmultimodal-modelchat-completions-api +1 more -
Jina AI
activeA search AI company providing embedding models, rerankers, and search infrastructure. Jina offers open-weight embedding models (jina-embeddings), a ...
embeddingrerankingretriever -
Mistral Large
activeMistral AI's flagship large language model, competing at the frontier with GPT-4 and Claude. Mistral Large offers strong multilingual performance, p...
large-language-modelchat-completions-apifunction-calling -
o1
activeOpenAI's reasoning-focused model family released in late 2024. o1 models use extended "thinking" time to solve complex problems, particularly excell...
large-language-modelreasoningchain-of-thought -
o3
activeOpenAI's advanced reasoning model announced in late 2024, building on the o1 architecture with enhanced reasoning capabilities. o3 achieved breakthro...
large-language-modelreasoningchain-of-thought -
Pinecone
activeA managed vector database service designed for production AI applications. Pinecone handles the infrastructure complexity of vector search at scale, o...
vector-searchhybrid-searchknowledge-base -
Voyage AI
activeA specialized embedding model provider focused on retrieval-optimized embeddings. Voyage AI offers embedding models trained specifically for search ...
embeddingretriever -
Zep
activeA memory layer for AI assistants and agents. Zep provides long-term memory, conversation history management, and user context for LLM applications. ...
long-term-memoryagent-memorysemantic-memory +1 more
tools
-
Axolotl
activeA tool for fine-tuning LLMs with a focus on ease of use and flexibility. Axolotl supports various fine-tuning methods (full, LoRA, QLoRA), multiple ...
fine-tuninginstruction-tuningrlhf -
Claude Code
activeAnthropic's official CLI-based AI coding assistant. Claude Code provides agentic coding capabilities directly in the terminal, including code generat...
code-executing-agenttool-using-agentiterative-agent-loop +2 more -
Claude Desktop
activeAnthropic's desktop application for Claude that supports the Model Context Protocol (MCP). Claude Desktop allows users to connect Claude to local too...
model-context-protocoltool-using-agenthuman-in-the-loop -
Cursor
activeAn AI-first code editor built as a fork of VS Code. Cursor deeply integrates AI capabilities including code completion, chat, multi-file editing, and...
code-executing-agenttool-using-agentretrieval-augmented-generation -
Ghostty
activeA fast, feature-rich terminal emulator written in Zig by Mitchell Hashimoto (HashiCorp co-founder). Ghostty emphasizes performance, correctness, and ...
tool-use -
LiteLLM
activeA unified interface for calling 100+ LLM providers using the OpenAI API format. LiteLLM acts as a translation layer—write code once using the OpenAI f...
api-gatewayopenai-apimodel-router -
llama.cpp
activeA C/C++ implementation of LLM inference optimized for running on consumer hardware without GPUs. llama.cpp pioneered efficient CPU inference and quant...
inferencequantizationedge-deployment -
LM Studio
activeA desktop application for running LLMs locally with a user-friendly GUI. LM Studio provides model discovery, downloads, chat interface, and local API ...
edge-deploymentinferenceopenai-api -
Ollama
activeA tool for running LLMs locally with a simple CLI and API. Ollama makes local LLM deployment accessible by handling model downloads, quantization, and...
edge-deploymentmodel-servinginference +1 more -
OpenCode
activeAn open-source AI coding assistant that runs in the terminal. OpenCode provides agentic coding capabilities including code generation, editing, file ...
code-executing-agenttool-using-agentiterative-agent-loop +1 more -
Promptfoo
activeA test framework for evaluating and red-teaming LLM applications. Promptfoo enables systematic testing of prompts across different models, comparing ...
benchmarkred-teamingguardrails -
Text Generation Inference
activeHugging Face's production-ready inference server for LLMs. TGI provides high-performance serving with features like continuous batching, tensor parall...
model-servinginferencestreaming -
Visual Studio Code
activeMicrosoft's free, open-source code editor that has become the dominant IDE for modern software development. VS Code features an extensive extension ...
tool-use -
vLLM
activeA high-throughput LLM inference and serving library using PagedAttention for efficient memory management. vLLM is the leading open-source solution for...
model-servinginferenceopenai-api +1 more