Curated Claude Code catalog
Updated 07.05.2026 · 19:39 CET
01 / Skill
scarletkc

vexor

Quality
9.0

Vexor is a semantic search engine that builds reusable indexes over files and code, enabling discovery by intent rather than exact text. It is ideal for developers and AI coding assistants needing to quickly locate relevant files in large repositories or when file names are unknown.

USP

Vexor offers intent-based semantic search across CLI, Python API, and a desktop app, surpassing traditional grep. It's uniquely designed to enhance both human and AI agent workflows, with configurable embedding and reranking providers for…

Use cases

  • 01Quickly finding code files by their function or purpose
  • 02Enabling AI agents to semantically discover relevant files in a codebase
  • 03Improving developer productivity by reducing time spent on file navigation
  • 04Indexing large code repositories for efficient knowledge retrieval

Detected files (2)

  • plugins/vexor/skills/vexor-cli/SKILL.mdskill
    Show content (2798 bytes)
    ---
    name: vexor-cli
    description: Semantic file discovery via `vexor`. Use whenever locating where something is implemented/loaded/defined in a medium or large repo, or when the file location is unclear. Prefer this over manual browsing.
    ---
    
    # Vexor CLI Skill
    
    ## Goal
    
    Find files by intent (what they do), not exact text.
    
    ## Use It Like This
    
    - Use `vexor` first for intent-based file discovery.
    - If `vexor` is missing, follow [references/install-vexor.md](references/install-vexor.md).
    
    ## Command
    
    ```bash
    vexor "<QUERY>" [--path <ROOT>] [--mode <MODE>] [--ext .py,.md] [--exclude-pattern <PATTERN>] [--top 5] [--format rich|porcelain|porcelain-z]
    ```
    
    ## Common Flags
    
    - `--path/-p`: root directory (default: current dir)
    - `--mode/-m`: indexing/search strategy
    - `--ext/-e`: limit file extensions (e.g., `.py,.md`)
    - `--exclude-pattern`: exclude paths by gitignore-style pattern (repeatable; `.js` → `**/*.js`)
    - `--top/-k`: number of results
    - `--include-hidden`: include dotfiles
    - `--no-respect-gitignore`: include ignored files
    - `--no-recursive`: only the top directory
    - `--format`: `rich` (default) or `porcelain`/`porcelain-z` for scripts
    - `--no-cache`: in-memory only, do not read/write index cache
    
    ## Modes (pick the cheapest that works)
    
    - `auto`: routes by file type (default)
    - `name`: filename-only (fastest)
    - `head`: first lines only (fast)
    - `brief`: keyword summary (good for PRDs)
    - `code`: code-aware chunking for `.py/.js/.ts` (best default for codebases)
    - `outline`: Markdown headings/sections (best for docs)
    - `full`: chunk full file contents (slowest, highest recall)
    
    ## Troubleshooting
    
    - Need ignored or hidden files: add `--include-hidden` and/or `--no-respect-gitignore`.
    - Scriptable output: use `--format porcelain` (TSV) or `--format porcelain-z` (NUL-delimited).
    - Get detailed help: `vexor search --help`.
    - Config issues: `vexor doctor` or `vexor config --show` diagnoses API, cache, and connectivity (tell the user to set up).
    
    ## Examples
    
    ```bash
    # Find CLI entrypoints / commands
    vexor search "typer app commands" --top 5
    ```
    
    ```bash
    # Search docs by headings/sections
    vexor search "user authentication flow" --path docs --mode outline --ext .md --format porcelain
    ```
    
    ```bash
    # Locate config loading/validation logic
    vexor search "config loader" --path . --mode code --ext .py
    ```
    
    ```bash
    # Exclude tests and JavaScript files
    vexor search "config loader" --path . --exclude-pattern tests/** --exclude-pattern .js
    ```
    
    ## Tips
    
    - First time search will index files (may take a minute). Subsequent searches are fast. Use longer timeouts if needed.
    - Results return similarity ranking, exact file location, line numbers, and matching snippet preview.
    - Combine `--ext` with `--exclude-pattern` to focus on a subset (exclude rules apply on top).
    
  • .claude-plugin/marketplace.jsonmarketplace
    Show content (497 bytes)
    {
      "name": "vexor-marketplace",
      "metadata": {
        "description": "Marketplace for the Vexor Claude Code plugin (skills for indexing and semantic search)."
      },
      "owner": {
        "name": "scarletkc"
      },
      "plugins": [
        {
          "name": "vexor",
          "source": "./plugins/vexor",
          "description": "A vector-powered CLI for semantic search over files (Vexor skill bundle).",
          "category": "developer-tools",
          "tags": ["search", "indexing", "cli"],
          "strict": true
        }
      ]
    }
    

README

Vexor

Vexor

Python PyPI CI Codecov License Ask DeepWiki


Vexor is a semantic search engine that builds reusable indexes over files and code. It supports configurable embedding and reranking providers, and exposes the same core through a Python API, a CLI tool, and an optional desktop frontend.

Vexor Demo Video

Featured In

Vexor has been recognized and featured by the community:

Why Vexor?

When you remember what a file does but forget its name or location, Vexor finds it instantly—no grep patterns or directory traversal needed.

Designed for both humans and AI coding assistants, enabling semantic file discovery in autonomous agent workflows.

Install

Download standalone binary from releases (no Python required), or:

pip install vexor  # also works with pipx, uv

Quick Start

0. Guided Setup (Recommended)

vexor init

The wizard also runs automatically on first use when no config exists.

1. Search

vexor "api client config"  # defaults to search current directory
# or explicit path:
vexor search "api client config" --path ~/projects/demo --top 5
# in-memory search only:
vexor search "api client config" --no-cache 

Vexor auto-indexes on first search. Example output:

Vexor semantic file search results
──────────────────────────────────
#   Similarity   File path                       Lines   Preview
1   0.923        ./src/config_loader.py          -       config loader entrypoint
2   0.871        ./src/utils/config_parse.py     -       parse config helpers
3   0.809        ./tests/test_config_loader.py   -       tests for config loader

2. Explicit Index (Optional)

vexor index  # indexes current directory
# or explicit path:
vexor index --path ~/projects/demo --mode code

Useful for CI warmup or when auto_index is disabled.

Desktop App (Experimental)

The desktop app is experimental and not actively maintained. It may be unstable. For production use, prefer the CLI.

GUI

Download the desktop app from releases.

Python API

Vexor can also be imported and used directly from Python:

from vexor import index, search

index(path=".", mode="head")
response = search("config loader", path=".", mode="name")

for hit in response.results:
    print(hit.path, hit.score)

By default it reads ~/.vexor/config.json. For runtime config overrides, cache controls, and per-call options, see docs/api/python.md.

AI Agent Skill

This repo includes a skill for AI agents to use Vexor effectively:

vexor install --skills claude  # Claude Code
vexor install --skills codex   # Codex

Skill source: plugins/vexor/skills/vexor-cli

Configuration

vexor config --set-provider openai          # default; also supports gemini/voyageai/custom/local
vexor config --set-model text-embedding-3-small
vexor config --set-provider voyageai        # uses voyage defaults when model/base_url are unset
vexor config --set-batch-size 0             # 0 = single request
vexor config --set-embed-concurrency 4       # parallel embedding requests
vexor config --set-extract-concurrency 4     # parallel file extraction workers
vexor config --set-extract-backend auto      # auto|thread|process (default: auto)
vexor config --set-embedding-dimensions 1024 # optional, model/provider dependent
vexor config --clear-embedding-dimensions    # reset to model default dimension
vexor config --set-auto-index true          # auto-index before search (default)
vexor config --rerank bm25                  # optional BM25 rerank for top-k results
vexor config --rerank flashrank             # FlashRank rerank (requires optional extra)
vexor config --rerank remote                # remote rerank via HTTP endpoint
vexor config --set-flashrank-model ms-marco-MultiBERT-L-12  # multilingual model
vexor config --set-flashrank-model          # reset FlashRank model to default
vexor config --clear-flashrank              # remove cached FlashRank models
vexor config --set-remote-rerank-url https://proxy.example.com/v1/rerank
vexor config --set-remote-rerank-model bge-reranker-v2-m3
vexor config --set-remote-rerank-api-key $VEXOR_REMOTE_RERANK_API_KEY  # or env var
vexor config --clear-remote-rerank          # clear remote rerank config
vexor config --set-base-url https://proxy.example.com  # optional proxy
vexor config --clear-base-url               # reset to official endpoint
vexor config --show                         # view current settings

Rerank defaults to off. It is highly recommended to configure the Reranker in advance to improve search accuracy. FlashRank requires pip install "vexor[flashrank]" and caches models under ~/.vexor/flashrank.

Config stored in ~/.vexor/config.json.

Configure API Key

vexor config --set-api-key "YOUR_KEY"

Or via environment: VEXOR_API_KEY, OPENAI_API_KEY, GOOGLE_GENAI_API_KEY, or VOYAGE_API_KEY.

Rerank

Rerank reorders the semantic results with a secondary ranker. Candidate sizing uses clamp(int(--top * 2), 20, 150).

Recommended defaults:

  • Keep off unless you want extra precision.
  • Use bm25 for lightweight lexical boosts; it is fast and lightweight.
  • BM25 uses a multilingual tokenizer (Bert pre-tokenizer), so it can handle CJK better.
  • Use flashrank for stronger reranking (requires pip install "vexor[flashrank]" and downloads a model to ~/.vexor/flashrank).
  • Use remote to call a hosted reranker that accepts {model, query, documents} and returns ranked indexes.
  • For Chinese or multi-language content, set --set-flashrank-model ms-marco-MultiBERT-L-12.
  • If unset, FlashRank defaults to ms-marco-TinyBERT-L-2-v2.

Providers: Remote vs Local

Vexor supports both remote API providers (openai, gemini, voyageai, custom) and a local provider (local):

  • Remote providers use api_key and optional base_url.
  • voyageai defaults to https://api.voyageai.com/v1 when base_url is not set.
  • custom is OpenAI-compatible and requires both model and base_url.
  • Local provider ignores api_key/base_url and only uses model plus local_cuda (CPU/GPU switch).

Embedding Dimensions

Embedding dimensions are optional. If unset, the provider/model default is used. Custom dimensions are validated for:

  • OpenAI text-embedding-3-*
  • Voyage voyage-3* and voyage-code-3*
vexor config --set-embedding-dimensions 1024
vexor config --clear-embedding-dimensions

If you change dimensions after an index is built, rebuild the index:

vexor index --path .

Local Model (Offline)

Install the lightweight local backend:

pip install "vexor[local]"

GPU backend (requires CUDA drivers):

pip install "vexor[local-cuda]"

Download a local embedding model and auto-configure Vexor:

vexor local --setup --model intfloat/multilingual-e5-small

Then use vexor search / vexor index as usual.

Local models are stored in ~/.vexor/models (clear with vexor local --clean-up).

GPU (optional): install onnxruntime-gpu (or vexor[local-cuda]) and use vexor local --setup --cuda (or vexor local --cuda). Switch back with vexor local --cpu.

Index Modes

Control embedding granularity with --mode:

ModeDescription
autoDefault. Smart routing: Python/JS/TS → code, Markdown → outline, small files → full, large files → head
nameEmbed filename only (fastest, zero content reads)
headExtract first snippet for lightweight semantic context
briefExtract high-frequency keywords from PRDs/requirements docs
fullChunk entire content; long documents searchable end-to-end
codeAST-aware chunking by module/class/function boundaries for Python and JavaScript/TypeScript; other files fall back to full
outlineChunk Markdown by heading hierarchy with breadcrumbs; non-.md falls back to full

Cache Behavior

Index cache keys derive from: --path, --mode, --include-hidden, --no-recursive, --no-respect-gitignore, --ext, --exclude-pattern.

Keep flags consistent to reuse cache; changing flags creates a separate index.

vexor config --show-index-all    # list all cached indexes
vexor config --clear-index-all   # clear all cached indexes
vexor index --path . --clear     # clear index for specific path

Re-running vexor index only re-embeds changed files; >50% changes trigger full rebuild.

Command Reference

CommandDescription
vexor initRun the interactive setup wizard
vexor QUERYShortcut for vexor search QUERY
vexor search QUERY --path PATHSemantic search (auto-indexes if needed)
vexor index --path PATHBuild/refresh index manually
vexor config --showDisplay current configuration
vexor config --clear-flashrankRemove cached FlashRank models under ~/.vexor/flashrank
vexor local --setup [--model MODEL]Download a local model and set provider to local
vexor local --clean-upRemove local model cache under ~/.vexor/models
vexor local --cudaEnable CUDA for local embeddings (requires onnxruntime-gpu)
vexor local --cpuDisable CUDA and use CPU for local embeddings
vexor install --skills claudeInstall Agent Skill for Claude Code
vexor install --skills codexInstall Agent Skill for Codex
vexor doctorRun diagnostic checks (command, config, cache, API key, API connectivity)
vexor update [--upgrade] [--pre]Check for new version (optionally upgrade; --pre includes pre-releases)
vexor feedbackOpen GitHub issue form (or use gh)
vexor aliasPrint a shell alias for vx and optionally apply it

Common Flags

FlagDescription
--path PATHTarget directory (default: current working directory)
--mode MODEIndex mode (auto/name/head/brief/full/code/outline)
--top K / -kNumber of results (default: 5)
--ext .py,.md / -eFilter by extension (repeatable)
--exclude-pattern PATTERNExclude paths by gitignore-style pattern (repeatable; .js treated as **/*.js)
--include-hidden / -iInclude hidden files
--no-recursive / -nDon't recurse into subdirectories
--no-respect-gitignoreInclude gitignored files
--format porcelainScript-friendly TSV output
--format porcelain-zNUL-delimited output
--no-cacheIn-memory only; do not read/write index cache

Porcelain output fields: rank, similarity, path, chunk_index, start_line, end_line, preview (line fields are - when unavailable).

Documentation

See docs for more details.

Contributing

Contributions, issues, and PRs welcome! Star if you find it helpful.

Star History

Star History Chart

License

MIT