1.4 KiB
1.4 KiB
Specification Dilemma Experiment
Files
config.yaml— LMStudio endpoint, model, generation and analysis parametersprompts/sparse.json— 30 sparse promptsprompts/dense.json— 30 dense prompts (matched to sparse by index)smoke_test.py— pre-flight: connectivity, seed-honoring, per-generation latencygenerate.py— runs completions against LMStudioembed.py— sentence embeddingssimilarity.py— pairwise cosine similaritiesstats.py— t-test, Mann-Whitney, bootstrap, Cohen's dplot.py— violin plotrun_all.py— orchestrator (runs the five pipeline scripts in order)pyproject.toml,uv.lock— uv-managed environmentrequirements.txt— pip fallbackoutputs/{sparse,dense}/NN.txt— model completions (generated)embeddings/{sparse,dense}.npy— L2-normalized embedding matrices (generated)results/pairwise.csv,results/stats.json,results/plot.png— analysis artifacts (generated)
Setup
- Install LMStudio, load a strong instruction-tuned model, start the local server.
uv sync- Edit
config.yamlfor your LMStudio host, port, and model name. uv run python smoke_test.py— verifies the endpoint and reports whetherseedis honored.
Run
uv run python run_all.py
Or step-by-step: generate.py → embed.py → similarity.py → stats.py → plot.py.