Web UI Benchmark

A comparison of how different models handle the same UI prompts, side by side.