
Maker
-
Supporters
-Idea
0.0
Product
0.0
Feedback
0
Roasted
0
Bloom is an open-source framework for automated behavior evaluation of large language models, designed for safety researchers and ML engineers who need reproducible, configurable probing of complex model behaviors. Instead of relying on fixed prompt sets, Bloom grows an evaluation suite from a seed configuration that defines a target behavior, example transcripts, and interaction parameters.
Using a simple seed.yaml and behaviors.json, you can specify behaviors such as sycophancy, political bias, or self-preservation, then automatically generate rich interaction scenarios against models like claude-sonnet-4. Bloom orchestrates multi-turn conversations, logs every pipeline stage to structured JSON, and stores transcripts for further analysis.
Key capabilities include:
.envBy pairing seeds with version-controlled configurations, Bloom enables reproducible safety evaluations, systematic sweeps across behaviors, and rigorous comparison of model variants under evolving behavioral test suites.
Hyperfocal
Photography editing made easy.
Describe any style or idea
Turn it into a Lightroom preset
Awesome styles, in seconds.
Built by Jon·C·Phillips
Weekly Drops: Launches & Deals