
Maker
-
Supporters
-Idea
0.0
Product
0.0
Feedback
0
Roasted
0
Bloom is an open-source framework for automated behavior evaluation of large language models, designed for safety researchers and ML engineers who need reproducible, configurable probing of complex model behaviors. Instead of relying on fixed prompt sets, Bloom grows an evaluation suite from a seed configuration that defines a target behavior, example transcripts, and interaction parameters.
Using a simple seed.yaml and behaviors.json, you can specify behaviors such as sycophancy, political bias, or self-preservation, then automatically generate rich interaction scenarios against models like claude-sonnet-4. Bloom orchestrates multi-turn conversations, logs every pipeline stage to structured JSON, and stores transcripts for further analysis.
Key capabilities include:
.envBy pairing seeds with version-controlled configurations, Bloom enables reproducible safety evaluations, systematic sweeps across behaviors, and rigorous comparison of model variants under evolving behavioral test suites.
Scale globally with less complexity
With Paddle as your Merchant of Record
Compliance? Handled
New country? Done
Local pricing? One click
Payment methods? Tick
Weekly Drops: Launches & Deals