Releases

August 20, 2024

Datasets

Gentrace test cases in pipelines currently work well when only one engineer is working on the pipeline. However, once more than one engineer is working on the same pipeline, it becomes difficult to manage test data in Gentrace. In practice, engineers end up cloning pipelines or overwriting test data, both of which have significant drawbacks.
To solve this, Gentrace has:

Introduced datasets, which organize test data into separate groups within a pipeline
Migrated existing test data into a default "Golden dataset"
Made existing API routes and SDK methods operate on the "Golden dataset," and created optional parameters or new versions which allow you to specify an alternative dataset.

Please give us feedback on how datasets feel.

Test result settings memory

Settings in any of the test result pages (such as hiding evaluators, collapsing inputs, outputs, or metadata, re-ordering fields, etc) are now remembered across test results in the same pipeline.
This makes it easier to see exactly what you want (and only exactly what you want), without having to redo your work every time.

Improvements [6]

Persistent user-specific view settings, which can be saved and overriden from a URL Filter test runs by their input values Added explicit compare button

Releases

Datasets

Test result settings memory

Evaluate

Experiment

Compare