sample operator

Psuedo-randomly selects rows at a rate specified by a parameter. The sampling algorithm is not statistically rigorous but provides a way to explore and understand a dataset. For statistically rigorous analysis summarize should be used.

Syntax

| sample <rate>

Arguments

nametypedescription
Sample RatefloatA number greater than 0 and less than 1 which specifies how many rows are selected for the next operation.

Example

['http-logs']
| sample 0.1 // 10% of the logs go on

Was this page helpful?