framework bucketize gts bucketizer
The Geo Time Series™ kept in the Warp 10 platform grow over time as more measurements are added. Some series have very regular measurements,
others more sporadic ones. But when it comes to manipulating the data, it might be handy to be able to impose some kind of regularity to measurements.
This is exactly what the
BUCKETIZE framework does, it provides the tooling for putting the data of a Geo Time Series™ into regularly spaced
A bucket is a time interval which spans a certain number of time units called the bucketspan, ending at a tick called the lastbucket.
As an example, the bucket spanning 10 time units and ending at time units 20 will contain all measurements taken at the following times:
20, 19, 18, 17, 16, 15, 14, 13, 12, 11
the previous bucket with the same bucketspan ends at 10 and covers ticks 10 down to 1. The next bucket ends at 30 and covers ticks 21 to 30.
A bucketized Geo Time Series™ is characterized by its bucketspan, its bucketcount and the lastbucket. A bucketized Geo Time Series™ has at most one measurement per bucket, there might be buckets with no measurements.
BUCKETIZE framework is used to convert a non bucketized Geo Time Series™ into a bucketized one. The bucketization process collects
the measurements of the original geo time series which fall in each bucket and apply a bucketizer function on those data, thus leading to at most a
single measurement for each bucket.
BUCKETIZE framework comes with a number of bucketizer which implement very common aggregation functions such as SUM, MIN, MAX, MEAN, etc.