You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat: allow historical data ingestion based on user defined time (#683)
This PR adds enhancement to use a user provided timestamp for partition
in ingesting logs instead of using server time.
User needs to add custom header X-P-Time-Partition (optional) at stream
creation api to allow ingestion/query using timestamp column from the
log data instead of server time p_timestamp
This is time_partition field name is stored in stream.json and in memory
STREAM_INFO in ingest api. Server checks if timestamp column name exists in
the log event, if not, throw exception. Also, checks if timestamp value can be
parsed into datetime, if not, throw exception arrow file name gets the date,
hr, mm from the timestamp field (if defined in stream) else file name gets
the date, hr, mm from the server time parquet file name gets a random
number attached to it. This is because a lot of log data can have same
date, hr, mm value of the timestamp field and with this random number,
parquet will not get overwritten in the console, query from and to date will
be matched against the value of the timestamp column of the log data (if
defined in the stream), else from and to date will be matched against the
p_timestamp column.
Fixes#671Fixes#685
0 commit comments