The hubExamples package provides three data sets that contain example model output and
target data for an example forecast hub: forecast_outputs, forecast_target_ts, and
forecast_oracle_output.
forecast_outputs contains example forecast data that represents model outputs from a forecast hub
with predictions for three influenza-related targets (wk inc flu hosp, wk flu hops rate category,
and wk flu hosp rate) for two reference dates in 2022. This dataset reflects forecast outputs as they look
when retrieved from a hub via the hubData package (which is slightly different than they look
when originally submitted by modelers).
forecast_target_ts contains time series target data from a hub that predicts influenza-related targets.
forecast_oracle_output represents predictions that would have been
generated by an "oracle model" that knew the observed data values in advance.
This example represents influenza-related targets corresponding to the
forecast_outputs.
Format
forecast_outputs
A data frame with 10,224 rows and 9 columns:
- model_id
the name of the model
- reference_date
the starting point of the forecast in yyyy-mm-dd format
- target
a unique identifier for the target
- horizon
number of time units ahead being forecasted relative to the
reference_date, in units of weeks- location
FIPS code identifying a location
- target_end_date
the date of occurrence of the outcome of interest in yyyy-mm-dd format; this can be calculated directly from the
reference_dateandhorizonas follows:target_end_date = reference_date + 7*horizon- output_type
the type of representation of the prediction
- output_type_id
more identifying information specific to the output type;
output_type_idis not relevant for every kind ofoutput_type(for example, hubs will not expectoutput_type_idvalues when theoutput_typeis mean or median- value
the model’s prediction
forecast_target_ts
A data frame with 10,255 rows and 3 columns:
- date
the date of the target observation in yyyy-mm-dd format
- location
FIPS code identifying a location
- observation
a count of hospital admissions in the given
locationin the week ending on the givendate.
forecast_oracle_output
A data frame with 200,340 rows and 6 columns:
- location
FIPS code identifying a location
- target_end_date
the target's observation date in yyyy-mm-dd format; this is used to match on the
target_end_datefield in model output data submitted to the hub- target
a unique identifier for the target
- output_type
the type of representation of the prediction
- output_type_id
more identifying information specific to the output type; as in the model output data,
output_type_idis not relevant foroutput_typeof mean and median; target data that represents quantileoutput_typewill not have anoutput_type_id.- oracle_value
the predicted value that would be generated if the observed data were known with certainty in advance.
