AWS SAGEMAKER AUTOML PREDICT

Runs batch inference against AWS SageMaker AutoML-trained models by deploying endpoints (or reusing active ones), scoring the prediction dataset, and optionally tearing down endpoints and artifacts to control costs. Use this worker downstream of aws_sagemaker_automl_learn whenever you need to generate predictions from previously trained SageMaker AutoML models.

When to use

Tagged: automl, aws, batch, cloud, endpoint, inference, ml, predict.

Inputs

Label ID Type Default Required Description
Prediction Dataset prediction_dataset dataset Tabular dataset to score; must have the same column structure (features) as the training dataset used in aws_sagemaker_automl_learn — the target column may be omitted.
Models models dataset   Models metadata dataset produced by the aws_sagemaker_automl_learn worker, containing columns model_name, target_feature, input_features, and job_name; required when no active_endpoints dataset is supplied.
Active Endpoints active_endpoints dataset   Previously created and still-live SageMaker endpoint metadata dataset (same schema as the active_endpoints output of this worker); supply instead of models to skip endpoint creation and reuse running endpoints.
Cleanup Endpoints cleanup_endpoints boolean True   Boolean flag (default: true) — when true, deletes the SageMaker endpoints after prediction completes to avoid ongoing inference costs; set to false only when you intend to reuse the endpoints via the active_endpoints output.
Cleanup Models and Files cleanup_models_and_files boolean False   Boolean flag (default: false) — when true, deletes the SageMaker model artifacts and associated S3 files after prediction; enable only if the model will not be needed for future inference runs.

Outputs

Label ID Type Description
Predictions predictions dataset Tabular dataset of model predictions, one row per input record, with columns identifying the target feature and its predicted value for each model.
Model Prediction Information model_prediction_information dataset Metadata dataset summarising each prediction run, including model name, job name, endpoint name, endpoint config name, input/target features, cleanup status, and any per-model errors.
Active Endpoints active_endpoints dataset Dataset describing the SageMaker endpoints that remain live after execution; only populated when cleanup_endpoints is false — pass this to a subsequent invocation of this worker to skip endpoint creation.
Logs logs dataset Execution log dataset with columns log_type, log_message, and log_time (Unix timestamp), capturing info and error events emitted during the prediction workflow.

Disciplines

  • ai_ml.supervised.classification
  • ai_ml.supervised.regression
  • platform.integration

Auto-generated from platform schema. Worker id: aws_sagemaker_automl_predict. Schema hash: e6ed288462c3. Hand-curated docs in workerexamples/ override this page when present.