Bug Fixes Model Customization by nargokul · Pull Request #5558 · aws/sagemaker-python-sdk

nargokul · 2026-02-16T21:59:06Z

Bug 1: inference_config ignored for model customization deployments deploy() accepted inference_config with ResourceRequirements (accelerator count, memory, CPUs) but never passed it to _deploy_model_customization(). Fixed by forwarding inference_config through deploy() and extracting its values into the CreateInferenceComponent API call.

Bug 2: Incorrect compute requirements causing deployment failures _fetch_and_cache_recipe_config() used metadata memory values that exceeded SageMaker limits, passed accelerator counts to CPU instances, and used a wrong artifact path for fine-tuned models. Fixed by using safe defaults (1024 MB memory), dynamically querying EC2 for GPU detection/counts, stripping accelerators for CPU instances, and returning None artifact URL for fine-tuned models.

Bug 3: Base model evaluation uses fine-tuned model weights (sagemaker-train) The EvaluateBaseInferenceModel step in the LLM-as-Judge pipeline template included ModelPackageConfig with SourceModelPackageArn, causing it to load fine-tuned weights instead of the base model. Fixed by removing ModelPackageConfig from the base model evaluation step so it uses only the base model from the public hub.

More info: https://tiny.amazon.com/1icjlpkmh

The train integ tests seem to be failing with a ResourceLimitExceeded Error . We will need to track this fix separately .

Bug Fixes Model Customization

ebcab5c

nargokul temporarily deployed to auto-approve February 16, 2026 21:59 — with GitHub Actions Inactive

fixes

d959a9d

nargokul temporarily deployed to auto-approve February 17, 2026 19:12 — with GitHub Actions Inactive

Fix

d664b18

nargokul temporarily deployed to auto-approve February 17, 2026 21:33 — with GitHub Actions Inactive

Fix

3643eb7

nargokul temporarily deployed to auto-approve February 19, 2026 16:02 — with GitHub Actions Inactive

Fix

37a1cff

nargokul temporarily deployed to auto-approve February 19, 2026 17:05 — with GitHub Actions Inactive

Test FIxes

38c9058

nargokul temporarily deployed to auto-approve February 19, 2026 18:42 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug Fixes Model Customization#5558

Bug Fixes Model Customization#5558
nargokul wants to merge 6 commits intoaws:masterfrom
nargokul:mc-bug-fix

nargokul commented Feb 16, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

nargokul commented Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

nargokul commented Feb 16, 2026 •

edited

Loading