Get started with SageMaker Debugger¶
Real-time analysis¶
MXNet¶
PyTorch¶
- Using SageMaker Debugger and SageMaker Experiments for iterative model pruning
- Using SageMaker Debugger and SageMaker Experiments for iterative model pruning
- Using Amazon SageMaker Debugger with your own PyTorch container
TensorFlow¶
TensorFlow 1.x¶
- Amazon SageMaker Debugger - Using built-in rule
- Detect Stalled Training and Stop Training Job Using SageMaker Debugger Rule
- Amazon SageMaker Debugger - Reacting to Cloudwatch Events from Rules
- Amazon SageMaker - Debugging with custom rules
XGBoost¶
- Debugging XGBoost Training Jobs with Amazon SageMaker Debugger Using Rules
- Debugging XGBoost training jobs in real time with Amazon SageMaker Debugger
- Explainability with Amazon SageMaker Debugger
Bring your own container¶
- Build a Custom Training Container and Debug Training Jobs with Amazon SageMaker Debugger
- Step 1: Prepare prerequisites
- Step 2: Prepare a Dockerfile and register the Debugger hook to you training script
- Step 3: Create a Docker image, build the Docker training container, and push to Amazon ECR
- Step 4: Use Amazon SageMaker to set the Debugger hook and rule configuration
- Step 5. Define a SageMaker Estimator object with Debugger and initiate a training job
- Step 6: Retrieve output tensors using the smdebug trials class
- Step 7: Analyze the training job using the smdebug
trialmethods and the Debugger rule job status - Notebook Summary and Other Applications