Model debugging and Loss curve

Loss Curve에 나타난 모델의 문제점을 해석하고, 그에 따른 Debugging 방법을 정리합니다.

Interpreting Loss Curves

Ideal loss curve	Interpretation
	As the number of training steps increases, loss begins high, then decreases exponentially, and ultimately flattens out to reach a minimum loss.

Debugging Methods

모델 테스트 후 디버깅 과정을 Data와 Model의 관점 2가지로 나눠볼 수 있다.

Data Debugging
- Data validation with data schema : raw data에 대한 검증
  - Data schema : rules for expected statistics
- Ensure Splits are Good Quality
  - Test/Train 데이터 통계적으로 동일한 지, 분할 비율이 일정하게 유지되는 지에 대한 검증
- Test Engineered Data : 실제 모델의 입력 데이터에 대한 검증
Model Debugging
- Check that the data can predict the labels
  - 모델에 사용된 features가 predictive signals을 가지는 지 검증 (e.g, correlation matrices)
- Establish a baseline.
  - 모델 개발 단계에서 간단한 heuristic 기반으로 설정한 baseline과 비교해 모델 성능 판단
- Unit tests for ML Code to detect bugs
- Adjust your hyperparameter values (Learning Rate , Regularization, Training epochs, Batch size, Depth/width)

Loss Curves & Debugging Methods

Loss curves & Interpretation	Actions that could fix the problem described.
Model is Not Converging (the loss oscillates) : unstable training process	Data Debugging - Check if features can predict the labels - Simplify your dataset to 10 examples that you know your model can predict on. Obtain a very low loss on the reduced dataset. Then continue debugging your model on the full dataset. Model Debugging - Reduce your learning rate - Simplify your model and ensure the model outperforms your baseline. Then incrementally add complexity to the model.
An Exploding loss : The loss decreasing up to a certain number of training steps and then suddenly increasing with further training steps	Data Debugging for the raw data - Check if there are anomalous values in input data ( NaNs / Exploding gradient due to anomalous data / Division by zero / Logarithm of zero or negative numbers ) Data Debugging for the engineered data - Check for anomalous data in the batches and in the engineered data. - Otherwise, outlying data ⇒ shuffle the data to ensure that outliers are evenly distributed between batches.
Contradictory Metrics : Ideal loss curve, but recall is stuck at 0	Model Debugging - Examples' classification probability is never higher than the threshold $encoding="application/x-tex">^✔</annotation></semantics></math>$ for positive classification. (often occurs with a large class imbalance) ⇒ Lower your classification threshold. - Check threshold-invariant metrics (AUC).
Overfitting : Too high Testing Loss	Data Debugging - Check that the training and test splits are statistically equivalent. Model Debugging - Reduce model capacity. - Add regularization.
Model Gets Stuck : Repetitive, step-like behavior of loss	Data Debugging - Check if the input data is itself exhibiting repetitive behavior ⇒ shuffle the data to remove repetitive behavior.

$encoding="application/x-tex">^✔</annotation></semantics></math>$ tf.keras default threshold for positive classification is 0.5

Source&Reference : Interpreting Loss Curves | Testing and Debugging in Machine Learning

저작자표시 비영리 변경금지

'Certificate - DS > Deep learning specialization' 카테고리의 다른 글

Glossary (0)	2021.12.12
Model Debugging - Hyperparameter 하이퍼파라미터 값 조정 (0)	2021.11.23
Hyperparameters Tuning 하이퍼파라미터 튜닝 (0)	2021.11.22
Activation Function 활성화 함수 - Sigmoid, tanh, ReLU, LeakyReLU (0)	2021.11.21
[Hyperparameters] Batch/Batch size/Epoch/Iteration 배치, 에포크 (0)	2021.11.20

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

JINSTORY

Model debugging and Loss curve

Interpreting Loss Curves

Debugging Methods

Loss Curves & Debugging Methods

'Certificate - DS > Deep learning specialization' 카테고리의 다른 글

티스토리툴바

개인정보

단축키

내 블로그

블로그 게시글

모든 영역