[Kaggle Extra Study] 12. Drop-out

캐글 보충

[Kaggle Extra Study] 12. Drop-out

dongsunseng 2024. 11. 7. 21:15

What is Drop-out?

Drop-out is one of the methods to reduce overfitting in neural networks.
- Drop-out is not the only method to avoid overfitting.
- There are various methods including regularization, and while weight reduction methods like regularization are simple to implement and can avoid overfitting to some extent, when neural network models become complex, it becomes difficult to cope with overfitting through weight reduction methods alone.
- This is when the drop-out technique becomes useful.
- You can find more information about overfitting, regularization, and more in my previous post:

[Kaggle Study] 4. Overfitting, Underfitting, Variance and Bias

Overfitting and UnderfittingOverfitting refers to cases where a model performs well on the training set but shows poor performance on the validation set.Underfitting occurs when there isn't big difference between training and validation set performance, bu

dongsunseng.com

[Kaggle Study] 5. Regularization 가중치 규제

From the previous post which was about overfitting and underfitting, we mentioned that one of the most popular methods to get rid of overfitting is to restrict the weights, which is called Regularization. [Kaggle Study] 4. Overfitting, Underfitting, Var

dongsunseng.com

Drop-out randomly deactivates some neurons.
By randomly deactivating some neurons, it prevents the model from relying too heavily on specific neurons during training.
This can also be understood as a method to strengthen robustness by probabilistically adding noise to the model's learning(training) process.
To predict targets well even when some neurons are deactivated, all neurons must learn meaningful patterns without overly depending on specific neurons.
As neurons detect patterns in the training set more evenly, overall generalization performance improves.
Since drop-out is a technique that is only applied during model training, it is not applied during testing or in production.
As a result, the output values in testing and production are relatively higher than the output values during training, so it makes sense theoretically that the output values should be lowered by the dropout rate during testing or in production.
- This is because more neurons are activated and those neurons add their weight values respectively.
- We can multiply the dropout rate which allows us to obtain the output values on a similar scale to those during training.
TensorFlow and most other deep learning frameworks solve this problem in the opposite way.
That is, during training, they increase the neurons' outputs by the dropout rate.
While in principle the output should be lowered during testing or in production, this method also works well.
Additionally, dropout layers have no learnable weights.
They simply randomly set some neurons' outputs to 0 and increase the remaining neurons' outputs by dividing them by the non-dropout rate.

Ensemble learning is closely related to drop-out. This is because drop-out's action of randomly deleting neurons during training can be interpreted as training different models each time. In ohter words, drop-out can be thought of as implementing the effects of ensemble learning with a single network.

Reference

Do it! 딥러닝 입문

★★★★★ 딥러닝을 배우고자 하는분께 강추합니다!(wtiger85 님) ★★★★★ 강추. 박해선님의 책은 일단 지른 다음에 생각합니다.(heistheguy 님) ♥♥♥♥ 코랩을 사용한 딥러닝을 알려주는 책 매

tensorflow.blog

Academic paper about drop-out:

Dropout: A Simple Way to Prevent Neural Networks from Overfitting

Dropout: A Simple Way to Prevent Neural Networks from Overfitting Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, Ruslan Salakhutdinov; 15(56):1929−1958, 2014. Abstract Deep neural nets with a large number of parameters are very powe

jmlr.org

Dropout Reduces Underfitting

Introduced by Hinton et al. in 2012, dropout has stood the test of time as a regularizer for preventing overfitting in neural networks. In this study, we demonstrate that dropout can also mitigate underfitting when used at the start of training. During the

arxiv.org

Win or learn. There is no losing.
- Conor Mcgregor -

저작자표시 비영리 변경금지 (새창열림)

'캐글 보충' 카테고리의 다른 글

[Kaggle Extra Study] 14. Tree-based Ensemble Models (1)	2024.11.10
[Kaggle Extra Study] 13. Weight Initialization (3)	2024.11.09
[Kaggle Extra Study] 11. Polars (8)	2024.11.06
[Kaggle Extra Study] 10. TabNet (5)	2024.11.04
[Kaggle Extra Study] 9. Plots with Missing Data (4)	2024.10.28

현재글[Kaggle Extra Study] 12. Drop-out

cibmtr - equity in post-hct survival predictions, Kaggle, 오블완, 투자, 캐글, nlp, backend, 매매일지, home credit default risk, 단타, 비트코인, nodejs, 티스토리챌린지, 코인, 경제, Express, ML, llm, dl, Prompt Engineering,

Today :
Yesterday :

동선생