Covariate

1 minute read

Covariate

β€œCovariate” is a continues independent variable in a regression or similar model. For instance, if you were modeling number of animals in a given area, you might have covariates such as temperature, season, latitude, altitude, time of day and so on.

톡계λͺ¨λΈμ—λŠ” 독립 λ³€μˆ˜(independent variable)와 쒅속 λ³€μˆ˜(dependent variable)κ°€ μžˆλ‹€. 처음 μ‹€ν—˜μ„ κ³„νšν•  λ•ŒλŠ” μ •ν™•νžˆ μ–΄λ–€ μš”μΈμ΄ 쒅속 λ³€μˆ˜μ— 영ν–₯을 μ£ΌλŠ”μ§€ λͺ¨λ₯΄κΈ° λ•Œλ¬Έμ— μˆ˜μ§‘ν•  수 μžˆλŠ” λͺ¨λ“  μš”μΈμ„ λͺ¨μœΌκ²Œ λœλ‹€. κ·Έλž˜μ„œ μ˜¨λ„, μœ„λ„, 경도, μ‹œκ°„ λ“±λ“± μ–΄μ©Œλ©΄ 쒅속 λ³€μˆ˜μ—λŠ” μ „ν˜€ μ“Έλͺ¨κ°€ 없을지도 λͺ¨λ₯΄λŠ” 값듀을 μˆ˜μ§‘ν•œλ‹€.

<Covariate>λŠ” κ·Έλ ‡κ²Œ μˆ˜μ§‘ν•œ λͺ¨λ“  independent variable을 λ§ν•œλ‹€. ꡬ체적으둠 continuous independent variable을 <Covariate>라고 ν•œλ‹€.

<Covariate>λŠ” 쒅속 λ³€μˆ˜μ— 영ν–₯을 μ£ΌλŠ” λ³€μˆ˜κ°€ 될 μˆ˜λ„ 있고, λ˜λŠ” μ „ν˜€ μ“Έλͺ¨ μ—†λŠ” λ³€μˆ˜κ°€ 될 μˆ˜λ„ μžˆλ‹€. 그것을 νŒλ‹¨ν•˜κΈ° μœ„ν•΄ μ‹€ν—˜ κ²°κ³Όλ₯Ό λΆ„μ„ν•˜λŠ” 것이닀.

μš”μ•½ν•˜λ©΄, <Covariate>λŠ” β€œfeature”라고 말할 수 μžˆμ„ 것 κ°™λ‹€.

Covariate Shift

λ”₯λŸ¬λ‹ μͺ½μ—μ„œ μ“°λŠ” μš©μ–΄μΈλ°, β€œtraining setκ³Ό test set이 λ‹€λ₯Έ 뢄포λ₯Ό κ°€μ§€λŠ” 상황”을 λ§ν•œλ‹€. 사싀 training setκ³Ό test set을 λ‹¨μˆœν•œ ν…Œν¬λ‹‰μœΌλ‘œ λ‚˜λˆ΄λ‹€λ©΄ 이런 ν˜„μƒμ„ λ§ˆμ£Όν•  κ°€λŠ₯성이 크닀. 또, λͺ¨λΈμ„ ν•™μŠ΅ ν–ˆμ„ λ•Œμ™€ λͺ¨λΈμ„ μ„œλΉ„μŠ€ ν•  λ•Œ μž…λ ₯λ˜λŠ” 데이터 μ‚¬μ΄μ—λŠ” Gap이 μžˆμ„ 수 밖에 μ—†λ‹€.

이런 상황을 ν•΄κ²°ν•˜κΈ° μœ„ν•΄ λ”₯λŸ¬λ‹μ—μ„  Batch Normalization이 λ“±μž₯ν–ˆλŠ”λ°β€¦ 잘 λͺ¨λ₯΄κ² λ‹€λ©΄ μ§€κΈˆμ€ λ„˜μ–΄κ°€μž! πŸ˜‰