끄적끄적 — 끄적끄적

Hold-out vs Cross-validation 차이

2023.07.16

1. Hold-out Hold-out 방법은 데이터를 train, test set으로 나누고 train set에서 학습된 모델을 test set에서 성능을 평가하는 방식이다(held-out method라고도 함) 2. Cross-validation Cross-validation은 k-fold cross validation이라고도 하며, hold-out 방법을 k번 시행하되 매번 새로운 train/test set으로 진행하는 방식이다. 아래 그림에서 처럼 모든 데이터셋이 test set으로 사용될 때 까지 돌린다. * Hold-out vs Cross-validation Hold-out은 데이터셋이 매우 크고 시간이 없거나 PoC 컨셉으로 모델을 테스트해볼 때 유용하다. Cross-validation은 시간..

Reinforcement Learning

[RL] Value Function 구체적으로 생각해보기

2023.07.15

Markov Decision Process(MDP)에서 policy가 존재할 때 value function은 Bellman Equation으로 표현되며, 다음과 같이 두 가지가 있다. The state-value function $\color{red}v_{\pi}(s)$ of an MDP is the expected return from state $s$, and then following policy $\pi$ $$\color{black}v_{\pi}(s) = \mathbb{E}_{\pi}[G_{t}\ |\ S_{t}=s]$$ The action-value function $\color{red}q_{\pi}(s,a)$ is the expected return starting from state $s$,..

개발

Gunicorn & Uvicorn

2023.07.09

1. Gunicorn Gunicorn은 production 레벨에서 사용할 수 있는 WSGI(Web Server Gateway Interface)로 고성능, 쉬운 배포가 특징이다. Flask, Django와 같은 프레임워크들을 서빙할 수 있다. 여러 프로세스로 서버를 돌릴 수 있으며 또한 process manager로도 동작하기 때문에 Uvicorn으로 실행된 어플리케이션을 관리할 수 있다. 2. Uvicorn Uvicorn은 ASGI(Asynchronous Server Gateway Interface) 웹 서버로 Starlette을 추상화하여 개발되었다. ASGI이기 때문에 단일 프로세스에서 비동기로 요청을 처리하기 때문에 WSGI에 비해 매우 많은 요청을 처리할 수 있다. [참고] https://te..

개발

[Nginx] 맥os에 Nginx 설치하기

2023.07.08

맥북에서 Nginx 설치를 해보고자 한다. 맥OS 버전은 13.0 기준이다. 먼저 brew install nginx로 설치해준다. ==> Fetching nginx ==> Downloading https://ghcr.io/v2/homebrew/core/nginx/manifests/1.25.1_1 ... ==> Installing nginx ==> Pouring nginx--1.25.1_1.arm64_ventura.bottle.tar.gz ==> Caveats Docroot is: /opt/homebrew/var/www The default port has been set in /opt/homebrew/etc/nginx/nginx.conf to 8080 so that nginx can run without..

개발

[Nginx] forward & reverse proxy

2023.07.08

Nginx를 사용하기 위해 먼저 proxy server를 이해하고자 한다. 1. Proxy Server 먼저 proxy server는 다음과 같다. A proxy server is a system or router that provides a gateway between users and the internet. Therefore, it helps prevent cyber attackers from entering a private network. It is a server, referred to as an “intermediary” because it goes between end-users and the web pages they visit online... Proxies provide a valua..

python 메모

[pkill] 여러 프로세스 동시에 종료시키기

2023.07.08

Linux 계열에서 pkill을 사용하면 여러 프로세스를 동시에 종료시킬 수 있다. 다음과 같이 3개의 파이썬 스크립트가 돌고 있다고 하자. 이때 pkill -f [파일이름 또는 사용자명]로 다음과 같이 한번에 종료시킬 수 있다. [참고] https://unix.stackexchange.com/questions/57605/i-want-to-kill-all-processes-that-result-from-the-following-command

개발

[fastapi] logging format 설정

2023.07.08

Fastapi에 logging을 설정하는 방법은 다음과 같이 두 가지가 있다. 1. uvicorn으로 설정 log_config = uvicorn.config.LOGGING_CONFIG log_config["formatters"]["access"]["fmt"] = "%(asctime)s - %(levelname)s - %(message)s" log_config["formatters"]["default"]["fmt"] = "%(asctime)s - %(levelname)s - %(message)s" uvicorn.run(app, log_config=log_config) 2. logger로 설정 @app.on_event("startup") async def startup_event(): logger = log..

개발

[Miniconda] CommandNotFoundError: Your shell has not been properly configured to use 'conda activate'. 에러

2023.07.08

Ubuntu에서 Miniconda로 가상환경을 실행시킬 때 다음과 같은 에러가 뜨는 경우가 있다. CommandNotFoundError: Your shell has not been properly configured to use 'conda activate'. To initialize your shell, run $ conda init Currently supported shells are: - bash - fish - tcsh - xonsh - zsh - powershell See 'conda init --help' for more information and options. IMPORTANT: You may need to close and restart your shell after running '..

논문 및 개념 정리

Propensity Score

2023.06.28

Bio Statistics 분야에서 많이 활용되는 개념인 Propensity Score에 대해서 정리하고자 한다. 0. Propensity Score가 필요한 상황 Clinical research 분야에서 Randomized Controlled Trials(RCT)는 꽤 중요한 개념이다. 어떤 집단 A에 treatment에 대한 효과를 통계적으로 검증한다고 했을 때, 가장 이상적인 방법은 집단 A를 반으로 나눠 한쪽은 control group으로, 한쪽은 treatment를 진행한 group으로 나눠 수치를 얻어내는 것이다. 하지만 의료 분야에서는 현실적이지 않을 뿐더러 윤리적으로도 수행할 수 없는 경우가 많다. 흡연자에 대한 treatment를 검증하고자 할 때, 강제적으로 흡연을 시킬 수 없으니 말이..

Reinforcement Learning

[RL] Introduction to Multi-Armed Bandits (1)

2023.06.26

Reinforcement Learning 관련 내용 중 하나인 Multi-Armed Bandits(MAB)에 대한 내용을 정리하고자 한다(논문링크). The Multi-Armed Bandit problem (MAB) is a toy problem that models sequential decision tasks where the learner must simultaneously exploit their knowledge and explore unknown actions to gain knowledge for the future (exploration-exploitation tradeoff)(출처). 0. Introduction: Scope and Motivation 1) Example Multi-arm..

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

전체 글

Hold-out vs Cross-validation 차이

[RL] Value Function 구체적으로 생각해보기

Gunicorn & Uvicorn

[Nginx] 맥os에 Nginx 설치하기

[Nginx] forward & reverse proxy

[pkill] 여러 프로세스 동시에 종료시키기

[fastapi] logging format 설정

[Miniconda] CommandNotFoundError: Your shell has not been properly configured to use 'conda activate'. 에러

Propensity Score

[RL] Introduction to Multi-Armed Bandits (1)

티스토리툴바

개인정보

단축키

내 블로그

블로그 게시글

모든 영역