다음 단계는 이 장의 시작 부분에서 했던 것처럼 분류 작업에서 모델을 미세 튜닝하는 것입니
다. 다음처럼 모델을 로드하여 수행할 수 있습니다.
from transformers import AutoModelForSequenceClassification
# 분류를 위해 미세 튜닝합니다.
model = AutoModelForSequenceClassification.from_pretrained("mlm", num_labels=2)
tokenizer = AutoTokenizer.from_pretrained("mlm")
11.4
개체명 인식
이 절에서 사전 훈련된 모델을 개체명 인식
named
-
entitiy
recognition
(
NER
) 작업을 위해 미세 튜닝하
는 과정을 자세히 알아보겠습니다. 이 작업은 전체 문서를 분류하는 것이 아니라 사람이나 위
치 등이 포함된 개별 토큰이나 단어를 분류합니다. 민감한 데이터가 있을 때 익명화하는 작업
에 특별히 도움이 됩니다.
개체명 인식은 이 장의 초반에 살펴본 분류 ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month, and much more.
O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.