의미하는데, 이렇게 작은 지진이 실제로 발생했는지 의문스럽기도 합니다. 값들의 개수가 많은
것으로 보아 이 값은 실제로 발생한 지진 규모 값이라기보다 알 수 없는 지진 규모 값을 임의로
저장한 값일 가능성도 있습니다. 따라서, 이처럼 지진 규모 값이 음수인 데이터는 이상값으로
간주해도 좋습니다.
전체 데이터를 정렬하는 대신 하나 이상의 필드를 기준으로 그룹화를 수행해 해당 그룹 내 이
상값을 찾는 방법도 있습니다. 예를 들어,
place
필드의 특정 지역을 기준으로 규모가 가장 큰
지진과 가장 작은 지진을 확인하고, 해당 지역 내 규모별 지진 발생 횟수도 확인해봅시다.
SELECT
place, mag,
count
(*)
FROM
earthquakes
WHERE
mag
is
not
null
and
place = ‘Northern California’
GROUP
BY
1,2
ORDER
BY
1,2
desc
;
place mag count ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month, and much more.
O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.