특정 건물을 인식해보자! - Vision Framework, CreateML 흘겨 보기

Notice

Recent Posts

Recent Comments

Link

« 2025/08 »
일	월	화	수	목	금	토
					1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30
31

Tags more

Archives

Today

Total

관리 메뉴

Machineboy空

특정 건물을 인식해보자! - Vision Framework, CreateML 흘겨 보기 본문

언어/iOS

특정 건물을 인식해보자! - Vision Framework, CreateML 흘겨 보기

안녕도라 2024. 10. 3. 18:34

카메라로 특정 건물을 비추면, 그 건물을 인식하여 옆에 시설 정보를 띄울 것이다.

사용될 법한 관련 기술을 조사해보겠다.

Vision Framework

이미지나 영상을 처리하는 알고리즘인 듯하다!

대표 기능 다섯가지를 살펴보자.

1) Tracking human and animal body poses or the trajectory of an object

https://developer.apple.com/kr/videos/play/wwdc2020/10653/

Detect Body and Hand Pose with Vision - WWDC20 - 비디오 - Apple Developer

Explore how the Vision framework can help your app detect body and hand poses in photos and video. With pose detection, your app can...

developer.apple.com

2) Recognizing text in 18 different languages

https://developer.apple.com/documentation/vision/extracting-phone-numbers-from-text-in-images?changes=_1

Extracting phone numbers from text in images | Apple Developer Documentation

Analyze and filter phone numbers from text in live capture by using Vision.

developer.apple.com

https://developer.apple.com/kr/videos/play/wwdc2019/234/

Text Recognition in Vision Framework - WWDC19 - 비디오 - Apple Developer

Document Camera and Text Recognition features in Vision Framework enable you to extract text data from images. Learn how to leverage this...

developer.apple.com

3) Detecting faces and face landmarks, such as eyes, nose, and mouth

https://www.kodeco.com/1163620-face-detection-tutorial-using-the-vision-framework-for-ios/page/2

Face Detection Tutorial Using the Vision Framework for iOS

In this tutorial, you’ll learn how to use Vision for face detection of facial features and overlay the results on the camera feed in real time.

www.kodeco.com

4) Performing hand tracking to enable new device interactions

아마도 visionOS와 같은 new device와도 연동된다는 뜻이려나..? 마땅한 예제를 찾지 못했다.

5) Calculating an aesthetics score to determine how memorable a photo is

https://www.createwithswift.com/scoring-the-aesthetics-of-an-image-with-the-vision-framework/

Scoring the aesthetics of an image with the Vision framework

Learn how to calculate the overall aesthetic score of an image with the Vision framework in a SwiftUI app.

www.createwithswift.com

가장 인상적인 기능인데 미감 점수를 매겨준다. 어떤 기준인지는 더 살펴봐야겠지만 신기방기.

+ 누끼 기능

추가로 Vision Framework을 활용해 구현할 수 있는 대표적인 기능이 누끼이다..

https://developer.apple.com/documentation/vision/applying-matte-effects-to-people-in-images-and-video

Applying Matte Effects to People in Images and Video | Apple Developer Documentation

Generate image masks for people automatically by using semantic person-segmentation.

developer.apple.com

more than 25 requests available

Still-image analysis
Image sequence analysis
Image aesthetics analysis
Saliency analysis : 관심있는 물체를 관심이 없는 배경(background)로 부터 분리시키는 것
Object tracking : https://mr-waguwagu.tistory.com/48
Face and body detection
Body and hand pose detection
3D body pose detection
Text detection
Barcode detection
Trajectory, contour, and horizon detection
- contour: 동일한 색 또는 동일한 색상 강도(color intensity)를 가진 부분의 가장 자리 경계를 연결한 선
Animal detection
Optical flow and rectangle detection
- 빛의 패널의 가시적인 움직임, https://velog.io/@yoorachoi/%EC%BB%B4%ED%93%A8%ED%84%B0-%EB%B9%84%EC%A0%84-Optical-Flow-Lukas-Kanade-Method-%EC%A4%91%EC%8B%AC%EC%9D%98-%EA%B0%9C%EB%85%90-OpenCV-%EA%B5%AC%ED%98%84
Image alignment
Image feature print and background removal
Machine learning image analysis

왠지 쓰게 될 것 같은 기술들에 색표시를 해봤다.

Vision 프레임 워크 재밌는 기술인 것 같다!! 컴퓨터그래픽스 더 공부해보고 싶다.

CreateML

가장 궁금한건 Image classification과 Object detection의 차이점

우선 두 기술의 관련 예제들을 가져왔다!

https://developer.apple.com/documentation/createml/creating-an-image-classifier-model

Creating an Image Classifier Model | Apple Developer Documentation

Train a machine learning model to classify images, and add it to your Core ML app.

developer.apple.com

https://developer.apple.com/documentation/createml/building-an-object-detector-data-source

Building an object detector data source | Apple Developer Documentation

Arrange your training data for an object detector in one of several different structured ways.

developer.apple.com

Image Classification : 이미지 분류

responds with a category label for that image.
Use at least 10 images per category, but keep in mind that an image classifier performs better with a more diverse set of images. Consider including images of each category from multiple angles and in different lighting conditions.
Balance the number of images for each category. For example, don’t use 10 images for one category and then 1000 images for another.

모델을 학습시켜, 새로운 이미지에 label을 붙여 분류하는 작업!

활용하게 된다면 A,B,C 건물을 데이터를 넣어 모델을 만들고 카메라로 건물을 비췄을 때 기존 이미지들과 비교해서 라벨을 붙여주는 프로세스일듯!

Object Detection : 물체 검출

Image: A collection of images that contain objects to train from.
Annotations: Structured text data that defines the bounding box and labels of each object within the data set.

annotation을 custom해서 시설정보를 띄워주면 될 것 같다!

근데 스크린샷이 아니라, 카메라로 실시간으로 비추면서 이것을 인식하려면 어떻게 해야하는거지...wow

우선 기술조사 결과 정리를 해보자면!

image classification 모델을 만들고, object detection을 실행하여 Annotation 정보를 custom해 화면에 띄운다..
input이 사진이 아닌, 실시간 영상을 계속 detect하고 있으려면...? 내일 다시 찾아보리!

'언어 > iOS' 카테고리의 다른 글

[AR Navigation 개발일지#1] ARView의 물체의 위치를 실제 위도, 경도 좌표로 변환해 표시해보자! (2)	2024.10.09
Core Location을 파헤쳐 보자! (3)	2024.10.09
RealityKit SceneReconstruction, 환경 인식 관련 (1)	2024.10.02
Reality Composer Pro Animations, 애니메이션 있는 usdz파일 넣기 (2)	2024.10.01
Augmented Reality: ARKit의 대략적 원리(VIO, 평면 감지, 오브젝트 분류) (0)	2024.07.07

'언어/iOS' Related Articles

Machineboy空

특정 건물을 인식해보자! - Vision Framework, CreateML 흘겨 보기 본문

특정 건물을 인식해보자! - Vision Framework, CreateML 흘겨 보기

Vision Framework

1) Tracking human and animal body poses or the trajectory of an object

2) Recognizing text in 18 different languages

3) Detecting faces and face landmarks, such as eyes, nose, and mouth

4) Performing hand tracking to enable new device interactions

5) Calculating an aesthetics score to determine how memorable a photo is

+ 누끼 기능

CreateML

'언어 > iOS' 카테고리의 다른 글

티스토리툴바