본문 바로가기

전체 글92

[논문 리뷰]Open-Vocabulary Affordance Detection in 3D Point Clouds 본 논문은 Affordance Detection이라는 task 논문이다.Affordance Detection은 행동 유도성 감지로위 그림 처럼 '가방'이라는 object가 감지되었을 때'grab'이라는 task를 수행하기 위해'가방'의 어디를 감지해야 하는지 찾아내는 것이다.컴퓨터 비전 관점에서는 part segmenation과 유사하다.본 논문은 로봇 학회 논문이다. IROS'23https://arxiv.org/abs/2303.02401 Open-Vocabulary Affordance Detection in 3D Point CloudsAffordance detection is a challenging problem with a wide variety of robotic applications. Trad.. 2023. 11. 23.

[논문리뷰]3D Compositional Zero-shot Learning with DeCompositional Consensus 이 논문은 3D point clouds part segmentation에서 compositional zero-shot learning을 수행하는 논문이다. 본 논문의 큰 기조는 object에서 part를 구한 뒤 part정보들로 부터 classification을 수행하는 것이다. ECCV'22 https://arxiv.org/abs/2111.14673 3D Compositional Zero-shot Learning with DeCompositional Consensus Parts represent a basic unit of geometric and semantic similarity across different objects. We argue that part knowledge should be co.. 2023. 11. 21.

[논문리뷰] Learning to Prompt for Vision-Language Models 본 논문은 현재 주목받고 있는 CLIP의 한계점을 지적하며 CLIP prompt를 learnable하게 만드는 방법을 제안한다. IJCV'22 https://link.springer.com/article/10.1007/s11263-022-01653-1 Abstract 저자들은 prompt engineering의 현실적인 문제를 지적한다. CLIP의 "a photo of [CLASS]"라는 prompt만으로는 사실 이미지 도메인이 전혀 고려되지 않고 또 이미지 도메인을 고려한 prompt는 만드는 것이 사실상 불가능하다. 다음 예시가 prompt의 문제점을 적나라하게 보여준다. 위 그림을 보면 관사 "a"가 있냐 없냐에 따라 성능이 크게 달라지는 것을 확인할 수 있다. 이처럼 모든 이미지에 알맞은 prom.. 2023. 11. 20.

[논문리뷰]PLA: Language-Driven Open-Vocabulary 3D Scene Understanding 이 논문은 3D segmentation에open-vocabulary를 적용한 논문이다.해당 논문은 CLIP의 textual encoder에 초점을 둔 논문이다.처음에 사전 지식 하나 없이 읽었다가 이해 못하고 넘어갔었던 논문이다. CVPR'23https://arxiv.org/abs/2211.16312 PLA: Language-Driven Open-Vocabulary 3D Scene UnderstandingOpen-vocabulary scene understanding aims to localize and recognize unseen categories beyond the annotated label space. The recent breakthrough of 2D open-vocabulary perce.. 2023. 11. 7.

[논문리뷰] Decomposed Soft Prompt Guided Fusion Enhancing for Compositional Zero-Shot Learning soft prompt와 feature decompose, feature fusion을 사용하는 논문이다. CVPR'23 https://openaccess.thecvf.com/content/CVPR2023/html/Lu_Decomposed_Soft_Prompt_Guided_Fusion_Enhancing_for_Compositional_Zero-Shot_Learning_CVPR_2023_paper.html CVPR 2023 Open Access Repository Decomposed Soft Prompt Guided Fusion Enhancing for Compositional Zero-Shot Learning Xiaocheng Lu, Song Guo, Ziming Liu, Jingcai Guo; Proce.. 2023. 10. 31.

[논문리뷰] Towards Large-scale 3D Representation Learning with Multi-dataset Point Prompt Training 이 논문은 3D segmentation을 학습시킬 때 다양한 dataset을 사용하기 위한 방법으로 prompt tuning 방법론을 제안한다. (최근 대회를 연속 2개를 나가서 논문을 한참 못읽었다..) arXiv'23 https://arxiv.org/abs/2308.09718 Towards Large-scale 3D Representation Learning with Multi-dataset Point Prompt Training The rapid advancement of deep learning models often attributes to their ability to leverage massive training data. In contrast, such privilege has not ye.. 2023. 10. 16.

이전 1 ··· 5 6 7 8 9 10 11 ··· 16 다음

티스토리툴바