The multi-volume set of LNCS books with volume numbers 15059 up to 15147 constitutes the refereed proceedings of the 18th European Conference on Computer Vision, ECCV 2024, held in Milan, Italy, during September 29-October 4, 2024. The 2387 papers presented in these proceedings were carefully reviewed and selected from a total of 8585 submissions. They deal with topics such as computer vision; machine learning; deep neural networks; reinforcement learning; object recognition; image classification; image processing; object detection; semantic segmentation; human pose estimation; 3d…mehr
The multi-volume set of LNCS books with volume numbers 15059 up to 15147 constitutes the refereed proceedings of the 18th European Conference on Computer Vision, ECCV 2024, held in Milan, Italy, during September 29-October 4, 2024.
The 2387 papers presented in these proceedings were carefully reviewed and selected from a total of 8585 submissions. They deal with topics such as computer vision; machine learning; deep neural networks; reinforcement learning; object recognition; image classification; image processing; object detection; semantic segmentation; human pose estimation; 3d reconstruction; stereo vision; computational photography; neural networks; image coding; image reconstruction; motion estimation.
Generating Physically Realistic and Directable Human Motions from Multi-Modal Inputs.- CoTracker: It is Better to Track Together.- SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models.- PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology.- Improving Adversarial Transferability via Model Alignment.- RealGen: Retrieval Augmented Generation for Controllable Traffic Scenarios.- ADen: Adaptive Density Representations for Sparse-view Camera Pose Estimation.- Embodied Understanding of Driving Scenarios.- Learning to Drive via Asymmetric Self-Play.- OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation.- ViLA: Efficient Video-Language Alignment for Video Question Answering.- Factorizing Text-to-Video Generation by Explicit Image Conditioning.- MobileDiffusion: Instant Text-to-Image Generation on Mobile Devices.- Open-Set Biometrics: Beyond Good Closed-Set Models.- UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening.- Which Model Generated This Image? A Model-Agnostic Approach for Origin Attribution.- Osmosis: RGBD Diffusion Prior for Underwater Image Restoration.- Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization.- Computing the Lipschitz constant needed for fast scene recovery from CASSI measurements.- DatasetNeRF: Efficient 3D-aware Data Factory with Generative Radiance Fields.- Flowed Time of Flight Radiance Fields.- 3D-GOI: 3D GAN Omni-Inversion for Multifaceted and Multi-object Editing.- Fast Registration of Photorealistic Avatars for VR Facial Animation.- CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings.- HiFi-Score: Fine-grained Image Description Evaluation with Hierarchical Parsing Graphs.- Image-to-Lidar Relational Distillation for Autonomous Driving Data.- Thinking Outside the BBox: Unconstrained Generative Object Compositing.
Generating Physically Realistic and Directable Human Motions from Multi-Modal Inputs.- CoTracker: It is Better to Track Together.- SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models.- PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology.- Improving Adversarial Transferability via Model Alignment.- RealGen: Retrieval Augmented Generation for Controllable Traffic Scenarios.- ADen: Adaptive Density Representations for Sparse-view Camera Pose Estimation.- Embodied Understanding of Driving Scenarios.- Learning to Drive via Asymmetric Self-Play.- OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation.- ViLA: Efficient Video-Language Alignment for Video Question Answering.- Factorizing Text-to-Video Generation by Explicit Image Conditioning.- MobileDiffusion: Instant Text-to-Image Generation on Mobile Devices.- Open-Set Biometrics: Beyond Good Closed-Set Models.- UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening.- Which Model Generated This Image? A Model-Agnostic Approach for Origin Attribution.- Osmosis: RGBD Diffusion Prior for Underwater Image Restoration.- Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization.- Computing the Lipschitz constant needed for fast scene recovery from CASSI measurements.- DatasetNeRF: Efficient 3D-aware Data Factory with Generative Radiance Fields.- Flowed Time of Flight Radiance Fields.- 3D-GOI: 3D GAN Omni-Inversion for Multifaceted and Multi-object Editing.- Fast Registration of Photorealistic Avatars for VR Facial Animation.- CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings.- HiFi-Score: Fine-grained Image Description Evaluation with Hierarchical Parsing Graphs.- Image-to-Lidar Relational Distillation for Autonomous Driving Data.- Thinking Outside the BBox: Unconstrained Generative Object Compositing.
Es gelten unsere Allgemeinen Geschäftsbedingungen: www.buecher.de/agb
Impressum
www.buecher.de ist ein Internetauftritt der buecher.de internetstores GmbH
Geschäftsführung: Monica Sawhney | Roland Kölbl | Günter Hilger
Sitz der Gesellschaft: Batheyer Straße 115 - 117, 58099 Hagen
Postanschrift: Bürgermeister-Wegele-Str. 12, 86167 Augsburg
Amtsgericht Hagen HRB 13257
Steuernummer: 321/5800/1497
USt-IdNr: DE450055826