- RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark
- ShapeWords: Guiding Text-to-Image Synthesis with 3D Shape-Aware Prompts
- Cancer Survival Prognosis From Whole Slide Images Using Hopfield Network
- Three-view Focal Length Recovery From Homographies
- Towards Natural Language-Based Document Image Retrieval: New Dataset and Benchmark
- SEEN-DA: SEmantic ENtropy guided Domain-aware Attention for Domain Adaptive Object Detection
- Design Optimization of Synchronous Reluctance Motor for Electric Two Wheeler Application
- MaDCoW: Marginal Distortion Correction for Wide-Angle Photography with Arbitrary Objects
- SOAP: Vision-Centric 3D Semantic Scene Completion with Scene-Adaptive Decoder and Occluded Region-Aware View Projection
- An Intelligent Prediction Method for Safety Margins of Flexible Thermal Power Units Based on PipeLine Creep Life Damage
- Enhancing Scene Coordinate Regression with Efficient Keypoint Detection and Sequential Information
- RoboGround: Robotic Manipulation with Grounded Vision-Language Priors
- From Faces to Voices: Learning Hierarchical Representations for High-quality Video-to-Speech
- Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face Detector
- Rashomon Sets for Prototypical-Part Networks: Editing Interpretable Models in Real-Time
- VTON-HandFit: Virtual Try-on for Arbitrary Hand Pose Guided by Hand Priors Embedding
- Bridging Viewpoint Gaps: Geometric Reasoning Boosts Semantic Correspondence
- ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions
- Interleaved-Modal Chain-of-Thought
- ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding
- Shading Meets Motion: Self-supervised Indoor 3D Reconstruction Via Simultaneous Shape-from-Shading and Structure-from-Motion
- Methodology for Business Value Analysis of Innovative IT in a Business Sector. The Case of the Material Supply Chain
- The Application Progress of Power Batteries in New Energy Ships
- COFFEE: Mitigating Hallucination in LVLMs via COllaborative Filtering for Enhanced Eyes
- LineArt: A Knowledge-guided Training-free High-quality Appearance Transfer for Design Drawing with Diffusion Model
- Dual Prompting Image Restoration with Diffusion Transformers
- LogiCzsl: Exploring Logic-induced Representation for Compositional Zero-shot Learning
- VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
- Research on the MC/DC Test on Civil Aircraft Software Robust Requirements Based on DO-178C
- Quantization without Tears
- Artificial Intelligence Adoption, Enterprise Capabilities and Performance
- Thin-Shell-SfT: Fine-Grained Monocular Non-rigid 3D Surface Tracking with Neural Deformation Fields
- A mixed-precision quantum-classical algorithm for solving linear systems
- Learning Class Prototypes for Unified Sparse-Supervised 3D Object Detection
- Design and Analysis of a PSFB Current Doubler for VRFB: Impact of Magnetic Components and Snubber Circuit Requirements
- SEC-Prompt:SEmantic Complementary Prompting for Few-Shot Class-Incremental Learning
- Differentially-Fed Harmonic RFID Tag for Multi-Tag Detection
- Object-aware Sound Source Localization via Audio-Visual Scene Understanding
- A Low-Profile Shared-Aperture Antenna Using Electromagnetic Transparent Structure and AMC
- Analytical Study on Fault-Tolerant Control of Five-Phase Induction Motor Drive
- AIpparel: A Multimodal Foundation Model for Digital Garments
- SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models
- Application of AI in Lightning and Thunderstorm Forecasting: A Vision for the Future
- CoSDH: Communication-Efficient Collaborative Perception via Supply-Demand Awareness and Intermediate-Late Hybridization
- Analysing Loss Mechanisms in PSFB Current Doublers for Telecom Tower Applications: Impact of Frequency and Power Level
- A Novel Bidirectional Multiport DC-DC Converter For Hybrid Energy System
- DeformCL: Learning Deformable Centerline Representation for Vessel Extraction in 3D Medical Image
- NoPain : No-box Point Cloud Attack via Optimal Transport Singular Boundary
- Complementary Advantages: Exploiting Cross-Field Frequency Correlation for NIR-Assisted Image Denoising
- Design and Research of a Tea Production Process Interactive Educational Device Based on Arduino
- Scaling up Image Segmentation across Data and Tasks
- SAM2Object: Consolidating View Consistency via SAM2 for Zero-Shot 3D Instance Segmentation
- FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding
- VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents
- MagicQuill: An Intelligent Interactive Image Editing System
- MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models
- DriveGPT4-V2: Harnessing Large Language Model Capabilities for Enhanced Closed-Loop Autonomous Driving
- Feasibility Study and Preliminary Testing of 3D Printing on ASICs for MEMS: A Particulate Sensor Case Study
- Subspace Constraint and Contribution Estimation for Heterogeneous Federated Learning
- Dual Diffusion for Unified Image Generation and Understanding
- Implementing Directive-Based Deferred Execution for Effective Network Aggregation
- SemanticDraw: Towards Real-Time Interactive Content Creation from Image Diffusion Models
- Koala-36M : A Large-Scale Video Dataset Improving Consistency between Fine-Grained Conditions and Video Content
- GIFStream: 4D Gaussian-Based Immersive Video with Feature Stream
- DropGaussian: Structural Regularization for Sparse-view Gaussian Splatting
- A UAV Exploration Planning Method Based on Improved Bio-Inspired Neural Network
- OpenSIEM:A Unified Open Source Security Management Framework
- Knowledge Base Autoencoder Framework: A Novel Approach for Continuous Phase Shift Compression in RIS-Aided Comunications
- Explorative Evaluation of Validation Criteria for Validating Needs and Benefits of a Mobility Hub
- Enhancing Manufacturing Training Through VR Simulations
- Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data
- C2FNet: Cross-Probabilistic Weak Supervision Learning for High-Resolution Land Cover Enhancement
- MLLM-as-a-Judge for Image Safety without Human Labeling
- MANTA: Diffusion Mamba for Efficient and Effective Stochastic Long-Term Dense Action Anticipation
- Survey App: Rating and Feedback System Application
- Radar Self-Evolution Detection: Two-Stage Knowledge Transfer via Distillation-Fusion Synergy
- FedAWA: Adaptive Optimization of Aggregation Weights in Federated Learning Using Client Vectors
- WeGen: A Unified Model for Interactive Multimodal Generation as We Chat
- Application of a Blockchain-Based Filtering Model to Mitigate Cyber Attacks in a Decentralized Transactive Energy System
- MP-GUI: Modality Perception with MLLMs for GUI Understanding
- Analytical Subdomain Modelling and Analysis of a Single Rotor Induction Assisted IPM Motor for EVs
- HiMoR: Monocular Deformable Gaussian Reconstruction with Hierarchical Motion Representation
- 3D Prior is All You Need: Cross-Task Few-shot 2D Gaze Estimation
- Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event Cameras
- ILIAS: Instance-Level Image retrieval At Scale
- SkyMamba: Integrating Transformer and State Space Model for UAV Remote Sensing RGB-D Images Semantic Segmentation
- GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
- Research on Torque Optimization of Outer Rotor Permanent Magnet Synchronous Motor Based on Response Surface Methodology
- Multiple Object Tracking as ID Prediction
- Segment Any-Quality Images with Generative Latent Space Enhancement
- Gaussian Splatting for Efficient Satellite Image Photogrammetry
- Instant Adversarial Purification with Adversarial Consistency Distillation
- Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation
- ProAPO: Progressively Automatic Prompt Optimization for Visual Classification
- UrbanCAD: Towards Highly Controllable and Photorealistic 3D Vehicles for Urban Scene Simulation
- Enhancing KGCN-Based Recommendation Algorithms via Attention Mechanism Integration
- An Improved Data Fusion Model for Secondary Return Water Temperature of Heating System Employing EKF
- Microfluidic biosensors for biotic and abiotic plant stress monitoring
- Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
- Parallel Fractal Decomposition Optimization Algorithms on Heterogeneous Architectures