- Enhancing 3D Gaze Estimation in the Wild using Weak Supervision with Gaze Following Labels
- POT: Prototypical Optimal Transport for Weakly Supervised Semantic Segmentation
- Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models
- Semantic-guided Cross-Modal Prompt Learning for Skeleton-based Zero-shot Action Recognition
- Boosting Point-Supervised Temporal Action Localization through Integrating Query Reformation and Optimal Transport
- Decouple-Then-Merge: Finetune Diffusion Models as Multi-Task Learning
- SPIPE: Differentiable SPICE-Level Co-Simulation Program for Integrated Photonics and Electronics
- Rumor Detection Based on Supervised Multiprototype Contrastive Learning
- SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images
- Rethinking Training for De-biasing Text-to-Image Generation: Unlocking the Potential of Stable Diffusion
- An Offline Loss Minimization Framework in Induction Motor-Based Traction Drives Using Improved Deadbeat Control Method
- Vision-Language Models Do Not Understand Negation
- BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices
- Pairbot: Enhancing Computational Capabilities by Pairing of Autonomous Mobile Robots
- Adaptive PSO with Orientation Awareness for Robust Object Localization and Bounding Box Refinement
- Towards Efficient Allocation of Tasks in Dag-Based Workflows Across Federated Fog Systems
- Microalgae Density Measurement Using Quantum-Dot-Integrated Sensing and Communication System
- GLC++: Source-Free Universal Domain Adaptation through Global-Local Clustering and Contrastive Affinity Learning
- Research on Multi-Class Component Detection of Transmission Lines Based on Improved YOLO11
- Hyperbolic Uncertainty-Aware Few-Shot Incremental Point Cloud Segmentation
- Research on the Prediction System of State Grid Corporation of China's Bidding Business Based on Ensemble Learning and Reflex
- DRAWER: Digital Reconstruction and Articulation With Environment Realism
- RATE: A Retrieval-Augmented Transformer for Regional Earthquake Early Warning
- Machine Learning Model Coupling PCA and PLSR for Predicting Moisture Content in Different Soil Textures Using Near-Infrared Spectroscopy
- 3DGUT: Enabling Distorted Cameras and Secondary Rays in Gaussian Splatting
- Approach for Long-Time Coherent Integration for High-Speed Maneuvering Target Detection with Nonuniform Sampling
- Finite Time Sliding Mode Control for Chattering Reduction in Unmanned Aerial Vehicles with Dynamic Payloads
- An ERAN-Based Dynamic Graph Neural Network for CSI Prediction in Massive MIMO Systems
- Framework to Support Digital and Sustainable Manufacturing: From Research Environments to Industrial Applications
- Current Balancer Integrated with Impedance Matching Circuit for Megahertz High-power WPT Systems
- Enhanced Network Traffic Monitoring and Anomaly Detection System Utilizing Convolutional Neural Networks
- A Three-Stage Feature Selection Method Based on Particle Swarm Optimization with Dynamic Granularity Clustering
- Improved Fabric Defect Detection Algorithm of YOLOv11n
- PBR-NeRF: Inverse Rendering with Physics-Based Neural Fields
- Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation
- A Hybrid Resnet-Bilstm Network Approach for Airborne Fire-Control Radar Operational Mode Recognition
- Timescape Museum in Virtual Reality With Blender and Unity
- Experimental Characterization of a Double-Spiral Resistive Temperature Microsensor
- A Back-EMF Based Sensorless Control for a Dual Parallel Surface-mounted Permanent Magnet Synchronous Motor Drives Fed by a Single Inverter
- 3D-MVP: 3D Multiview Pretraining for Manipulation
- A Cascaded 2-Level Z-Source Dual Inverter with Single Source and Reduced Battery Voltage
- Re-examining MPPT Control Dynamics through Limit Cycles in Solar PV Converters
- ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation
- Ultrasonic waves detection above 1000 degree Celsius with Fiber Bragg Grating sensors
- MoireComm: Secure Screen-camera Communication Based on Moire Cryptography
- A Universal Scale-Adaptive Deformable Transformer for Image Restoration across Diverse Artifacts
- Curriculum Coarse-to-Fine Selection for High-IPC Dataset Distillation
- Non-Cooperative Target Radar RCS Data Generation Based on Transfer Learning
- Improving Diffusion Inverse Problem Solving with Decoupled Noise Annealing
- Believing is Seeing: Unobserved Object Detection using Generative Models
- Design and Implementation of Pre-Litigation Mediation Platform Under Web Technology
- Security Control of SMMS Teleoperation Systems Based on DTOD Scheduling Protocol
- PointSR: Self-Regularized Point Supervision for Drone-View Object Detection
- Research on Financial Statement Anomaly Risk Prediction Based on Generative Adversarial Networks
- Optimum Distance for In-Flight UAV-to-UAV Wireless Charging
- ProbeSDF: Light Field Probes For Neural Surface Reconstruction
- Fish-Vista: A Multi-Purpose Dataset for Understanding & Identification of Traits from Images
- Scaling Graph Neural Networks for Particle Track Reconstruction
- Harnessing Global-Local Collaborative Adversarial Perturbation for Anti-Customization
- Multi-modal Contrastive Learning with Negative Sampling Calibration for Phenotypic Drug Discovery
- Sketchtopia: A Dataset and Foundational Agents for Benchmarking Asynchronous Multimodal Communication with Iconic Feedback
- High-Current Performance Analysis of a Non-Isolated High Step-Down DC-DC Converter and Miniaturization of the Gate Drive Circuit
- TO-LF: A Texture and Occlusion-Oriented Benchmark Dataset for Light Field Disparity Estimation
- DexGrasp Anything: Towards Universal Robotic Dexterous Grasping with Physics Awareness
- Efficient Diffusion as Low Light Enhancer
- Fronthaul Compression and Beamforming Optimization for Secure Cell-free ISAC Systems
- Heterogeneous Memory Pool Tuning
- A Spatial TDMA-Based Medium Access Control Protocol for Underwater Acoustic Backscatter Networks
- Prior-free 3D Object Tracking
- Seeking Consistent Flat Minima for Better Domain Generalization via Refining Loss Landscapes
- Category-Agnostic Neural Object Rigging
- SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs
- CADCrafter: Generating Computer-Aided Design Models from Unconstrained Images
- nnWNet: Rethinking the Use of Transformers in Biomedical Image Segmentation and Calling for a Unified Evaluation Benchmark
- LIRM: Large Inverse Rendering Model for Progressive Reconstruction of Shape, Materials and View-dependent Radiance Fields
- OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints
- Towards Autonomous Micromobility through Scalable Urban Simulation
- IDEA-Bench: How Far are Generative Models from Professional Designing?
- Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation
- Cost-Efficient Fall Risk Assessment with Attention Augmented Vision Machine Learning on Sit-To-Stand Test Videos
- OpenROAD Agent: An Intelligent Self-Correcting Script Generator for OpenROAD
- LAL: Enhancing 3D Human Motion Prediction with Latency-aware Auxiliary Learning
- Video-Bench: Human-Aligned Video Generation Benchmark
- Exploring Contextual Attribute Density in Referring Expression Counting
- Development and Implementation of a Wide-Range Output Voltage Power Factor Correction
- Goal-oriented Control Strategies for Soft Growing Robots
- Segment Any Motion in Videos
- Assessing Parallel and Distributed Computing Knowledge Through a Card Game
- Is GAN Necessary for Mel-Spectrogram-based Neural Vocoder?
- Challenges and Strategic Choices in Business Model Evolution of European Digital Health Startups: A Qualitative Study
- Millimeter-Wave Scattering Model Based on RCS Distribution: A Simulation and Verification Study
- Scaling Vision Pre-Training to 4K Resolution
- Satellite to GroundScape - Large-scale Consistent Ground View Generation from Satellite Views
- SemAlign3D: Semantic Correspondence between RGB-Images through Aligning 3D Object-Class Representations
- A Study on improvement of single phase PLL algorithm stability and accuracy
- Blood Flow Speed Estimation with Optical Coherence Tomography Angiography Images
- T3-ANFIS: Type-3 Adaptive Neuro-Fuzzy Inference System With a Noniterative Learning Algorithm
- DIO: Decomposable Implicit 4D Occupancy-Flow World Model
- Large-scale Multi-view Tensor Clustering with Implicit Linear Kernels
- Research on SOC Balancing Control Strategies for Multiple Energy Storages in DC Microgrids