- BWFormer: Building Wireframe Reconstruction from Airborne LiDAR Point Cloud with Transformer
- FruitNinja: 3D Object Interior Texture Generation with Gaussian Splatting
- Rethinking Epistemic and Aleatoric Uncertainty for Active Open-Set Annotation: An Energy-Based Approach
- Differentiable Predictive Control for Power Electronic Systems
- Establishing a Bypass Route for Surge Current Using SPD and Sit Part 2
- ORIDa: Object-centric Real-world Image Composition Dataset
- S2Gaussian: Sparse-View Super-Resolution 3D Gaussian Splatting
- VoCo-LLaMA: Towards Vision Compression with Large Language Models
- Design and Analysis of Phase Locked Loop VLSI IC for Carrier Synchronization in Wireless Communications using 45nm Technology
- ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning
- SimAvatar: Simulation-Ready Avatars with Layered Hair and Clothing
- Limit Cycle-Based Artificial Fields for Obstacle Avoidance in Robot Path Planning
- An Energy-Aware Approach to Stream Processing and Collaborative Offloading in Internet of Vehicles
- Active Gate Driver for SiC MOSFET Based on Voltage Sensing
- Silent Branding Attack: Trigger-free Data Poisoning Attack on Text-to-Image Diffusion Models
- Efficient Attention in Partially Relevant Video Retrieval: A Benchmarking Study on Accuracy-Efficiency Trade-Offs
- Towards Sustainable Machine Learning with Serverless at the Edge
- Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene
- FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation
- Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation
- Leveraging Perturbation Robustness to Enhance Out-of-Distribution Detection
- STPro: Spatial and Temporal Progressive Learning for Weakly Supervised Spatio-Temporal Grounding
- PyTorchGeoNodes: Enabling Differentiable Shape Programs for 3D Shape Reconstruction
- Efficient Long Video Tokenization via Coordinate-based Patch Reconstruction
- A Compact Deep Learning Architecture for Multi-Label Classification of Plant Diseases
- SASep: Saliency-Aware Structured Separation of Geometry and Feature for Open Set Learning on Point Clouds
- Can Text-to-Video Generation help Video-Language Alignment?
- MTMLD-AWSR: A Novel Multi-Teacher Multi-Level Distillation Approach for Class Incremental Learning in Edge-Cloud Systems
- RivuletMLP: An MLP-based Architecture for Efficient Compressed Video Quality Enhancement
- Adapting Text-to-Image Generation with Feature Difference Instruction for Generic Image Restoration
- RASP: Revisiting 3D Anamorphic Art for Shadow-Guided Packing of Irregular Objects
- NitroFusion: High-Fidelity Single-Step Diffusion through Dynamic Adversarial Training
- Concept Replacer: Replacing Sensitive Concepts in Diffusion Models via Precision Localization
- OffsetOPT: Explicit Surface Reconstruction without Normals
- SwiftEdit: Lightning Fast Text-Guided Image Editing via One-Step Diffusion
- Design Aspects of Inverter for 9-ϕ Pole Phase Modulated Induction Motor Drives
- Robust Lipreading Through a Dual-Stream Adaptive Framework with Spatiotemporal Landmark
- Finite Control Set Model Predictive Control of Grid-Connected Inverters with Extended Prediction Horizon
- Gaussian Splashing: Unified Particles for Versatile Motion Synthesis and Rendering
- IoT Enabled Real-Time Vehicle Tracking and Alert System for Educational Transport Service
- Detecting Open World Objects via Partial Attribute Assignment
- EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild
- Variance-Integrated Policy Optimization: A Maximum Entropy Approach for Localization in Energy Interconnection Systems
- Joint LDPC Code and Spreading Optimization for Multi-user Communications
- Evolving High-Quality Rendering and Reconstruction in a Unified Framework with Contribution-Adaptive Regularization
- A Novel Variable-Level ANPC Inverter with Capacitor Voltage Reconfiguration Method for 2 kV Photovoltaic Applications
- Enhancing Agricultural Decision Making with Machine Learning
- Reward Fine-Tuning Two-Step Diffusion Models via Learning Differentiable Latent-Space Surrogate Reward
- Double-Branched and Multi-Magnetic Directions Feature Fusion Network (DB&MDF2-Net) for the Accurate Reconstruction of Magnetic Particle Imaging
- Complex water surface image rain removal lightweight network based on convolution attention
- Alias-Free Latent Diffusion Models: Improving Fractional Shift Equivariance of Diffusion Latent Space
- Toward Greener AI-Based Smart Services: An Original Framework for Identifying Energy Efficiency Measurement Parameters
- AvianGuard-Signal Disruption Drone
- Integrated Sensing and Backscatter Communication with Movable Antennas: State-of-the-Art Survey and A Novel Inverse Scattering Framework
- Generalizable Object Keypoint Localization from Generative Priors
- HumanMM: Global Human Motion Recovery from Multi-shot Videos
- Exploring QUIC Dynamics: A Large-Scale Dataset for Encrypted Traffic Analysis
- Material Anything: Generating Materials for Any 3D Object via Diffusion
- Research on Adaptive Calibration Technique for Noncontact Voltage Sensors Based on Parameter-Independence Architecture
- Robust Remote Heart Rate Estimation Network Based on Spatial-Temporal-Channel Learning From Facial Videos
- Research on PCB Defect Detection Based on Yolo Algorithm
- Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
- DEEP: Edge-Based Dataflow Processing with Hybrid Docker Hub and Regional Registries
- Video Depth without Video Models
- QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge
- Conformity Assessment of a Multi-Sensor Device for Indoor Environmental Quality Monitoring
- Tension Resolution in Sustainability Working Groups with Behavioral Mimicry
- A Bi-Level Multi-Objective System for Renewable Energy Self-Consumption: A Resident-Aware Approach to Leveraging Energy Flexibility
- Beyond Sight: Towards Cognitive Alignment in LVLM via Enriched Visual Knowledge
- MoFlow: One-Step Flow Matching for Human Trajectory Forecasting via Implicit Maximum Likelihood Estimation based Distillation
- Low-complexity look-up table-based frequency-domain trigonometric nonlinear equalizer for underwater wireless optical communications
- Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding
- Revisiting Audio-Visual Segmentation with Vision-Centric Transformer
- Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting
- Taming Teacher Forcing for Masked Autoregressive Video Generation
- CASP: Compression of Large Multimodal Models Based on Attention Sparsity
- Experimental Small-signal Characterization of Frequency Modulated Converters
- Air Quality Predictive Analysis using Empirical Mode Decomposition with Adaptive Noise-Bee Colony Optimization
- AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
- Improving Personalized Search with Regularized Low-Rank Parameter Updates
- Improved Activity Recognition Through Fusion of Earable Pairs
- Towards Understanding How Knowledge Evolves in Large Vision-Language Models
- A New Systematic Inverse Design Method of Pneumatic Soft Actuator for Precise Motion
- Prompt-CAM: Making Vision Transformers Interpretable for Fine-Grained Analysis
- Understanding Multi-Task Activities from Single-Task Videos
- Dual-Satellite Beam Illuminating Strategy for Load-Balanced LEO Beam Hopping Systems
- Vision Transformer based approach for accurately detecting cervical cancer
- Contrastive Learning-Based Agent Modeling for Deep Reinforcement Learning
- Design and Implementation of Static Wireless Power Transmission System in Electric Vehicles
- Optimal Transformer Turn Design of LLC Resonant Converters for High Efficient Operation
- LibraryX-ASIC: A First Look
- A Novel Formation Control Strategy for USVs With Improved DDPG: Simulation and Field Test
- TLPipe: An Efficient Two-Level Optimization Pipeline Model Parallelism for Large Model Training
- STDD: Spatio-Temporal Dual Diffusion for Video Generation
- EgoPressure: A Dataset for Hand Pressure and Pose Estimation in Egocentric Vision
- Towards Universal AI-Generated Image Detection by Variational Information Bottleneck Network
- Optimizing Traffic Anomaly Detection with Bertcapsule Model
- Investigation into the Reverse Recovery Dynamics of High-Voltage Fast Recovery Diodes
- RNG: Relightable Neural Gaussians
- Mind the Time: Temporally-Controlled Multi-Event Video Generation