- Multi-modal Contrastive Learning with Negative Sampling Calibration for Phenotypic Drug Discovery
- TO-LF: A Texture and Occlusion-Oriented Benchmark Dataset for Light Field Disparity Estimation
- SparseAlign: A Fully Sparse Framework for Cooperative Object Detection
- Hierarchical Flow Diffusion for Efficient Frame Interpolation
- SEAL: SEmantic Attention Learning for Long Video Representation
- Deterministic-to-Stochastic Diverse Latent Feature Mapping for Human Motion Synthesis
- ForestLPR: LiDAR Place Recognition in Forests Attentioning Multiple BEV Density Images
- Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views
- Model Predictive Control for Reliable and Efficient Path Tracking in Autonomous Vehicles
- Temporal Separation with Entropy Regularization for Knowledge Distillation in Spiking Neural Networks
- Efficient Diffusion as Low Light Enhancer
- Prior-free 3D Object Tracking
- Seeking Consistent Flat Minima for Better Domain Generalization via Refining Loss Landscapes
- AdMiT: Adaptive Multi-Source Tuning in Dynamic Environments
- Design2GarmentCode: Turning Design Concepts to Tangible Garments Through Program Synthesis
- Category-Agnostic Neural Object Rigging
- Hardware implementation of PDS-PWM for Five-Level Active Neutral Point Clamped Inverter using LAUNCHXL F28379D
- Containerized Deployment of Secure LLM Workflows in Multi-Cloud Infrastructures
- Towards Realistic Example-based Modeling via 3D Gaussian Stitching
- Enhanced Cascaded Object Detection Network Based on Indirect Self-Attention Mechanism
- STINR: Deciphering Spatial Transcriptomics via Implicit Neural Representation
- Cost-Efficient Fall Risk Assessment with Attention Augmented Vision Machine Learning on Sit-To-Stand Test Videos
- Unveiling the Ignorance of MLLMs: Seeing Clearly, Answering Incorrectly
- A Geometric Approach for Cyber-Attack Detection in DC Microgrids
- LumiNet: Latent Intrinsics Meets Diffusion Models for Indoor Scene Relighting
- AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion
- GoLF-NRT: Integrating Global Context and Local Geometry for Few-Shot View Synthesis *
- Multi-view Hand Reconstruction with a Point-Embedded Transformer
- 2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification
- Camera-Based Human Heart Rate Measurement Method
- Assessing Parallel and Distributed Computing Knowledge Through a Card Game
- A Privacy-Preserving Power Grid Data Aggregation Scheme Based on Blockchain and Homomorphic Encryption
- Is GAN Necessary for Mel-Spectrogram-based Neural Vocoder?
- Imperfect Recognition: A Study of OCR Limitations in the Context of Scientific Documents
- The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
- GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency
- Scaling Vision Pre-Training to 4K Resolution
- BOLT: Boost Large Vision-Language Model Without Training for Long-Form Video Understanding
- Lessons and Insights from a Unifying Study of Parameter-Efficient Fine-Tuning (PEFT) in Visual Recognition
- Blood Flow Speed Estimation with Optical Coherence Tomography Angiography Images
- GROVE: A Generalized Reward for Learning Open-Vocabulary Physical Skill
- DIO: Decomposable Implicit 4D Occupancy-Flow World Model
- Efficient Video Face Enhancement with Enhanced Spatial-Temporal Consistency
- Large-scale Multi-view Tensor Clustering with Implicit Linear Kernels
- OmniGuard: Hybrid Manipulation Localization via Augmented Versatile Deep Image Watermarking
- Multi-Resolution Pathology-Language Pre-training Model with Text-Guided Visual Representation
- ManiVideo: Generating Hand-Object Manipulation Video with Dexterous and Generalizable Grasping
- High Availability Design for a Container Cloud Platform Monitoring and Management Module
- HeMoRa: Unsupervised Heuristic Consensus Sampling for Robust Point Cloud Registration
- Leveraging Large Language Models for Cultural Heritage Digitization: A Textual Analysis of Historical Buildings
- Attend to Not Attended: Structure-then-Detail Token Merging for Post-training DiT Acceleration
- Self-Supervised Cross-View Correspondence with Predictive Cycle Consistency
- A Real-Time Monitoring System for User Attentivity on e-learning platforms
- Test-Time Fine-Tuning of Image Compression Models for Multi-Task Adaptability
- Beyond Generation: A Diffusion-based Low-level Feature Extractor for Detecting AI-generated Images
- Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation
- Construction of a Machine Learning-Based Model for Predicting the Mechanical Properties of $\text{Fe}-\mathrm{C}-\text{Mn}-\text{Al}$ System Steels
- DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models
- Stable and Efficient I/F Control for Dual Parallel Surface-Mounted Permanent Magnet Synchronous Motor Drives fed by a Single Inverter
- Unified Framework for Evaluating Numerical Integration Methods through Characteristic Root Distortion in Linear Ordinary Differential Equations
- Industrial Machine Data Generation and Artificial Optimisation for Blow Molding Extrusion Machines
- A Multi-mode Adaptive Switching Vibration Control Strategy For Marine Turbines
- Unveiling Differences in Generative Models: A Scalable Differential Clustering Approach
- Fault Detection and Isolation of a Speed Sensorless Quadruplex BLDC Motor for Aerospace Applications
- Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise
- Hyperspectral Methane Plume Segmentation Through Foundation Computer Vision Models
- Heuristic Optimization Strategies for Reliable Amplifier Reconfiguration in Autonomous Optical Networks with Field-trial Validation
- A T A: Adaptive Transformation Agent for Text-Guided Subject-Position Variable Background Inpainting
- An Adaptability Analysis Method Based on the Quantification of Terrain Matching Performance
- Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient
- Hypergraph Vision Transformers: Images are More than Nodes, More than Edges
- Efficient Motion-Aware Video MLLM
- Predictive Current Control of a Three-Level Multi-Modular NPC Converter With Mutual Error Compensation and Fault Tolerance
- Multi-View Pose-Agnostic Change Localization with Zero Labels
- Domain-Specific Multi-Document Political News Summarization Using BART and ACT-GAN
- Provoking Multi-modal Few-Shot LVLM via Exploration-Exploitation In-Context Learning
- HLS-Eval: A Benchmark and Framework for Evaluating LLMs on High-Level Synthesis Design Tasks
- DTGBrepGen: A Novel B-rep Generative Model through Decoupling Topology and Geometry
- Research on Smooth Soft-Start Control of Two-Switch Buck-Boost
- Shape Abstraction via Marching Differentiable Support Functions
- A Non-Isolated Hybrid Switched-Capacitor Network Based High-Gain Quadratic DC-DC Boost Converter
- Learning Heterogeneous Tissues with Mixture of Experts for Gigapixel Whole Slide Images
- Self-Learning Hyperspectral and Multispectral Image Fusion via Adaptive Residual Guided Subspace Diffusion Model
- BlenderGym: Benchmarking Foundational Model Systems for Graphics Editing
- PSBD: Prediction Shift Uncertainty Unlocks Backdoor Detection
- NN-Former: Rethinking Graph Structure in Neural Architecture Representation
- Joint Optimal Allocation of Radio and Computational Resources Aiming at Minimizing Global Average Task Offloading Age for Long-Term Multi-Cell MEC Systems
- FedCDC: Efficient Similarity Identification in Clustered Federated Learning via Community Detection on Non-IID Data
- Adaptive Policy-Driven Network Intelligence for Edge-to-Cloud Continuum
- Context Based Analysis of the Impact of Humanism Drivers in Contemporary Management Using AHP
- Research on Dynamic Navigation and Obstacle Avoidance of Mobile Robots in Open Office
- IoT-Enabled Smart Robot for Efficient Banana Harvesting and Quality Assessment
- Rumor Detection Based on Supervised Multiprototype Contrastive Learning
- Spectrum Efficiency Optimization of Hybrid Precoding Based on Adam-PGD Algorithm
- TKG-DM: Training-free Chroma Key Content Generation Diffusion Model
- Simulation of Overvoltage in Photovoltaic Energy Storage System Caused by Lightning Strike
- Research on Heat Transfer Coefficient of Rotor Oil Jet Impingement Cooling of Permanent Magnet Synchronous Motor in Electric Vehicles
- Design, Selection and Implementation of Conditioning Circuits for Digital Control Applications
- EntitySAM: Segment Everything in Video
- HiVeGen – Hierarchical LLM-based Verilog Generation for Scalable Chip Design