- Open Set Label Shift with Test Time Out-of-Distribution Reference
- Unconditionally Stable Leapfrog Complying Divergence Implicit FDTD Method with Lumped Elements
- DPCT: Efficient High-Resolution Depth Prediction via Cross-Covariance Attention Transformers
- Context-Aware Multimodal Pretraining
- SpectroMotion: Dynamic 3D Reconstruction of Specular Scenes
- MaSS13K: A Matting-level Semantic Segmentation Benchmark
- Adaptive Protein Design Protocols and Middleware
- Sharp-It: A Multi-view to Multi-view Diffusion Model for 3D Synthesis and Manipulation
- Efficient GPU Memory Resource Scheduling Algorithm for Vehicle Detection Tasks in High Concurrent Scenarios
- A Learning Algorithm Based on Similarity Identification and Knowledge Transfer for Dynamic Multi-Objective Optimization
- Frequency-Domain Analysis of Contaminant Effects on Leakage Current and Harmonic Distortion for Transmission Line Diagnostics
- Text Embedding is Not All You Need: Attention Control for Text-to-Image Semantic Alignment with Text Self-Attention Maps
- MoEE: Mixture of Emotion Experts for Audio-Driven Portrait Animation
- Design and Feasibility Study of Transverse-flux Double-sided Linear Induction Motor
- De 2 Gaze: Deformable and Decoupled Representation Learning for 3D Gaze Estimation
- Coeff-Tuning: A Graph Filter Subspace View for Tuning Attention-Based Large Models
- Domain Adaptive Diabetic Retinopathy Grading with Model Absence and Flowing Data
- Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
- Attribute-formed Class-specific Concept Space: Endowing Language Bottleneck Model with Better Interpretability and Scalability
- AutoPresent: Designing Structured Visuals from Scratch
- Insightful Instance Features for 3D Instance Segmentation
- A Novel Kernel-Based Hilbert Space Framework for Predictive Modeling of lncRNA-miRNA-Disease Interaction Networks
- Triple-Band Efficiency Improvement of Smartwatch Antennas by Sharing a Zeroth-Order Resonance Patch
- Conversion Formulas between the WLP Spectrum and the Frequency Spectrum for WLP-FDTD Analysis
- Vehicle Re-Identification in Occluded Scenes Based on Vision Transformer
- Research Overview on Moving Object Detection Methods Based on Video Image Analysis
- Construction of an Ethical Monitoring System for Public Space Design Based on Quantitative Models
- CorrMADA: Improving Robustness of ML-Coupled Intrusion Detection Systems
- A Frequency-Adaptive Differential Mode Active Filter for Mitigating Supraharmonics in the 10 kHz to 80 kHz Range
- Test-Time Domain Generalization via Universe Learning: A Multi-Graph Matching Approach for Medical Image Segmentation
- On the Quality Aspects of Digital Twins
- VisionUnite: a Vision-Language Foundation Model for Ophthalmology Enhanced with Clinical Knowledge
- One-Way Ticket: Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models
- Learning Hazing to Dehazing: Towards Realistic Haze Generation for Real-World Image Dehazing
- OM-Koop: Online Memorable Koopman Operator Learning for Marine Robots Steering Dynamics
- Fault Joint Detection and Adaptive Fault-Tolerant Control of Legged Robots Under Joint Partial Failures
- Digital Transformation of Healthcare Professionals Trainings: Towards a Design Framework for Creating Autonomous XR Training Platforms
- A Brain Tumor Image Classification Method Based on Improved MobileViT Model
- Edge Computing for Brain Stroke Detection using Deep Learning Techniques
- Research on Improved YOLOv10n Sign Language Recognition Algorithm
- UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting
- HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian Splatting
- Double Pancake Spiral Coil based Wireless Power Transfer System for EV Charging
- Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens
- Radar Cross Section Statistical Characterization & Detection Probability Calculations of Stealth Aircraft
- BioX-CPath: Biologically-driven Explainable Diagnostics for Multistain IHC Computational Pathology
- Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation
- Noise Calibration and Spatial-Frequency Interactive Network for STEM Image Enhancement
- Can Reasoning Models Reason about Hardware? An Agentic HLS Perspective
- SmartEraser: Remove Anything from Images using Masked-Region Guidance
- Model Predictive Control of Interleaved DC-DC Boost Converter
- Advanced Stroke Prediction Leveraging MRI Data with an Ensemble Convolutional Neural Network Framework
- Backscatter Measurements and Statistical Models for RF Sensing in Indoor Cluttered Environments
- DehazeMist: Research on Image Dehazing System Based on Improved Dark Channel Prior
- Anchor-Aware Similarity Cohesion in Target Frames Enables Predicting Temporal Moment Boundaries in 2D
- KAM-Net: Kilobyte-Scale Ultra-Lightweight Attention-based Network for Glass Defect Detection with Algorithm/Hardware Co-design
- Secure Audio Processing: Facial Encryption with Speech Transcription and Translation
- FoundationStereo: Zero-Shot Stereo Matching
- Decoupled Motion Expression Video Segmentation
- Learning Audio-guided Video Representation with Gated Attention for Video-Text Retrieval
- Reconfigurable Intelligent Surfaces for ISAC: CRB Analysis and Optimization for Joint Angle and Radial Velocity Estimation
- A RISC-V Coprocessor for Seamless Integration of Stream-Based Accelerators
- Research on the Application of Artificial Intelligence Technology in Beam Resource Allocation for Multi-Beam Satellite Communication Systems
- Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding
- InsightEdit: Towards Better Instruction Following for Image Editing
- A Methodology for Analyzing and Diagnosing Renewable Grid Tie Inverter Designs
- EarthDial: Turning Multi-sensory Earth Observations to Interactive Dialogues
- Removing Reflections from RAW Photos
- Frequency-Biased Synergistic Design for Image Compression and Compensation
- Multi-scale Semantically Modulated Mixed Convolutional Networks for Sub-pixel Mapping
- Cross Scale Attention Transformer for Single Image Super-Resolution
- Design Method of Distributed Decoupling Capacitors for Both Voltage Overshoot Suppression and Dynamic Current Sharing in SiC MOSFET Power Module
- Coupling Study in a 2D Gimbal-less Quasi-static Piezoelectrically-Actuated MEMS Mirror
- MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos
- VL-RewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models
- Virtual Wind Tower Siting Method Based on Multi-Objective Mayfly Optimization Algorithm
- Lumbar EMG-Based Motion Intent Recognition for Industrial Exoskeletons
- Secure Access: A Multimodal Authentication and Thread Detection System for Person in Residence Halls
- HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting
- S 3 -Face: SSS-Compliant Facial Reflectance Estimation via Diffusion Priors
- Classic Video Denoising in a Machine Learning World: Robust, Fast, and Controllable
- INFP: Audio-Driven Interactive Head Generation in Dyadic Conversations
- IDOL: Instant Photorealistic 3D Human Creation from a Single Image
- Harvesting Energy from Subclavian Artery Motion for Self-Powered Implantable Medical Devices
- Do Visual Imaginations Improve Vision-and-Language Navigation Agents?
- Optimizing Deep Learning Inference on Heterogeneous Devices on the Edge: An ILP-Based Rate Monotonic Scheduling Approach
- ShowUI: One Vision-Language-Action Model for GUI Visual Agent
- Low Latency Depth of Field Fusion System and Method Employing Fpga for Autonomous Driving
- Improving mapping of convolutional neural networks on FPGAs through tailored macro sizes
- Rethinking Decoder Design: Improving Biomarker Segmentation Using Depth-to-Space Restoration and Residual Linear Attention
- GauCho: Gaussian Distributions with Cholesky Decomposition for Oriented Object Detection
- In-Situ Benchmarking of Oxide-Based Leaf Wetness Sensor for Integrated Plant Disease Management
- Strain-Regulated Polarity Switching in Flexible MoTe 2 Transistors
- Method and Implementation of Batch Renaming for Structure Trees Based on CATIA V6 Secondary Development
- Breaking Down LLM Inference: A preliminary performance analysis of sparsified transformers
- Can Machines Understand Composition? Dataset and Benchmark for Photographic Image Composition Embedding and Understanding
- Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model
- Investigating Winter Lightning in Kanazawa, Japan: Results from Triggered Experiments and Multi-Parameter Observations
- Causal Composition Diffusion Model for Closed-loop Traffic Generation
- A Linked Stochastic Kriging for Multi-Layer Systems with Noisy Response