- Enhanced Soups for Graph Neural Networks
- 3D-HGS: 3D Half-Gaussian Splatting *
- SoMA: Singular Value Decomposed Minor Components Adaptation for Domain Generalizable Representation Learning
- Robust Multimodal Survival Prediction with Conditional Latent Differentiation Variational AutoEncoder
- Uncertain Multimodal Intention and Emotion Understanding in the Wild
- Event-based Video Super-Resolution via State Space Models
- CLGAIN-KF: A Data Recovery Method for False Data Injection Attacks in Power Systems
- Next-Gen Multimedia Encryption by Combining Symmetric and Asymmetric Cryptographic Techniques
- Deep Learning Integration In Agriculture Framework For Farmer Sustainability
- ArtFormer: Controllable Generation of Diverse 3D Articulated Objects
- Asynchronous Collaborative Graph Representation for Frames and Events
- DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
- Rethinking Spiking Self-Attention Mechanism: Implementing α-XNOR Similarity Calculation in Spiking Transformers
- SKDream: Controllable Multi-view and 3D Generation with Arbitrary Skeletons
- Uniformity Calibration of Mini/Micro Light Emitting Diode Splicing Screen Based on Multi-Image Fusion and Parameter Optimization
- Weakly Supervised Semantic Segmentation via Progressive Confidence Region Expansion
- State-of-the-Art Defense Schemes Against Byzantine Attack in Federated Learning
- Dual-Agent Optimization framework for Cross-Domain Few-Shot Segmentation
- Exploring the AMD ® Deep Learning Processor Unit for Accelerating Selective Sweep Detection
- Towards an Efficient Containerized Cloud Gaming Platform
- Object Detection and Hazard Alert System for Child Safety on Robot using YOLO
- Miniaturized Multi-Band Smartphone Antenna Using Non-Foster Based Active Matching Below 1 GHz
- Recurrent Feature Mining and Keypoint Mixup Padding for Category-Agnostic Pose Estimation
- Design of A Dynamic Decoupling Network for A Pair of Wearable Inverted L-shaped Antennas Based on Common/Differential Mode Theory
- Probabilistic Prompt Distribution Learning for Animal Pose Estimation
- Efficient Routing Congestion Prediction in Chip Design: Integrating Netlist Structure and Design Specifications With Heterogeneous Graph Attention Networks
- RIS-enhanced Semantic-aware Sensing, Communication, Computation and Control for Internet of Things
- Bidirectional Isolated Half Bridge Three-level Resonant DC-DC Converter and Optimized Modulation Strategy
- A Survey on Transformer Architecture Design for Abnormal Behavior Recognition Tasks
- Smart Groundwater Recharge Management using Cloud Computing and Gradient Boosting Machines
- RAAP-CGRA: Placement for CGRAs with Restricted Routing Architectures
- BrepGiff: Lightweight Generation of Complex B-rep with 3D GAT Diffusion
- VideoHandles: Editing 3D Object Compositions in Videos Using Video Generative Priors
- Anomize: Better Open Vocabulary Video Anomaly Detection
- Explaining in Diffusion: Explaining a Classifier with Diffusion Semantics
- Fast Terminal Sliding Mode Control with Nonlinear Disturbance Observer for Buck Converter with Space Vector Pulse Width Amplitude Modulation
- Explainable Saliency: Articulating Reasoning with Contextual Prioritization
- Interactive Medical Image Analysis with Concept-based Similarity Reasoning
- Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning
- PoseTraj: Pose-Aware Trajectory Control in Video Diffusion
- Driver Fatigue Detection Based on BiLSTM-At and Multi-Source Information Fusion
- Gazing at Rewards: Eye Movements as a Lens into Human and AI Decision-Making in Hybrid Visual Foraging
- VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
- OFER: Occluded Face Expression Reconstruction
- Real Time Feedforward Decoupled Control of Triple Active Bridge Converter for Charging of Electric Vehicles
- Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models
- PosterMaker: Towards High-Quality Product Poster Generation with Accurate Text Rendering
- Learning Visual Composition through Improved Semantic Guidance
- On the Effect of Rolling Sphere Penetration and Rod Spacing Configurations in Optimizing Lightning Protection for Solar Farms
- LesionLocator: Zero-Shot Universal Tumor Segmentation and Tracking in 3D Whole-Body Imaging
- Spatial Equivalent Impedance Model Based Initial Position Detection for PMSM with High-Frequency Injection
- Artificial Neural Network-Based Control for Capacitor Voltage Ripple Balancing in Cascaded H-Bridge Converters
- THz Channels for Short-Range Mobile Networks: Multipath Channel Behavior and Human Body Shadowing Effects
- A Shape-Adjustable BéZier Model and Its De Casteljau-Type Algorithm
- Good Vibes: a PWM-Enabled Covert Channel for Securing UAVs Operations
- Exposure-slot: Exposure-centric representations learning with Slot-in-Slot Attention for Region-aware Exposure Correction
- Digital Twin Catalog: A Large-Scale Photorealistic 3D Object Digital Twin Dataset
- Single phase AC-AC Modular Traction Power Conditioner and Control Strategy for High-Speed Co-phasal Railway Systems
- Self-Supervised Large Scale Point Cloud Completion for Archaeological Site Restoration
- UniPhy: Learning a Unified Constitutive Model for Inverse Physics Simulation
- A Distractor-Aware Memory for Visual Object Tracking with SAM2
- On the Influence of the SNR on the Optimization of Two-Satellite Systems with AOA Receivers
- Lifting Motion to the 3D World via 2D Diffusion
- Black Hole-Driven Identity Absorbing in Diffusion Models
- Post-pre-training for Modality Alignment in Vision-Language Foundation Models
- Cloud-Accelerated Personalized Cancer Treatment Prediction Using Adaptive Quantum-Inspired Evolutionary Algorithm and Graph Neural Networks
- Similarity-Guided Layer-Adaptive Vision Transformer for UAV Tracking
- Yo’Chameleon: Personalized Vision and Language Generation
- A DHTOL Test-Based Methodology to Investigate the Switching Reliability of GaN HEMTs Under Repeated Drain Voltage Ringing
- Masking meets Supervision: A Strong Learning Alliance
- ReSpec: Relevance and Specificity Grounded Online Filtering for Learning on Video-Text Data Streams
- PhD: A ChatGPT-Prompted Visual hallucination Evaluation Dataset
- An Ultra Flexible Quad-port Converter for Hybrid Energy Storage System (HESS) in Fuel Cell Electric Vehicle (FCEV) Powertrains
- Variational Information Inference: An Interpretable Disentangled Transfer Learning Quality Prediction for Multirate Industrial Processes
- EigenGS Representation: From Eigenspace to Gaussian Image Space
- Investigation of Deep Learning Techniques Used in Medicinal Plants Identification and Classification
- Dense-SfM: Structure from Motion with Dense Consistent Matching
- Feature Selection for Latent Factor Models
- Gradient-Guided Annealing for Domain Generalization
- Incorporating Dense Knowledge Alignment into Unified Multimodal Representation Models
- Hierarchical Knowledge Prompt Tuning for Multi-task Test-Time Adaptation
- Towards Million-Scale Adversarial Robustness Evaluation With Stronger Individual Attacks
- LLLaVA-Critic: Learning to Evaluate Multimodal Models
- ViT-CatNet: An End-to-End Vision Transformer Architecture for Instance-Level Feline Identification
- Leveraging Temporal Cues for Semi-Supervised Multi-View 3D Object Detection
- Multi-modal Vision Pre-training for Medical Image Analysis
- Leveraging SD Map to Augment HD Map-based Trajectory Prediction
- Design of a GaN-based Series Resonant Dual Active Bridge DC-DC converter for EV Charging Application
- Cybercrime in Numbers: Data-Driven Cybersecurity Evaluation
- Fuzzy Adaptive Command Filtered Control of Strict-Feedback Fractional-Order Nonlinear Systems With State and Input Quantization
- Show and Tell: Visually Explainable Deep Neural Nets via Spatially-Aware Concept Bottleneck Models
- UNICL-SAM: Uncertainty-Driven In-Context Segmentation with Part Prototype Discovery
- Exploring Spatiotemporal Relational Learning With TimeSformer for Identifying the Severity of the Road Accidents
- DecoupledGaussian: Object-Scene Decoupling for Physics-Based Interaction
- Machine Learning Prediction on User Satisfaction in Human-Robot Interaction (HRI) Tasks
- Comparative Performance Analysis of Classic and Modulated FCS-MPC for Grid-Tied Inverters
- GaPT-DAR: Category-level Garments Pose Tracking via Integrated 2D Deformation and 3D Reconstruction
- Targeted Poisoning Attacks Against Vertical Federated Learning Via Embedding Manipulation
- Integrated multi-format microwave signal generator using thin-film lithium niobite Mach-Zehnder modulator
- Comprehensive Relighting: Generalizable and Consistent Monocular Human Relighting and Harmonization