- A Distractor-Aware Memory for Visual Object Tracking with SAM2
- On the Influence of the SNR on the Optimization of Two-Satellite Systems with AOA Receivers
- Lifting Motion to the 3D World via 2D Diffusion
- Black Hole-Driven Identity Absorbing in Diffusion Models
- Post-pre-training for Modality Alignment in Vision-Language Foundation Models
- Cloud-Accelerated Personalized Cancer Treatment Prediction Using Adaptive Quantum-Inspired Evolutionary Algorithm and Graph Neural Networks
- Similarity-Guided Layer-Adaptive Vision Transformer for UAV Tracking
- Yo’Chameleon: Personalized Vision and Language Generation
- A DHTOL Test-Based Methodology to Investigate the Switching Reliability of GaN HEMTs Under Repeated Drain Voltage Ringing
- Masking meets Supervision: A Strong Learning Alliance
- ReSpec: Relevance and Specificity Grounded Online Filtering for Learning on Video-Text Data Streams
- PhD: A ChatGPT-Prompted Visual hallucination Evaluation Dataset
- An Ultra Flexible Quad-port Converter for Hybrid Energy Storage System (HESS) in Fuel Cell Electric Vehicle (FCEV) Powertrains
- Variational Information Inference: An Interpretable Disentangled Transfer Learning Quality Prediction for Multirate Industrial Processes
- EigenGS Representation: From Eigenspace to Gaussian Image Space
- Investigation of Deep Learning Techniques Used in Medicinal Plants Identification and Classification
- Dense-SfM: Structure from Motion with Dense Consistent Matching
- Feature Selection for Latent Factor Models
- Gradient-Guided Annealing for Domain Generalization
- Incorporating Dense Knowledge Alignment into Unified Multimodal Representation Models
- Hierarchical Knowledge Prompt Tuning for Multi-task Test-Time Adaptation
- Towards Million-Scale Adversarial Robustness Evaluation With Stronger Individual Attacks
- LLLaVA-Critic: Learning to Evaluate Multimodal Models
- ViT-CatNet: An End-to-End Vision Transformer Architecture for Instance-Level Feline Identification
- Leveraging Temporal Cues for Semi-Supervised Multi-View 3D Object Detection
- Multi-modal Vision Pre-training for Medical Image Analysis
- Leveraging SD Map to Augment HD Map-based Trajectory Prediction
- Design of a GaN-based Series Resonant Dual Active Bridge DC-DC converter for EV Charging Application
- Cybercrime in Numbers: Data-Driven Cybersecurity Evaluation
- Fuzzy Adaptive Command Filtered Control of Strict-Feedback Fractional-Order Nonlinear Systems With State and Input Quantization
- Show and Tell: Visually Explainable Deep Neural Nets via Spatially-Aware Concept Bottleneck Models
- UNICL-SAM: Uncertainty-Driven In-Context Segmentation with Part Prototype Discovery
- Exploring Spatiotemporal Relational Learning With TimeSformer for Identifying the Severity of the Road Accidents
- DecoupledGaussian: Object-Scene Decoupling for Physics-Based Interaction
- Machine Learning Prediction on User Satisfaction in Human-Robot Interaction (HRI) Tasks
- Comparative Performance Analysis of Classic and Modulated FCS-MPC for Grid-Tied Inverters
- GaPT-DAR: Category-level Garments Pose Tracking via Integrated 2D Deformation and 3D Reconstruction
- Targeted Poisoning Attacks Against Vertical Federated Learning Via Embedding Manipulation
- Integrated multi-format microwave signal generator using thin-film lithium niobite Mach-Zehnder modulator
- Comprehensive Relighting: Generalizable and Consistent Monocular Human Relighting and Harmonization
- Optimizing for the Shortest Path in Denoising Diffusion Model
- UCOD-DPL: Unsupervised Camouflaged Object Detection via Dynamic Pseudo-label Learning
- Bregman-divergence-based Arimoto-Blahut algorithm
- Extending Microservices Performance Optimization Through Horizontal Pod Autoscaling: A Comprehensive Study
- HuMoCon: Concept Discovery for Human Motion Understanding
- An Efficient Hybrid Algorithm Combining Skeletonization MoM-PO and EDM for Solving Electromagnetic Radiation of Large-Scale Targets
- Touch2Shape: Touch-Conditioned 3D Diffusion for Shape Exploration and Reconstruction
- Dual Graph Learning for Multivariate Time Series Anomaly Detection in IoUT
- TANGO: Training-free Embodied AI Agents for Open-world Tasks
- Tooth Instance Segmentation in CBCT Images Using Watershed Algorithm and nnUNet
- OODD: Test-time Out-of-Distribution Detection with Dynamic Dictionary
- Non-intrusive load monitoring using two-point sensors for load measurement, identification and localization
- Indoor Air Quality Control and Intelligent Design Combined with Machine Learning
- Forensic Self-Descriptions Are All You Need for Zero-Shot Detection, Open-Set Source Attribution, and Clustering of AI-generated Images
- Inter turn short circuit fault detection using PWM ripple currents in Brushless DC motor.
- Detail-Preserving Latent Diffusion for Stable Shadow Removal
- Plug-and-Play Versatile Compressed Video Enhancement
- Shape My Moves: Text-Driven Shape-Aware Synthesis of Human Motions
- Celebrating the 30th Anniversary of WiC [Member Activities]
- CaricatureBooth: Data-Free Interactive Caricature Generation in a Photo Booth
- Resilient Voltage Restoration Scheme for AC Microgrids Under Cyber Attack
- A Practical Approach to Accurate Modeling of SiC MOSFET Turn-Off Switching Losses
- A New Method for Temperature Measurement of Wire Rod Based on Adaptive Exposure Time
- A Planar Transformer Winding Configuration for High Frequency DAB Converter with Common-Mode EMI Mitigation
- Research on the Security of Time-Sensitive Networking Enabled Industrial Control System
- Lightweight Neural Network With Fixed Classifier for Enhanced Drone RF Signal Recognition
- Co-op: Correspondence-based Novel Object Pose Estimation
- DriveDreamer4D: World Models Are Effective Data Machines for 4D Driving Scene Representation
- Interoperable Tools for Deploying Non-Destructive Inspection Solutions
- MambaIRv2: Attentive State Space Restoration
- Conical Visual Concentration for Efficient Large Vision-Language Models
- Derivative-Free Diffusion Manifold-Constrained Gradient for Unified XAI
- Statistical Fault Attacks on ASCON Using Improved Square Euclidean Imbalance
- Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios
- Exploring the Use of Neural Networks for Early Detection of Oral Cancer and Other Dental Pathologies
- Attraction Diminishing and Distributing for Few-Shot Class-Incremental Learning
- Low Resource Passive Acoustic Vessel Detectors: Performance and System Design for Challenging Acoustic Environments
- Infinity∞: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
- RandAR: Decoder-only Autoregressive Visual Generation in Random Orders
- VISTA3D: A Unified Segmentation Foundation Model For 3D Medical Imaging
- Deep Learning-Driven Vulnerability Detection Models for Software Security
- Diagnosis of Retinal Disorder by using Deep Learning Algorithm
- Chebyshev Attention Depth Permutation Texture Network with Latent Texture Attribute Loss
- CustomKD: Customizing Large Vision Foundation for Edge Model Improvement via Knowledge Distillation
- MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing
- Simultaneous Enhancement of Electrochemical Migration Lifetime and Reliability of Sintered Silver
- Security of Dynamically Reconfigurable RISC-V Systems: I/O Attack Focus
- Advanced Charger Placement Strategies in Sensor Networks Using Graph Theory and Evolutionary Algorithms
- A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for Accelerating Large VLMs
- Electrostatic and Electromagnetic Particle-in-Cell Solvers for Electron Beam Device Simulations
- OpenHumanVid: A Large-Scale High-Quality Dataset for Enhancing Human-Centric Video Generation
- Active Harmonic Filter Based Topology and Control of Medium Voltage High Power Traction Motor Drive for Enhanced Performance
- HoVLE: Unleashing the Power of Monolithic Vision-Language Models with Holistic Vision-Language Embedding
- AToM: Aligning Text-to-Motion Model at Event-Level with GPT-4Vision Reward
- Neuromorphic Event Camera-based Object Recognition and Grasping Position Detection Using a Transfer Learning-Enhanced Multi-Task Model
- NightAdapter: Learning a Frequency Adapter for Generalizable Night-time Scene Segmentation
- Energy Efficient Scheduling of AI/ML Workloads on Multi-Instance GPUs with Dynamic Repartitioning
- Dynamic Hard Task-Guided Meta-Transfer Learning for Pathological Cell Segmentation
- Flowing from Words to Pixels: A Noise-Free Framework for Cross-Modality Evolution
- Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models