- Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models
- Optical Dialogue Photonic Converter for Photon-Driven DC Motor System
- Generation, Analysis and Validation of Retinal Images Associated with Diabetic Retinopathy Using Generative Artificial Intelligence
- Self-Supervised Learning for Color Spike Camera Reconstruction
- Cross-Modal Distillation for 2D/3D Multi-Object Discovery from 2D motion
- Zero-shot 3D Question Answering via Voxel-based Dynamic Token Compression
- PGC: Physics-Based Gaussian Cloth from a Single Pose
- Infrastructure Resilience in Fast-Growing Cities: Key Challenges and Opportunities – A Review
- AniDoc: Animation Creation Made Easier
- CountLLM: Towards Generalizable Repetitive Action Counting via Large Language Model
- Outlier Detection and other applications of Quantum Matrix Multiplication
- Efficient Intra-node Hierarchical Parallelisms And Dynamic Load Balancing Strategies On Heterogeneous Systems
- Integrated State of Charge and Thermal Active Balancing in Lithium-Ion Batteries: A Finite Set Model Predictive Control Approach
- Processing of Optical Imagery Onboard Earth Observation Satellites: Benchmarking An Embedded Computing Approach
- Navigating Uncertainty: The Evolution of Entrepreneurial Support Networks During the COVID-19 Crisis
- A Novel Continuously Variable Gate Voltage Control Concept for Silicon Carbide Power Modules
- ConMo: Controllable Motion Disentanglement and Recomposition for Zero-Shot Motion Transfer
- Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head Generation
- EVolSplat: Efficient Volume-based Gaussian Splatting for Urban View Synthesis
- Research on Multimodal MRI Glioma Segmentation Based on Attention Mechanism
- SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding
- Lightning Warning Based on Extrapolation of Radar Composite Reflectivity Data
- Improving Relation Extraction with Contrastive Learning-Based Named Entity Recognition
- Securing End-to-End Reinforcement Learning-Driven Autonomous Driving: A Control Command Utility-based Intrusion Response System
- Project-Probe-Aggregate: Efficient Fine-Tuning for Group Robustness
- Generative Map Priors for Collaborative BEV Semantic Segmentation
- Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Large Model Enhancement
- WeakMCN: Multi-task Collaborative Network for Weakly Supervised Referring Expression Comprehension and Segmentation
- Malware Detection System: Safeguarding Against Evolving Threats
- Trade-Offs in Resource-Constrained Dimensionality Reduction Algorithms
- DynFocus: Dynamic Cooperative Network Empowers LLMs with Video Understanding
- Determining the Efficacy of SENet Integrated YOLO Models For Animal Detection
- SFQ-Driven Pulse-Phase Sequence Generator for Superconducting Qubit Control
- Composing Parts for Expressive Object Generation
- Language Guided Concept Bottleneck Models for Interpretable Continual Learning
- Electromagnetically Actuated Variable Optical Attenuator with Configurable Dynamic Range
- ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark
- Long-Range Dense Mapping with Enhanced Accuracy via a Flying Variable-Baseline Stereo System
- OpticalNet: An Optical Imaging Dataset and Benchmark Beyond the Diffraction Limit
- UNEM: UNrolled Generalized EM for Transductive Few-Shot Learning
- Online Monitoring the Arc Contacts Erosion of HVCBs Based on Radiation Signals
- Performance Improvement of Multi-output Auxiliary Power Supply with Planar Magnetics Design
- Mamba4D: Efficient 4D Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models
- ColabSfM: Collaborative Structure-from-Motion by Point Cloud Registration
- Generation and Deep Learning-Based Classification of RF Signals Represented as I/Q Time Series
- EventSplat: 3D Gaussian Splatting from Moving Event Cameras for Real-time Rendering
- InPO: Inversion Preference Optimization with Reparametrized DDIM for Efficient Diffusion Model Alignment
- Research on Application of Gesture Interaction Technology Based on Computer Vision
- Enhanced Visual-Semantic Interaction with Tailored Prompts for Pedestrian Attribute Recognition
- Diffusion Bridge: Leveraging Diffusion Model to Reduce the Modality Gap Between Text and Vision for Zero-Shot Image Captioning
- Mitigating Magnetic Saturation in Coupled Inductor VRMs with a Novel Interleaved Winding Arrangement
- MOS-Attack: A Scalable Multi-Objective Adversarial Attack Framework
- Personalized Preference Fine-tuning of Diffusion Models
- A Three-Phase Synchronous Reference Frame Controller-Based DC Link Voltage Balancing Technique for CHB-Based Modular SST
- ImagineFSL: Self-Supervised Pretraining Matters on Imagined Base Set for VLM-based Few-shot Learning
- GS-2DGS: Geometrically Supervised 2DGS for Reflective Object Reconstruction
- A High Gain Non-Isolated Single-Switch DC-DC Boost Converter: Design and Analysis
- FedSPA : Generalizable Federated Graph Learning under Homophily Heterogeneity
- Progressive Focused Transformer for Single Image Super-Resolution
- SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting
- Preprocessing Pipelines for OT Network Traffic Capture Data in AI Cybersecurity Applications
- Prompting-in-a-Series: Psychology-Informed Contents and Embeddings for Personality Recognition With Decoder-Only Models
- From Laboratory to Real World: A New Benchmark Towards Privacy-Preserved Visible-Infrared Person Re-Identification
- Enhancing Creative Generation on Stable Diffusion-based Models
- DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation
- Deformable Radial Kernel Splatting
- Efficient Net Load Forecasting in Large-scale Power Distribution Systems via Dual-branch Experts Fusion Memory Network
- SeaLion: Semantic Part-Aware Latent Point Diffusion Models for 3D Generation
- TAET: Two-Stage Adversarial Equalization Training on Long-Tailed Distributions
- Testing and Experimentation Facility for AI Decision Support Systems for Energy Solutions
- SDB-YOLO: A Lightweight X-Ray Image Component Detection Algorithm Based on Semantic Dual-Branch Features
- Once-Tuning-Multiple-Variants: Tuning Once and Expanded as Multiple Vision-Language Model Variants
- RORem: Training a Robust Object Remover with Human-in-the-Loop
- Line Graph Neural Network for Drug-Disease Association Prediction
- UA-Pose: Uncertainty-Aware 6D Object Pose Estimation and Online Object Completion with Partial References
- AlphaPre: Amplitude-Phase Disentanglement Model for Precipitation Nowcasting
- Checkpointing Optimisation to Prepare Future Exascale Plasma Turbulence Simulations
- Dynamic Generation Technology of Network Scene Based on Proximal Policy Optimization
- Design and Application of a Supervision Management System for Railway Rolling Stock
- Shadows of Disparity: Unveiling the Asymmetry of Mutual Coupling in Densely-Packed MIMO
- S2D-LFE: Sparse-to-Dense Light Field Event Generation
- Scalable Autoregressive Monocular Depth Estimation
- DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models
- Speech Recognition System Based on Microcontroller
- TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization
- OpenSDI: Spotting Diffusion-Generated Images in the Open World
- Hearing Anywhere in Any Environment
- On the Zero-shot Adversarial Robustness of Vision-Language Models: A Truly Zero-shot and Training-free Approach
- A Hardware/Software Co-Design Approach for Versal-Based K-means Acceleration
- Let Humanoids Hike! Integrative Skill Development on Complex Trails
- Track4Gen: Teaching Video Diffusion Models to Track Points Improves Video Generation
- M3GYM: A Large-Scale Multimodal Multi-view Multi-person Pose Dataset for Fitness Activity Understanding in Real-world Settings
- Distinguish Then Exploit: Source-free Open Set Domain Adaptation via Weight Barcode Estimation and Sparse Label Assignment
- Real-time Voltage Control in Smart Distribution Network through Multi-agent Cooperative Optimization
- DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes
- Progressive Correspondence Regenerator for Robust 3D Registration
- LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
- SnowMaster: Comprehensive Real-world Image Desnowing via MLLM with Multi-Model Feedback Optimization
- Cooperative Algorithms for Multi-Agent Multi-Armed Bandits: Integrating $\varepsilon$ -Greedy Optimization
- GOAL: Global-local Object Alignment Learning