- SYCL for HPC: Adapting to Diverse CPU Architecture
- TSP-Mamba: The Travelling Salesman Problem Meets Mamba for Image Super-resolution and Beyond
- Remaining Useful Life Prediction of Bearings under Complex Operating Conditions: A DAENet-XLSTM Transfer Learning Model
- PanDA: Towards Panoramic Depth Anything with Unlabeled Panoramas and Möbius Spatial Augmentation
- Reconstructing Animals and the Wild
- Towards Natural Language-Based Document Image Retrieval: New Dataset and Benchmark
- Scaling Down Text Encoders of Text-to-Image Diffusion Models
- Design of Transformer Parameters for Energy Efficiency Enhancement of Semi-Active Bridge Converter
- Research on Semantic Segmentation Algorithm Based on Enhanced DeeplabV3+
- Spatial Transport Optimization by Repositioning Attention Map for Training-Free Text-to-Image Synthesis
- COFFEE: Mitigating Hallucination in LVLMs via COllaborative Filtering for Enhanced Eyes
- LineArt: A Knowledge-guided Training-free High-quality Appearance Transfer for Design Drawing with Diffusion Model
- Examining Recent Entrepreneurial Ecosystem Research in Emerging Economies – A Bibliometric Analysis
- Dual Prompting Image Restoration with Diffusion Transformers
- A Novel Circulating Current Control Technique in Onboard Integrated Charger
- g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks
- LogiCzsl: Exploring Logic-induced Representation for Compositional Zero-shot Learning
- VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
- Thin-Shell-SfT: Fine-Grained Monocular Non-rigid 3D Surface Tracking with Neural Deformation Fields
- Robust-MVTON: Learning Cross-Pose Feature Alignment and Fusion for Robust Multi-View Virtual Try-On
- Visual Consensus Prompting for Co-Salient Object Detection
- Sonata: Self-Supervised Learning of Reliable Point Representations
- Generalized Received Signal Power Models for Multi-hop RIS and its Practical Analysis
- A New Statistical Model of Star Speckles for Learning to Detect and Characterize Exoplanets in Direct Imaging Observations
- Learning Class Prototypes for Unified Sparse-Supervised 3D Object Detection
- DeRS: Towards Extremely Efficient Upcycled Mixture-of-Experts Models
- Object-aware Sound Source Localization via Audio-Visual Scene Understanding
- A Low-Profile Shared-Aperture Antenna Using Electromagnetic Transparent Structure and AMC
- CDI: Copyrighted Data Identification in Diffusion Models
- SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models
- Distilling Monocular Foundation Model for Fine-grained Depth Completion
- Parametric Point Cloud Completion for Polygonal Surface Reconstruction
- Multifrequency Model of Sinusoidal PWM by DIDR Methodology
- Application of AI in Lightning and Thunderstorm Forecasting: A Vision for the Future
- Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think
- Memories of Forgotten Concepts
- Blind Beamforming via Deep Learning-Based Signal Classification and Transfer Learning
- DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations
- Electromyography-Informed Facial Expression Reconstruction for Physiological-Based Synthesis and Analysis
- Label Shift Meets Online Learning: Ensuring Consistent Adaptation with Universal Dynamic Regret
- Enhancing Manufacturing Training Through VR Simulations
- Instruction-based Image Manipulation by Watching How Things Move
- C2FNet: Cross-Probabilistic Weak Supervision Learning for High-Resolution Land Cover Enhancement
- Is Your World Simulator a Good Story Presenter? A Consecutive Events-Based Benchmark for Future Long Video Generation
- Performance Characterization of Parallel Combination Generators on CPU and GPU Systems
- TCM Control in High-frequency Inverter Without Bottom Current Detection
- Survey App: Rating and Feedback System Application
- Rationales and Expectations with Upcoming New Line Surge Arrester (LSA) Standard IEC/IEEE 60099-11
- HADES: Hardware Accelerated Decoding for Efficient Speculation in Large Language Models
- MP-GUI: Modality Perception with MLLMs for GUI Understanding
- HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled Generation
- Analytical Subdomain Modelling and Analysis of a Single Rotor Induction Assisted IPM Motor for EVs
- GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
- Neural Inverse Rendering from Propagating Light
- Star with Bilinear Mapping
- A Single-Stage Admittance Control Network Based Misalignment Tolerant Inductive Power Transfer System for EV Application
- Question-Aware Gaussian Experts for Audio-Visual Question Answering
- CrossOver: 3D Scene Cross-Modal Alignment
- An Improved Data Fusion Model for Secondary Return Water Temperature of Heating System Employing EKF
- Optimal Transport-Guided Source-Free Adaptation for Face Anti-Spoofing
- A Method for Detecting Dangerous Behaviors of Power Operation Personnel Based on Paddledetection
- A Parallel and Highly-Portable HPC Poisson Solver: Preconditioned Bi-CGSTAB with alpaka
- Microfluidic biosensors for biotic and abiotic plant stress monitoring
- Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
- Research on Neural Network Hyper-Parameters Optimization Based on Firefly Algorithm
- DKDM: Data-Free Knowledge Distillation for Diffusion Models with Any Architecture
- STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing from Text-to-Image Diffusion Models
- Energy-Efficient Design for Downlink Pinching-Antenna Systems with QoS Guarantee
- Exact: Exploring Space-Time Perceptive Clues for Weakly Supervised Satellite Image Time Series Semantic Segmentation
- Understanding multi-layered transmission matrices
- Deep Reinforcement Learning for Adaptive Beamforming in 6G Massive MIMO Systems Using DeepMIMO
- Neuron: Learning Context-Aware Evolving Representations for Zero-Shot Skeleton Action Recognition
- Acc3D: Accelerating Single Image to 3D Diffusion Models via Edge Consistency Guided Score Distillation
- MultiMorph: On-Demand Atlas Construction
- Exploring Near-Optimal Contraction Strategies for the Scalar Product in the Tensor-Train Format
- Continuous Space-Time Video Resampling with Invertible Motion Steganography
- Scene4U: Hierarchical Layered 3D Scene Reconstruction from Single Panoramic Image for Your Immerse Exploration
- Empowering Large Language Models with 3D Situation Awareness
- DNF: Unconditional 4D Generation with Dictionary-based Neural Fields
- Embracing Load Imbalance for Energy Optimizations: a Case-Study
- Open-Vocabulary Functional 3D Scene Graphs for Real-World Indoor Spaces
- Study on Insulator Local Arc Development Considering Energy Level Transition and Surface Particles
- Robust Methodology Design to Predict Opioid Overdose System based on AI Assisted Deep Learning Principles
- Conformal Prediction and MLLM aided Uncertainty Quantification in Scene Graph Generation
- DeSplat: Decomposed Gaussian Splatting for Distractor-Free Rendering
- SpiralShard: Highly Concurrent and Secure Blockchain Sharding via Linked Cross-Shard Endorsement
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation
- Sensing depth analysis of different permittivity materials based on open-ended coaxial probes at different input powers
- Improving Transferable Targeted Attacks with Feature Tuning Mixup
- Real-time Measurement of Aeolian Sand Transport by means of LoRaWAN-based Sand Traps
- HPPC-Based ECM Parameter Optimisation of Lithium-ion Battery: A Comparative Analysis of Non-Linear Least Squares Methods
- MonoInstance: Enhancing Monocular Priors via Multi-view Instance Alignment for Neural Rendering and Reconstruction
- Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
- Demystifying Chains, Trees, and Graphs of Thoughts
- BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature
- Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration
- Shift the Lens: Environment-Aware Unsupervised Camouflaged Object Detection
- Joint retrieval of ozone profile in near-space based on the atmospheric and near infrared atmospheric bands of O 2 airglow
- DDoS Protection System for Cloud using AWS and Machine Learning
- Calico: Part-Focused Semantic Co-Segmentation with Large Vision-Language Models