- Joint UAV 3D Trajectory and Resource Allocation for Integrated LEO Satellite and Multi-UAV-Enabled Marine IoT Networks: A Federated Multi-Agent Deep Reinforcement Learning Approach
- Distributed Cooperative Positioning in Mobile Wireless Networks: A GNN-Aided Joint Model- and Data-Driven Framework With High-Accuracy Closed-Form Message Representation
- A Hierarchical Hybrid PSO for Single-Objective Numerical Optimization
- DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long Videos
- MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes
- ν-LPA: Fast GPU-based Label Propagation Algorithm (LPA) for Community Detection
- Three-Phase Matrix-Based High-Power AC-DC Fast Charger for Low-Voltage Electric Vehicles
- Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation
- Learning Person-Specific Animatable Face Models from In-the-Wild Images via a Shared Base Model
- MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection
- Conformity Assessment of a Multi-Sensor Device for Indoor Environmental Quality Monitoring
- LC-Mamba: Local and Continuous Mamba with Shifted Windows for Frame Interpolation
- CNN-based Data Processing for Enhanced Detection of Small Targets in Sea Clutter
- Automated Parking Trajectory Generation Using Deep Reinforcement Learning
- AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea
- LookCloser: Frequency-aware Radiance Field for Tiny-Detail Scene
- MEGA: Masked Generative Autoencoder for Human Mesh Recovery
- Research on Information Fusion Algorithm for Wireless Sensor Homogeneous Networks
- Three-level Boost integrated Five-level Active Neutral Point Clamped Inverter for improved DC-link utilisation
- Research on the Application of Multi Factor Authentication Technology for 5G Terminals in the Coal Industry
- FruitNinja: 3D Object Interior Texture Generation with Gaussian Splatting
- EgoPressure: A Dataset for Hand Pressure and Pose Estimation in Egocentric Vision
- Towards Universal AI-Generated Image Detection by Variational Information Bottleneck Network
- YOLOv11-ASD: Improved YOLOv11 for Multi-Scale and Small Object Detection
- InsTaG: Learning Personalized 3D Talking Head from Few-Second Video
- Camera resection from known line pencils and a radially distorted scanline
- ZeroVO: Visual Odometry with Minimal Assumptions
- An Effective Action Recognition Method Based on Image Coding and a Dual-Channel Fusion Network
- Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie Dubbing
- Decision SpikeFormer: Spike-Driven Transformer for Decision Making
- FDS: Frequency-Aware Denoising Score for Text-Guided Latent Diffusion Image Editing
- An Improved Control Method of a Resonant-Inductive Wireless Charger with Input Power Factor Correction
- From Alexnet to Transformers: Measuring the Non-linearity of Deep Neural Networks with Affine Optimal Transport
- DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models
- Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning
- Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models
- Traversing Distortion-Perception Tradeoff using a Single Score-Based Generative Model
- ISL-based Multi-Satellite Collaborative Computation Offloading and Resource Allocation in ISTN
- Diff-Palm: Realistic Palmprint Generation with Polynomial Creases and Intra-Class Variation Controllable Diffusion Models
- IoT Enabled Real-Time Vehicle Tracking and Alert System for Educational Transport Service
- Data-Driven Adaptive Open-Circuit Fault Localization Method With Reduced Sensors for Magnetic-Network Energy Router
- Optimizing House Price Prediction Models: A Hybrid Approach using GWO-Feature Selection and CatBoost Tuning
- Tightening Robustness Verification of MaxPool-based Neural Networks via Minimizing the Over-Approximation Zone
- PRaDA: Projective Radial Distortion Averaging
- An Adaptive Weighted Metric Learning Network Based on Fractional Domain Decoupling for Hyperspectral Change Detection
- Balancing Profit and Purpose in Business Model Innovation: A 6C Analysis of an Emerging Circular Ecosystem
- EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild
- Variance-Integrated Policy Optimization: A Maximum Entropy Approach for Localization in Energy Interconnection Systems
- A Macro-Physical Model for Narrow Bipolar Events
- Cloud-Enabled ML Techniques for PTSD Assessment and Management
- Test-time augmentation improves efficiency in conformal prediction
- DAB-based Swiss Rectifier for Wide-range Voltage Output with Universal Input
- PassionSR: Post-Training Quantization with Adaptive Scale in One-Step Diffusion based Image Super-Resolution
- Active Power Control Method for Voltage Support with Three-phase Series PV Inverter in Low-Voltage Distribution Networks
- Advanced Analysis of Optoelectronic System Signals to Assess Postural Behavior in Parkinson’s Disease
- Sparse Point Cloud Patches Rendering via Splitting 2D Gaussians
- Speed Control of Six Step Commutation Trapezoidal by Fuzzy Logic Control of BLDC Motor for E-Vehicle
- Enhanced Fault Diagnosis in Transformer Oil using Duval Pentagon Method
- PI-HMR: Towards Robust In-bed Temporal Human Shape Reconstruction with Contact Pressure Sensing
- RefPose: Leveraging Reference Geometric Correspondences for Accurate 6D Pose Estimation of Unseen Objects
- CAP4D: Creating Animatable 4D Portrait Avatars with Morphable Multi-View Diffusion Models
- DOF-GS: Adjustable Depth-of-Field 3D Gaussian Splatting for Post-Capture Refocusing, Defocus Rendering and Blur Removal
- A Method for Detecting Citrus Leaf Diseases Improved Based on YOLOv10
- Few-shot Personalized Scanpath Prediction
- Incomplete Multi-modal Brain Tumor Segmentation via Learnable Sorting State Space Model
- SP3D: Boosting Sparsely-Supervised 3D Object Detection via Accurate Cross-Modal Semantic Prompts
- Towards Sustainable Machine Learning with Serverless at the Edge
- Charon: An End-to-End Infrastructure for Connecting AI@Edge to HPC
- PillarHist: A Quantization-aware Pillar Feature Encoder based on Height-aware Histogram
- Auto Cherry-Picker : Learning from High-quality Generative Data Driven by Language
- Real-Time Environmental Monitoring using ESP-NOW-based Wireless Sensor Network for Sustainable Agriculture
- High-Performance Computing for Graph AI: A Top-Down Perspective
- ShowMak3r: Compositional TV Show Reconstruction
- SACB-Net: Spatial-Awareness Convolutions for Medical Image Registration
- FSHNet: Fully Sparse Hybrid Network for 3D Object Detection
- HistoFS: Non-IID Histopathologic Whole Slide Image Classification via Federated Style Transfer with RoI-Preserving
- Design Optimization of a 3kW Bi-Directional Dual Active Bridge Converter for Battery Energy Storage Application
- Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
- Design, Analysis and Fabrication of a High Speed Inner-Hollow Outer Rotor Brushless DC Motor for Yarn Feeding Textile Machinery
- DucDiff: Dual-consistent Diffusion for Uncertainty-aware Information Diffusion Prediction
- SemiDAViL: Semi-supervised Domain Adaptation with Vision-Language Guidance for Semantic Segmentation
- UniAlign: Scaling Multimodal Alignment within One Unified Model
- VEU-Bench: Towards Comprehensive Understanding of Video Editing
- A U-Net Framework with Dice Loss for High-Precision Retinal Vessel Segmentation
- RL-RC-DoT: A Block-level RL agent for Task-Aware Video Compression
- Bootstrap Your Own Views: Masked Ego-Exo Modeling for Fine-grained View-invariant Video Representations
- Guiding Human-Object Interactions with Rich Geometry and Relations
- QanDe: The Power of Binary Emulation for Obfuscation Analysis
- Development and Evaluation of Secure Data Transmission in Microgrid Secondary Controller
- Learning Conditional Space-Time Prompt Distributions for Video Class-Incremental Learning
- FRAMES-VQA: Benchmarking Fine-Tuning Robustness across Multi-Modal Shifts in Visual Question Answering
- VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning
- Learning to Normalize on the SPD Manifold under Bures-Wasserstein Geometry
- Study of Photovoltaic Parameters of Efficient Bulk-Heterojunction Organic Solar Cells for Indoor Applications Under Varied Intensities and Indoor LED Lighting
- Real-Time Rat Pose Estimation System via Miniature Stereo Vision for Robot-Rat Interaction
- DefMamba: Deformable Visual State Space Model
- Unbalanced AC Grid Operation of a Power-Dense, Cost-Effective, and Efficient Hybrid Modular Multilevel Converter
- Single Domain Generalization for Few-Shot Counting via Universal Representation Matching
- An Interleaved Soft-Switching Bidirectional Converter for EV Application
- A Lightweight Radio Frequency Fingerprint Recognition Method Based on Spatial Synergy Enhancing Attention