- Artificial Intelligence Based Inverter Fault Detection System
- Dual-Criteria Active Learning for Medical Image Segmentation
- Community Forensics: Using Thousands of Generators to Train Fake Image Detectors
- Polar Dense Ice Layer Ship Path Planning Based on DI-IVYA-A* Algorithm
- Shadow Generation Using Diffusion Model with Geometry Prior
- Sub-THz and THz Channel Measurements and Characteristic Analysis in Indoor and Outdoor Environments for 6G Wireless Systems
- BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence
- Towards Consistent Multi-Task Learning: Unlocking the Potential of Task-Specific Parameters
- GFlowVLM: Enhancing Multi-step Reasoning in Vision-Language Models with Generative Flow Networks
- Matrix-Free Shared Intrinsics Bundle Adjustment
- Three Cars Approaching within 100m! Enhancing Distant Geometry by Tri-Axis Voxel Scanning for Camera-based Semantic Scene Completion
- VidHalluc: Evaluating Temporal Hallucinations in Multimodal Large Language Models for Video Understanding
- EVOS: Efficient Implicit Neural Training via EVOlutionary Selector
- TMPGait: Hybrid Architecture for Multi-Scale Spatiotemporal Feature Learning in Gait Recognition
- Spectral State Space Model for Rotation-Invariant Visual Representation Learning
- Unlocking Energy-Efficient and High-Throughput Secure Data Communication in IoT with Memory-Centric Computing
- Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
- Open Prototyping – A Toolkit for Open Engineering?: The Case of The New Real Observatory, an Environmentally-Conscious Generative AI Platform
- Ornstein-Uhlenbeck Noise Driven Diffusion Model
- Research on Floating Ice Recognition and Wave Reconstruction Using Binocular Vision
- Novel View Synthesis with Pixel-Space Diffusion Models
- A Flag Decomposition for Hierarchical Datasets
- VidSeg: Training-free Video Semantic Segmentation based on Diffusion Models
- Comparative Study of Planar Ag/AgCl Quasi-Reference Electrodes Developed on PCB
- Novel LPP-Constructed Topology Integrated with Knowledge based GCN for Fault Diagnosis
- SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing
- SocialMOIF: Multi-Order Intention Fusion for Pedestrian Trajectory Prediction
- Generative Image Layer Decomposition with Visual Effects
- Dynamic Coordination to Supplementary Damping Controllers in Heterogeneous Wind Farm to Suppress Oscillations Among Synchronous Generators
- Seeing Speech and Sound: Distinguishing and Locating Audio Sources in Visual Scenes
- Annotation Ambiguity Aware Semi-Supervised Medical Image Segmentation
- Universal Scene Graph Generation
- Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models
- Simulation Framework for Assessing VWC Performance in Low-Cost Smart Agriculture Sensors
- h-Edit: Effective and Flexible Diffusion-Based Editing via Doob’s h-Transform
- LEBR-YOLO: Accurate Fruit and Vegetable Detection Method for Space Station Cargo Bay
- Digital Transformation of Healthcare Professionals Trainings: Towards a Design Framework for Creating Autonomous XR Training Platforms
- IMFine: 3D Inpainting via Geometry-guided Multi-view Refinement
- A Brain Tumor Image Classification Method Based on Improved MobileViT Model
- Causal Composition Diffusion Model for Closed-loop Traffic Generation
- Method for Image Restoration in Oil Well Drilling Fluid
- BadToken: Token-level Backdoor Attacks to Multi-modal Large Language Models
- A Novel Three Port Multi-Input Single Inductor DC-DC Bidirectional Boost Converter
- Intelligent Image Classification and Emergency Detection for Enhanced CCTV Surveillance Systems using CNN
- SCSGuardian: A Practical Hardware Defense against Speculative Cache Side-Channel Attacks
- Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
- The PanAf-FGBG Dataset: Understanding the Impact of Backgrounds in Wildlife Behaviour Recognition
- Mask-Enhanced Edge-Aware Knowledge Distillation for Medical Image Segmentation
- Dataset Distillation with Neural Characteristic Function: A Minmax Perspective
- Detection-Friendly Nonuniformity Correction: A Union Framework for Infrared UAV Target Detection
- Sensorless Implementation of Pulse Density Modulation in Direct AC/AC SST
- EGS-SLAM: RGB-D Gaussian Splatting SLAM with Events
- MICAS: Multi-grained In-Context Adaptive Sampling for 3D Point Cloud Processing
- AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models
- Research on the Design of a Multi-User Interactive System for Chu Music Based on Multimodal Collaboration
- GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration
- RigGS: Rigging of 3D Gaussians for Modeling Articulated Objects in Videos
- DPU: Dynamic Prototype Updating for Multimodal Out-of-Distribution Detection
- Study and Validation of a Novel dq-axes Equivalent Circuit Model for PMSM Considering the Iron Loss
- ML Enabled Parallel R-C Sensor for Level and Electrical Conductivity Measurement
- PerLA: Perceptive 3D language assistant
- Introducing Time-lag and Bi-LSTM Neural Network for In-Operando Surface Temperature Estimation in Lithium-ion Batteries
- FedMIA: An Effective Membership Inference Attack Exploiting "All for One" Principle in Federated Learning
- GSM based LPG Leakage Detector with SMS Alert System
- A Cost-effective Occupancy Estimation System for Energy-efficient Buildings in Africa
- Setchain Algorithms for Blockchain Scalability (Extended Abstract)
- Design of Intelligent Home Environment Monitoring System Based on Deep Learning
- Optimized Gas Sensor Array with AI for Distinguishing and Classifying Similar Odorants
- PrEditor3D: Fast and Precise 3D Shape Editing
- Improved Video VAE for Latent Video Diffusion Model
- Physical Constraints into Deep Learning for Enhanced Snow Depth Retrieval over the Third Pole
- HC-PCL: A Hierarchical Cross-Camera Prototypical Contrastive Learning Framework for Unsupervised Object Re-Identification
- Model Poisoning Attacks to Federated Learning via Multi-Round Consistency
- Lightweight Hybrid Attention Network for Edge Deployment in Crop Disease Recognition
- 3D Modeling of Coal Bunkers Based on LiDAR
- Double-Frequency Control of Multi-Active Bridge Converters for Soft-Switching Range Extension
- Uplink Spectral Efficiency Performance of Cell-Free RAN System Under Imperfect CSI
- Unveiling collective value creation behavior in public projects: The stakeholder value network approach
- Joint Semantic Detection and Dissemination Control of Phishing Attacks on Social Media Via Llamabased Modeling
- Multitwine: Multi-Object Compositing with Text and Layout Control
- Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification
- Towards community-based influence spread prediction (CIP) for edge changes in large-scale dynamic social networks
- One-Minute Video Generation with Test-Time Training
- Contemporary Landscape of Fire Safety Monitoring and Control Practices in Buildings
- Predicting the Predictor: Linear Metamodeling for Evolving User Response Prediction
- DDMG: A New Discrete Diffusion Model Developed for Molecular Graph Generation
- CCIFE: Channel-Resilient Ensemble Adversarial Attack Against DNN-Based Modulation Classifiers
- Lumbar EMG-Based Motion Intent Recognition for Industrial Exoskeletons
- Smart Metering for Real-Time Power Prediction Price Forecasting and Intelligent Alerts in Modern Living Spaces
- VSNet: Focusing on the Linguistic Characteristics of Sign Language
- YOLO-Poppy: Opium Poppy Detection Algorithm for Complex Aerial Scenes
- MoEE: Mixture of Emotion Experts for Audio-Driven Portrait Animation
- Activating Sparse Part Concepts for 3D Class Incremental Learning
- Feature Information Driven Position Gaussian Distribution Estimation for Tiny Object Detection
- Diagnosis of Retinal Disorder by using Deep Learning Algorithm
- SoftShadow: Leveraging Soft Masks for Penumbra-Aware Shadow Removal
- Security of Dynamically Reconfigurable RISC-V Systems: I/O Attack Focus
- Classic Video Denoising in a Machine Learning World: Robust, Fast, and Controllable
- Overcoming Shortcut Problem in VLM for Robust Out-of-Distribution Detection
- Electrostatic and Electromagnetic Particle-in-Cell Solvers for Electron Beam Device Simulations