- DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation
- A Survey on All-in-One Image Restoration: Taxonomy, Evaluation and Future Trends
- Active Event-based Stereo Vision
- Graph Neural Network Combining Event Stream and Periodic Aggregation for Low-Latency Event-Based Vision
- Pothole Detection Using Deep Learning Methods
- Review of the Drone Applications in Urban Underground Environments
- A Short Survey on Large Language Models' Application in Manufacturing Domain
- Coordinated Control of Deformation and Flight for Morphing Aircraft via Meta-Learning and Coupled State-Dependent Riccati Equations
- Structured Superposition of Autoencoders for UEP Codes at Intermediate Blocklengths
- Benefit Assessment Model and Method for Virtual Control Unit Participation in Frequency Response Control of Provincial Networks
- Training Data Provenance Verification: Did Your Model Use Synthetic Data from My Generative Model for Training?
- Composing Parts for Expressive Object Generation
- LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models
- Language Guided Concept Bottleneck Models for Interpretable Continual Learning
- Understanding and Mitigating Lightning-Related Animal Fatalities: Case Studies, Injury Pathways, and Protection Measures
- ActiveGAMER: Active GAussian Mapping through Efficient Rendering
- Semiconductor Loss Balancing of a 9-level Cascaded H-Bridge multilevel Inverter through Novel Carrier-reassignment PWM Scheme
- Investigating the Role of Weight Decay in Enhancing Nonconvex SGD
- SeqMvRL: A Sequential Fusion Framework for Multi-view Representation Learning
- RestorGS: Depth-aware Gaussian Splatting for Efficient 3D Scene Restoration
- Advanced Traffic Object Detection in Complex Road Conditions Using an Optimized YOLOv8 Algorithm
- Using Powerful Prior Knowledge of Diffusion Model in Deep Unfolding Networks for Image Compressive Sensing
- Learning to Highlight Audio by Watching Movies
- Towards a Model-Based Framework for Automated Traceable Systems and Probabilistic Model Checking
- Spec2RTL-Agent: Automated Hardware Code Generation from Complex Specifications Using LLM Agent Systems
- VERA: Explainable Video Anomaly Detection via Verbalized Learning of Vision-Language Models
- Integrating Physics-Informed Neural Networks and GRU for SciML-based Surface Temperature Prediction Li-ion Battery
- Optimized Cloud Performance Through Secure Data Detachment and Reproduction
- ChannelGuard: A DIRS-based Location Privacy-Protecting Mechanism for Integrated Sensing and Communication Systems
- EventSplat: 3D Gaussian Splatting from Moving Event Cameras for Real-time Rendering
- Scalable Runtime Architecture for Data-driven, Hybrid HPC and ML Workflow Applications
- Diffusion Bridge: Leveraging Diffusion Model to Reduce the Modality Gap Between Text and Vision for Zero-Shot Image Captioning
- A Three-Phase Synchronous Reference Frame Controller-Based DC Link Voltage Balancing Technique for CHB-Based Modular SST
- DarkGAN-Enhanced Low-Light Detection and Localization of Low-Voltage Electricity Meters
- Recognizing Abnormalities in Fundus Images Using Vision Transformer for Ocular Diseases
- Enhancing Creative Generation on Stable Diffusion-based Models
- AdaptCMVC: Robust Adaption to Incremental Views in Continual Multi-view Clustering
- DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation
- Binarized Mamba-Transformer for Lightweight Quad Bayer HybridEVS Demosaicing
- Large Language Model for Verilog Generation with Code-Structure-Guided Reinforcement Learning
- Epidemic-Behavior Coevolutionary Vaccination Game Dynamics Under Prospect Theory
- Mr. DETR: Instructive Multi-Route Training for Detection Transformers
- DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models
- Towards In-the-wild 3D Plane Reconstruction from a Single Image
- On the Zero-shot Adversarial Robustness of Vision-Language Models: A Truly Zero-shot and Training-free Approach
- FedChip: Federated LLM for Artificial Intelligence Accelerator Chip Design
- Let Humanoids Hike! Integrative Skill Development on Complex Trails
- Blurred LiDAR for Sharper 3D: Robust Handheld 3D Scanning with Diffuse LiDAR and RGB
- MFogHub: Bridging Multi-Regional and Multi-Satellite Data for Global Marine Fog Detection and Forecasting
- GIF: Generative Inspiration for Face Recognition at Scale
- LERFSNet: A Lightweight SAR Ship Detection Model with Enhanced Receptive Field and Shared Decoupling Head
- Light Weight Apple Leaf Disease Detection Method Based on Improved YOLOv8
- VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models
- TIMotion: Temporal and Interactive Framework for Efficient Human-Human Motion Generation
- Design and Analysis of 39GHz 5G Array Antennas for WBAN Applications
- FedBiP: Heterogeneous One-Shot Federated Learning with Personalized Latent Diffusion Models
- QCLAB: A Matlab Toolbox for Quantum Computing
- ASIC-Agent: An Autonomous Multi-Agent System for ASIC Design with Benchmark Evaluation
- Fusing CNN and LSTM Networks for Residential Electricity Load Forecasting
- UNEM: UNrolled Generalized EM for Transductive Few-Shot Learning
- Exploration and Application of Desktop Cloud Technology Based on Fusionaccess in Geological Survey Service
- From Laboratory to Real World: A New Benchmark Towards Privacy-Preserved Visible-Infrared Person Re-Identification
- Inversion Circle Interpolation: Diffusion-based Image Augmentation for Data-scarce Classification
- SeaLion: Semantic Part-Aware Latent Point Diffusion Models for 3D Generation
- SDB-YOLO: A Lightweight X-Ray Image Component Detection Algorithm Based on Semantic Dual-Branch Features
- A Single-Phase Single-Stage Boost Inverter with Sensorless Balancing of Capacitors
- Can Generative Video Models Help Pose Estimation?
- MaIR: A Locality-and Continuity-Preserving Mamba for Image Restoration
- Research on Optimizing Data Preprocessing Using Clustering Algorithms
- ScaleLSD: Scalable Deep Line Segment Detection Streamlined
- Exploring Deep Learning Models for Hyperspectral Image Classification
- Analytical Model for Eccentric IPMSM via a New Equivalent Transformation Method and Its Rotor Vibration Calculation Considering the Coupling Effect
- A Hardware/Software Co-Design Approach for Versal-Based K-means Acceleration
- Track4Gen: Teaching Video Diffusion Models to Track Points Improves Video Generation
- Real-time Voltage Control in Smart Distribution Network through Multi-agent Cooperative Optimization
- DynamicScaler: Seamless and Scalable Video Generation for Panoramic Scenes
- Generation, Analysis and Validation of Retinal Images Associated with Diabetic Retinopathy Using Generative Artificial Intelligence
- APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers
- Enhancing Safety in Industry 5.0: Human-Computer Collaboration Benefits through a Dataset of Protective Equipment Detection
- Towards Stable and Storage-efficient Dataset Distillation: Matching Convexified Trajectory
- DeClotH: Decomposable 3D Cloth and Human Body Reconstruction from a Single Image
- GroomLight: Hybrid Inverse Rendering for Relightable Human Hair Appearance Modeling
- ITA-MDT: Image-Timestep-Adaptive Masked Diffusion Transformer Framework for Image-Based Virtual Try-On
- OccMamba: Semantic Occupancy Prediction with State Space Models
- A Novel Continuously Variable Gate Voltage Control Concept for Silicon Carbide Power Modules
- Active Hyperspectral Imaging Using an Event Camera
- Photovoltaic Powered Autonomous Wireless Power Transfer charging for Marine Electric Vehicle
- A Generic Process Mining Framework for Uncovering Hierarchical Process Model
- ArcPro: Architectural Programs for Structured 3D Abstraction of Sparse Points
- Semantic and Sequential Alignment for Referring Video Object Segmentation
- Novel Three-Phase Efficient Power Supply for Hydrogen Production Using Active Voltage Injection Method
- A Lightweight UDF Learning Framework for 3D Reconstruction Based on Local Shape Functions
- Peachy Parallel Assignments (EduPar 2025)
- Surgeon: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation Sparsity
- SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction
- Motion Modes: What Could Happen Next?
- SFS: A Simple File System for Teaching Parallelism in Computer Systems
- Fault Feature Extraction and Diagnosis Method for Marine Diesel Engine Based on Time-delay Embedded manifold Learning
- SimLTD: Simple Supervised and Semi-Supervised Long-Tailed Object Detection
- OrgoFarm : Harnessing Artificial Intelligence and Machine Learning to Revolutionize Organic Farming and Direct-to-Consumer Marketing