- MP-GUI: Modality Perception with MLLMs for GUI Understanding
- Analytical Subdomain Modelling and Analysis of a Single Rotor Induction Assisted IPM Motor for EVs
- HiMoR: Monocular Deformable Gaussian Reconstruction with Hierarchical Motion Representation
- 3D Prior is All You Need: Cross-Task Few-shot 2D Gaze Estimation
- Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event Cameras
- ILIAS: Instance-Level Image retrieval At Scale
- SkyMamba: Integrating Transformer and State Space Model for UAV Remote Sensing RGB-D Images Semantic Segmentation
- GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
- Research on Torque Optimization of Outer Rotor Permanent Magnet Synchronous Motor Based on Response Surface Methodology
- Multiple Object Tracking as ID Prediction
- Segment Any-Quality Images with Generative Latent Space Enhancement
- Gaussian Splatting for Efficient Satellite Image Photogrammetry
- Instant Adversarial Purification with Adversarial Consistency Distillation
- Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation
- ProAPO: Progressively Automatic Prompt Optimization for Visual Classification
- UrbanCAD: Towards Highly Controllable and Photorealistic 3D Vehicles for Urban Scene Simulation
- Enhancing KGCN-Based Recommendation Algorithms via Attention Mechanism Integration
- An Improved Data Fusion Model for Secondary Return Water Temperature of Heating System Employing EKF
- Microfluidic biosensors for biotic and abiotic plant stress monitoring
- Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
- Parallel Fractal Decomposition Optimization Algorithms on Heterogeneous Architectures
- Lifeline Connect: A Web-based Multi-Feature System for Mental Health Support
- Development of a next-generation thermopile detector for cold-body space applications
- Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation
- SLADE: Shielding against Dual Exploits in Large Vision-Language Models
- Multi-feature Collaborative Attention Dynamic Hypergraph Convolutional Network for Hyperspectral Image Classification
- MaskGaussian: Adaptive 3D Gaussian Representation from Probabilistic Masks
- UCM-VeID V2: A Richer Dataset and A Pre-Training Method for UAV Cross-Modality Vehicle Re-Identification
- TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution
- DirectTriGS: Triplane-based Gaussian Splatting Field Representation for 3D Generation
- Self-Sustained Oscillation Analysis of Grid-Interfaced Converters by Frequency-Domain Method Considering Harmonic Balance Principle
- Smart Eye: A Surveillance system
- MAGE : Single Image to Material-Aware 3D via the Multi-View G-Buffer Estimation Model
- Object-Shot Enhanced Grounding Network for Egocentric Video
- AssertionForge: Enhancing Formal Verification Assertion Generation with Structured Representation of Specifications and RTL
- iG-6DoF: Model-Free 6DoF Pose Estimation for Unseen Object via Iterative 3D Gaussian Splatting
- FSDFormer: a Frequency-Selected Differential Fusion Transformer for Remote Sensing Image Spatiotemporal Fusion
- Stacking Brick by Brick: Aligned Feature Isolation for Incremental Face Forgery Detection
- PRISTINE: PRIority-Aware Smart Resource Orchestration eNginE for Cloud-Native Applications
- SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model
- Empowering Large Language Models with 3D Situation Awareness
- GLAD-TL: A Time-Sensitive Crowdsourced Model for Robust Detection of Fake Taxis in Urban Traffic Surveillance
- SCORPIO: A Parallel I/O library for Exascale Earth System Models
- Study on Insulator Local Arc Development Considering Energy Level Transition and Surface Particles
- Towards Practical Real-Time Neural Video Compression
- Floxels: Fast Unsupervised Voxel Based Scene Flow Estimation
- NFC in Health Monitoring : A New Era of Medical Cards and Application
- Curriculum Direct Preference Optimization for Diffusion and Consistency Models
- Sufficient Invariant Learning for Distribution Shift
- ICE Over the Years - A Keyword Analysis
- TaskSimLF: Efficient Leader-Follower Multi-Agent Path Finding With Clustered Pickup and Delivery
- Conformal Prediction and MLLM aided Uncertainty Quantification in Scene Graph Generation
- DeSplat: Decomposed Gaussian Splatting for Distractor-Free Rendering
- Estimate Crowd Flow Including Side-trip Behavior When Exiting from Large-scale Event Venues
- AI Driven Self-Healing Cybersecurity Systems with Agentic AI for Adaptive Threat Response and Resilience
- Learning Compatible Multi-Prize Subnetworks for Asymmetric Retrieval
- A Review on Digital Product Passports as Drivers of Digital Transformation in Industry
- HELVIPAD: A Real-World Dataset for Omnidirectional Stereo Depth Estimation
- nbshmem: Enabling GPU-Initiated Multi-GPU Communication in Python
- StyleSSP: Sampling StartPoint Enhancement for Training-free Diffusion-based Method for Style Transfer
- FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields
- Accelerating CRS Format Conversion for Sparse Matrix Computation with an FPGA
- Demystifying Chains, Trees, and Graphs of Thoughts
- Robust Safety Critical Control of Uncertain Nonlinear Systems with DoS Attacks
- Remote Current Sensing Using Reflectometry for Bioelectric Applications
- Shift the Lens: Environment-Aware Unsupervised Camouflaged Object Detection
- CraterID-Loc: End-to-End Crater Identification for Lunar Image Localization
- Cyber Laws & Emerging Trends of Artificial Intelligence: An Analytical Study
- PanDA: Towards Panoramic Depth Anything with Unlabeled Panoramas and Möbius Spatial Augmentation
- Reconstructing Animals and the Wild
- Modeling and Analysis of a Multipole Permanent Magnet Assisted Synchronous Reluctance Machine for Electric Vehicles
- SVG-IR: Spatially-Varying Gaussian Splatting for Inverse Rendering
- Scaling Down Text Encoders of Text-to-Image Diffusion Models
- Unsupervised Predictive Maintenance on Industrial Electric Motors Based on Self-Sustainable IoT Wireless Sensor Nodes
- RTL++: Graph-enhanced LLM for RTL Code Generation
- A Novel Circulating Current Control Technique in Onboard Integrated Charger
- g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks
- Apple vs. Oranges: Evaluating the Apple Silicon M-Series SoCs for HPC Performance and Efficiency
- SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation
- Curing Cycle of Thermoset Epoxy Resin: Modeling and Simulation of a Drone Body Cover Geometry to Demonstrate the Usage of a Digital Tool and Its Application in a Real Case Scenario
- Iceberg: Enhancing HLS Modeling with Synthetic Data
- Sparse Voxels Rasterization: Real-time High-fidelity Radiance Field Rendering
- Faster Parameter-Efficient Tuning with Token Redundancy Reduction
- Control Strategy of Permanent Magnet Synchronous Motor Based on Improved Sliding Mode Controller
- LATENT: LLM-Augmented Trojan Insertion and Evaluation Framework for Analog Netlist Topologies
- Robust-MVTON: Learning Cross-Pose Feature Alignment and Fusion for Robust Multi-View Virtual Try-On
- Optimized Battery Thermal Management using Smart Hybrid Cooling Techniques for Electric Vehicle Applications
- SpatialCLIP: Learning 3D-aware Image Representations from Spatially Discriminative Language
- PMNI: Pose-free Multi-view Normal Integration for Reflective and Textureless Surface Reconstruction
- Generalized Received Signal Power Models for Multi-hop RIS and its Practical Analysis
- PatchDEMUX: A Certifiably Robust Framework for Multi-label Classifiers Against Adversarial Patches
- Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget
- Dynamic Event-Triggered Optimal Control for Fuzzy Multiagent Systems With DoS Attacks
- Mixture of Submodules for Domain Adaptive Person Search
- Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding
- Lightning Current Distribution in an Electric Vehicle
- Viewpoint Rosetta Stone: Unlocking Unpaired Ego-Exo Videos for View-invariant Representation Learning
- Blind Beamforming via Deep Learning-Based Signal Classification and Transfer Learning
- Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation
- Concept Drift Mitigation on Resource-Constrained IoT Devices via Self-Learning