- KAC: Kolmogorov-Arnold Classifier for Continual Learning
- ROD-MLLM: Towards More Reliable Object Detection in Multimodal Large Language Models
- Timestep Embedding Tells: It’s Time to Cache for Video Diffusion Model
- NoT: Federated Unlearning via Weight Negation
- Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
- T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation
- Personalized Diabetes Diet Recommendation System with Knowledge Graph and Incremental Learning
- Data Analysis for Structural Health Monitoring of a Steel Jacket Offshore Platform
- Generalized Zero-Shot Classification via Semantics-Free Inter-Class Feature Generation
- ClimbingCap: Multi-Modal Dataset and Method for Rock Climbing in World Coordinate
- Effect of Field Contaminants on rGO-coated Flexible Leaf Wetness Sensors for In-Situ Agriculture Applications
- Doppelgangers++: Improved Visual Disambiguation with Geometric 3D Features
- Impact of Mutual Flux on Rotor Position Estimation Using the Reluctance Equivalent Back-EMF Model for Synchronous Reluctance Motors
- Wonderland: Navigating 3D Scenes From a Single Image
- GCC: Generative Color Constancy via Diffusing a Color Checker
- ArtiFade: Learning to Generate High-quality Subject from Blemished Images
- Study of TEG-Heatsink Pairs for Indoor Thermal Energy Harvesting Applications
- Any-Resolution AI-Generated Image Detection by Spectral Learning
- Poster: A Scalable and Fault-Tolerant Decentralized Middleware for CI/CD Workflow
- MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation
- SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting
- Protection Measures for Lightning Overvoltages in a Low-Voltage Dc Power Supply System
- Zero-Shot Image Restoration Using Few-Step Guidance of Consistency Models (and Beyond)
- A Structured Tool Landscape for Data-Driven ProductManagernent
- Let’s Chorus: Partner-aware Hybrid Song-Driven 3D Head Animation
- A Unified, Resilient, and Explainable Adversarial Patch Detector
- Application of Deep Learning in Vehicle Classification: Boosted by AugMix Image Enhancement
- SOH Estimation of Lithium-ion Batteries using LSTM Model with Deconvoluted EIS Parameters
- ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way
- Omnia de EgoTempo: Benchmarking Temporal Understanding of Multi-Modal LLMs in Egocentric Videos
- ETAP: Event-based Tracking of Any Point
- Confidence-Aware 3D Spatial Compounding of 2D Ultrasound Images for Needle Shadow Removal
- Traction-Based Topology Reconstruction for UAV Anti-Jamming Networking: A Game-Theoretical Strategy
- Multi-Scaled Lightweight Neural Network Based ARC Fault Diagnosis: Feature Combination and Classification
- MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling
- Detecting Adversarial Data Using Perturbation Forgery
- BG-Triangle: Bézier Gaussian Triangle for 3D Vectorization and Rendering
- Video Summarization using 3D CNNs: A Convolutional Approach to Spatial-Temporal Feature Extraction
- The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human Motion
- Initial Set Point Prediction of Adaptive PI Controller using Machine Learning Algorithm
- Visual Prompting for One-shot Controllable Video Editing without Inversion
- Research on Fishing Vessel Operation Type Recognition Based on CNN-LSTM
- Video-ColBERT: Contextualized Late Interaction for Text-to-Video Retrieval
- Flash-Split: 2D Reflection Removal with Flash Cues and Latent Diffusion Separation
- When the Future Becomes the Past: Taming Temporal Correspondence for Self-supervised Video Representation Learning
- Autoregressive Sequential Pretraining for Visual Tracking
- Model Predictive Control based Adaptive Phase Shift Modulation for Neutral Point Clamped Dual Active Bridge Converter System
- Evaluating the Use of NORM Residues in Building Restoration: A Risk Assessment Approach Using ERICA and NORMALYSA
- V-Stylist: Video Stylization via Collaboration and Reflection of MLLM Agents
- Q-PART: Quasi-Periodic Adaptive Regression with Test-time Training for Pediatric Left Ventricular Ejection Fraction Regression
- Triple Switch Flexible Step-Up Converter for Fuel Cell Electric Vehicle
- Honey Bee-Inspired Energy-Efficient Cluster Head Optimization for Large-Scale Cloud-based IoT Applications
- FlexDrive: Toward Trajectory Flexibility in Driving Scene Gaussian Splatting Reconstruction and Rendering
- DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
- Rethinking Noisy Video-Text Retrieval via Relation-aware Alignment
- Testbench analysis using non-invasive fault injection
- Adapting to the Unknown: Training-Free Audio-Visual Event Perception with Dynamic Thresholds
- HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories
- Millions of Matrix-Multiplications: GEMM Variations on Aurora
- Your Scale Factors are My Weapon: Targeted Bit-Flip Attacks on Vision Transformers via Scale Factor Manipulation
- GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction
- Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues
- Simulator HC: Regression-based Online Simulation of Starting Problem-Solution Pairs for Homotopy Continuation in Geometric Vision
- Simplified Power Semiconductor Loss Evaluation With SPICE Models in PLECS
- Towards Tactile Communication of English Language: A Visual Handbook Enhances Letter Learning
- FAM Diffusion: Frequency and Attention Modulation for High-Resolution Image Generation with Stable Diffusion
- VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving
- Experimental Analysis of Multipath Characteristics in Indoor Distributed Massive MIMO Channels
- Breaking the Low-Rank Dilemma of Linear Attention
- An Information-Theoretic Framework for Out-of-Distribution Generalization with Applications to Stochastic Gradient Langevin Dynamics
- PIT: A Plug-and-Play Image Translator for Making Off-the-Shelf Models Adapt to Corruptions
- Expert System in the Construction of Personalized Football Training Program Model
- ManipTrans: Efficient Dexterous Bimanual Manipulation Transfer via Residual Learning
- A Novel Deep Learning Approach for Automatic Indian Classical Dance Style Classification
- CraftsMan3D: High-fidelity Mesh Generation with 3D Native Diffusion and Interactive Geometry Refiner
- Performance Comparison of Scalar and Direct Vector Control For Five-Phase Induction Motor Drive
- Rethinking Reconstruction and Denoising in the Dark: New Perspective, General Architecture and Beyond
- Heuristic Methods for Checking the Normality of Measurement Data with Graphical and Numerical Tests
- Data-Based Adaptive Asymptotic Tracking Control for High-Speed Train: A Feedback Linearization Approach
- Intelligent Coordination System for Autonomous Domestic Heating: An AI-Driven Test-Bench
- Building Vision Models upon Heat Conduction
- Biomechanical Effects of Arm Endpoint Stiffness During Three-Dimensional Isometric Force Maintenance
- Open-Canopy: Towards Very High Resolution Forest Monitoring
- TreeMeshGPT: Artistic Mesh Generation with Autoregressive Tree Sequencing
- A Reduced Switch Series Topology 15-level and 27level Multilevel Inverter
- Enhancing Network Security with Intrusion Detection Systems in IoT Devices
- CUPIDO: An Analog Ultra-Low-Power and Contactless Eye Blink Detector for Smart Glasses
- Multi-party Collaborative Attention Control for Image Customization
- MC 2 : Multi-concept Guidance for Customized Multi-concept Generation
- Just Dance with π! A Poly-modal Inductor for Weakly-supervised Video Anomaly Detection
- M3-RAG: Unified Multimodal and Multilingual Retrieval-Augmented Generation
- Seed Sources for Spectral Beam Combining via Brillouin De-interleaving of 1 μm Frequency Combs
- RICCARDO: Radar Hit Prediction and Convolution for Camera-Radar 3D Object Detection
- DALNN: Dual Attention Learning Neural Network for Diagnosis of Autism Spectrum Disorders and Exploration of Lesion Brain Regions
- A Metaheuristic-Enhanced Optimization Approach for Facility Location in the Wood Supply Chain
- DELTA: Directional-Aware Encoding and Local Transformer for Thangka Style Transfer
- Conceptualization and Validation of a Novel Power Electronics Transformer without High-frequency AC Link
- SAM-REF: Introducing Image-Prompt Synergy during Interaction for Detail Enhancement in the Segment Anything Model
- Crash Course on Quantum Computing for Engineering Students
- MUST: The First Dataset and Unified Framework for Multispectral UAV Single Object Tracking