- TAGA: Self-supervised Learning for Template-free Animatable Gaussian Articulated Model
- STOP: Integrated Spatial-Temporal Dynamic Prompting for Video Understanding
- Conductive Noise Modeling using GA Parameter Fitting and Effective Validation of Noise Reduction Filter
- Text Augmented Correlation Transformer For Few-shot Classification & Segmentation
- A 27-nW Wake-Up Receiver With a Quartz Transformer Matching Network Achieving −71.9-dBm Sensitivity and −46-dB SIR at 0.8% Offset
- Understanding Multi-Task Activities from Single-Task Videos
- StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements
- LibraryX-ASIC: A First Look
- TLPipe: An Efficient Two-Level Optimization Pipeline Model Parallelism for Large Model Training
- Feat2GS: Probing Visual Foundation Models with Gaussian Splatting
- Observability and Incident Response in Managed Serverless Environments Using Ontology-Based Log Monitoring
- Automatic Spectral Calibration of Hyperspectral Images: Method, Dataset and Benchmark
- Design and Analysis of Phase Locked Loop VLSI IC for Carrier Synchronization in Wireless Communications using 45nm Technology
- Closed-loop feedback scheme for multi-channel fiber optic current sensor based on time-delay staircase waveform
- Omnidirectional Multi-Object Tracking
- Tartan IMU: A Light Foundation Model for Inertial Positioning in Robotics
- SuperLightNet: Lightweight Parameter Aggregation Network for Multimodal Brain Tumor Segmentation
- Layered Image Vectorization via Semantic Simplification
- Two by Two: Learning Multi-Task Pairwise Objects Assembly for Generalizable Robot Manipulation
- A Systematic Literature Review of Innovations, Challenges, and Future Directions in Telemonitoring and Wearable Health Technologies
- A Robust and Efficient 13-Level Common-Ground SCMLI with Sextuple Voltage Gain and Fault-Tolerant Design for Sustainable Power Applications
- Research on Hybrid Intelligent Decision-Making and Pattern Recognition Methods in Autonomous Driving Systems
- VBSF: Vulnerability Behavior Scanning Framework for Intelligent Autonomous Transport Systems
- DualPM: Dual Posed-Canonical Point Maps for 3D Shape and Pose Reconstruction
- Subnet-Aware Dynamic Supernet Training for Neural Architecture Search
- Gameplay with a Socially Supportive Virtual Robot Enhances Children's Global Self-Esteem, Peer Relationships, Interest and Engagement
- Period-LLM: Extending the Periodic Capability of Multimodal Large Language Model
- Efficient Decoupled Feature 3D Gaussian Splatting via Hierarchical Compression
- Practical solutions to the relative pose of three calibrated cameras
- PS-EIP: Robust Photometric Stereo Based on Event Interval Profile
- NeISF++: Neural Incident Stokes Field for Polarized Inverse Rendering of Conductors and Dielectrics
- Enhancing Physical Layer Security in Cognitive Radio-Enabled NTNs with Beyond Diagonal RIS
- Early Diagnosis of Pancreatic Cancer Using CA 19-9 and LRG-1 Proteins Through Surface Enhanced Raman Spectroscopy
- Integration of Machine Learning Models to Predict and Prevent Security Breaches in IOD
- Efficient Long Video Tokenization via Coordinate-based Patch Reconstruction
- LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences
- Minimal Interaction Separated Tuning: A New Paradigm for Visual Adaptation
- Entity Recognition for Power Equipment Data Based on Optional Word Vectors and Feature Fusion
- MPDrive: Improving Spatial Understanding with Marker-Based Prompt Learning for Autonomous Driving
- On-Device Self-Supervised Learning of Low-Latency Monocular Depth from Only Events
- Change3D: Revisiting Change Detection and Captioning from A Video Modeling Perspective
- SASep: Saliency-Aware Structured Separation of Geometry and Feature for Open Set Learning on Point Clouds
- Enhancing Vertical Handover Provisioning of Heterogeneous Network in Marine Pasture
- Effects of Electron Irradiation and Thermal Cycling on Electrical Properties of SiC MOSFET
- The Dynamics of Laser-Driven Ionisation in High-Voltage Circuit Switching
- Adapting Text-to-Image Generation with Feature Difference Instruction for Generic Image Restoration
- LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs
- An AC–DC Bridgeless Step-Down PFC Converter With Elimination of Dead Zones
- LEDiff: Latent Exposure Diffusion for HDR Generation
- Protein database search using Processing-in-Memory architecture
- AMR-Transformer: Enabling Efficient Long-range Interaction for Complex Neural Fluid Simulation
- ConceptGuard: Continual Personalized Text-to-Image Generation with Forgetting and Confusion Mitigation
- A Contrast-Source Inversion-Assisted Attention-Unet for Microwave Imaging
- Towards Conceptual Framework for Analysing the Impact of Artificial Intelligence on Fresh-Food-Produce Wholesalers' Transport Logistics Performance
- Continuous Integration, Deployment and Validation: Supporting Scalable Industrial Ecosystems
- Gaussian Splashing: Unified Particles for Versatile Motion Synthesis and Rendering
- Event Ellipsometer: Event-based Mueller-Matrix Video Imaging
- Online Task-Free Continual Learning via Dynamic Expansionable Memory Distribution
- Efficient Event-Based Object Detection: A Hybrid Neural Network with Spatial and Temporal Attention
- Queueing Delay Minimization in Overloaded Networks via Rate Control
- Mosaic of Modalities: A Comprehensive Benchmark for Multimodal Graph Learning
- A Property-Aware Framework for Molecular Property Prediction Via Substructure Contrastive Learning
- A Novel Variable-Level ANPC Inverter with Capacitor Voltage Reconfiguration Method for 2 kV Photovoltaic Applications
- Multi-Objective Tertiary Layer Optimization for DC Microgrids
- Amplified OFF State Voltage Stress across SiC MOSFETs of 4-Quadrant Switch
- DEFOM-Stereo: Depth Foundation Model Based Stereo Matching
- TriTex: Learning Texture from a Single Mesh via Triplane Semantic Features
- CamPoint: Boosting Point Cloud Segmentation with Virtual Camera
- Effect of the read-out electronics on QCM-D measurements
- Toward Efficient Power Scene Detection via Topology-Preserved Knowledge Distillation
- Automated Assembly of Magnetic Soft Microrobots with Chopstick-Like Two-Fingered Microhand
- Integrated Sensing and Backscatter Communication with Movable Antennas: State-of-the-Art Survey and A Novel Inverse Scattering Framework
- Generalizable Object Keypoint Localization from Generative Priors
- Enhancing Virtual Try-On with Synthetic Pairs and Error-Aware Noise Scheduling
- Flash3D: Super-scaling Point Transformers through Joint Hardware-Geometry Locality
- Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation
- The Photographer’s Eye: Teaching Multimodal Large Language Models to See and Critique like Photographers
- Every SAM Drop Counts: Embracing Semantic Priors for Multi-Modality Image Fusion and Beyond
- CARL: A Framework for Equivariant Image Registration
- Limit Cycle-Based Artificial Fields for Obstacle Avoidance in Robot Path Planning
- A 70–160-GHz Ultrawideband Amplifier Utilizing Reverse-Side Grounded Balun With Balance Compensation in 130-nm SiGe Process
- VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling
- A Compressed QUBO Format for Traveling Salesperson Problems
- Unbiased Video Scene Graph Generation via Visual and Semantic Dual Debiasing
- A Hybrid MobileNet-LSTM Model for Enhanced Detection of Deepfake Media in Real-Time Image and Video Analysis
- Constrained Stochastic Recursive Momentum Successive Convex Approximation
- TSC-FL: Communication-Efficient Federated Learning Based on Three-Stage Compression Mechanism for Internet of Vehicles
- Efficient Attention in Partially Relevant Video Retrieval: A Benchmarking Study on Accuracy-Efficiency Trade-Offs
- LSNet: See Large, Focus Small
- Unsupervised 3D Object Detection by Commonsense Clue
- Chat2SVG: Vector Graphics Generation with Large Language Models and Image Diffusion Models
- R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual Localization
- Leveraging Perturbation Robustness to Enhance Out-of-Distribution Detection
- STPro: Spatial and Temporal Progressive Learning for Weakly Supervised Spatio-Temporal Grounding
- PyTorchGeoNodes: Enabling Differentiable Shape Programs for 3D Shape Reconstruction
- Instant Gaussian Stream: Fast and Generalizable Streaming of Dynamic Scene Reconstruction via Gaussian Splatting
- When Domain Generalization meets Generalized Category Discovery: An Adaptive Task-Arithmetic Driven Approach
- Cache-Aware Transformer-Based Scheduling for LLM-Driven IoT Workflows in Multi-Clouds
- STEP: Enhancing Video-LLMs’ Compositional Reasoning by Spatio-Temporal Graph-guided Self-Training
- Robust HV JFET With Split-PBL and Resistive Field Plate for Flexible V P Design and Enhanced Current Capability