- Periodic Event-Triggered Output-Feedback Control of Stochastic Nonlinear Systems With Flexible Tracking Performance
- Flash-Split: 2D Reflection Removal with Flash Cues and Latent Diffusion Separation
- When the Future Becomes the Past: Taming Temporal Correspondence for Self-supervised Video Representation Learning
- Design And Validation Of Low Power Low Area High Speed Mixed Logic 4 Bit Comparator
- Nanofabrication of SiNW Sensors with Au and Ag
- VODiff: Controlling Object Visibility Order in Text-to-Image Generation
- Towards a Unified Charging Infrastructure: Integrating Conductive and Wireless Charging Methods
- Improving Visual and Downstream Performance of Low-Light Enhancer with Vision Foundation Models Collaboration
- Evaluating the Use of NORM Residues in Building Restoration: A Risk Assessment Approach Using ERICA and NORMALYSA
- Video-Guided Foley Sound Generation with Multimodal Controls
- SAM-I2V: Upgrading SAM to Support Promptable Video Segmentation with Less than 0.2% Training Cost
- Temperature-adaptive Smart Electric Vehicle Fast Charging System
- FRESA: Feedforward Reconstruction of Personalized Skinned Avatars from Few Images
- Human Motion Instruction Tuning
- Enhanced Doa Estimation for Lightning Sources Using Music and Coherent Signal Subspace Method
- Geometry-guided Online 3D Video Synthesis with Multi-View Temporal Consistency
- Enduring, Efficient and Robust Trajectory Prediction Attack in Autonomous Driving via Optimization-Driven Multi-Frame Perturbation Framework
- STCOcc: Sparse Spatial-Temporal Cascade Renovation for 3D Occupancy and Scene Flow Prediction
- Heterogeneity-Aware Client Scheduling for Split Federated Learning in Resource-Scarce Networks
- Rectification-specific Supervision and Constrained Estimator for Online Stereo Rectification
- Targeted Forgetting of Image Subgroups in CLIP Models
- Frequency Dynamic Convolution for Dense Image Prediction
- Pos3R: 6D Pose Estimation for Unseen Objects Made Easy
- Apply Hierarchical-Chain-of-Generation to Complex Attributes Text-to-3D Generation
- EchoTraffic: Enhancing Traffic Anomaly Understanding with Audio-Visual Insights
- SnapGen: Taming High-Resolution Text-To-Image Models for Mobile Devices with Efficient Architectures and Training
- EffiDec3D: An Optimized Decoder for High-Performance and Efficient 3D Medical Image Segmentation
- An FPGA-Accelerated Framework for Optimizing Decision Tree Ensembles in Supervised Learning
- Development of a Hybrid Experimental Environment using PHIL for Multi-Unit Power Converter Networks
- A Theory of Learning Unified Model via Knowledge Integration from Label Space Varying Domains
- X-Dyna: Expressive Dynamic Human Image Animation
- Joint Optimization of Neural Radiance Fields and Continuous Camera Motion from a Monocular Video
- Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation
- Disentangling Safe and Unsafe Image Corruptions via Anisotropy and Locality
- Vid2Sim: Generalizable, Video-based Reconstruction of Appearance, Geometry and Physics for Mesh-free Simulation
- GIVEPose: Gradual Intra-class Variation Elimination for RGB-based Category-Level Object Pose Estimation
- Robust Multi-Object 4D Generation for In-the-wild Videos
- Three/Single-Phase Switchable DAB Matrix Converter and Active Power Decoupling Method with Center-Tapped Transformer
- MoACNN-XGNet: Interpretable Multi-Omics Convolutional Network for Breast Cancer Subtyping and Prognostic Genes Identification
- Pippo: High-Resolution Multi-View Humans from a Single Image
- LightLoc: Learning Outdoor LiDAR Localization at Light Speed
- StickMotion : Generating 3D Human Motions by Drawing a Stickman
- FeSATLock: An Energy Efficient and SAT Attack Resilient Logic Locking Design with FeFET LUT Architecture for Enhanced Hardware Security
- Thermal Management and Electronic Packaging of a 3.3 kW High Frequency On-Board Charger for EV Applications
- Applying Monolithic to Microservices Strategy for Elastic Container Deployment for AI Applications
- AG-VPReID: A Challenging Large-Scale Benchmark for Aerial-Ground Video-based Person Re-Identification
- Arc2Avatar: Generating Expressive 3D Avatars from a Single Image via ID Guidance
- Recognition-Synergistic Scene Text Editing
- Track Any Anomalous Object: A Granular Video Anomaly Detection Pipeline
- Training-free Neural Architecture Search through Variance of Knowledge of Deep Network Weights
- RaNT-Graph: A Scalable Approach to Sampling Billions of Walks or Paths from Weighted Graphs
- Wearable Sensors and Systems for Personalized Healthcare Monitoring
- Binarized Neural Network for Multi-spectral Image Fusion
- Collaborative Framework for Life Cycle Sustainability & Circularity Assessment and Advisory: An Automotive Electronics use Case
- Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
- Advanced Double Discontinuous Pulse Width Modulated Three-Phase Power Converter
- Scene-agnostic Pose Regression for Visual Localization
- A Label-Free and Non-Monotonic Metric for Evaluating Denoising in Event Cameras
- SEFR: A Mashup Recommendation Approach for Crossover Service Convergence
- Enhancing Graph Transformer Training through Adaptive Graph Parallelism
- SMILE-VLM: Self-Supervised Multi-View Representation Learning using Vision-Language Model for 3D/4D Facial Expression Recognition
- Spherical Manifold Guided Diffusion Model for Panoramic Image Generation
- VideoDPO: Omni-Preference Alignment for Video Diffusion Generation
- Any-Resolution AI-Generated Image Detection by Spectral Learning
- Symbolic Representation for Any-to-Any Generative Tasks
- Good, Cheap, and Fast: Overfitted Image Compression with Wasserstein Distortion
- Robust 3D Shape Reconstruction in Zero-Shot from a Single Image in the Wild
- Hand-held Object Reconstruction from RGB Video with Dynamic Interaction
- Language-Guided Audio-Visual Learning for Long-Term Sports Assessment
- Repurposing Stable Diffusion Attention for Training-Free Unsupervised Interactive Segmentation
- Intelligent Human Gait Phase Classification Using Machine Learning Models
- Conformal Prediction for Zero-Shot Models
- RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation
- Improving the Transferability of Adversarial Attacks on Face Recognition with Diverse Parameters Augmentation
- Solar Powered Battery Charger for Light Commercial Electric Vehicles
- Resilient Sensor Fusion under Adverse Sensor Failures via Multi-Modal Expert Fusion
- Research on Power-Computing Coordinated Scheduling Based on Dual Demand Coupling and Multi-Agent Learning
- Language-Assisted Debiasing and Smoothing for Foundation Model-Based Semi-Supervised Learning
- Research on Lightweight Detection Network for Fog Cannon Vehicles Based on Multi Scale Feature Fusion
- BIMBA: Selective-Scan Compression for Long-Range Video Question Answering
- Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera
- Updates of Lightning Current Measurements on Shenzhen Meteorological Gradient Tower (2017-2023)
- Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
- Descriptor-In-Pixel : Point-Feature Tracking for Pixel Processor Arrays
- MultimodalStudio: A Heterogeneous Sensor Dataset and Framework for Neural Rendering across Multiple Imaging Modalities
- Microwave Based Non-Invasive Blood Glucose Sensors: Key Design Parameters and Case-Informed Evaluation
- Following Is All You Need: Robot Crowd Navigation Using People As Planners
- DSTNet: Dynamic Trajectory Prediction for Autonomous Vehicles via Spatio-Temporal Attention
- Assessing the Environmental Impact of IoT Devices - Hotspots and Guidelines for a Better Understanding
- DEThresh: A Hybrid Evolutionary and Threshold Algorithm for Cloud Optimization
- Thermodynamic Modeling of Hashtag Dynamics for Social Media Clustering: A Maxwell-Boltzmann Approach
- A Comprehensive Tutorial and Survey of O-RAN: Exploring Slicing-Aware Architecture, Deployment Options, Use Cases, and Challenges
- M3-RAG: Unified Multimodal and Multilingual Retrieval-Augmented Generation
- RICCARDO: Radar Hit Prediction and Convolution for Camera-Radar 3D Object Detection
- Beamforming in Secure Integrated Sensing and Communication Systems with Antenna Allocation
- Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation
- Conceptualization and Validation of a Novel Power Electronics Transformer without High-frequency AC Link
- Crash Course on Quantum Computing for Engineering Students
- ASHiTA: Automatic Scene-Grounded HIerarchical Task Analysis
- MUST: The First Dataset and Unified Framework for Multispectral UAV Single Object Tracking