- Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation
- Leveraging SD Map to Augment HD Map-based Trajectory Prediction
- ONDA-Pose: Occlusion-Aware Neural Domain Adaptation for Self-Supervised 6D Object Pose Estimation
- Design of a GaN-based Series Resonant Dual Active Bridge DC-DC converter for EV Charging Application
- Application of Data-Driven Method in Fault Prediction of Intelligent Operation and Maintenance System of Photovoltaic Power Station
- A Physics-Informed Blur Learning Framework for Imaging Systems
- Show and Tell: Visually Explainable Deep Neural Nets via Spatially-Aware Concept Bottleneck Models
- UNICL-SAM: Uncertainty-Driven In-Context Segmentation with Part Prototype Discovery
- A Design Methodology for a Partial Power PSFB DC-DC Converter for Battery Charging
- BIGS: Bimanual Category-agnostic Interaction Reconstruction from Monocular Videos via 3D Gaussian Splatting
- Real Time RFID-GPS Integrated Package Tracking and Monitoring System
- Classification Prediction Model of Students' College English Scores Based on Online Course Learning Data
- Empowering Communication: A Word-to-GIF Approach for Sign Language Accessibility
- GeoMM: On Geodesic Perspective for Multi-modal Learning
- Bias for Action: Video Implicit Neural Representations with Bias Modulation
- Digital Transformation in Small and Medium Size Enterprises in Germany - A Use Case. Digital Twins for Virtuel Commissioning
- Complex Valued Linear Discriminant Analysis on mmWave Radar Face Signatures for Task-Oriented Semantic Communication
- DA-VPT: Semantic-Guided Visual Prompt Tuning for Vision Transformers
- Overview of NURBS curve technology
- Mimic In-Context Learning for Multimodal Tasks
- Integrated multi-format microwave signal generator using thin-film lithium niobite Mach-Zehnder modulator
- Deciphering the Critical Role of Doping on P-Type Ohmic Contact Formation in MoS 2 FETs
- Cross Platform Lightweight and Efficient Digital Twin Model Interaction Component for Equipment Inventory Management
- T 2 SG: Traffic Topology Scene Graph for Topology Reasoning in Autonomous Driving
- An Efficient Hybrid Algorithm Combining Skeletonization MoM-PO and EDM for Solving Electromagnetic Radiation of Large-Scale Targets
- SIR-DIFF: Sparse Image Sets Restoration with Multi-View Diffusion Model
- Touch2Shape: Touch-Conditioned 3D Diffusion for Shape Exploration and Reconstruction
- LP-Diff: Towards Improved Restoration of Real-World Degraded License Plate
- FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video
- Effortless Active Labeling for Long-Term Test-Time Adaptation
- Tooth Instance Segmentation in CBCT Images Using Watershed Algorithm and nnUNet
- Non-intrusive load monitoring using two-point sensors for load measurement, identification and localization
- Research on a Short-Term Traffic Flow Prediction Model Using Enhanced Genetic Algorithms and Neural Network Collaboration
- Associative Transformer
- Celebrating the 30th Anniversary of WiC [Member Activities]
- Flexible DC grid protection scheme based on fitting slope magnitude
- Resilient Voltage Restoration Scheme for AC Microgrids Under Cyber Attack
- A Planar Transformer Winding Configuration for High Frequency DAB Converter with Common-Mode EMI Mitigation
- VideoICL: Confidence-based Iterative In-context Learning for Out-of-Distribution Video Understanding
- An Advanced Zero-Error Continuous Control Set Model Predictive Controller for Low Voltage Ride Through of Grid-Connected Power Converters
- EEG-Driven Machine Learning for Stroke Detection in High-Risk Patients
- Multiport Fast-charging Station Architecture based on Hexverter with High-frequency Isolation
- Methane Release Rate Estimation using Model-based Gas Tomography
- RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete
- High-Performance Surface Plasmon Resonance Sensor for Pathogenic Cancer Detection Using Ag/Si/EuS Layers
- OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts
- Failure Detection in De-energized GaN-HEMT Switching Cells using Gate Driver-Induced Residual Voltage
- A ResNet-Based Classification Method for Ship Viewpoint Estimation
- PhyS-EdiT: Physics-aware Semantic Image Editing with Text Description
- Adaptive Resource Allocation in Cloud Computing Using Advanced AI Techniques
- Data-Free Group-Wise Fully Quantized Winograd Convolution via Learnable Scales
- PFLIO-SAM: Tightly-coupled Polarization Camera/Optical Flow/LiDAR/IMU Odometry via Smoothing and Mapping
- Across-Array LDPC Codes Design for Resistive Random-Access Memories
- Weakly Supervised Semantic Segmentation via Progressive Confidence Region Expansion
- Learning-enabled Polynomial Lyapunov Function Synthesis via High-Accuracy Counterexample-Guided Framework
- Bilateral Tensor Ring Decomposition for Thick Cloud Removal in Multitemporal Remote Sensing Images
- Recurrent Feature Mining and Keypoint Mixup Padding for Category-Agnostic Pose Estimation
- Layer-Wise Dual Attention Network with Adaptive Feature Fusion for Apple Disease Classification
- RIS-enhanced Semantic-aware Sensing, Communication, Computation and Control for Internet of Things
- Is In-Context Learning Feasible for HPC Performance Autotuning?
- RAAP-CGRA: Placement for CGRAs with Restricted Routing Architectures
- BrepGiff: Lightweight Generation of Complex B-rep with 3D GAT Diffusion
- Investigations on Achieving Zero Energy Hill-Hold Condition in an Electric Vehicle with a Doubly Salient Parallel Path Magnetic Motor
- Finer-CAM: Spotting the Difference Reveals Finer Details for Visual Explanation
- MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation
- Fundamental limits via CRB of semi-blind channel estimation in Massive MIMO systems
- Low Resource Passive Acoustic Vessel Detectors: Performance and System Design for Challenging Acoustic Environments
- Transmit Beamforming Self-Interference Cancellation Design and Experiment for STAR Phased Array
- Multi-Sensor Environmental Monitoring System for Smart Health Care
- Domain Generalization in CLIP via Learning with Diverse Text Prompts
- Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content
- Infinity∞: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
- AnyMoLe: Any Character Motion In-betweening Leveraging Video Diffusion Models
- Multi-view Reconstruction via SfM-guided Monocular Depth Estimation
- Learning Causal Structure Distributions for Robust Planning
- CheckManual: A New Challenge and Benchmark for Manual-based Appliance Manipulation
- Enhanced OoD Detection through Cross-Modal Alignment of Multi-Modal Representations
- EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing
- MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
- Hierarchical Fusion Estimation With Feedback for Clustered Sensor Networks Subject to Leader and Subordinate Sensors
- A Unified Framework for Heterogeneous Semi-supervised Learning
- DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis
- Corrections to “Quantum Dot DBR Lasers Monolithically Integrated on Silicon Photonics by In-Pocket Heteroepitaxy”
- Boosting Adversarial Transferability through Augmentation in Hypothesis Space
- Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion
- Identity-Clothing Similarity Modeling for Unsupervised Clothing Change Person Re-Identification
- CryptoFace: End-To-End Encrypted Face Recognition
- Adaptive Fault Tolerance Mechanisms of Large Language Models in Cloud Computing Environments
- Integrated Synchronous Machine Emulation in Enhanced Droop Control for Grid-Forming Inverter-Based PV Plant Management
- Gradient-Guided Annealing for Domain Generalization
- Golden Cudgel Network for Real-Time Semantic Segmentation
- DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models
- Leveraging Temporal Cues for Semi-Supervised Multi-View 3D Object Detection
- TimeTracker: Event-based Continuous Point Tracking for Video Frame Interpolation with Non-linear Motion
- ODE: Open-Set Evaluation of Hallucinations in Multimodal Large Language Models
- Improving Editability in Image Generation with Layer-wise Memory
- Adaptive Lightweight Framework for the Brain MRI Segmentation Task
- Forecasting the Unpredictable: Deep Neural Networks in the Volatile World of Cryptocurrencies
- Real Time Object Recognition with Voice Guided Navigation for Visually Impaired using OpenCV
- Power Meter Architecture with AC/DC TMR Current Sensors for Smart Grid Applications