- Modular DAB-based Isolated Bidirectional 1-Stage DC-AC Converter With 3-ϕ/1-ϕ Capability
- Collaborative Bandwidth-Efficient Intra-Node Allreduce
- A Collaborative Framework for Image Enhancement and Feature Extraction Based on the Joint Training Mechanism
- K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs
- Consistency-aware Self-Training for Iterative-based Stereo Matching
- Generalized Concentration-Based Performance Guarantees on Sensor Selection for State Estimation
- GAF: Gaussian Avatar Reconstruction from Monocular Videos via Multi-view Diffusion
- PSBD: Prediction Shift Uncertainty Unlocks Backdoor Detection
- Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation
- A Comprehensive Review of Edge Detection Techniques
- Integrating Neural Networks and Big Data for Public Opinion Monitoring and Decision Support in Social Network
- Integrating Large Language Models (LLMs) and Vector Databases into Healthcare Operations
- Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh
- LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding
- PIDSR: Complementary Polarized Image Demosaicing and Super-Resolution
- EventPSR: Surface Normal and Reflectance Estimation from Photometric Stereo Using an Event Camera
- You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
- DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning
- CPath-Omni: A Unified Multimodal Foundation Model for Patch and Whole Slide Image Analysis in Computational Pathology
- MirrorVerse: Pushing Diffusion Models to Realistically Reflect the World
- The Illusion of Unlearning: The Unstable Nature of Machine Unlearning in Text-to-Image Diffusion Models
- NN-Former: Rethinking Graph Structure in Neural Architecture Representation
- Mind the Gap: Confidence Discrepancy Can Guide Federated Semi-Supervised Learning Across Pseudo-Mismatch
- Joint Optimal Allocation of Radio and Computational Resources Aiming at Minimizing Global Average Task Offloading Age for Long-Term Multi-Cell MEC Systems
- Machine Learning-Powered Detection of FASTag Scams: A Proactive Fraud Identification Approach
- Alignment, Mining and Fusion: Representation Alignment with Hard Negative Mining and Selective Knowledge Fusion for Medical Visual Question Answering
- Research on Cucumber Knowledge QA Systems with Integrated Large Language Models
- A Bio-Nano Systems Interconnection Hierarchical Network Model for Targeted Drug Delivery
- Maiden Application of 2DOFPID Controller for IPMSG based Wind Energy Conversion System Integrated with Islanded Microgrid
- Terahertz bandgap modulation of plasmonic crystal waveguide for highly sensitive liquid detection
- Continuous Adverse Weather Removal via Degradation-Aware Distillation
- One Diffusion to Generate Them All
- Performance of Directional Relay in the Presence of Grid-Forming Inverter
- Cross-Modal 3D Representation with Multi-View Images and Point Clouds
- DEIM: DETR with Improved Matching for Fast Convergence
- Zero Current Switching Based Current-Source Inverter With Reduced Switches in IPT Application
- TopoCellGen: Generating Histopathology Cell Topology with a Diffusion Model
- Task-Aware Clustering for Prompting Vision-Language Models
- VL2Lite: Task-Specific Knowledge Distillation from Large Vision-Language Models to Lightweight Networks
- MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors
- Data Synthesis with Diverse Styles for Face Recognition via 3DMM-Guided Diffusion
- LayoutFusion: A Data-Layout-Focused Operator Fusion Framework
- 20 MVA Wind Turbine Power Conversion System with PM Vernier Generators
- PV based Sensors for Smart Street Light Fault Detection and Tracking
- LoTUS: Large-Scale Machine Unlearning with a Taste of Uncertainty
- A Generalized Analytical Gain Model for CLLC Resonant Converter with Asymmetric Parameters
- Massive MIMO Beam ID-Based Positioning Method With Low Earth Orbit Satellite Mega Constellations
- Novel Synthesis Method for Wideband BPF With Additional Insertion Phase Shift and True Time Delay
- GarmentPile: Point-Level Visual Affordance Guided Retrieval and Adaptation for Cluttered Garments Manipulation
- Textured Gaussians for Enhanced 3D Scene Appearance Modeling
- Exploration of LLM Lossless Compression on Scientific Data
- Coupled Electromagnetic and Heat Analysis of a ZnO Disk with the FDTD Method in the 2D Cylindrical Coordinate System
- IM-Portrait: Learning 3D-aware Video Diffusion for Photorealistic Talking Heads from Monocular Videos
- Observations of Rocket Triggered Lightning Discharge in Winter Thunderstorms in Japan Using a Broadband VHF Interferometer
- Multi-Scale Perceptual Learning for Skin Lesion Image Segmentation
- Study of Current Control for High-Speed Motor Drive Systems
- Diabetes Prediction Model Based on SVM Optimized with RF Feature Selection and GWO
- Research on Automatic Extraction of Key Parameters of Lightning Risk Assessment Based on Laser Point Cloud of Transmission Line
- DATAWiSE: A Scalable Big Data Reference Architecture for Smart Building
- Advancing Interference Cancellation for In-Band Full-Duplex In-Home Broadband PLC Systems
- VideoDirector: Precise Video Editing via Text-to-Video Models
- Ref-GS: Directional Factorization for 2D Gaussian Splatting
- SAIST: Segment Any Infrared Small Target Model Guided by Contrastive Language-Image Pretraining
- All-directional Disparity Estimation for Real-world QPD Images
- Reconstructing Humans with a Biomechanically Accurate Skeleton
- Study on the Characteristics of Intense Lightning Activity During a Spring Severe Thunderstorm Process in Southern China
- Perception Tokens Enhance Visual Reasoning in Multimodal Language Models
- CustAny: Customizing Anything from A Single Example
- Enhancing Online Continual Learning with Plug-and-Play State Space Model and Class-Conditional Mixture of Discretization
- Improving LLM-Powered EDA Assistants with RAFT
- Argus: A Compact and Versatile Foundation Model for Vision
- A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
- M3amba: Memory Mamba is All You Need for Whole Slide Image Classification
- Characterizing the Influence of Circuit Parasitics and Operating Conditions on a Passive Regenerative Snubber for Phase-Shifted Full-Bridge Converter
- Relative Pose Estimation through Affine Corrections of Monocular Depth Priors
- SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception
- Green Edge Computing Based IoV Dynamic Task Collaborative Strategy
- Vehicle Routing Incorporating Implicit Preferences: An Omnidimensional Human–Algorithm Collaboration Approach
- Any6D: Model-free 6D Pose Estimation of Novel Objects
- Cheb-GR: Rethinking k-nearest neighbor search in Re-ranking for Person Re-identification
- DDIP: Mutual-Regularized Dual Deep Image Prior for Self-Supervised Compressive Spectral Imaging
- AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization
- Hardware-Rasterized Ray-Based Gaussian Splatting
- Control of Integrated Magnetic-Based Active Harmonic Filter for Three-Phase Standalone Application
- Development and Deployment of a Genomic Cancer Data Extraction Pipeline on the Cloud
- OSV: One Step is Enough for High-Quality Image to Video Generation
- COSMIC: Clique-Oriented Semantic Multi-space Integration for Robust CLIP Test-Time Adaptation
- Directional Label Diffusion Model for Learning from Noisy Labels
- AdMiT: Adaptive Multi-Source Tuning in Dynamic Environments
- Design2GarmentCode: Turning Design Concepts to Tangible Garments Through Program Synthesis
- Fine-Grained Skin Wound Segmentation Based on Machine Learning with Scribble Annotations
- Combined Model for P-S or S-P Configured Lithium-ion Batteries and Equalization Electronics for Spacecraft
- RayFlow: Instance-Aware Diffusion Acceleration via Adaptive Flow Trajectories
- ImViD: Immersive Volumetric Videos for Enhanced VR Engagement
- Reanimating Images using Neural Representations of Dynamic Stimuli
- Gen-AI in a Bottle: Experiments with LLMs to Generate HPC Kernels
- Assembly of FETI dual operator using CUDA
- SketchFusion: Learning Universal Sketch Features through Fusing Foundation Models
- GoLF-NRT: Integrating Global Context and Local Geometry for Few-Shot View Synthesis *
- A Performance Comparison of Chiller Models for Energy Optimization in Commercial Buildings