- Removing Reflections from RAW Photos
- HiFi-Portrait: Zero-shot Identity-preserved Portrait Generation with High-fidelity Multi-face Fusion
- Design Method of Distributed Decoupling Capacitors for Both Voltage Overshoot Suppression and Dynamic Current Sharing in SiC MOSFET Power Module
- Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
- CATANet: Efficient Content-Aware Token Aggregation for Lightweight Image Super-Resolution
- UVGS: Reimagining Unstructured 3D Gaussian Splatting using UV Mapping
- HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting
- A Regularization-Guided Equivariant Approach for Image Restoration
- EventGPT: Event Stream Understanding with Multimodal Large Language Models
- Harvesting Energy from Subclavian Artery Motion for Self-Powered Implantable Medical Devices
- AToM: Aligning Text-to-Motion Model at Event-Level with GPT-4Vision Reward
- Parkinson’s Disease Detection Using Multi-Scale Frequency-Sharing Channel Attention Network With Smartwatch Movement Recordings
- Do Visual Imaginations Improve Vision-and-Language Navigation Agents?
- Teaching Large Language Models to Regress Accurate Image Quality Scores Using Score Distribution
- A Novel Kernel-Based Hilbert Space Framework for Predictive Modeling of lncRNA-miRNA-Disease Interaction Networks
- Improving mapping of convolutional neural networks on FPGAs through tailored macro sizes
- Rethinking Decoder Design: Improving Biomarker Segmentation Using Depth-to-Space Restoration and Residual Linear Attention
- Image Generation Diversity Issues and How to Tame Them
- GauCho: Gaussian Distributions with Cholesky Decomposition for Oriented Object Detection
- Embedding Generative AI into Products – 10 Design Principles for Building Intelligent Systems
- Can Machines Understand Composition? Dataset and Benchmark for Photographic Image Composition Embedding and Understanding
- MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders
- SimVS: Simulating World Inconsistencies for Robust View Synthesis
- Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding
- Investigating CNN Models Efficacy in Spotting Lung Conditions using X-Ray Images
- SDGOCC: Semantic and Depth-Guided Bird’s-Eye View Transformation for 3D Multimodal Occupancy Prediction
- Show and Segment: Universal Medical Image Segmentation via In-Context Learning
- MambaVision: A Hybrid Mamba-Transformer Vision Backbone
- Optimus-2 : Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy
- Efficient Dynamic Scene Editing via 4D Gaussian-based Static-Dynamic Separation
- Towards Automated Certification Framework of Composite Systems: A SWRL-Based Approach
- VinTAGe: Joint Video and Text Conditioning for Holistic Audio Generation
- VisionUnite: a Vision-Language Foundation Model for Ophthalmology Enhanced with Clinical Knowledge
- POMP: Physics-constrainable Motion Generative Model through Phase Manifolds
- FIRE: Robust Detection of Diffusion-Generated Images via Frequency-Guided Reconstruction Error
- STING-BEE : Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection
- Supervising Sound Localization by In-the-wild Egomotion
- Assessing the Impact of Industrial Energy Reductions on Electric Truck Adoption
- HoGS: Unified Near and Far Object Reconstruction via Homogeneous Gaussian Splatting
- Double Pancake Spiral Coil based Wireless Power Transfer System for EV Charging
- CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction
- Simple Derivation of Approximate Crosstalk Expressions for Multicore Fibers With Core-Dependent Loss
- Ultra-Efficient Three-Phase Integrated-Active-Filter Isolated Rectifier for AI Data Center Applications
- AI-Enhanced Detection of Dynamic Structural Changes in Inflammatory Protein Interfaces: A Case Study of CD11b/Mac-1 Interactions
- Channel-wise Noise Scheduled Diffusion for Inverse Rendering in Indoor Scenes
- Volt-VAR Control of Grid Connected PV Inverters to Increase PV Penetration in CIGRE European LV Distribution Network
- Advanced Preventive External Lightning Protection System to Mitigate Lightning Fire in Indonesia Oil Refinery
- Towards a Predictive Model to Forecast Competency Needs in Organizations: Integrating Competency Cluster Associations and Their Future Relevance
- Collaborative Forecasting with Reinforcement Learning to Enhance Resilience
- RainyGS: Efficient Rain Synthesis with Physically-Based Gaussian Splatting
- Nosie Attenuation Performance Improvement of Active EMI Filter based on Impedance Mismatch
- SUM Parts: Benchmarking Part-Level Semantic Segmentation of Urban Meshes
- Impact of Image Resolution on Controlling Drones Using Remote VR Headset Visualization and a Cloud Architecture
- One-for-More: Continual Diffusion Model for Anomaly Detection
- TinyFusion: Diffusion Transformers Learned Shallow
- Black Start Strategy for Modern Power Systems Using Inverter-Based Resources
- Learning Visual Generative Priors without Text
- The Standardization Framework of Product Traceability and Process Performance Monitoring in Interoperable Agroindustry Systems
- Diff2Flow: Training Flow Matching Models via Diffusion Model Alignment
- Optimal Modulation of Penta-Phase Shifted Multi-Active-Bridge for EV Charging
- Optical Intelligent Reflecting Surface Enhanced Secure Communications in NOMA-based VLC
- SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow
- Empirical Design of a Robotic Arm Control System based on Flex Sensors with Artificial Intelligence (AI) Association
- A Real Time Data Acquisition of Photovoltaic Solar Panel Monitoring System based on Internet of Things using Arduino UNO
- Profile Least Squares Estimation in Networks with Covariates
- Enhancing Time-Domain Shielding Effectiveness of Cables Using Metal-Coated Aramid-Fiber Composites
- SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis
- Liquid Crystal Mimics Your Heart: A Physical Spoofing Attack against PPG-based Systems
- SceneTAP: Scene-Coherent Typographic Adversarial Planner against Vision-Language Models in Real-World Environments
- AMSnet 2.0: A Large AMS Database with AI Segmentation for Net Detection
- Analysis of higher-order Lotka-Volterra models: Application of S-tensors and the polynomial complementarity problem
- Optimized Credit Card Fraud Detection using Stacking and Grid Search Techniques for Enhanced Anomaly Detection
- COMPGS: Unleashing 2D Compositionality for Compositional Text-to-3D via Dynamically Optimizing 3D Gaussians
- Ges3ViG: Incorporating Pointing Gestures into Language-Based 3D Visual Grounding for Embodied Reference Understanding
- OmniSplat: Taming Feed-Forward 3D Gaussian Splatting for Omnidirectional Images with Editable Capabilities
- On Denoising Walking Videos for Gait Recognition
- Efficient Test-Time Adaptive Object Detection via Sensitivity-Guided Pruning
- IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera
- FASTer: Focal Token Acquiring-and-Scaling Transformer for Long-term 3D Object Detection
- Research on Fault Diagnosis of Track Circuit Based on Optimied Variational Mode Decomposition
- From Sparse to Dense: Camera Relocalization with Scene-Specific Detector from Feature Gaussian Splatting
- EDM: Equirectangular Projection-Oriented Dense Kernelized Feature Matching
- Fragment-Driven Progressive Alternating Diffusion for De Novo Molecular Design
- Virtual Sensing Model for Power Prediction of Satellite Solar array: J₂ Perturbation Dynamics Resolution via Dual-Coordinate Transformation and Adaptive Runge-Kutta
- Audio-Visual Semantic Graph Network for Audio-Visual Event Localization
- Text Embedding is Not All You Need: Attention Control for Text-to-Image Semantic Alignment with Text Self-Attention Maps
- Cross-Sector, Efficient, Trusted Data Sharing in Dataspaces
- Convex Relaxation for Robust Vanishing Point Estimation in Manhattan World
- Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding
- The Scene Language: Representing Scenes with Programs, Words, and Embeddings
- Multimodal Meta-Learning for Early Rumor Detection Based on Few-Shot Learning
- A Switchable Transmissive-Reflective Metasurface Unit for Full-Space Continuous Phase Modulation
- Comprehensive Analysis and Wide Range Operation of ZVS and Quasi-ZPA in Wireless Power Transfer System
- Text-Driven Fashion Image Editing with Compositional Concept Learning and Counterfactual Abduction
- Noise Calibration and Spatial-Frequency Interactive Network for STEM Image Enhancement
- Underwater Image Recovery Using Low-Frequency Filtering and Polarization Imaging Modeling
- Investigation of Stability Challenges in MEA Onboard DC Microgrids using MTPA based Direct Torque Control
- Benchmarking Sustainability Assessment Tools for SMEs: An AHP–TOPSIS Framework and a Vision–Execution Quadrant Approach
- WonderWorld: Interactive 3D Scene Generation from a Single Image
- Servitization in the B2B Manufacturing Context A Practice-Based Research Agenda