- TPU-Gen: LLM-Driven Custom Tensor Processing Unit Generator
- Mitigating Lightning Hazards in Open Mining Areas: A Mobile Lightning Protection System Approach
- OSDFace: One-Step Diffusion Model for Face Restoration
- Named Entity Recognition in Educational Psychology Based on the Bert Model
- Enhancing Safety in Industry 5.0: Human-Computer Collaboration Benefits through a Dataset of Protective Equipment Detection
- SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous Driving
- On the recognition of tremor severity in Parkinson’s disease by means of inertial measurements-based ML algorithm
- A Space Vector PWM based Speed-range Extension Scheme for a Split-phase Machine under OC Fault
- DeClotH: Decomposable 3D Cloth and Human Body Reconstruction from a Single Image
- GroomLight: Hybrid Inverse Rendering for Relightable Human Hair Appearance Modeling
- Research on Cutting Control System of Insulation Layer of Insulation Pipe Based on Particle Swarm Impedance Control
- OccMamba: Semantic Occupancy Prediction with State Space Models
- Sea-Ing in Low-Light
- VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models
- Sintered Silver Based Direct-Cooled IGBTs With High Output Power and Thermal Reliability
- TF-MCRN: A Lightweight Speech Enhancement Algorithm Based on Mel Spectrogram for Speech Recognition
- Magnetic sensor arrays for the detection of 3D displacements of cracks in Structural Health Monitoring
- MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices
- Improving Semi-Supervised Semantic Segmentation with Sliced-Wasserstein Feature Alignment and Uniformity
- Active Hyperspectral Imaging Using an Event Camera
- FedBiP: Heterogeneous One-Shot Federated Learning with Personalized Latent Diffusion Models
- Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline
- Semi-Supervised Federated Learning Based Non-IID Jamming Recognition Against Poisoning Attacks
- Memory Efficient WebAssembly Containers
- Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning
- Distilled Prompt Learning for Incomplete Multimodal Survival Prediction
- SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding
- Container Scheduling Strategy Based on Container Clustering and Deep Reinforcement Learning
- AIM-Fair: Advancing Algorithmic Fairness via Selectively Fine-Tuning Biased Models with Contextual Synthetic Data
- Integration of On-board and Wireless Charging for Electric Vehicles with Single Stage Resonant Converter
- Image Quality Assessment: Investigating Causal Perceptual Effects with Abductive Counterfactual Inference
- Learning Phase Distortion with Selective State Space Models for Video Turbulence Mitigation
- Pseudo Visible Feature Fine-Grained Fusion for Thermal Object Detection
- State of Health Estimation of Lithium Batteries Based on Electrochemical Impedance Spectroscopy and Data-Driven Approaches
- Black-box Dynamic Model Identification for a Quadruple-Active-Bridge Converter
- VideoGLaMM : A Large Multimodal Model for Pixel-Level Visual Grounding in Videos
- POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation
- An Improved Compensation Circuit Design for Efficient Wireless Power Transfer Using EF 2 Resonant Inverter
- Optimization Approaches in Wood Supply Chain Management: A Multi-Problem Perspective Based on Sustainability and Circular Economy
- Analysis of Inherent Damping Mechanism and Its Contribution to Stability in DCM Grid-Tied Inverters with LCL Filters
- Dynamic Resource Allocation of Virtualized GPUs and CPUs for Scalable AI Workloads in Containerized Environments
- Motion Modes: What Could Happen Next?
- Fault Feature Extraction and Diagnosis Method for Marine Diesel Engine Based on Time-delay Embedded manifold Learning
- An Error Compensation Method for Parallel Coordinate Measuring Machines Based on Kinematic Model and Kriging Interpolation
- SimLTD: Simple Supervised and Semi-Supervised Long-Tailed Object Detection
- Arbitrary-steps Image Super-resolution via Diffusion Inversion
- Dynamic and Forecast-Based Containers Autoscaling for Kubernetes with Reinforcement Learning
- V 2 Dial: Unification of Video and Visual Dialog via Multimodal Experts
- Study on the control strategy of Solid Oxide Fuel Cell-Lithium DC microgrid based on adaptive droop control
- ASIC-Agent: An Autonomous Multi-Agent System for ASIC Design with Benchmark Evaluation
- Cyber-Attack Analysis and Investigation on PMSM Drive System in Battery Electrical Vehicles
- StyleGAN3-Based Lung Nodule Sample Generation and Classification Study
- Towards High-fidelity 3D Talking Avatar with Personalized Dynamic Texture
- Two-Step Microphone Array Fusion Algorithm for Enhanced Indoor Sound Source Localization
- Fuse Before Transmit: A Multimodal Semantic Communication Method
- Blockchain consensus mechanisms for democratic voting environments
- An Advanced Type-2 Fuzzy Inference for Rapid Convergence in Adaptive Communication Control
- Calibrated Multi-Preference Optimization for Aligning Diffusion Models
- Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking
- AMO Sampler: Enhancing Text Rendering with Overshooting
- A Bidirectional Linear Piezoelectric Actuator With High Resolution and Output Maintained Capability
- Divide, Conquer, and Match: A Distributed and Asynchronous Approach for Subgraph Isomorphism
- A Simple yet Effective Layout Token in Large Language Models for Document Understanding
- Early Detection of Lung Cancer using DenseNet with AI Support
- ESG Reporting and Digitalization in Financial Services: A Scoping Review of Emerging Trends and Gaps
- Monocular and Generalizable Gaussian Talking Head Animation
- Splatter-360: Generalizable 360° Gaussian Splatting for Wide-baseline Panoramic Images
- AesthetiQ: Enhancing Graphic Layout Design via Aesthetic-Aware Preference Alignment of Multi-modal Large Language Models
- VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM
- From Sparse Signal to Smooth Motion: Real-Time Motion Generation with Rolling Prediction Models
- MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation
- Enhancing Testing-Time Robustness for Trusted Multi-View Classification in the Wild
- SGFormer: Satellite-Ground Fusion for 3D Semantic Scene Completion
- MEET: Towards Memory-Efficient Temporal Sparse Deep Neural Networks
- Development of a Simulation Model to Design Lightning Protection Measures for an Overhead Railway Traction System in South Africa
- Conditional Balance: Improving Multi-Conditioning Trade-Offs in Image Generation
- Scalable Runtime Architecture for Data-driven, Hybrid HPC and ML Workflow Applications
- Attribute-Missing Multi-view Graph Clustering
- Improving Wireless Federated Learning via Joint Downlink-Uplink Beamforming over Analog Transmission
- Lightning Activity Patterns Across Regions Using the Chi-Square Test
- Research on Optimizing Data Preprocessing Using Clustering Algorithms
- Artificial Intelligence to Improve Security of Cloud Networks
- ScaleLSD: Scalable Deep Line Segment Detection Streamlined
- Analytical Model for Eccentric IPMSM via a New Equivalent Transformation Method and Its Rotor Vibration Calculation Considering the Coupling Effect
- Quad-Pixel Image Defocus Deblurring: A New Benchmark and Model
- DriveScape: High-Resolution Driving Video Generation by Multi-View Feature Fusion
- Control of Grid-Connected Dual-VSI DFIG-dc System With Series Connection at DC-link
- An Adaptive High-Accuracy Terrain Phase Error Compensation Algorithm in GNSS-based InBSAR via Multi-Satellite Collaborative Observations
- Insider Threat Detection using Machine Learning Models for User Behavior Analysis
- PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models
- FedChip: Federated LLM for Artificial Intelligence Accelerator Chip Design
- Generation of Dispersive Waves in Double-Ring Core Fiber for High-Order OAM Modes
- DeepCompress-ViT: Rethinking Model Compression to Enhance Efficiency of Vision Transformers at the Edge
- GIF: Generative Inspiration for Face Recognition at Scale
- Real-IAD D 3 : A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection
- LERFSNet: A Lightweight SAR Ship Detection Model with Enhanced Receptive Field and Shared Decoupling Head
- Detecting Out-of-distribution through the Lens of Neural Collapse
- Consistent Normal Orientation for 3D Point Clouds via Least Squares on Delaunay Graph
- Towards Precise Embodied Dialogue Localization via Causality Guided Diffusion
- FaceBench: A Multi-View Multi-Level Facial Attribute VQA Dataset for Benchmarking Face Perception MLLMs