- A Compliant Mandrel-Based Fiber Optic Hydrophone for Underwater Acoustic Sensing
- CoA: Towards Real Image Dehazing via Compression-and-Adaptation
- Cloud-Edge Collaborative Transcoding for Adaptive Video Streaming: Enhancing QoE in Wireless Networks
- FisherTune: Fisher-Guided Robust Tuning of Vision Foundation Models for Domain Generalized Segmentation
- Exploiting Temporal State Space Sharing for Video Semantic Segmentation
- Impact assessment of optimally integrated green energy resources on microgrid loss allocation using an efficient distribution power flow algorithm
- Towards Explainable and Unprecedented Accuracy in Matching Challenging Finger Crease Patterns
- Research on Automotive Motors Suitable for LCA, Including Motors with Aluminum Windings
- Improving Adversarial Transferability on Vision Transformers via Forward Propagation Refinement
- UNIC-Adapter: Unified Image-Instruction Adapter with Multi-Modal Transformer for Image Generation
- InteractionMap: Improving Online Vectorized HDMap Construction with Interaction
- Advancing Generalizable Tumor Segmentation with Anomaly-Aware Open-Vocabulary Attention Maps and Frozen Foundation Diffusion Models
- From Head to Tail: Efficient Black-box Model Inversion Attack via Long-tailed Learning
- Delay Optimizing Based Passivity Enhancement of Converter-Side Current Controlled LCL-Type Grid Converters
- Towards the Integration of FPGA-based Deep Learning Edge Computing on SmallSats for Low-Latency Autonomous Decision-Making
- Optimizing Pega Healthcare Solutions with Hyperscalers: A Scalable Cloud-First Approach to Digital Transformation
- Artificial Intelligence Algorithm Decision and Optimization Methods for Complex Production Systems
- A Multiphysics Reservoir Computing System with Mass-Spring Metamaterials and Spintronic Readout for Vibration Analysis
- DTOS: Dynamic Time Object Sensing with Large Multimodal Model
- Outage Probability of UAV-Assisted Dual-Hop RF-UWOC Systems Leveraging NOMA
- Teaching Accelerated Computing with Hands-on Experience
- Joint Optimization of Multi-UAV Assisted Computation Offloading and Topological Task Routing for Consumer IoT Emerging Businesses
- Research on Technical Solution and Implementation Mechanism of Wechat Ecosystem-Based Air Quality Program
- Exploiting CoRSMA-ISAC in Multi-UAV System for Emergency Response
- Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training
- LL-Localizer: A Life-Long Localization System based on Dynamic i-Octree
- Brain Tumor Classification with ResNeXt and its Comprehensive Evaluation
- Multi-Agent Hierarchical Deep Reinforcement Learning for HVAC Control With Flexible DERs
- Adversarial Domain Prompt Tuning and Generation for Single Domain Generalization
- Enhancing Dataset Distillation via Non-Critical Region Refinement
- Bridging Modalities: Improving Universal Multimodal Retrieval by Multimodal Large Language Models
- Synthetic Visual Genome
- OmniStereo: Real-Time Omnidireactional Depth Estimation with Multiview Fisheye Cameras
- Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation
- HUNet: Homotopy Unfolding Network for Image Compressive Sensing
- CMMLoc: Advancing Text-to-PointCloud Localization with Cauchy-Mixture-Model Based Framework
- ESC: Erasing Space Concept for Knowledge Deletion
- On the Consistency of Video Large Language Models in Temporal Comprehension
- The Body in Affective Robotics: A Survey and Conceptual Positioning Using the Performing Arts as a Scaffold for Understanding Bodily Expressed Emotion
- Subpixel-Level Dynamic Displacement Tracking in Pile Driving Using Computer Vision and Frequency-Domain Matching
- UniK3D: Universal Camera Monocular 3D Estimation
- Cropper: Vision-Language Model for Image Cropping through In-Context Learning
- Unlabeled Samples Improve Few-Shot Underwater Acoustic Target Recognition
- A System for DNS Over HTTPS Deployment and Security Measurement
- ZoomLDM: Latent Diffusion Model for multi-scale image generation
- WAAD: A Web Vulnerability Attack Behavior Identification Method Based on Large Language Model
- Driving Industry 5.0 Success: How Theory Meets Practice in Europe's Industrial Evolution
- Gray Scale Image Colorization using Convolutional Neural Network and PyTorch
- A Yolov8-Based Object Detection Framework with Moiré Pattern Removal
- HERA: Hybrid Explicit Representation for Ultra-Realistic Head Avatars
- Learnable Infinite Taylor Gaussian for Dynamic View Rendering
- Lska-Yolo: Improved Yolo Framework Tailored for Cervical Cell Detection
- An Enhanced Control Strategy to Alleviate Weak Grid Oscillations in Type-4 Wind Farms
- Power Mismatch Elimination in Three Phase Grid Connected Modular Multilevel Converters using Quadruple Active Bridges
- DeepCircuitX: A Comprehensive Repository-Level Dataset for RTL Code Understanding, Generation, and PPA Analysis
- Seeing more with less: human-like representations in vision models
- LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
- Speedy-Splat: Fast 3D Gaussian Splatting with Sparse Pixels and Sparse Primitives
- Interpretable Image Classification via Non-parametric Part Prototype Learning
- HVI: A New Color Space for Low-light Image Enhancement
- RC-AutoCalib: An End-to-End Radar-Camera Automatic Calibration Network
- Research on the Prediction System of State Grid Corporation of China's Bidding Business Based on Ensemble Learning and Reflex
- CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object Representation
- CLIP is Almost All You Need: Towards Parameter-Efficient Scene Text Retrieval without OCR
- A Fully Soft-Switched AC/DC Converter With ZVT Cell And Magnetic Integration for 800V On-Board Charger
- High-Voltage-Design and Ultrafast-Switching Issues of an UWBG Vertical Ga 2 O 3 MOSFET
- GraphI2P: Image-to-Point Cloud Registration with Exploring Pattern of Correspondence via Graph Learning
- HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation
- A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning
- Reconstructing People, Places, and Cameras
- A Cascaded 2-Level Z-Source Dual Inverter with Single Source and Reduced Battery Voltage
- Hardware Acceleration of LDPC Encoding Based on CGRA
- Non-Cooperative Target Radar RCS Data Generation Based on Transfer Learning
- A Study on HMI Design of Intelligent Networked Vehicles Based on KJ-AHP
- Synthetic Data is an Elegant GIFT for Continual Vision-Language Models
- Relation-Rich Visual Document Generator for Visual Information Extraction
- Sketchtopia: A Dataset and Foundational Agents for Benchmarking Asynchronous Multimodal Communication with Iconic Feedback
- Multi-modal Contrastive Learning with Negative Sampling Calibration for Phenotypic Drug Discovery
- TO-LF: A Texture and Occlusion-Oriented Benchmark Dataset for Light Field Disparity Estimation
- SparseAlign: A Fully Sparse Framework for Cooperative Object Detection
- Hierarchical Flow Diffusion for Efficient Frame Interpolation
- SEAL: SEmantic Attention Learning for Long Video Representation
- Deterministic-to-Stochastic Diverse Latent Feature Mapping for Human Motion Synthesis
- ForestLPR: LiDAR Place Recognition in Forests Attentioning Multiple BEV Density Images
- Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views
- Model Predictive Control for Reliable and Efficient Path Tracking in Autonomous Vehicles
- Temporal Separation with Entropy Regularization for Knowledge Distillation in Spiking Neural Networks
- Efficient Diffusion as Low Light Enhancer
- Prior-free 3D Object Tracking
- Seeking Consistent Flat Minima for Better Domain Generalization via Refining Loss Landscapes
- AdMiT: Adaptive Multi-Source Tuning in Dynamic Environments
- Design2GarmentCode: Turning Design Concepts to Tangible Garments Through Program Synthesis
- Category-Agnostic Neural Object Rigging
- Hardware implementation of PDS-PWM for Five-Level Active Neutral Point Clamped Inverter using LAUNCHXL F28379D
- Containerized Deployment of Secure LLM Workflows in Multi-Cloud Infrastructures
- Towards Realistic Example-based Modeling via 3D Gaussian Stitching
- Enhanced Cascaded Object Detection Network Based on Indirect Self-Attention Mechanism
- STINR: Deciphering Spatial Transcriptomics via Implicit Neural Representation
- Unveiling the Ignorance of MLLMs: Seeing Clearly, Answering Incorrectly
- A Geometric Approach for Cyber-Attack Detection in DC Microgrids