- Analytical Model for Ballistic 2D Nanotransistors
- MonoPlace3D: Learning 3D-Aware Object Placement for 3D Monocular Detection
- Generalizing Deepfake Video Detection with Plug-and-Play: Video-Level Blending and Spatiotemporal Adapter Tuning
- On the Singularity of SYCL
- DarkGAN-Enhanced Low-Light Detection and Localization of Low-Voltage Electricity Meters
- Recognizing Abnormalities in Fundus Images Using Vision Transformer for Ocular Diseases
- ROS-SAM: High-Quality Interactive Segmentation for Remote Sensing Moving Object
- A Reference Architecture for Digital Transformation of SMEs in the Manufacturing Domain
- Design of High-Precision Network Timing System Based on RT1064
- EfficientLLaVA: Generalizable Auto-Pruning for Large Vision-language Models
- Don’t Shake the Wheel: Momentum-Aware Planning in End-to-End Autonomous Driving
- EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation
- Shadows of Disparity: Unveiling the Asymmetry of Mutual Coupling in Densely-Packed MIMO
- A YOLO-ESAM Algorithm with Occlusion Immunity for Small Object Detection
- Robust Partially-Observed VIoT Data Sensing via Half Quadratic Loss With Flexible Weighted Groupwise Relaxed Label Margins
- DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models
- AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness Benchmark
- Artificial Intelligence to Improve Security of Cloud Networks
- Analytical Model for Eccentric IPMSM via a New Equivalent Transformation Method and Its Rotor Vibration Calculation Considering the Coupling Effect
- Hearing Anywhere in Any Environment
- FedChip: Federated LLM for Artificial Intelligence Accelerator Chip Design
- Let Humanoids Hike! Integrative Skill Development on Complex Trails
- Align-A-Video: Deterministic Reward Tuning of Image Diffusion Models for Consistent Video Editing
- Evaluation of the Mutual Lightning Shielding Effect on Onshore Wind Farms
- Dynamic Derivation and Elimination: Audio Visual Segmentation with Enhanced Audio Semantics
- ExpertAF: Expert Actionable Feedback from Video
- An Integrated API and WebSocket-Based Framework for Emergency Vehicle Priority Routing
- Retrieving Semantics from the Deep: an RAG Solution for Gesture Synthesis
- DV-Matcher: Deformation-based Non-Rigid Point Cloud Matching Guided by Pre-trained Visual Features
- AI Personalized Language Learning Application
- Effect of hardening on the magnetic behavior of AISI 1045 steel
- Real-IAD D 3 : A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection
- Detecting Out-of-distribution through the Lens of Neural Collapse
- Towards Transformer-Based Aligned Generation with Self-Coherence Guidance
- Progressive Correspondence Regenerator for Robust 3D Registration
- LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
- Language-Guided Salient Object Ranking
- Any3DIS: Class-Agnostic 3D Instance Segmentation by 2D Mask Tracking
- Self-supervised ControlNet with Spatio-Temporal Mamba for Real-world Video Super-resolution
- A Comprehensive Machine Learning Framework for Phishing URL Detection: Dataset Integration, Feature Extraction, and Model Evaluation
- Flying Vines: Design, Modeling, and Control of a Soft Aerial Robotic Arm
- SnowMaster: Comprehensive Real-world Image Desnowing via MLLM with Multi-Model Feedback Optimization
- FaceBench: A Multi-View Multi-Level Facial Attribute VQA Dataset for Benchmarking Face Perception MLLMs
- Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models
- Optical Dialogue Photonic Converter for Photon-Driven DC Motor System
- Difference Inversion: Interpolate and Isolate the Difference with Token Consistency for Image Analogy Generation
- A General Adaptive Dual-level Weighting Mechanism for Remote Sensing Pansharpening
- Enhancing Safety in Industry 5.0: Human-Computer Collaboration Benefits through a Dataset of Protective Equipment Detection
- EntropyMark: Towards More Harmless Backdoor Watermark via Entropy-based Constraint for Open-source Dataset Copyright Protection
- ITA-MDT: Image-Timestep-Adaptive Masked Diffusion Transformer Framework for Image-Based Virtual Try-On
- PGC: Physics-Based Gaussian Cloth from a Single Pose
- Sea-Ing in Low-Light
- VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models
- PEACE: Empowering Geologic Map Holistic Understanding with MLLMs
- Automated Proof of Polynomial Inequalities via Reinforcement Learning
- Improving Semi-Supervised Semantic Segmentation with Sliced-Wasserstein Feature Alignment and Uniformity
- Prolog-RAG: A Symbolic Reasoning Approach to Retrieval-Augmented Generation
- A Novel Continuously Variable Gate Voltage Control Concept for Silicon Carbide Power Modules
- ConMo: Controllable Motion Disentanglement and Recomposition for Zero-Shot Motion Transfer
- VideoWorld: Exploring Knowledge Learning from Unlabeled Videos
- DeepFakeGuard: Real-Time Deepfake Video Detection Leveraging Celeb-DF Dataset and CNN-LSTM Framework
- Deep Learning - Driver Edge Registration for Enhanced Home Health Monitoring
- DNN-Based Precoding in RIS-Aided mmWave MIMO Systems With Practical Phase Shift
- GOAL: Global-local Object Alignment Learning
- Modeling and Analysis of the Effect of Grounding and Bonding of Floating Roof Tanks on Their Performance Against the Direct Lightning Strikes
- On the Usability and Energy Efficiency of High-Level Synthesis for FPGA-based Network-Attached Accelerators
- A Simplified Model and Parameterization Method for Variable Frequency Drive Loads in Phasor-Level Power System Studies
- Exploring Simple Open-Vocabulary Semantic Segmentation
- ArcPro: Architectural Programs for Structured 3D Abstraction of Sparse Points
- SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding
- Container Scheduling Strategy Based on Container Clustering and Deep Reinforcement Learning
- Volumetrically Consistent 3D Gaussian Rasterization
- Analysis of Inherent Damping Mechanism and Its Contribution to Stability in DCM Grid-Tied Inverters with LCL Filters
- FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models
- Generative Multiview Relighting for 3D Reconstruction under Extreme Illumination Variation
- COB-GS: Clear Object Boundaries in 3DGS Segmentation Based on Boundary-Adaptive Gaussian Splitting
- Adversarial Attack Against 3D Shapes Utilizing Their Common Points
- HumanRig: Learning Automatic Rigging for Humanoid Character in a Large Scale Dataset
- Fuse Before Transmit: A Multimodal Semantic Communication Method
- Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking
- A Bidirectional Linear Piezoelectric Actuator With High Resolution and Output Maintained Capability
- Sound Bridge: Associating Egocentric and Exocentric Videos via Audio Cues
- SkePU-DNN: Algorithmic Skeleton Programming for Deep Learning on Heterogeneous Systems
- Multi-Modal Contrastive Masked Autoencoders: A Two-Stage Progressive Pre-training Approach for RGBD Datasets
- Computer Audition: From Task-Specific Machine Learning to Foundation Models
- A Context Sensitive Method for Complex Event Processing
- Robustness Analysis of Temperature-Sensitive Electrical Parameters of SiC MOSFETs
- WeakMCN: Multi-task Collaborative Network for Weakly Supervised Referring Expression Comprehension and Segmentation
- Active Data Curation Effectively Distills Large-Scale Multimodal Models
- Sample-Efficient Reinforcement Learning from Human Feedback via Information-Directed Sampling
- Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding
- MuTri: Multi-view Tri-alignment for OCT to OCTA 3D Image Translation
- Reinforcement Learning-Based Predefined-Time Tracking Control for Input-Saturated Nonlinear Systems With Performance Guarantees
- Dual Energy-Based Model with Open-World Uncertainty Estimation for Out-of-distribution Detection
- Low-Cost and Low-Frequency Interface for Soil Moisture Monitoring
- MobileH2R: Learning Generalizable Human to Mobile Robot Handover Exclusively from Scalable and Diverse Synthetic Data
- Independent Pole Arm Current Control of Hybrid MMC Under DC Grid Fault
- VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification
- Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization
- The Role of Governmental Support in Strengthening Venture Capital Ecosystems in Developing Economies: Case of Kazakhstan