- Equivalent Sampling Frequency Offset in Transceivers: Minimization and Compensation for Broadband Photonics-Aided THz Wireless Transmission Systems
- DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving
- AC Capacitor Dynamics-Based Synchronous Control for Grid-Following Operations
- Task-Specific Gradient Adaptation for Few-Shot One-Class Classification
- Integrating LSTM Autoencoders in Medical Monitoring Systems for Early Diabetes Detection
- AI-Face: A Million-Scale Demographically Annotated AI-Generated Face Dataset and Fairness Benchmark
- Lunar Crater Matching with Triangle-Based Global Second-Order Similarity for Precision Navigation
- Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding
- Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis
- Blurred LiDAR for Sharper 3D: Robust Handheld 3D Scanning with Diffuse LiDAR and RGB
- Research on Game Theory Based on Accurate Evaluation Function
- Align-A-Video: Deterministic Reward Tuning of Image Diffusion Models for Consistent Video Editing
- Evaluation of the Mutual Lightning Shielding Effect on Onshore Wind Farms
- Dynamic Derivation and Elimination: Audio Visual Segmentation with Enhanced Audio Semantics
- ExpertAF: Expert Actionable Feedback from Video
- GaussianSpa: An "Optimizing-Sparsifying" Simplification Framework for Compact and High-Quality 3D Gaussian Splatting
- Self-supervised ControlNet with Spatio-Temporal Mamba for Real-world Video Super-resolution
- Is There a Path Backward If the Cloud is Compromised?
- AniMo: Species-Aware Model for Text-Driven Animal Motion Generation
- Influences of the Earth Conductivity on Electric Fields Radiated from Lightning Strikes to Tall Towers
- DIV-FF: Dynamic Image-Video Feature Fields For Environment Understanding in Egocentric Videos
- AerialMegaDepth: Learning Aerial-Ground Reconstruction and View Synthesis
- Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding
- MuTri: Multi-view Tri-alignment for OCT to OCTA 3D Image Translation
- Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
- Implicit Bias Injection Attacks against Text-to-Image Diffusion Models
- Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks
- StableAnimator: High-Quality Identity-Preserving Human Image Animation
- Learning to Highlight Audio by Watching Movies
- Integrating Physics-Informed Neural Networks and GRU for SciML-based Surface Temperature Prediction Li-ion Battery
- 4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models
- A Three-Party Batch Authentication Based on Twisted Edwards-Curve in Mobile Edge Computing
- ChannelGuard: A DIRS-based Location Privacy-Protecting Mechanism for Integrated Sensing and Communication Systems
- TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation
- Secure Cloud AI: Leveraging Artificial Intelligence to Safeguard Cloud Data Sharing Against Fault Injection Attacks
- Pow3R: Empowering Unconstrained 3D Reconstruction with Camera and Scene Priors
- Enhancing Productivity and Performance of HClib-Actor with Efficient Task Termination
- An Image-like Diffusion Method for Human-Object Interaction Detection
- Comparative Analysis of Defect Detection Models for Wind Turbine Structures
- A FESI Domain Decomposition Method for EM Scattering by Multiscale Objects With Multiple Interior/Exterior Spherical Couplings
- DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation
- Exploring Simple Open-Vocabulary Semantic Segmentation
- Mitigation of Voltage Imbalance and Improving the Reliability of a Bipolar DC Microgrid using a Multiport Compensator
- Research on Rapid Prediction of Disaster Factors in Mine Tunnel Fires Based on WOA-BP
- Passive Multi-Target Visible Light Positioning Based on Multi-Camera Joint Optimization
- Optimized LCC-Series Resonant DC-DC Converter for Efficient Inductive Charging of EV’s with 400V and 800V Battery Systems
- A Generic Process Mining Framework for Uncovering Hierarchical Process Model
- Fault-Tolerant Synchronization Control of Switched Complex Networks by a Proportional-Integral Intermediate Observer Approach
- Zero-Shot Blind-spot Image Denoising via Implicit Neural Sampling
- Bridging Gait Recognition and Large Language Models Sequence Modeling
- A Multimodal Narrative Analysis Framework for University Ceremony Live Streaming Based on Deep Vision and Speech Models
- FactCheXcker: Mitigating Measurement Hallucinations in Chest X-ray Report Generation Models
- Adversarial Attack Against 3D Shapes Utilizing Their Common Points
- All-day Retrieval of Cloud Physical Properties from Meteosat Second Generation Satellite
- Design and Assembly of a Low-Parasitic-Capacitance Medium-Frequency Medium-Voltage Transformer
- Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering
- Uncertainty-Instructed Structure Injection for Generalizable HD Map Construction
- SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction
- FreeCloth: Free-form Generation Enhances Challenging Clothed Human Modeling
- Navigating Image Restoration with VAR’s Distribution Alignment Prior
- Consistency Posterior Sampling for Diverse Image Synthesis
- Q-CASA: Concluding Remarks From Theory to Execution: System-Level Challenges and Innovations in Scalable Quantum Computing *
- Research on Cruise Control System of Waterborne Unmanned Vessel Based on Kalman Filtering
- Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis
- Low-Complexity Human Detection and Localization via Channel State Information of a Wi-Fi Signal
- 4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians
- Seurat: From Moving Points to Depth
- Digital Twin Modeling of the Blast Furnace CO Composition Field
- Exploiting Deblurring Networks for Radiance Fields
- A Machine Learning Based Method for Situational Target Threat Assessment
- Switching Loss of Power MOSFET in Switched-Capacitor Converters
- EEE-Bench: A Comprehensive Multimodal Electrical And Electronics Engineering Benchmark
- AnySat: One Earth Observation Model for Many Resolutions, Scales, and Modalities
- GenDeg: Diffusion-based Degradation Synthesis for Generalizable All-In-One Image Restoration
- A Comparative Study of Emerging Technologies for Smart Attendance Systems in the Digital Age
- Research on Cooperative Motion Planning of Mobile Manipulator Based on Hierarchical Switching Strategy
- A Memetic Algorithm Integrating Argument Acceptability for Stable Extension Solving in Abstract Argumentation Frameworks
- Light Transport-aware Diffusion Posterior Sampling for Single-View Reconstruction of 3D Volumes
- A Context Sensitive Method for Complex Event Processing
- Consistent and Controllable Image Animation with Motion Diffusion Models
- Modeling Multiple Normal Action Representations for Error Detection in Procedural Tasks
- Image Over Text: Transforming Formula Recognition Evaluation with Character Detection Matching
- Speed Sensorless Direct Torque Control of Asymmetric Six-Phase Induction Motor
- Reusable Object-Oriented Parallelization of Branch-and-Bound Algorithms
- Customizable Screen-Printed Triboelectric Sensor Tape for Immersive Exoskeleton-Based Virtual Interaction
- Based on Symplectic Geometry Decomposition Multimodal Symmetric Dot Pattern Marine Diesel Engines Fault Diagnosis Method
- ChatGarment: Garment Estimation, Generation and Editing via Large Language Models
- Make It Count: Text-to-Image Generation with an Accurate Number of Objects
- GUARD: A GNN-Based Tool for Automated Unit Test Case Generation and Code Defect Prediction
- Dual Energy-Based Model with Open-World Uncertainty Estimation for Out-of-distribution Detection
- Low-Cost and Low-Frequency Interface for Soil Moisture Monitoring
- MobileH2R: Learning Generalizable Human to Mobile Robot Handover Exclusively from Scalable and Diverse Synthetic Data
- Uncertainty-guided Perturbation for Image Super-Resolution Diffusion Model
- NAPEH: An Asynchronous and NUMA-Aware KV Store Based on Non-Volatile Memory Architectures
- DreamOmni: Unified Image Generation and Editing
- The Role of Governmental Support in Strengthening Venture Capital Ecosystems in Developing Economies: Case of Kazakhstan
- LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models
- POp-GS: Next Best View in 3D-Gaussian Splatting with P-Optimality
- Understanding and Mitigating Lightning-Related Animal Fatalities: Case Studies, Injury Pathways, and Protection Measures
- ActiveGAMER: Active GAussian Mapping through Efficient Rendering