- Order-One Rolling Shutter Cameras
- Development of a Hybrid Experimental Environment using PHIL for Multi-Unit Power Converter Networks
- HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation
- Leveraging RAG for Enhanced Business Intelligence with Local LLMs
- Jailbreaking the Non-Transferable Barrier via Test-Time Data Disguising
- BHViT: Binarized Hybrid Vision Transformer
- MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical Environments
- Effect of Field Contaminants on rGO-coated Flexible Leaf Wetness Sensors for In-Situ Agriculture Applications
- Heuristic Methods for Checking the Normality of Measurement Data with Graphical and Numerical Tests
- Data Analysis for Structural Health Monitoring of a Steel Jacket Offshore Platform
- Low-Rank Adaptation in Multilinear Operator Networks for Security-Preserving Incremental Learning
- Protecting Your Video Content: Disrupting Automated Video-based LLM Annotations
- GCC: Generative Color Constancy via Diffusing a Color Checker
- AI in Public Procurement: Potential and Adoption in the Competitive Tendering Process
- Unseen Visual Anomaly Generation
- Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion
- SOH Estimation of Lithium-ion Batteries using LSTM Model with Deconvoluted EIS Parameters
- SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis
- A Robust Cascade Controller Based Phase Shifted Full Bridge Converter for Electric Vehicle Applications
- An AST-guided LLM Approach for SVRF Code Synthesis
- Conformal Prediction for Zero-Shot Models
- Moving Towards Measuring Spatial Hearing Using Consumer-grade Headband EEG
- Investigating Efficient Edge Offloading Architectures for Serverless Systems
- Resilient Sensor Fusion under Adverse Sensor Failures via Multi-Modal Expert Fusion
- Reconfigurable Coding Design for Programmable Metasurface-Based DOA Estimation via Riemannian Manifold Optimization
- Sensify: A Learning-Based Budget-Aware Task Assignment in Mobile Crowdsensing
- Interpreting Object-level Foundation Models via Visual Precision Search
- TAPT: Test-Time Adversarial Prompt Tuning for Robust Inference in Vision-Language Models
- DiskVPS: Vanishing Point Detector via Hough Transform in a Disk Region
- Open-Canopy: Towards Very High Resolution Forest Monitoring
- 4Deform: Neural Surface Deformation for Robust Shape Interpolation
- From Elements to Design: A Layered Approach for Automatic Graphic Design Composition
- Efficient Dynamic mmWave Beam Selection Using Multimodal Attention-Based Approach
- Fault Detection for Train-Controlled On-Board Equipment Using a Hybrid CNN-LSTM Model
- LIM: Large Interpolator Model for Dynamic Reconstruction
- SAM-REF: Introducing Image-Prompt Synergy during Interaction for Detail Enhancement in the Segment Anything Model
- Crash Course on Quantum Computing for Engineering Students
- STF-GCN: A Multi-Domain Graph Convolution Network Method for Automatic Modulation Recognition via Adaptive Correlation
- A Systematic Approach for Continuous Monitoring and Validation of Product Properties in the Product Engineering Process
- A Multi-Time Selection Framework for Machine Translation Based on Large Language Models
- Perceptual Video Compression with Neural Wrapping
- High Dynamic Range Video Compression: A Large-Scale Benchmark Dataset and A Learned Bit-depth Scalable Compression Algorithm
- Repurposing Pre-trained Video Diffusion Models for Event-based Video Interpolation
- Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances
- SceneCrafter: Controllable Multi-View Driving Scene Editing
- ObjectMover: Generative Object Movement with Video Prior
- Partial Discharge Fault Detection Method of CNN-LSTM Based on Fusion Attention Mechanism
- FedCALM: Conflict-aware Layer-wise Mitigation for Selective Aggregation in Deeper Personalized Federated Learning
- DarkIR: Robust Low-Light Image Restoration
- Analysis of Students Stress Level using Machine Learning Algorithms
- Testing of a Concept for In-situ Detection of Humidity-Driven Degradation of IGBT Modules under Accelerated Aging
- Wavelet and Prototype Augmented Query-based Transformer for Pixel-level Surface Defect Detection
- GlyphMastero: A Glyph Encoder for High-Fidelity Scene Text Editing
- Latent Space Imaging
- Scaling Inference Time Compute for Diffusion Models
- FlashGS: Efficient 3D Gaussian Splatting for Large-scale and High-resolution Rendering
- Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields
- FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation
- Video-ColBERT: Contextualized Late Interaction for Text-to-Video Retrieval
- Flash-Split: 2D Reflection Removal with Flash Cues and Latent Diffusion Separation
- When the Future Becomes the Past: Taming Temporal Correspondence for Self-supervised Video Representation Learning
- Prototype-Based Image Prompting for Weakly Supervised Histopathological Image Segmentation
- Multi-View Multi-Scale Network for 3D Object Recognition and Retrieval
- Homogeneous Dynamics Space for Heterogeneous Humans
- Enhanced Doa Estimation for Lightning Sources Using Music and Coherent Signal Subspace Method
- STCOcc: Sparse Spatial-Temporal Cascade Renovation for 3D Occupancy and Scene Flow Prediction
- Honey Bee-Inspired Energy-Efficient Cluster Head Optimization for Large-Scale Cloud-based IoT Applications
- Does Matter: Visual Navigation via Denoising Diffusion Bridge Models
- RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics
- Pos3R: 6D Pose Estimation for Unseen Objects Made Easy
- CoMatcher: Multi-View Collaborative Feature Matching
- VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving
- FineLIP: Extending CLIP’s Reach via Fine-Grained Alignment with Longer Text Inputs
- Experimental Analysis of Multipath Characteristics in Indoor Distributed Massive MIMO Channels
- Breaking the Low-Rank Dilemma of Linear Attention
- Expert System in the Construction of Personalized Football Training Program Model
- Error Compensation-Based Fusion Algorithm for Drone-image Color Correction
- Process Simulation as a Basis for Process Mining in Intralogistics
- ManipTrans: Efficient Dexterous Bimanual Manipulation Transfer via Residual Learning
- Performance Comparison of Scalar and Direct Vector Control For Five-Phase Induction Motor Drive
- Towards Precise Scaling Laws for Video Diffusion Transformers
- A Theory of Learning Unified Model via Knowledge Integration from Label Space Varying Domains
- X-Dyna: Expressive Dynamic Human Image Animation
- Joint Optimization of Neural Radiance Fields and Continuous Camera Motion from a Monocular Video
- High-fidelity 3D Object Generation from Single Image with RGBN-Volume Gaussian Reconstruction Model
- Attention Distillation: A Unified Approach to Visual Characteristics Transfer
- AG-VPReID: A Challenging Large-Scale Benchmark for Aerial-Ground Video-based Person Re-Identification
- Classifier-Free Guidance inside the Attraction Basin May Cause Memorization
- Toward Robust Neural Reconstruction from Sparse Point Sets
- Design and Stability Optimization of 6T FinFET SRAM Cell using Strategic Multi-Fin Configuration
- ICP: Immediate Compensation Pruning for Mid-to-high Sparsity
- Semantic Entity Recognition Model Identification Method Based on Multiple Modes
- WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model
- Improved Wavelet Threshold Denoising Approach for Air Duct Leakage Signal Processing
- Predicting Performance Variability
- MuCHEx: A Multimodal Conversational Debugging Tool for Interactive Visual Exploration of Hierarchical Object Classification
- Study on Spam SMS Detection Based on TF-IDF and Random Forest
- BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs
- Identifying and Mitigating Spurious Correlation in Multi-Task Learning
- A thin film-based lab-on-chip for in-field analysis in agriculture