- Design and Research of a Tea Production Process Interactive Educational Device Based on Arduino
- Scaling up Image Segmentation across Data and Tasks
- SAM2Object: Consolidating View Consistency via SAM2 for Zero-Shot 3D Instance Segmentation
- FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding
- VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents
- MagicQuill: An Intelligent Interactive Image Editing System
- MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models
- DriveGPT4-V2: Harnessing Large Language Model Capabilities for Enhanced Closed-Loop Autonomous Driving
- Feasibility Study and Preliminary Testing of 3D Printing on ASICs for MEMS: A Particulate Sensor Case Study
- Subspace Constraint and Contribution Estimation for Heterogeneous Federated Learning
- Dual Diffusion for Unified Image Generation and Understanding
- Implementing Directive-Based Deferred Execution for Effective Network Aggregation
- SemanticDraw: Towards Real-Time Interactive Content Creation from Image Diffusion Models
- Koala-36M : A Large-Scale Video Dataset Improving Consistency between Fine-Grained Conditions and Video Content
- GIFStream: 4D Gaussian-Based Immersive Video with Feature Stream
- DropGaussian: Structural Regularization for Sparse-view Gaussian Splatting
- A UAV Exploration Planning Method Based on Improved Bio-Inspired Neural Network
- OpenSIEM:A Unified Open Source Security Management Framework
- Knowledge Base Autoencoder Framework: A Novel Approach for Continuous Phase Shift Compression in RIS-Aided Comunications
- Explorative Evaluation of Validation Criteria for Validating Needs and Benefits of a Mobility Hub
- Enhancing Manufacturing Training Through VR Simulations
- Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data
- C2FNet: Cross-Probabilistic Weak Supervision Learning for High-Resolution Land Cover Enhancement
- MLLM-as-a-Judge for Image Safety without Human Labeling
- MANTA: Diffusion Mamba for Efficient and Effective Stochastic Long-Term Dense Action Anticipation
- Survey App: Rating and Feedback System Application
- Radar Self-Evolution Detection: Two-Stage Knowledge Transfer via Distillation-Fusion Synergy
- FedAWA: Adaptive Optimization of Aggregation Weights in Federated Learning Using Client Vectors
- WeGen: A Unified Model for Interactive Multimodal Generation as We Chat
- Application of a Blockchain-Based Filtering Model to Mitigate Cyber Attacks in a Decentralized Transactive Energy System
- MP-GUI: Modality Perception with MLLMs for GUI Understanding
- Analytical Subdomain Modelling and Analysis of a Single Rotor Induction Assisted IPM Motor for EVs
- HiMoR: Monocular Deformable Gaussian Reconstruction with Hierarchical Motion Representation
- 3D Prior is All You Need: Cross-Task Few-shot 2D Gaze Estimation
- Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event Cameras
- ILIAS: Instance-Level Image retrieval At Scale
- SkyMamba: Integrating Transformer and State Space Model for UAV Remote Sensing RGB-D Images Semantic Segmentation
- GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control
- Research on Torque Optimization of Outer Rotor Permanent Magnet Synchronous Motor Based on Response Surface Methodology
- Multiple Object Tracking as ID Prediction
- Segment Any-Quality Images with Generative Latent Space Enhancement
- Gaussian Splatting for Efficient Satellite Image Photogrammetry
- Instant Adversarial Purification with Adversarial Consistency Distillation
- Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation
- ProAPO: Progressively Automatic Prompt Optimization for Visual Classification
- UrbanCAD: Towards Highly Controllable and Photorealistic 3D Vehicles for Urban Scene Simulation
- Enhancing KGCN-Based Recommendation Algorithms via Attention Mechanism Integration
- An Improved Data Fusion Model for Secondary Return Water Temperature of Heating System Employing EKF
- Microfluidic biosensors for biotic and abiotic plant stress monitoring
- Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
- Parallel Fractal Decomposition Optimization Algorithms on Heterogeneous Architectures
- Lifeline Connect: A Web-based Multi-Feature System for Mental Health Support
- Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation
- SLADE: Shielding against Dual Exploits in Large Vision-Language Models
- Multi-feature Collaborative Attention Dynamic Hypergraph Convolutional Network for Hyperspectral Image Classification
- MaskGaussian: Adaptive 3D Gaussian Representation from Probabilistic Masks
- UCM-VeID V2: A Richer Dataset and A Pre-Training Method for UAV Cross-Modality Vehicle Re-Identification
- TSD-SR: One-Step Diffusion with Target Score Distillation for Real-World Image Super-Resolution
- DirectTriGS: Triplane-based Gaussian Splatting Field Representation for 3D Generation
- Self-Sustained Oscillation Analysis of Grid-Interfaced Converters by Frequency-Domain Method Considering Harmonic Balance Principle
- Smart Eye: A Surveillance system
- MAGE : Single Image to Material-Aware 3D via the Multi-View G-Buffer Estimation Model
- Object-Shot Enhanced Grounding Network for Egocentric Video
- AssertionForge: Enhancing Formal Verification Assertion Generation with Structured Representation of Specifications and RTL
- iG-6DoF: Model-Free 6DoF Pose Estimation for Unseen Object via Iterative 3D Gaussian Splatting
- FSDFormer: a Frequency-Selected Differential Fusion Transformer for Remote Sensing Image Spatiotemporal Fusion
- Stacking Brick by Brick: Aligned Feature Isolation for Incremental Face Forgery Detection
- PRISTINE: PRIority-Aware Smart Resource Orchestration eNginE for Cloud-Native Applications
- SceneDiffuser++: City-Scale Traffic Simulation via a Generative World Model
- Empowering Large Language Models with 3D Situation Awareness
- GLAD-TL: A Time-Sensitive Crowdsourced Model for Robust Detection of Fake Taxis in Urban Traffic Surveillance
- SCORPIO: A Parallel I/O library for Exascale Earth System Models
- Study on Insulator Local Arc Development Considering Energy Level Transition and Surface Particles
- Towards Practical Real-Time Neural Video Compression
- Floxels: Fast Unsupervised Voxel Based Scene Flow Estimation
- NFC in Health Monitoring : A New Era of Medical Cards and Application
- Curriculum Direct Preference Optimization for Diffusion and Consistency Models
- Sufficient Invariant Learning for Distribution Shift
- ICE Over the Years - A Keyword Analysis
- TaskSimLF: Efficient Leader-Follower Multi-Agent Path Finding With Clustered Pickup and Delivery
- Conformal Prediction and MLLM aided Uncertainty Quantification in Scene Graph Generation
- DeSplat: Decomposed Gaussian Splatting for Distractor-Free Rendering
- AI Driven Self-Healing Cybersecurity Systems with Agentic AI for Adaptive Threat Response and Resilience
- Learning Compatible Multi-Prize Subnetworks for Asymmetric Retrieval
- A Review on Digital Product Passports as Drivers of Digital Transformation in Industry
- HELVIPAD: A Real-World Dataset for Omnidirectional Stereo Depth Estimation
- nbshmem: Enabling GPU-Initiated Multi-GPU Communication in Python
- StyleSSP: Sampling StartPoint Enhancement for Training-free Diffusion-based Method for Style Transfer
- FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields
- Accelerating CRS Format Conversion for Sparse Matrix Computation with an FPGA
- Demystifying Chains, Trees, and Graphs of Thoughts
- Robust Safety Critical Control of Uncertain Nonlinear Systems with DoS Attacks
- Remote Current Sensing Using Reflectometry for Bioelectric Applications
- Shift the Lens: Environment-Aware Unsupervised Camouflaged Object Detection
- CraterID-Loc: End-to-End Crater Identification for Lunar Image Localization
- Cyber Laws & Emerging Trends of Artificial Intelligence: An Analytical Study
- PanDA: Towards Panoramic Depth Anything with Unlabeled Panoramas and Möbius Spatial Augmentation
- Reconstructing Animals and the Wild
- Modeling and Analysis of a Multipole Permanent Magnet Assisted Synchronous Reluctance Machine for Electric Vehicles
- SVG-IR: Spatially-Varying Gaussian Splatting for Inverse Rendering