- VoxelSplat: Dynamic Gaussian Splatting as an Effective Loss for Occupancy and Flow Prediction
- SET: Spectral Enhancement for Tiny Object Detection
- Towards Satellite Image Road Graph Extraction: A Global-Scale Dataset and A Novel Method
- MaRI: Material Retrieval Integration across Domains
- Comparison of Classical Controllers in DTC of PMSG-Based Wind Energy Conversion System
- Effectivity of Insulator Replacement to Avoiding Disturbance Caused by Lightning
- StoryGPT-V: Large Language Models as Consistent Story Visualizers
- OCRT: Boosting Foundation Models in the Open World with Object-Concept-Relation Triad
- Vehicular Communication Security: Multi-Channel and Multi-Factor Authentication
- Interference Exploitation in ISAC Systems: Finite-Alphabet Precoding with Low Resolution DACs and PSs
- Test-Time Visual In-Context Tuning
- The design of a multi-parameter intelligent monitoring system for marine ranching based on cloud platforms
- RealEdit: Reddit Edits As a Large-scale Empirical Dataset for Image Transformations
- Bridge the Gap: From Weak to Full Supervision for Temporal Action Localization with PseudoFormer
- Panorama Generation From NFoV Image Done Right
- Data Analysis for Structural Health Monitoring of a Steel Jacket Offshore Platform
- ClimbingCap: Multi-Modal Dataset and Method for Rock Climbing in World Coordinate
- Generalized Zero-Shot Classification via Semantics-Free Inter-Class Feature Generation
- Intelligent Coordination System for Autonomous Domestic Heating: An AI-Driven Test-Bench
- Personalized Diabetes Diet Recommendation System with Knowledge Graph and Incremental Learning
- Low-Rank Adaptation in Multilinear Operator Networks for Security-Preserving Incremental Learning
- Building Vision Models upon Heat Conduction
- Study of TEG-Heatsink Pairs for Indoor Thermal Energy Harvesting Applications
- Exploring NCCL Tuning Strategies for Distributed Deep Learning
- One2Any: One-Reference 6D Pose Estimation for Any Object
- Poster: A Scalable and Fault-Tolerant Decentralized Middleware for CI/CD Workflow
- HeatFormer: A Neural Optimizer for Multiview Human Mesh Recovery
- Number it: Temporal Grounding Videos like Flipping Manga
- SOH Estimation of Lithium-ion Batteries using LSTM Model with Deconvoluted EIS Parameters
- Global-Edge Dual-Path Semantic Image Segmentation for Transparent Objects
- A Network of Influence: Agency and Stakeholder Relationships in Sustainability Strategy Implementation
- Analysis of Project Management Models: An Investigation of Structure-, Process- and Function-Oriented Elements for the Tailoring of Project Design
- PreciseCam: Precise Camera Control for Text-to-Image Generation
- A Hybrid Technique for Detecting Cyber Threats Through Network Traffic Analysis
- A Sophisticated Authentication Coupled with A Multi Modal Security Integration and Role based Access Control for Secure File Management
- Investigating Efficient Edge Offloading Architectures for Serverless Systems
- PMA: Towards Parameter-Efficient Point Cloud Understanding via Point Mamba Adapter
- Commonsense Video Question Answering through Video-Grounded Entailment Tree Reasoning
- Reconfigurable Coding Design for Programmable Metasurface-Based DOA Estimation via Riemannian Manifold Optimization
- Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis
- Design and Verification of SiC Amplifiers for Extreme Temperature Applications Based on ANN Modeling of 4H-SiC MOSFETs
- PAVE: Patching and Adapting Video Large Language Models
- Augmenting Advertiser Decision Support with Generative AI and Interactive Analytics
- Interpreting Object-level Foundation Models via Visual Precision Search
- T-FAKE: Synthesizing Thermal Images for Facial Landmarking
- Design of a Solid-State Circuit Beaker (SSCB) Integrated DC Fast Charger with Flexible Power Processing Capability
- Channel Consistency Prior and Self-Reconstruction Strategy Based Unsupervised Image Deraining
- DashGaussian: Optimizing 3D Gaussian Splatting in 200 Seconds
- Reconfigurable Processor-Centric Accelerators for Safety-Critical Applications
- Design Method of Solder Joint in Surge Protective Device by Simulation
- TAPT: Test-Time Adversarial Prompt Tuning for Robust Inference in Vision-Language Models
- Research and Design of Triple Modular Redundancy Technology for Processors Oriented to RISC-V Architecture
- Channel Modeling and Accelerated Ray-Tracing Simulation for RIS-Assisted MIMO Systems
- Automated Canary Analysis for Kubernetes Deployments
- Open-Canopy: Towards Very High Resolution Forest Monitoring
- RelationField: Relate Anything in Radiance Fields
- DELTA: Directional-Aware Encoding and Local Transformer for Thangka Style Transfer
- Dual-Point Grounded Five-Level T-type Inverter for Photovoltaic Applications
- Mission Abort Policy for Coherent Systems With Heterogeneous Components
- R2C: Mapping Room to Chessboard to Unlock LLM As Low-Level Action Planner
- DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution
- An Active Filter Compensation Solution for High Power Energy Sources
- Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion
- 3D Gaussian Head Avatars with Expressive Dynamic Appearances by Compact Tensorial Representations
- Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene
- COLNet: A Chinese Online Learning Analysis Model
- CareerAlly: An Intelligent NLP-Driven Chatbot
- Distilling Spatially-Heterogeneous Distortion Perception for Blind Image Quality Assessment
- COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
- Event fields: Capturing light fields at high speed, resolution, and dynamic range
- Towards Universal Dataset Distillation via Task-Driven Diffusion
- Does Matter: Visual Navigation via Denoising Diffusion Bridge Models
- Quantized Graph-Based Personalized DRL for Dependency-Aware Task Offloading in Heterogeneous Edge Networks
- Boosting the Dual-Stream Architecture in Ultra-High Resolution Segmentation with Resolution-Biased Uncertainty Estimation
- Testbench analysis using non-invasive fault injection
- HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories
- Your Scale Factors are My Weapon: Targeted Bit-Flip Attacks on Vision Transformers via Scale Factor Manipulation
- Assessment of Insider Threats in Computer Networks and Mitigation Techniques
- Quadratic Switched Inductor-Capacitor Multi-Port Converter for DC Microgrid Application
- Decoupling Training-Free Guided Diffusion by ADMM
- Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues
- Simulator HC: Regression-based Online Simulation of Starting Problem-Solution Pairs for Homotopy Continuation in Geometric Vision
- Simplified Power Semiconductor Loss Evaluation With SPICE Models in PLECS
- Cooperative Localization Via Semidefinite Programming for Transmitters and Reflectors
- Towards Tactile Communication of English Language: A Visual Handbook Enhances Letter Learning
- MambaOut: Do We Really Need Mamba for Vision?*
- VisionPAD: A Vision-Centric Pre-training Paradigm for Autonomous Driving
- FineLIP: Extending CLIP’s Reach via Fine-Grained Alignment with Longer Text Inputs
- D2SP: Dynamic Dual-Stage Purification Framework for Dual Noise Mitigation in Vision-Based Affective Recognition
- ManipTrans: Efficient Dexterous Bimanual Manipulation Transfer via Residual Learning
- CraftsMan3D: High-fidelity Mesh Generation with 3D Native Diffusion and Interactive Geometry Refiner
- HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation
- Video Summarization with Large Language Models
- Rethinking Correspondence-based Category-Level Object Pose Estimation
- AgentBlock: Infrastructure for Integrating Blockchain and Multi-Agent Robotic Systems for Optimising Industrial Production and Logistics
- Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
- Pathology-Guided AI System for Accurate Segmentation and Diagnosis of Cervical Spondylosis
- Attention Distillation: A Unified Approach to Visual Characteristics Transfer
- Classifier-Free Guidance inside the Attraction Basin May Cause Memorization
- Fast start-up procedure for two-dimensional MEMS micromirror