- Smart Eye: A Surveillance system
- Speech Prediction in ANC Headphones for Improved Attenuation: New Methods and Perceptual Study
- AssertionForge: Enhancing Formal Verification Assertion Generation with Structured Representation of Specifications and RTL
- Observation and Analysis of a Multiple Lightning Strike Based on Dynamic Vision
- Face Forgery Video Detection via Temporal Forgery Cue Unraveling
- Scattering Center Modeling Of Complex Targets Under Cross-polarization
- Rolling-Capacitors Topology: A Simplified Phase-modular Solution to Obtain Stepped-up Three-Phase Five-level AC from Single DC Source
- Obstacle Avoidance Distributed Tracking of Networked UAVs with Online Path Planning
- Watermarking One for All: A Robust Watermarking Scheme Against Partial Image Theft
- OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows
- 4D-Fly: Fast 4D Reconstruction from a Single Monocular Video
- Make them Socialites: Supporting Social Entrepreneurs
- Automatic Joint Structured Pruning and Quantization for Efficient Neural Network Training and Compression
- Adapting to Observation Length of Trajectory Prediction via Contrastive Learning
- Twinner: Shining Light on Digital Twins in a Few Snaps
- Hybrid Concept Bottleneck Models
- Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters Themselves
- Lightning-Terrain Association Mining Based on Improved Apriori Algorithm
- Research on the Construction and Operation Mode of Power Wireless Internet of Things Card Operation Management Platform
- ESCAPE: Equivariant Shape Completion via Anchor Point Encoding
- Compensation of a longitudinal excitation electromagnetic system for the detection of foreign bodies flowing in a pipe
- Knowledge-Aligned Counterfactual-Enhancement Diffusion Perception for Unsupervised Cross-Domain Visual Emotion Recognition
- ProtoDepth: Unsupervised Continual Depth Completion with Prototypes
- PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation
- Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing
- Free-viewpoint Human Animation with Pose-correlated Reference Selection
- Medusa: A Multi-Scale High-order Contrastive Dual-Diffusion Approach for Multi-View Clustering
- Powertrain Architecture and Control System Design for Overhead Line and Battery Powered Railway Tower Car
- Development of a BIM-Data Mining Integrated Digital Twin and Its Use for Lifecycle Management Tools
- MenTeR: A fully-automated Multi-agenT workflow for end-to-end RF/Analog Circuits Netlist Design
- CoSER: Towards Consistent Dense Multiview Text-To-Image Generator for 3D Creation
- Smart Charge Scheduling of EVs based on User Behaviour Predicted using Machine Learning
- Detecting Fake News in Social Media using Natural Language Processing by Fake Polarity Detection
- GBlobs: Explicit Local Structure via Gaussian Blobs for Improved Cross-Domain LiDAR-based 3D Object Detection
- Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data
- AI-Driven Optimization of Passenger Flow: Integrating Computer Vision, Machine Learning, and Simulation for Enhanced Efficiency and Revenue Generation
- A Novel MIMO Arc SAR Imaging System for FOD Detection
- Unified Medical Lesion Segmentation via Self-referring Indicator
- TopV: Compatible Token Pruning with Inference Time Optimization for Fast and Low-Memory Multimodal Vision Language Model
- Machine Learning Models for Predicting the Performance of Risk Management Applications Running in Cloud
- Towards Universal Soccer Video Understanding
- Image Reconstruction from Readout-Multiplexed Single-Photon Detector Arrays
- Continual SFT Matches Multimodal RLHF with Negative Supervision
- Deep Multimodal Imitation Learning-Based Framework for Robot-Assisted Medical Examination
- Prof. Robot: Differentiable Robot Rendering Without Static and Self-Collisions
- Radar Self-Evolution Detection: Two-Stage Knowledge Transfer via Distillation-Fusion Synergy
- Serially Concatenated PPM for Deep Space Optical Communications over Poisson Channel
- FedAWA: Adaptive Optimization of Aggregation Weights in Federated Learning Using Client Vectors
- Robust Audio-Visual Segmentation via Audio-Guided Visual Convergent Alignment
- Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval
- Enhancing Obstacle Detection and Control in Autonomous Robotic Vehicles Through Edge Computing Integration
- PIAD: Pose and Illumination agnostic Anomaly Detection
- AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data Generation
- Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event Cameras
- DistinctAD: Distinctive Audio Description Generation in Contexts
- Multiple Object Tracking as ID Prediction
- Segment Any-Quality Images with Generative Latent Space Enhancement
- MetaShadow: Object-Centered Shadow Detection, Removal, and Synthesis
- Chain of Attack: On the Robustness of Vision-Language Models Against Transfer-Based Adversarial Attacks
- Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts
- A History of the VHF Broadband Digital Interferometer of Lightning Research Group Osaka University
- Investigation of Oscillating Micro U-tube Based Fluid Density Sensor
- An Improved RFR Method for Enhancing Large-Signal Stability of Grid-Following Inverter Under Weak and Faulty Grid Conditions
- Dynamic Estimation of Mental Workload and Operator Accuracy for Time-Constrained Binary Classification Tasks
- SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images
- Enhancing Generalization in Video Anomaly Detection through Multimodal Data Mixing
- Statistical Model Limitations of Ground Flash Density for Lightning Risk Assessment
- Less is More: Efficient Image Vectorization with Adaptive Parameterization
- Continuous 3D Perception Model with Persistent State
- Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion
- Unlocking Generalization Power in LiDAR Point Cloud Registration
- SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Models
- Design and Implementation of a Cloud-Based Water Environment Monitoring System Using Internet of Things
- TCFG: Tangential Damping Classifier-free Guidance
- DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation
- MotionBench: Benchmarking and Improving Fine-Grained Video Motion Understanding for Vision Language Models
- Crab: A Unified Audio-Visual Scene Understanding Model with Explicit Cooperation
- Black Swan: Abductive and Defeasible Video Reasoning in Unpredictable Events
- Digital Products Based on Large Language Models for the Exploration of Graph-Databases in Materials Science and Manufacturing
- Pioneering 4-Bit FP Quantization for Diffusion Models: Mixup-Sign Quantization and Timestep-Aware Fine-Tuning
- Depth Dynamics via One-Bit Frequency Probing in Embedded Direct Time-of-Flight Sensing
- AD-LDB: A Modality-Incomplete Learning Model for Alzheimer's Disease Diagnosis
- Multi-Modal Aerial-Ground Cross-View Place Recognition with Neural ODEs
- A Review on Digital Product Passports as Drivers of Digital Transformation in Industry
- AlignMamba: Enhancing Multimodal Mamba with Local and Global Cross-Modal Alignment
- Calibrated Uncertainty Estimation for Trustworthy Deep IoT Attack Detection
- Generalized Recorrupted-to-Recorrupted: Self-Supervised Learning Beyond Gaussian Noise
- Robust Safety Critical Control of Uncertain Nonlinear Systems with DoS Attacks
- Remote Current Sensing Using Reflectometry for Bioelectric Applications
- RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark
- PersonaHOI: Effortlessly Improving Face Personalization in Human-Object Interaction Generation
- Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark Dataset
- EnvPoser: Environment-aware Realistic Human Motion Estimation from Sparse Observations with Uncertainty Modeling
- Three-view Focal Length Recovery From Homographies
- ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding
- Intelligent and Autonomous Systems in Government
- Shading Meets Motion: Self-supervised Indoor 3D Reconstruction Via Simultaneous Shape-from-Shading and Structure-from-Motion
- Tripartite Weight-Space Ensemble for Few-Shot Class-Incremental Learning
- The Application Progress of Power Batteries in New Energy Ships
- Galois Hulls of a kind of Goppa Codes with Applications to EAQECCs