- HyperGLM: HyperGraph for Video Scene Graph Generation and Anticipation
- Design and Implementation of Mobile Emergency Communication Terminal Based on Tiantong Satellite
- Hybrid Global-Local Representation with Augmented Spatial Guidance for Zero-Shot Referring Image Segmentation
- LLM-driven Multimodal and Multi-Identity Listening Head Generation
- CoSDH: Communication-Efficient Collaborative Perception via Supply-Demand Awareness and Intermediate-Late Hybridization
- Ultra-Wide Voltage Range Reconfigurable DAB Converter for Universal PEV Charging Stations
- SAM2Object: Consolidating View Consistency via SAM2 for Zero-Shot 3D Instance Segmentation
- Design and Analysis of the Influence of Station Number, Distance, and Configuration on the Location Error of the 3-Dimensional Lightning Mapping System
- Acquire and then Adapt: Squeezing out Text-to-Image Model for Image Restoration
- SplatFlow: Self-Supervised Dynamic Gaussian Splatting in Neural Motion Flow Field for Autonomous Driving
- EditAR: Unified Conditional Generation with Autoregressive Models
- Horizon-Gs: Unified 3D Gaussian Splatting for Large-Scale Aerial-To-Ground Scenes
- VideoSPatS: Video SPatiotemporal Splines for Disentangled Occlusion, Appearance and Motion Modeling and Editing
- An Acquisition Circuit Based on the Transimpedance Amplifier for Triboelectric Nanogenerator Sensors
- Dual Diffusion for Unified Image Generation and Understanding
- ScaMo: Exploring the Scaling Law in Autoregressive Motion Generation Model
- Koala-36M : A Large-Scale Video Dataset Improving Consistency between Fine-Grained Conditions and Video Content
- DropGaussian: Structural Regularization for Sparse-view Gaussian Splatting
- GENIUS: A Generative Framework for Universal Multimodal Search
- FSFM: A Generalizable Face Security Foundation Model via Self-Supervised Facial Representation Learning
- Manufacturing and Reliability of Low Parasitic Capacitance Flip Chip SiC Power Module
- MobileMamba: Lightweight Multi-Receptive Visual Mamba Network
- LLAVIDAL : A Large LAnguage VIsion Model for Daily Activities of Living
- Unraveling Normal Anatomy via Fluid-Driven Anomaly Randomization
- O-TPT: Orthogonality Constraints for Calibrating Test-time Prompt Tuning in Vision-Language Models
- Opportunistic Single-Photon Time of Flight
- Interpretable Generative Models through Post-Hoc Concept Bottlenecks
- PersonaBooth: Personalized Text-to-Motion Generation
- MAP: Unleashing Hybrid Mamba-Transformer Vision Backbone’s Potential with Masked Autoregressive Pretraining
- InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption
- Scalable Graph-Guided Transformer for Point Cloud Geometry Coding
- SecDV: A Lightweight Secure Deep Neural Network Inference Service with Dynamic Verification
- Radar compound deception jamming recognition based on fast-slow time-frequency distributions
- WISNet: Pseudo Label Generation on Unbalanced and Patch Annotated Waste Images
- Detecting Fake News in Social Media using Natural Language Processing by Fake Polarity Detection
- Open-World Objectness Modeling Unifies Novel Object Detection
- AI-Driven Optimization of Passenger Flow: Integrating Computer Vision, Machine Learning, and Simulation for Enhanced Efficiency and Revenue Generation
- HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled Generation
- Efficient Detection of Relaxed Maximal Cliques in Large-Scale IoT Networks
- Citrus Sorting Dynamic Control Using Multispectral Computer Vision
- Multiple Object Tracking as ID Prediction
- Segment Any-Quality Images with Generative Latent Space Enhancement
- Gaussian Splatting for Efficient Satellite Image Photogrammetry
- Instant Adversarial Purification with Adversarial Consistency Distillation
- Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation
- Rectified Diffusion Guidance for Conditional Generation
- Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction
- No Thing, Nothing: Highlighting Safety-Critical Classes for Robust LiDAR Semantic Segmentation in Adverse Weather
- MetaShadow: Object-Centered Shadow Detection, Removal, and Synthesis
- A Single-Stage Admittance Control Network Based Misalignment Tolerant Inductive Power Transfer System for EV Application
- CrossOver: 3D Scene Cross-Modal Alignment
- MambaIC: State Space Models for High-Performance Learned Image Compression
- MaskGaussian: Adaptive 3D Gaussian Representation from Probabilistic Masks
- Exact: Exploring Space-Time Perceptive Clues for Weakly Supervised Satellite Image Time Series Semantic Segmentation
- Understanding multi-layered transmission matrices
- Comparative Analysis of PI and Fuzzy Logic Control in High-Efficiency Triple-Output DC-DC Converters
- Towards Explicit Geometry-Reflectance Collaboration for Generalized LiDAR Segmentation in Adverse Weather
- UCM-VeID V2: A Richer Dataset and A Pre-Training Method for UAV Cross-Modality Vehicle Re-Identification
- Deep Fair Multi-View Clustering with Attention KAN
- HyperLoRA: Parameter-Efficient Adaptive Generation for Portrait Synthesis
- Enhancing Cluster Scheduling in HPC: A Continuous Transfer Learning for Real-Time Optimization
- Soft Switched Interleaved Buck Converter for High Power Applications
- Investigation of Oscillating Micro U-tube Based Fluid Density Sensor
- CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models
- Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels
- DirectTriGS: Triplane-based Gaussian Splatting Field Representation for 3D Generation
- Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction
- Dynamic Estimation of Mental Workload and Operator Accuracy for Time-Constrained Binary Classification Tasks
- FGI: Fast GNN Inference on Multi-Core Systems
- Smart Eye: A Surveillance system
- Power Quality Enhancement Using Diffusion-Probabilistic Least Mean Square Technique
- AssertionForge: Enhancing Formal Verification Assertion Generation with Structured Representation of Specifications and RTL
- Machine Learning-based Trajectory Planning for Single-loop Flatness-based Control of PMSMs
- Navigating the Unseen: Zero-shot Scene Graph Generation via Capsule-Based Equivariant Features
- Octopus: Alleviating Hallucination via Dynamic Contrastive Decoding
- NTFR: A Network Traffic Feature Reduction Method Based on Relational Analysis
- Automated Calculation of Algorithm Statement Execution Frequency Based on Abstract Syntax Tree
- MotionBench: Benchmarking and Improving Fine-Grained Video Motion Understanding for Vision Language Models
- Reduced Common Mode Voltage SVDPWM Strategy with Switching Loss Minimization in Four-Level NPC Inverter
- Digital Products Based on Large Language Models for the Exploration of Graph-Databases in Materials Science and Manufacturing
- NFC in Health Monitoring : A New Era of Medical Cards and Application
- TaskSimLF: Efficient Leader-Follower Multi-Agent Path Finding With Clustered Pickup and Delivery
- Conformal Prediction and MLLM aided Uncertainty Quantification in Scene Graph Generation
- SpiralShard: Highly Concurrent and Secure Blockchain Sharding via Linked Cross-Shard Endorsement
- Sensing depth analysis of different permittivity materials based on open-ended coaxial probes at different input powers
- Methodology for GPU Frequency Switching Latency Measurement
- AI Driven Self-Healing Cybersecurity Systems with Agentic AI for Adaptive Threat Response and Resilience
- StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
- CoLLM: A Large Language Model for Composed Image Retrieval
- GeoDepth: From Point-to-Depth to Plane-to-Depth Modeling for Self-Supervised Monocular Depth Estimation
- CGMatch: A Different Perspective of Semi-supervised Learning
- Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera
- Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration
- Exploring the Implications of Digital Tools for Participatory Ergonomics: Reflections Based on Three Case Studies
- Joint retrieval of ozone profile in near-space based on the atmospheric and near infrared atmospheric bands of O 2 airglow
- DDoS Protection System for Cloud using AWS and Machine Learning
- Calico: Part-Focused Semantic Co-Segmentation with Large Vision-Language Models
- Cancer Survival Prognosis From Whole Slide Images Using Hopfield Network
- PanDA: Towards Panoramic Depth Anything with Unlabeled Panoramas and Möbius Spatial Augmentation
- Reconstructing Animals and the Wild