- Textured Gaussians for Enhanced 3D Scene Appearance Modeling
- Exploration of LLM Lossless Compression on Scientific Data
- Coupled Electromagnetic and Heat Analysis of a ZnO Disk with the FDTD Method in the 2D Cylindrical Coordinate System
- IM-Portrait: Learning 3D-aware Video Diffusion for Photorealistic Talking Heads from Monocular Videos
- Observations of Rocket Triggered Lightning Discharge in Winter Thunderstorms in Japan Using a Broadband VHF Interferometer
- Multi-Scale Perceptual Learning for Skin Lesion Image Segmentation
- Study of Current Control for High-Speed Motor Drive Systems
- Diabetes Prediction Model Based on SVM Optimized with RF Feature Selection and GWO
- Research on Automatic Extraction of Key Parameters of Lightning Risk Assessment Based on Laser Point Cloud of Transmission Line
- DATAWiSE: A Scalable Big Data Reference Architecture for Smart Building
- VideoDirector: Precise Video Editing via Text-to-Video Models
- Ref-GS: Directional Factorization for 2D Gaussian Splatting
- SAIST: Segment Any Infrared Small Target Model Guided by Contrastive Language-Image Pretraining
- All-directional Disparity Estimation for Real-world QPD Images
- Reconstructing Humans with a Biomechanically Accurate Skeleton
- Study on the Characteristics of Intense Lightning Activity During a Spring Severe Thunderstorm Process in Southern China
- Perception Tokens Enhance Visual Reasoning in Multimodal Language Models
- CustAny: Customizing Anything from A Single Example
- Enhancing Online Continual Learning with Plug-and-Play State Space Model and Class-Conditional Mixture of Discretization
- Improving LLM-Powered EDA Assistants with RAFT
- Argus: A Compact and Versatile Foundation Model for Vision
- A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training
- M3amba: Memory Mamba is All You Need for Whole Slide Image Classification
- Characterizing the Influence of Circuit Parasitics and Operating Conditions on a Passive Regenerative Snubber for Phase-Shifted Full-Bridge Converter
- Relative Pose Estimation through Affine Corrections of Monocular Depth Priors
- SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception
- Green Edge Computing Based IoV Dynamic Task Collaborative Strategy
- Vehicle Routing Incorporating Implicit Preferences: An Omnidimensional Human–Algorithm Collaboration Approach
- Any6D: Model-free 6D Pose Estimation of Novel Objects
- Cheb-GR: Rethinking k-nearest neighbor search in Re-ranking for Person Re-identification
- DDIP: Mutual-Regularized Dual Deep Image Prior for Self-Supervised Compressive Spectral Imaging
- AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization
- Hardware-Rasterized Ray-Based Gaussian Splatting
- Control of Integrated Magnetic-Based Active Harmonic Filter for Three-Phase Standalone Application
- Development and Deployment of a Genomic Cancer Data Extraction Pipeline on the Cloud
- OSV: One Step is Enough for High-Quality Image to Video Generation
- COSMIC: Clique-Oriented Semantic Multi-space Integration for Robust CLIP Test-Time Adaptation
- Directional Label Diffusion Model for Learning from Noisy Labels
- AdMiT: Adaptive Multi-Source Tuning in Dynamic Environments
- Design2GarmentCode: Turning Design Concepts to Tangible Garments Through Program Synthesis
- Fine-Grained Skin Wound Segmentation Based on Machine Learning with Scribble Annotations
- Combined Model for P-S or S-P Configured Lithium-ion Batteries and Equalization Electronics for Spacecraft
- RayFlow: Instance-Aware Diffusion Acceleration via Adaptive Flow Trajectories
- ImViD: Immersive Volumetric Videos for Enhanced VR Engagement
- Reanimating Images using Neural Representations of Dynamic Stimuli
- Gen-AI in a Bottle: Experiments with LLMs to Generate HPC Kernels
- Assembly of FETI dual operator using CUDA
- SketchFusion: Learning Universal Sketch Features through Fusing Foundation Models
- GoLF-NRT: Integrating Global Context and Local Geometry for Few-Shot View Synthesis *
- A Performance Comparison of Chiller Models for Energy Optimization in Commercial Buildings
- Industrial Digitalization in the Metaverse: Enhancing Technical Services Through Immersive Technologies
- CoA: Towards Real Image Dehazing via Compression-and-Adaptation
- Exploiting CoRSMA-ISAC in Multi-UAV System for Emergency Response
- Industrial Machine Data Generation and Artificial Optimisation for Blow Molding Extrusion Machines
- Multi-Agent Hierarchical Deep Reinforcement Learning for HVAC Control With Flexible DERs
- Adversarial Domain Prompt Tuning and Generation for Single Domain Generalization
- Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models
- SecureBERT and L lama 2 Empowered Control Area Network Intrusion Detection and Classification
- Research on Safety Design of Waste Heat Recovery System (WHR) for Ships
- Impedance Analysis of Dimmable LED Lighting and Its Impact on Residential Distribution Grids
- Synthetic Visual Genome
- Volumetric Surfaces: Representing Fuzzy Geometries with Layered Meshes
- CMMLoc: Advancing Text-to-PointCloud Localization with Cauchy-Mixture-Model Based Framework
- ESC: Erasing Space Concept for Knowledge Deletion
- Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning
- Heuristic Optimization Strategies for Reliable Amplifier Reconfiguration in Autonomous Optical Networks with Field-trial Validation
- Experimental Evaluation of Efficiency and Power Distribution Control by 3-Level Inverter Drive for DC-inputs Direct Electric Power Converter (D-EPC)
- Efficient Motion-Aware Video MLLM
- Predictive Current Control of a Three-Level Multi-Modular NPC Converter With Mutual Error Compensation and Fault Tolerance
- Quantum Multi-Proxy Signature Scheme Based on W State
- HLS-Eval: A Benchmark and Framework for Evaluating LLMs on High-Level Synthesis Design Tasks
- DTGBrepGen: A Novel B-rep Generative Model through Decoupling Topology and Geometry
- MExD: An Expert-Infused Diffusion Model for Whole-Slide Image Classification
- Shape Abstraction via Marching Differentiable Support Functions
- A Non-Isolated Hybrid Switched-Capacitor Network Based High-Gain Quadratic DC-DC Boost Converter
- Learning Heterogeneous Tissues with Mixture of Experts for Gigapixel Whole Slide Images
- Self-Learning Hyperspectral and Multispectral Image Fusion via Adaptive Residual Guided Subspace Diffusion Model
- BlenderGym: Benchmarking Foundational Model Systems for Graphics Editing
- EnvGS: Modeling View-Dependent Appearance with Environment Gaussian
- Multi-View Clustering Model Based on Self-Supervised Comparative Learning for Bank Customer Segmentation and Risk Assessment
- Scaling Properties of Diffusion Models For Perceptual Tasks
- RCP-Bench: Benchmarking Robustness for Collaborative Perception Under Diverse Corruptions
- SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling
- A Semantic Knowledge Complementarity based Decoupling Framework for Semi-supervised Class-imbalanced Medical Image Segmentation
- Balanced Direction from Multifarious Choices: Arithmetic Meta-Learning for Domain Generalization
- EA-HAS-Bench and Language-Enhanced Shrinkage Search for Energy-aware NAS
- RDD: Robust Feature Detector and Descriptor using Deformable Transformer
- PoseBH: Prototypical Multi-Dataset Training Beyond Human Pose Estimation
- A Data-Selection Framework for Data-Efficient Battery Parameter Estimation
- SAFE: Semantic Adaptive Feature Extraction with Rate Control for 6G Wireless Communications
- GeoAvatar: Geometrically-Consistent Multi-Person Avatar Reconstruction from Sparse Multi-View Videos
- DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion
- Thermochromic Temperature Measurement: Towards an Alternative to Thermal Cameras
- Color Alignment in Diffusion
- A Simple Tiled Approach to Teaching Parallel Computing
- DFM: Differentiable Feature Matching for Anomaly Detection
- MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data
- FreqDebias: Towards Generalizable Deepfake Detection via Consistency-Driven Frequency Debiasing
- Augmenting Perceptual Super-Resolution via Image Quality Predictors
- FADE: Frequency-Aware Diffusion Model Factorization for Video Editing