- Research and Development of Rotating Structure Analysis Module Based on SiPESC.FEMS
- A Bias-Free Training Paradigm for More General AI-generated Image Detection
- Development of a Remote Monitoring System for Intelligent All-Electric Ships Based on 4G Network and Web Technology
- Robotic Visual Instruction
- Lux Post Facto: Learning Portrait Performance Relighting with Conditional Video Diffusion and a Hybrid Dataset
- UniScene: Unified Occupancy-centric Driving Scene Generation
- MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation
- Hierarchical Features Matter: A Deep Exploration of Progressive Parameterization Method for Dataset Distillation
- Coil-reinforced Flat Tube Actuators for Robotic Applications
- PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning
- A capacity configuration strategy for marine hybrid power storage system
- SleeperMark: Towards Robust Watermark against Fine-Tuning Text-to-image Diffusion Models
- X-type Planar Winding Arrangement Having Low Intrawinding Capacitance
- Alternating Iterative Optimization with Matrix Decomposition for Calibration-Free Rotation Angle Estimation and Polarization Reconstruction
- Multi-Sensor Environmental Monitoring System for Smart Health Care
- Finer-CAM: Spotting the Difference Reveals Finer Details for Visual Explanation
- Numerical Simulation Analysis of the Direct Effects of Lightning on Electric Vehicles
- 3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer
- DXA-Net: Dual-task Cross-lingual Alignment Network for Zero-shot Cross-lingual Spoken Language Understanding
- Smart Hybrid Battery: Integrating Active Cell Balancing and Peak Power Enhancement in Lithium-ion Batteries
- Feature Optimization-Based Multiple Instance Learning for Whole Slide Image Classification
- Sex Robots and the AI Act: Opening the Regulatory Discussion
- MLVU: Benchmarking Multi-task Long Video Understanding
- Computational Speedup of Simulated Annealing with Nested Monte Carlo Loop
- DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception
- Learning Causal Structure Distributions for Robust Planning
- UniRestore: Unified Perceptual and Task-Oriented Image Restoration Model Using Diffusion Prior
- GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector
- Enhanced OoD Detection through Cross-Modal Alignment of Multi-Modal Representations
- Design and Optimization of 1-MHz 48-12 V LLC Resonant Converter with GaN Devices and FPCB Planar Transformer
- Where the Devil Hides: Deepfake Detectors Can No Longer Be Trusted
- Protection of transmission systems with Hilbert Huang Transform
- A High Step-Up Soft-Switched Converter Based on Coupled Inductor and Current-Fed Voltage Multiplier
- ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting
- Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Point Cloud Analysis
- Corrections to “Quantum Dot DBR Lasers Monolithically Integrated on Silicon Photonics by In-Pocket Heteroepitaxy”
- Goal-Driven building automation using serverless computing
- PACT: Pruning and Clustering-Based Token Reduction for Faster Visual Language Models
- Enhanced Contrastive Learning with Multi-view Longitudinal Data for Chest X-ray Report Generation
- Boosting Adversarial Transferability through Augmentation in Hypothesis Space
- Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion
- Identity-Clothing Similarity Modeling for Unsupervised Clothing Change Person Re-Identification
- Multi-Modal Synergistic Implicit Image Enhancement for Efficient Optical Flow Estimation
- Topological Identicality Between Converters of Otherwise Dissimilar SMPCs: Current Doubler Rectifier and Interleaved Buck Converter
- Adaptive Fault Tolerance Mechanisms of Large Language Models in Cloud Computing Environments
- Towards Human-Understandable Multi-Dimensional Concept Discovery
- Golden Cudgel Network for Real-Time Semantic Segmentation
- An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models
- Big Data-Based Analysis and Prediction Model for Disease Risk Factors
- LoRASculpt: Sculpting LoRA for Harmonizing General and Specialized Knowledge in Multimodal Large Language Models
- S4-Driver: Scalable Self-Supervised Driving Multimodal Large Language Model with Spatio-Temporal Visual Representation
- MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model
- TopNet: Transformer-Efficient Occupancy Prediction Network for Octree-Structured Point Cloud Geometry Compression
- Coordinated Control Architecture of Dual Converters in Grid-forming DFIG Wind Turbine: Modeling, Analysis and Comparison
- Real Time Object Recognition with Voice Guided Navigation for Visually Impaired using OpenCV
- Digital Twin-Supported BESS Decision Support of RUL-Based Maintenance
- Classification Prediction Model of Students' College English Scores Based on Online Course Learning Data
- DeDe: Detecting Backdoor Samples for SSL Encoders via Decoders
- MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
- Cloud based DevOps Framework for Identifying Risk Factors of Hospital Utilization
- UMotion: Uncertainty-driven Human Motion Estimation from Inertial and Ultra-wideband Units
- Mind the Gap: Detecting Black-box Adversarial Attacks in the Making through Query Update Analysis
- LibraGrad: Balancing Gradient Flow for Universally Better Vision Transformer Attributions
- Learning Extremely High Density Crowds as Active Matters
- MonSter: Marry Monodepth to Stereo Unleashes Power
- Research on MEMS Anomaly Data Detection Model Based on If Algorithm
- CARLA2Real: A Tool for Reducing the Sim2real Appearance Gap in CARLA Simulator
- Think Small, Act Big: Primitive Prompt Learning for Lifelong Robot Manipulation
- Pf-Sds: a Framework for Crop Yield Prediction Based on Deep Forest and Interpretability
- VeriLeaky: Navigating IP Protection vs Utility in Fine-Tuning for LLM-Driven Verilog Coding
- Learning Occlusion-Robust Vision Transformers for Real-Time UAV Tracking
- ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping
- Game-Theoretic Approach to Autonomous and Human-Driven Vehicle Interactions
- Toward Few-Shot Leakage Detection in Fresh Air Ducts: A CB-UNet-Enabled Data Augmentation Approach
- RipVIS: Rip Currents Video Instance Segmentation Benchmark for Beach Monitoring and Safety
- DiffLO: Semantic-Aware LiDAR Odometry with Diffusion-Based Refinement
- ATP-LLaVA: Adaptive Token Pruning for Large Vision Language Models
- LumiPane: Intelligent Interaction through Gesture Sensing and Ambient Light Communication
- Latency-Sensitive Covert Federated Learning via UAV
- Miniaturized Dual-band Circularly Polarized Full-duplex Antenna in VHF Band with Improved Radiation Efficiency and Isolation
- Improve GaN-based Green Micro-LEDs Performance with AlGaN/InGaN Multiple Quantum Wells Structure by V-pits Engineering
- Beyond Image Classification: A Video Benchmark and Dual-Branch Hybrid Discrimination Framework for Compositional Zero-Shot Learning
- Dynamic Content Prediction with Motion-aware Priors for Blind Face Video Restoration
- Explicit Depth-Aware Blurry Video Frame Interpolation Guided by Differential Curves
- Digital Voltage-Mode Control of a Mixed-Type SIMO Converter Under Time-Multiplexing Scheme
- Financing the Future: Overcoming Venture Capital Barriers for Sustainable Startups
- PID Parameter Optimization Based on an Improved Educational Competition Optimization Algorithm
- MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention
- Cryptocurrency Market Price Prediction using Data Science and Machine Learning Techniques
- Locally Orderless Images for Optimization in Differentiable Rendering
- SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation
- Do ImageNet-trained models learn shortcuts? The impact of frequency shortcuts on generalization
- Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs
- Optimizing Speech Emotion Recognition with Dynamic Dilation Rates for Efficient Edge Deployment
- CLIP-driven Coarse-to-fine Semantic Guidance for Fine-grained Open-set Semi-supervised Learning
- Novel Coil Pitch Configuration of Rectangular coil for Improved Misalignment Tolerance in Wireless Charging of EVs
- AI-Assisted Liver Transplant Outcome Prediction
- ANN Development and Testing for Fault Detection, Classification, and Location in Solar Array
- MatAnyone: Stable Video Matting with Consistent Memory Propagation
- Spiking Transformer: Introducing Accurate Addition-Only Spiking Self-Attention for Transformer