Xiaoyang Qu

Publications

Vista: Scene-Aware Optimization for Streaming Video Question Answering under Post-Hoc Queries (2025), In AAAI2026 (CCF-A)
Turbo-TTS: Enhancing Diffusion Model TTS with an Improved ODE Solver (2025), In ICONIP2025 (CCF-C)
EMO-RL: Emotion-Rule-Based Reinforcement Learning Enhanced Audio-Language Model for Generalized Speech Emotion Recognition (2025), In EMNLP2025 (CCF-B)
Knowledge Distillation for Financial Large Language Models: A Systematic Review of Strategies, Applications, and Evaluation (2025), In [J], Frontiers of Information Technology & Electronic Engineering (FITEE) (SCI IF=2.9)
Federated Domain Generalization with Domain-specific Soft Prompts Generation (2025), In ICCV2025 (CCF-A)
Publicly Verifiable Private Information Retrieval Protocols Based on Function Secret Sharing (2025), In Inscrypt2025 (CCF-C)
Hierarchical-Task-Aware Multi-modal Mixture of Incremental LoRA Experts for Embodied Continual Learning (2025), In ACL2025 (CCF-A)
MoQAE: Mixed-Precision Quantization for Long-Context LLM Inference via Mixture of Quantization-Aware Experts (2025), In ACL2025 (CCF-A)
RATE-Nav: Region-Aware Termination Enhancement for Zero-shot Object Navigation with Vision-Language Models (2025), In ACL2025 (CCF-A)
BAGNet: A Boundary-Aware Graph Attention Network for 3D Point Cloud Semantic Segmentation (2025), In IJCNN2025 (CCF-C)
Bridging the Modality Gap: Semantic-Calibrated Zero-shot Speech Emotion Captioning (2025), In IJCNN2025 (CCF-C)
Data-free Black-box Knowledge Amalgamation (2025), In IJCNN2025 (CCF-C)
Generalized Audio Deepfake Detection Using Frame-level Latent Information Entropy (2025), In ICME2025 (CCF-B)
MADLLM: Multivariate Anomaly Detection via Pre-trained LLMs (2025), In ICME2025 (CCF-B)
Rano: Restorable Speaker Anonymization via Conditional Invertible Neural Network (2025), In IJCNN2025 (CCF-C)
Enhancing Multi-Agent Systems via Reinforcement Learning with LLM-based Planner and Graph-based Policy (2025), In ICRA2025 (CCF-B)
CycleFlow: Leveraging Cycle Consistency in Flow Matching for Speaker Style Adaptation (2025), In ICASSP2025 (CCF-B)
Graph Contrastive Learning with Decoupled Augmentation (2025), In ICASSP2025 (CCF-B)
PointActionCLIP: Preventing Transfer Degradation in Point Cloud Action Recognition with a Triple-Path CLIP (2025), In ICASSP2025 (CCF-B)
VisTa: Visual-contextual and Text-augmented Zero-shot Object-level OOD Detection (2025), In ICASSP2025 (CCF-B)
ACCon: Angle-Compensated Contrastive Regularizer for Deep Regression (2025), In AAAI2025 (CCF-A)
RUNA: Object-level Out-of-Distribution Detection via Regional Uncertainty Alignment of Multimodal Representations (2025), In AAAI2025 (CCF-A)
Cocktail:Chunk-AdaptiveMixed-Precision Ouanization for Long-Context LLM inference (2025), In DATE2025 (CCF-B) Best Paper Award
Incremental Label Distribution Learning With Scalable Graph Convolutional Networks (2024), In HPCC2024 (CCF-C)
FormerReckoning: Physics Inspired Transformer for Accurate Inertial Navigation (2024), In PICASSO2024
Beyond Aggregation: Efficient Federated Model Consolidation with Heterogeneity-Adaptive Weights Diffusion (2024), In CIKM2024 (CCF-B)
Retrieval-Augmented Audio Deepfake Detection (2024), In ICMR2024 (CCF-B)
Enhancing Anomalous Sound Detection with Multi-Level Memory Bank (2024), In IJCNN2024 (CCF-C)
PRENet: A Plane-Fit Redundancy Encoding Point Cloud Sequence Network for Real-Time 3D Action Recognition (2024), In IJCNN2024 (CCF-C)
Task-Agnostic Decision Transformer for Multi-Type Agent Control with Federated Split Training (2024), In IJCNN2024 (CCF-C)
Gecko: Resource-Efficient and Accurate Queries in Real-Time Video Streams at the Edge (2024), In INFOCOM2024 (CCF-A)
INCPrompt: Task-Aware Incremental Prompting for Rehearsal-Free Class-incremental Learning (2024), In ICASSP2024 (CCF-B)
P2DT: Mitigating Forgetting in Task-Incremental Learning with Progressive Prompt Decision Transformer (2024), In ICASSP2024 (CCF-B)
Value-Driven Mixed-Precision Quantization for Patch-Based Inference on Microcontrollers (2024), In DATE2024 (CCF-B)
FedET: A Communication-Efficient Federated Class-Incremental Learning Framework Based on Enhanced Transformer (2023), In IJCAI2023 (CCF-A)
GAIA: Delving into Gradient-based Attribution Abnormality for Out-of-distribution Detection (2023), In NeurIPS2023 (CCF-A)
Shoggoth: Towards Efficient Edge-Cloud Collaborative Real-Time Video Inference via Adaptive Online Learning (2023), In DAC2023 (CCF-A)
EdgeMA: Model Adaptation System for Real-Time Video Analytics on Edge Devices (2023), In ICONIP2023 (CCF-C)
Feature-Rich Audio Model Inversion for Data-Free Knowledge Distillation Towards General Sound Classification (2023), In ICASSP2023 (CCF-B)
Boosting Star-GANs for Voice Conversion with Contrastive Discriminator (2022), In ICONIP2022 (CCF-C)
Adaptive Few-Shot Learning Algorithm for Rare Sound Event Detection (2022), In IJCNN2022 (CCF-C)
Adaptive Sparse and Monotonic Attention for Transformer-based Automatic Speech Recognition (2022), In DSAA2022 (CCF-C)
Blur the Linguistic Boundary: Interpreting Chinese Buddhist Sutra in English via Neural Machine Translation (2022), In ICTAI2022 (CCF-C)
DT-SV: A Transformer-based Time-domain Approach for Speaker Verification (2022), In IJCNN2022 (CCF-C)
Learning Invariant Representation and Risk Minimized for Unsupervised Accent Domain Adaptation (2022), In SLT2022
Leveraging Causal Inference for Explainable Automatic Program Repair (2022), In IJCNN2022 (CCF-C)
Pose Guided Human Image Synthesis with Partially Decoupled GAN (2022), In ACML2022 (CCF-C)
QSpeech: Low-Qubit Quantum Speech Application Toolkit (2022), In IJCNN2022 (CCF-C)
r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled Noise Introducing and Contextual Information Incorporation (2022), In ICASSP2022 (CCF-B)
Supervised Contrastive Meta-learning for Few-Shot Classification (2022), In HPCC2022 (CCF-C)
Speech2Video: Cross-Modal Distillation for Speech to Video Generation (2021), In INTERSPEECH2021 (CCF-C)
Variational Information Bottleneck for Effective Low-Resource Audio Classification (2021), In INTERSPEECH2021 (CCF-C)
CACnet: Cube Attentional CNN for Automatic Speech Recognition (2021), In IJCNN2021 (CCF-C)
Automatic Joint Optimization of Algorithm-Level Compression and Compiler-Based Acceleration with Reinforcement Learning for DNN in Edge Devices (2021), In IJCNN2021 (CCF-C)
Case Study of Few-Shot Learning in Text Recognition Models (2021), In WISE2021 (CCF-C)
Communication-Memory-Efficient Decentralized Learning For Audio Representation (2021), In IJCNN2021 (CCF-C)
Contrastive Learning for improving End-to-end Speaker Verification (2021), In IJCNN2021 (CCF-C)
Effective Phase Encoding for End-To-End Speaker Verification (2021), In INTERSPEECH2021 (CCF-C) (Best Student Paper Award)
Enhancing Data-Free Adversarial Distillation with Activation Regularization and Virtual Interpolation (2021), In ICASSP2021 (CCF-B)
Enhancing Neural Architecture Search by Upgrading Weak Components (2021), In IJCNN2021 (CCF-C)
Federated Learning with Dynamic Transformer for Text to Speech (2021), In INTERSPEECH2021 (CCF-C)
ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform (2021), In INTERSPEECH2021 (CCF-C)
Neural Architecture Search as Self-assessor in Semi-supervised Learning (2021), In BigData2021 (CCF-C)
Quantum Convolutional Neural Network on Protein Distance Prediction (2021), In IJCNN2021 (CCF-C)
When Hearing the Voice, Who Will Come to Your Mind (2021), In IJCNN2021 (CCF-C)
3D Point Cloud Segmentation for Complex Structure Based on PointSIFT (2020), In PRCV2020 (CCF-C)
D-GHNAS for Joint Intent Classification and Slot Filling (2020), In APWeb-WAIM2020 (CCF-C)
Evolutionary Algorithm Enhanced Neural Architecture Search for Text-Independent Speaker Verification (2020), In INTERSPEECH2020 (CCF-C)
Image Compressed Sensing Using Neural Architecture Search (2020), In BigData2020
Multi-objective Cuckoo Algorithm for Mobile Devices Network Architecture Search (2020), In ICANN2020 (CCF-C)
ParallelNAS: A Parallel and Distributed System for Neural Architecture Search (2020), In HPCC2020 (CCF-C)
Quantization and Knowledge Distillation for Efficient Federated Learning on Edge Devices (2020), In HPCC2020 (CCF-C)