Xiaoyang Qu
Publications
- Turbo-TTS: Enhancing Diffusion Model TTS with an Improved ODE Solver (2025), In ICONIP2025 (CCF-C)
- EMO-RL: Emotion-Rule-Based Reinforcement Learning Enhanced Audio-Language Model for Generalized Speech Emotion Recognition (2025), In EMNLP2025 (CCF-B)
- Knowledge Distillation for Financial Large Language Models: A Systematic Review of Strategies, Applications, and Evaluation (2025), In FITEE (CCF-C,IF=2.9)
- Federated Domain Generalization with Domain-specific Soft Prompts Generation (2025), In ICCV2025 (CCF-A)
- Publicly Verifiable Private Information Retrieval Protocols Based on Function Secret Sharing (2025), In Inscrypt2025 (CCF-C)
- Hierarchical-Task-Aware Multi-modal Mixture of Incremental LoRA Experts for Embodied Continual Learning (2025), In ACL2025 (CCF-A)
- MoQAE: Mixed-Precision Quantization for Long-Context LLM Inference via Mixture of Quantization-Aware Experts (2025), In ACL2025 (CCF-A)
- RATE-Nav: Region-Aware Termination Enhancement for Zero-shot Object Navigation with Vision-Language Models (2025), In ACL2025 (CCF-A)
- BAGNet: A Boundary-Aware Graph Attention Network for 3D Point Cloud Semantic Segmentation (2025), In IJCNN2025 (CCF-C)
- Bridging the Modality Gap: Semantic-Calibrated Zero-shot Speech Emotion Captioning (2025), In IJCNN2025 (CCF-C)
- Data-free Black-box Knowledge Amalgamation (2025), In IJCNN2025 (CCF-C)
- Generalized Audio Deepfake Detection Using Frame-level Latent Information Entropy (2025), In ICME2025 (CCF-B)
- MADLLM: Multivariate Anomaly Detection via Pre-trained LLMs (2025), In ICME2025 (CCF-B)
- Rano: Restorable Speaker Anonymization via Conditional Invertible Neural Network (2025), In IJCNN2025 (CCF-C)
- Enhancing Multi-Agent Systems via Reinforcement Learning with LLM-based Planner and Graph-based Policy (2025), In ICRA2025 (CCF-B)
- CycleFlow: Leveraging Cycle Consistency in Flow Matching for Speaker Style Adaptation (2025), In ICASSP2025 (CCF-B)
- Graph Contrastive Learning with Decoupled Augmentation (2025), In ICASSP2025 (CCF-B)
- PointActionCLIP: Preventing Transfer Degradation in Point Cloud Action Recognition with a Triple-Path CLIP (2025), In ICASSP2025 (CCF-B)
- VisTa: Visual-contextual and Text-augmented Zero-shot Object-level OOD Detection (2025), In ICASSP2025 (CCF-B)
- ACCon: Angle-Compensated Contrastive Regularizer for Deep Regression (2025), In AAAI2025 (CCF-A)
- RUNA: Object-level Out-of-Distribution Detection via Regional Uncertainty Alignment of Multimodal Representations (2025), In AAAI2025 (CCF-A)
- Cocktail:Chunk-AdaptiveMixed-Precision Ouanization for Long-Context LLM inference (2025), In DATE2025 (CCF-B) Best Paper Award
- Incremental Label Distribution Learning With Scalable Graph Convolutional Networks (2024), In HPCC2024 (CCF-C)
- FormerReckoning: Physics Inspired Transformer for Accurate Inertial Navigation (2024), In PICASSO2024
- Beyond Aggregation: Efficient Federated Model Consolidation with Heterogeneity-Adaptive Weights Diffusion (2024), In CIKM2024 (CCF-B)
- Retrieval-Augmented Audio Deepfake Detection (2024), In ICMR2024 (CCF-B)
- Enhancing Anomalous Sound Detection with Multi-Level Memory Bank (2024), In IJCNN2024 (CCF-C)
- PRENet: A Plane-Fit Redundancy Encoding Point Cloud Sequence Network for Real-Time 3D Action Recognition (2024), In IJCNN2024 (CCF-C)
- Task-Agnostic Decision Transformer for Multi-Type Agent Control with Federated Split Training (2024), In IJCNN2024 (CCF-C)
- Gecko: Resource-Efficient and Accurate Queries in Real-Time Video Streams at the Edge (2024), In INFOCOM2024 (CCF-A)
- INCPrompt: Task-Aware Incremental Prompting for Rehearsal-Free Class-incremental Learning (2024), In ICASSP2024 (CCF-B)
- P2DT: Mitigating Forgetting in Task-Incremental Learning with Progressive Prompt Decision Transformer (2024), In ICASSP2024 (CCF-B)
- Value-Driven Mixed-Precision Quantization for Patch-Based Inference on Microcontrollers (2024), In DATE2024 (CCF-B)
- FedET: A Communication-Efficient Federated Class-Incremental Learning Framework Based on Enhanced Transformer (2023), In IJCAI2023 (CCF-A)
- GAIA: Delving into Gradient-based Attribution Abnormality for Out-of-distribution Detection (2023), In NeurIPS2023 (CCF-A)
- Shoggoth: Towards Efficient Edge-Cloud Collaborative Real-Time Video Inference via Adaptive Online Learning (2023), In DAC2023 (CCF-A)
- EdgeMA: Model Adaptation System for Real-Time Video Analytics on Edge Devices (2023), In ICONIP2023 (CCF-C)
- Feature-Rich Audio Model Inversion for Data-Free Knowledge Distillation Towards General Sound Classification (2023), In ICASSP2023 (CCF-B)
- Boosting Star-GANs for Voice Conversion with Contrastive Discriminator (2022), In ICONIP2022 (CCF-C)
- Adaptive Few-Shot Learning Algorithm for Rare Sound Event Detection (2022), In IJCNN2022 (CCF-C)
- Adaptive Sparse and Monotonic Attention for Transformer-based Automatic Speech Recognition (2022), In DSAA2022 (CCF-C)
- Blur the Linguistic Boundary: Interpreting Chinese Buddhist Sutra in English via Neural Machine Translation (2022), In ICTAI2022 (CCF-C)
- DT-SV: A Transformer-based Time-domain Approach for Speaker Verification (2022), In IJCNN2022 (CCF-C)
- Learning Invariant Representation and Risk Minimized for Unsupervised Accent Domain Adaptation (2022), In SLT2022
- Leveraging Causal Inference for Explainable Automatic Program Repair (2022), In IJCNN2022 (CCF-C)
- Pose Guided Human Image Synthesis with Partially Decoupled GAN (2022), In ACML2022 (CCF-C)
- QSpeech: Low-Qubit Quantum Speech Application Toolkit (2022), In IJCNN2022 (CCF-C)
- r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled Noise Introducing and Contextual Information Incorporation (2022), In ICASSP2022 (CCF-B)
- Supervised Contrastive Meta-learning for Few-Shot Classification (2022), In HPCC2022 (CCF-C)
- Speech2Video: Cross-Modal Distillation for Speech to Video Generation (2021), In INTERSPEECH2021 (CCF-C)
- Variational Information Bottleneck for Effective Low-Resource Audio Classification (2021), In INTERSPEECH2021 (CCF-C)
- CACnet: Cube Attentional CNN for Automatic Speech Recognition (2021), In IJCNN2021 (CCF-C)
- Automatic Joint Optimization of Algorithm-Level Compression and Compiler-Based Acceleration with Reinforcement Learning for DNN in Edge Devices (2021), In IJCNN2021 (CCF-C)
- Case Study of Few-Shot Learning in Text Recognition Models (2021), In WISE2021 (CCF-C)
- Communication-Memory-Efficient Decentralized Learning For Audio Representation (2021), In IJCNN2021 (CCF-C)
- Contrastive Learning for improving End-to-end Speaker Verification (2021), In IJCNN2021 (CCF-C)
- Effective Phase Encoding for End-To-End Speaker Verification (2021), In INTERSPEECH2021 (CCF-C) (Best Student Paper Award)
- Enhancing Data-Free Adversarial Distillation with Activation Regularization and Virtual Interpolation (2021), In ICASSP2021 (CCF-B)
- Enhancing Neural Architecture Search by Upgrading Weak Components (2021), In IJCNN2021 (CCF-C)
- Federated Learning with Dynamic Transformer for Text to Speech (2021), In INTERSPEECH2021 (CCF-C)
- ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform (2021), In INTERSPEECH2021 (CCF-C)
- Neural Architecture Search as Self-assessor in Semi-supervised Learning (2021), In BigData2021 (CCF-C)
- Quantum Convolutional Neural Network on Protein Distance Prediction (2021), In IJCNN2021 (CCF-C)
- When Hearing the Voice, Who Will Come to Your Mind (2021), In IJCNN2021 (CCF-C)
- 3D Point Cloud Segmentation for Complex Structure Based on PointSIFT (2020), In PRCV2020 (CCF-C)
- D-GHNAS for Joint Intent Classification and Slot Filling (2020), In APWeb-WAIM2020 (CCF-C)
- Evolutionary Algorithm Enhanced Neural Architecture Search for Text-Independent Speaker Verification (2020), In INTERSPEECH2020 (CCF-C)
- Image Compressed Sensing Using Neural Architecture Search (2020), In BigData2020
- Multi-objective Cuckoo Algorithm for Mobile Devices Network Architecture Search (2020), In ICANN2020 (CCF-C)
- ParallelNAS: A Parallel and Distributed System for Neural Architecture Search (2020), In HPCC2020 (CCF-C)
- Quantization and Knowledge Distillation for Efficient Federated Learning on Edge Devices (2020), In HPCC2020 (CCF-C)