Xiaoyang Qu

Publications

  1. Turbo-TTS: Enhancing Diffusion Model TTS with an Improved ODE Solver (2025), In ICONIP2025 (CCF-C)
  2. EMO-RL: Emotion-Rule-Based Reinforcement Learning Enhanced Audio-Language Model for Generalized Speech Emotion Recognition (2025), In EMNLP2025 (CCF-B)
  3. Knowledge Distillation for Financial Large Language Models: A Systematic Review of Strategies, Applications, and Evaluation (2025), In FITEE (CCF-C,IF=2.9)
  4. Federated Domain Generalization with Domain-specific Soft Prompts Generation (2025), In ICCV2025 (CCF-A)
  5. Publicly Verifiable Private Information Retrieval Protocols Based on Function Secret Sharing (2025), In Inscrypt2025 (CCF-C)
  6. Hierarchical-Task-Aware Multi-modal Mixture of Incremental LoRA Experts for Embodied Continual Learning (2025), In ACL2025 (CCF-A)
  7. MoQAE: Mixed-Precision Quantization for Long-Context LLM Inference via Mixture of Quantization-Aware Experts (2025), In ACL2025 (CCF-A)
  8. RATE-Nav: Region-Aware Termination Enhancement for Zero-shot Object Navigation with Vision-Language Models (2025), In ACL2025 (CCF-A)
  9. BAGNet: A Boundary-Aware Graph Attention Network for 3D Point Cloud Semantic Segmentation (2025), In IJCNN2025 (CCF-C)
  10. Bridging the Modality Gap: Semantic-Calibrated Zero-shot Speech Emotion Captioning (2025), In IJCNN2025 (CCF-C)
  11. Data-free Black-box Knowledge Amalgamation (2025), In IJCNN2025 (CCF-C)
  12. Generalized Audio Deepfake Detection Using Frame-level Latent Information Entropy (2025), In ICME2025 (CCF-B)
  13. MADLLM: Multivariate Anomaly Detection via Pre-trained LLMs (2025), In ICME2025 (CCF-B)
  14. Rano: Restorable Speaker Anonymization via Conditional Invertible Neural Network (2025), In IJCNN2025 (CCF-C)
  15. Enhancing Multi-Agent Systems via Reinforcement Learning with LLM-based Planner and Graph-based Policy (2025), In ICRA2025 (CCF-B)
  16. CycleFlow: Leveraging Cycle Consistency in Flow Matching for Speaker Style Adaptation (2025), In ICASSP2025 (CCF-B)
  17. Graph Contrastive Learning with Decoupled Augmentation (2025), In ICASSP2025 (CCF-B)
  18. PointActionCLIP: Preventing Transfer Degradation in Point Cloud Action Recognition with a Triple-Path CLIP (2025), In ICASSP2025 (CCF-B)
  19. VisTa: Visual-contextual and Text-augmented Zero-shot Object-level OOD Detection (2025), In ICASSP2025 (CCF-B)
  20. ACCon: Angle-Compensated Contrastive Regularizer for Deep Regression (2025), In AAAI2025 (CCF-A)
  21. RUNA: Object-level Out-of-Distribution Detection via Regional Uncertainty Alignment of Multimodal Representations (2025), In AAAI2025 (CCF-A)
  22. Cocktail:Chunk-AdaptiveMixed-Precision Ouanization for Long-Context LLM inference (2025), In DATE2025 (CCF-B) Best Paper Award
  23. Incremental Label Distribution Learning With Scalable Graph Convolutional Networks (2024), In HPCC2024 (CCF-C)
  24. FormerReckoning: Physics Inspired Transformer for Accurate Inertial Navigation (2024), In PICASSO2024
  25. Beyond Aggregation: Efficient Federated Model Consolidation with Heterogeneity-Adaptive Weights Diffusion (2024), In CIKM2024 (CCF-B)
  26. Retrieval-Augmented Audio Deepfake Detection (2024), In ICMR2024 (CCF-B)
  27. Enhancing Anomalous Sound Detection with Multi-Level Memory Bank (2024), In IJCNN2024 (CCF-C)
  28. PRENet: A Plane-Fit Redundancy Encoding Point Cloud Sequence Network for Real-Time 3D Action Recognition (2024), In IJCNN2024 (CCF-C)
  29. Task-Agnostic Decision Transformer for Multi-Type Agent Control with Federated Split Training (2024), In IJCNN2024 (CCF-C)
  30. Gecko: Resource-Efficient and Accurate Queries in Real-Time Video Streams at the Edge (2024), In INFOCOM2024 (CCF-A)
  31. INCPrompt: Task-Aware Incremental Prompting for Rehearsal-Free Class-incremental Learning (2024), In ICASSP2024 (CCF-B)
  32. P2DT: Mitigating Forgetting in Task-Incremental Learning with Progressive Prompt Decision Transformer (2024), In ICASSP2024 (CCF-B)
  33. Value-Driven Mixed-Precision Quantization for Patch-Based Inference on Microcontrollers (2024), In DATE2024 (CCF-B)
  34. FedET: A Communication-Efficient Federated Class-Incremental Learning Framework Based on Enhanced Transformer (2023), In IJCAI2023 (CCF-A)
  35. GAIA: Delving into Gradient-based Attribution Abnormality for Out-of-distribution Detection (2023), In NeurIPS2023 (CCF-A)
  36. Shoggoth: Towards Efficient Edge-Cloud Collaborative Real-Time Video Inference via Adaptive Online Learning (2023), In DAC2023 (CCF-A)
  37. EdgeMA: Model Adaptation System for Real-Time Video Analytics on Edge Devices (2023), In ICONIP2023 (CCF-C)
  38. Feature-Rich Audio Model Inversion for Data-Free Knowledge Distillation Towards General Sound Classification (2023), In ICASSP2023 (CCF-B)
  39. Boosting Star-GANs for Voice Conversion with Contrastive Discriminator (2022), In ICONIP2022 (CCF-C)
  40. Adaptive Few-Shot Learning Algorithm for Rare Sound Event Detection (2022), In IJCNN2022 (CCF-C)
  41. Adaptive Sparse and Monotonic Attention for Transformer-based Automatic Speech Recognition (2022), In DSAA2022 (CCF-C)
  42. Blur the Linguistic Boundary: Interpreting Chinese Buddhist Sutra in English via Neural Machine Translation (2022), In ICTAI2022 (CCF-C)
  43. DT-SV: A Transformer-based Time-domain Approach for Speaker Verification (2022), In IJCNN2022 (CCF-C)
  44. Learning Invariant Representation and Risk Minimized for Unsupervised Accent Domain Adaptation (2022), In SLT2022
  45. Leveraging Causal Inference for Explainable Automatic Program Repair (2022), In IJCNN2022 (CCF-C)
  46. Pose Guided Human Image Synthesis with Partially Decoupled GAN (2022), In ACML2022 (CCF-C)
  47. QSpeech: Low-Qubit Quantum Speech Application Toolkit (2022), In IJCNN2022 (CCF-C)
  48. r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled Noise Introducing and Contextual Information Incorporation (2022), In ICASSP2022 (CCF-B)
  49. Supervised Contrastive Meta-learning for Few-Shot Classification (2022), In HPCC2022 (CCF-C)
  50. Speech2Video: Cross-Modal Distillation for Speech to Video Generation (2021), In INTERSPEECH2021 (CCF-C)
  51. Variational Information Bottleneck for Effective Low-Resource Audio Classification (2021), In INTERSPEECH2021 (CCF-C)
  52. CACnet: Cube Attentional CNN for Automatic Speech Recognition (2021), In IJCNN2021 (CCF-C)
  53. Automatic Joint Optimization of Algorithm-Level Compression and Compiler-Based Acceleration with Reinforcement Learning for DNN in Edge Devices (2021), In IJCNN2021 (CCF-C)
  54. Case Study of Few-Shot Learning in Text Recognition Models (2021), In WISE2021 (CCF-C)
  55. Communication-Memory-Efficient Decentralized Learning For Audio Representation (2021), In IJCNN2021 (CCF-C)
  56. Contrastive Learning for improving End-to-end Speaker Verification (2021), In IJCNN2021 (CCF-C)
  57. Effective Phase Encoding for End-To-End Speaker Verification (2021), In INTERSPEECH2021 (CCF-C) (Best Student Paper Award)
  58. Enhancing Data-Free Adversarial Distillation with Activation Regularization and Virtual Interpolation (2021), In ICASSP2021 (CCF-B)
  59. Enhancing Neural Architecture Search by Upgrading Weak Components (2021), In IJCNN2021 (CCF-C)
  60. Federated Learning with Dynamic Transformer for Text to Speech (2021), In INTERSPEECH2021 (CCF-C)
  61. ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform (2021), In INTERSPEECH2021 (CCF-C)
  62. Neural Architecture Search as Self-assessor in Semi-supervised Learning (2021), In BigData2021 (CCF-C)
  63. Quantum Convolutional Neural Network on Protein Distance Prediction (2021), In IJCNN2021 (CCF-C)
  64. When Hearing the Voice, Who Will Come to Your Mind (2021), In IJCNN2021 (CCF-C)
  65. 3D Point Cloud Segmentation for Complex Structure Based on PointSIFT (2020), In PRCV2020 (CCF-C)
  66. D-GHNAS for Joint Intent Classification and Slot Filling (2020), In APWeb-WAIM2020 (CCF-C)
  67. Evolutionary Algorithm Enhanced Neural Architecture Search for Text-Independent Speaker Verification (2020), In INTERSPEECH2020 (CCF-C)
  68. Image Compressed Sensing Using Neural Architecture Search (2020), In BigData2020
  69. Multi-objective Cuckoo Algorithm for Mobile Devices Network Architecture Search (2020), In ICANN2020 (CCF-C)
  70. ParallelNAS: A Parallel and Distributed System for Neural Architecture Search (2020), In HPCC2020 (CCF-C)
  71. Quantization and Knowledge Distillation for Efficient Federated Learning on Edge Devices (2020), In HPCC2020 (CCF-C)

中文期刊文章

  1. 人工智能生成式内容技术概述 (2025), 《大数据》(CCF-T2)
  2. 基于可逆网络双嵌入和攻击层的鲁棒音频水印方法 (2025), 《大数据》(CCF-T2)
  3. 基于多模态大模型的具身智能体研究进展与展望 (2025), 《大数据》(CCF-T2)
  4. 基于大模型的具身智能任务规划研究:从单智能体到多智能体 (2025), 《大数据》(CCF-T2)
  5. 基于深度卷积和自注意力机制的端到端地震波降噪方法 (2025), 《大数据》(CCF-T2)
  6. 大语言模型长文本推断优化技术综述 (2025), 《大数据》(CCF-T2)
  7. 深度伪造音频生成与鉴伪技术综述 (2025), 《大数据》(CCF-T2)
  8. 深度图表示学习:方法、应用与挑战 (2025), 《大数据》(CCF-T2)
  9. 视频深度伪造检测的泛化性问题:方法、挑战与技术进展 (2025), 《大数据》(CCF-T2)

Events