Leveraging Biases in Large Language Models: bias-kNN for Effective Few-Shot Learning

Yong Zhang, Hanzhang Li, Zhitao Li, Ning Cheng, Ming Li, Jing Xiao, Jianzong Wang

January 2024

The architecture of our proposed model

Abstract

Large Language Models (LLMs) have shown significant promise in various applications, including zero-shot and few-shot learning. However, their performance can be hampered by inherent biases. Instead of traditionally sought methods that aim to minimize or correct these biases, this study introduces a novel methodology named bias-kNN. This approach capitalizes on the biased outputs, harnessing them as primary features for kNN and supplementing with gold labels. Our comprehensive evaluations, spanning diverse domain text classification datasets and different GPT-2 model sizes, indicate the adaptability and efficacy of the bias-kNN method. Remarkably, this approach not only outperforms conventional in-context learning in few-shot scenarios but also demonstrates robustness across a spectrum of samples, templates and verbalizers. This study, therefore, presents a unique perspective on harnessing biases, transforming them into assets for enhanced model performance.

Type

Publication

In 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing

Click the Cite button above to demo the feature to enable visitors to import publication metadata into their reference management software.

LLM

Leveraging Biases in Large Language Models: bias-kNN for Effective Few-Shot Learning

Abstract

Jianzong Wang

Honorary Director