Additionally, the strong dependency among in-context examples makes it an NP-hard combinatorial optimization problem and enumerating all permutations is infeasible. Hence we propose LENS, a fiLter-thEN-Search method to tackle this challenge in two stages: First we filter the dataset to obtain informative in-context examples individually. Specifically, we propose a novel metric, InfoScore, to evaluate the example's in-context informativeness based on the language model's feedback, and further propose a progressive filtering process to filter out uninformative examples. Then we propose diversity-guided example search which iteratively refines and evaluates the selected example permutations, to find examples that fully depict the task. The experimental results show that LENS significantly outperforms a wide range of baselines.

Through-the-lens metering

10.48550/arxiv.2302.13539

Cite

Citations (7)

EduChat: A Large-Scale Language Model-based Chatbot System for Intelligent Education

arXiv (Cornell University) (2023)

Yuhao Dan Zhikai Lei Yiyang Gu Yongbo Li Jianghao Yin

EduChat (https://www.educhat.top/) is a large-scale language model (LLM)-based chatbot system in the education domain. Its goal is to support personalized, fair, and compassionate intelligent education, serving teachers, students, and parents. Guided by theories from psychology and education, it further strengthens educational functions such as open question answering, essay assessment, Socratic teaching, and emotional support based on the existing basic LLMs. Particularly, we learn domain-specific knowledge by pre-training on the educational corpus and stimulate various skills with tool use by fine-tuning on designed system prompts and instructions. Currently, EduChat is available online as an open-source project, with its code, data, and model parameters available on platforms (e.g., GitHub https://github.com/icalk-nlp/EduChat, Hugging Face https://huggingface.co/ecnu-icalk ). We also prepare a demonstration of its capabilities online (https://vimeo.com/851004454). This initiative aims to promote research and applications of LLMs for intelligent education.

Chatbot

Open domain

Dialog system

Code (set theory)

10.48550/arxiv.2308.02773

Cite

Citations (15)

Enhancing EEG-to-Text Decoding through Transferable Representations from Pre-trained Contrastive EEG-Text Masked Autoencoder

Jiaqi Wang Zhenxi Song Zhengyu Ma Xipeng Qiu Min Zhang

Autoencoder

10.18653/v1/2024.acl-long.393

Cite

Citations (1)

F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods

Yu Sun Keyuchen Keyuchen Shujie Wang Peiji Li Qipeng Guo

10.18653/v1/2024.acl-long.507

Cite

Citations (0)

Face recognition with info-margin maximization

Xipeng Qiu Youdong Miao Lide Wu

We propose face recognition method with info-margin maximization (InfoMargin) from information theoretic viewpoint. It aims to achieve a low generalization error by maximizing the information divergence between the distributions of different classes while minimizing the entropy of the distribution in each single class. Experimental results show that our method outperforms the traditional face recognition methods.

Margin (machine learning)

Maximization

Entropy maximization

Kullback–Leibler divergence

Divergence (linguistics)

10.1109/icme.2009.5202681

Cite

Citations (0)

PerturbScore: Connecting Discrete and Continuous Perturbations in NLP

Linyang Li Ke Ren Yunfan Shao Pengyu Wang Xipeng Qiu

With the rapid development of neural network applications in NLP, model robustness problem is gaining more attention. Different from computer vision, the discrete nature of texts makes it more challenging to explore robustness in NLP. Therefore, in this paper, we aim to connect discrete perturbations with continuous perturbations, therefore we can use such connections as a bridge to help understand discrete perturbations in NLP models. Specifically, we first explore how to connect and measure the correlation between discrete perturbations and continuous perturbations. Then we design a regression task as a PerturbScore to learn the correlation automatically. Through experimental results, we find that we can build a connection between discrete and continuous perturbations and use the proposed PerturbScore to learn such correlation, surpassing previous methods used in discrete perturbation measuring. Further, the proposed PerturbScore can be well generalized to different datasets, perturbation methods, indicating that we can use it as a powerful tool to study model robustness in NLP.

Robustness

10.18653/v1/2023.findings-emnlp.442

Cite

Citations (2)

Towards Efficient NLP: A Standard Evaluation and A Strong Baseline

arXiv (Cornell University) (2021)

Xiangyang Liu Tianxiang Sun Junliang He Jiawen Wu Lingling Wu

Supersized pre-trained language models have pushed the accuracy of various natural language processing (NLP) tasks to a new state-of-the-art (SOTA). Rather than pursuing the reachless SOTA accuracy, more and more researchers start paying attention on model efficiency and usability. Different from accuracy, the metric for efficiency varies across different studies, making them hard to be fairly compared. To that end, this work presents ELUE (Efficient Language Understanding Evaluation), a standard evaluation, and a public leaderboard for efficient NLP models. ELUE is dedicated to depict the Pareto Frontier for various language understanding tasks, such that it can tell whether and how much a method achieves Pareto improvement. Along with the benchmark, we also release a strong baseline, ElasticBERT, which allows BERT to exit at any layer in both static and dynamic ways. We demonstrate the ElasticBERT, despite its simplicity, outperforms or performs on par with SOTA compressed and early exiting models. With ElasticBERT, the proposed ELUE has a strong Pareto Frontier and makes a better evaluation for efficient NLP models.

Baseline (sea)

Benchmark (surveying)

10.48550/arxiv.2110.07038

Cite

Citations (15)

CycleGT: Unsupervised Graph-to-Text and Text-to-Graph Generation via Cycle Training

arXiv (Cornell University) (2020)

Qipeng Guo Zhijing Jin Xipeng Qiu Weinan Zhang David Wipf

Two important tasks at the intersection of knowledge graphs and natural language processing are graph-to-text (G2T) and text-to-graph (T2G) conversion. Due to the difficulty and high cost of data collection, the supervised data available in the two fields are usually on the magnitude of tens of thousands, for example, 18K in the WebNLG~2017 dataset after preprocessing, which is far fewer than the millions of data for other tasks such as machine translation. Consequently, deep learning models for G2T and T2G suffer largely from scarce training data. We present CycleGT, an unsupervised training method that can bootstrap from fully non-parallel graph and text data, and iteratively back translate between the two forms. Experiments on WebNLG datasets show that our unsupervised model trained on the same number of data achieves performance on par with several fully supervised models. Further experiments on the non-parallel GenWiki dataset verify that our method performs the best among unsupervised baselines. This validates our framework as an effective approach to overcome the data scarcity problem in the fields of G2T and T2G. Our code is available at https://github.com/QipengGuo/CycleGT.

Labeled data

10.48550/arxiv.2006.04702

Cite

Citations (31)