Lifeng Shang

Huawei Technologies (China)

Author Statistics

Papers

Citation

H-Index

i-10 index

Research Trends

Author Order

Document Type

Co-Authors

Qun Liu

Huawei Technologies (Sweden)

Xin Jiang

Huawei Technologies (Sweden)

Yichun Yin

Qinghai University

Xin Jiang

Guizhou Normal University

Yasheng Wang

Nanjing Normal University

Yufei Wang

Southeast University

Xingshan Zeng

Huawei Technologies (China)

Liangyou Li

Huzhou University

Lu Hou

Beijing University of Posts and Telecommunications

Xiao Dong Chen

Soochow University

Cooperative Institutions

Huawei Technologies (China)

Chinese Academy of Sciences

University of Hong Kong

Huawei Technologies (Sweden)

Tsinghua University

Peking University

University of Chinese Academy of Sciences

Harbin Institute of Technology

Zhejiang University

Chinese University of Hong Kong

Author Statistics

Papers

Citation

H-Index

i-10 index

Research Field

Prompt-Based Length Controlled Generation with Reinforcement Learning

arXiv (Cornell University) (2023)

Renlong Jie Xiaojun Meng Lifeng Shang Xin Jiang Qun Liu

Large language models (LLMs) like ChatGPT and GPT-4 have attracted great attention given their surprising performance on a wide range of NLP tasks. Length controlled generation of LLMs emerges as an important topic, which enables users to fully leverage the capability of LLMs in more real-world scenarios like generating a proper answer or essay of a desired length. In addition, the autoregressive generation in LLMs is extremely time-consuming, while the ability of controlling this generated length can reduce the inference cost by limiting the length. Therefore, we propose a prompt-based length control method to achieve high-accuracy length controlled generation. In particular, we adopt reinforcement learning with the reward signal given by either trainable or rule-based reward models, which further enhances the length-control ability of LLMs by rewarding outputs that follows pre-defined control instruction. To enable rule-based inference, we also introduce standard prompt extractor to collect the standard control information from users' input. Experiments show that our method significantly improves the accuracy of prompt-based length control for summarization task on popular datasets like CNNDM and NYT. Both the standard prompt extractor and the RL-tuned model have show strong generalization ability to unseen control prompt templates.

Leverage (statistics)

10.48550/arxiv.2308.12030

Cite

Citations (0)

DTTM: A Discriminative Temporal Topic Model for Facial Expression Recognition

Lecture notes in computer science (2011)

Lifeng Shang Kwok-Ping Chan Guodong Pan

Discriminative model

Categorical variable

Expression (computer science)

Facial expression recognition

10.1007/978-3-642-24028-7_55

Cite

Citations (3)

Binary Fingerprint Image Thinning Using Template-Based PCNNs

IEEE Transactions on Systems Man and Cybernetics Part B (Cybernetics) (2007)

Luping Ji Yi Zhang Lifeng Shang Xiaorong Pu

This correspondence presents a coarse-to-fine binary-image-thinning algorithm by proposing a template-based pulse-coupled neural-network model. Under the control of coupled templates, this algorithm iteratively skeletonizes a binary image by changing the load signals of pulse neurons. A direction-constraining scheme for avoiding fingerprint ridge spikes has been discussed. Experiments show that this algorithm is effective for fingerprint thinning, as well as other common images. Moreover, this algorithm can be coupled with a fingerprint identification system to improve the recognition performance.

Template

Thinning

10.1109/tsmcb.2007.903369

Cite

Citations (43)

AutoTinyBERT: Automatic Hyper-parameter Optimization for Efficient Pre-trained Language Models

Yichun Yin Cheng Chen Lifeng Shang Xin Jiang Xiao Chen

Yichun Yin, Cheng Chen, Lifeng Shang, Xin Jiang, Xiao Chen, Qun Liu. Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 2021.

Chen

Computational linguistics

Association (psychology)

10.18653/v1/2021.acl-long.400

Cite

Citations (23)

GhostBERT: Generate More Features with Cheap Operations for BERT

Zhiqi Huang Lu Hou Lifeng Shang Xin Jiang Xiao Chen

Zhiqi Huang, Lu Hou, Lifeng Shang, Xin Jiang, Xiao Chen, Qun Liu. Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 2021.

Chen

Computational linguistics

10.18653/v1/2021.acl-long.509

Cite

Citations (20)

On Approximate Inference for Generalized Gaussian Process Models

arXiv (Cornell University) (2013)

Lifeng Shang Antoni B. Chan

A generalized Gaussian process model (GGPM) is a unifying framework that encompasses many existing Gaussian process (GP) models, such as GP regression, classification, and counting. In the GGPM framework, the observation likelihood of the GP model is itself parameterized using the exponential family distribution (EFD). In this paper, we consider efficient algorithms for approximate inference on GGPMs using the general form of the EFD. A particular GP model and its associated inference algorithms can then be formed by changing the parameters of the EFD, thus greatly simplifying its creation for task-specific output domains. We demonstrate the efficacy of this framework by creating several new GP models for regressing to non-negative reals and to real intervals. We also consider a closed-form Taylor approximation for efficient inference on GGPMs, and elaborate on its connections with other model-specific heuristic closed-form approximations. Finally, we present a comprehensive set of experiments to compare approximate inference algorithms on a wide variety of GGPMs.

Approximate inference

10.48550/arxiv.1311.6371

Cite

Citations (9)

Compression of Generative Pre-trained Language Models via Quantization

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (2022)

Chaofan Tao Lu Hou Wei Zhang Lifeng Shang Xin Jiang

Chaofan Tao, Lu Hou, Wei Zhang, Lifeng Shang, Xin Jiang, Qun Liu, Ping Luo, Ngai Wong. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2022.

Ping (video games)

Zhàng

10.18653/v1/2022.acl-long.331

Cite

Citations (32)

Facial expression analysis with graphical models

Lifeng Shang

Expression (computer science)

10.5353/th_b4784948

Cite

Citations (0)

On Estimating Variances for Topic Set Size Design.

Tetsuya Sakai Lifeng Shang

Source

Cite

Citations (5)

Neural Subgraph Isomorphism Counting

Xin Liu Haojie Pan Mutian He Yangqiu Song Xin Jiang

In this paper, we study a new graph learning problem: learning to count subgraph isomorphisms. Different from other traditional graph learning problems such as node classification and link prediction, subgraph isomorphism counting is NP-complete and requires more global inference to oversee the whole graph. To make it scalable for large-scale graphs and patterns, we propose a learning framework that augments different representation learning architectures and iteratively attends pattern and target data graphs to memorize intermediate states of subgraph isomorphism searching for global counting. We develop both small graphs (<= 1,024 subgraph isomorphisms in each) and large graphs (<= 4,096 subgraph isomorphisms in each) sets to evaluate different representation and interaction modules. A mutagenic compound dataset, MUTAG, is also used to evaluate neural models and demonstrate the success of transfer learning. While the learning based approach is inexact, we are able to generalize to count large patterns and data graphs in linear time compared to the exponential time of the original NP-complete problem. Experimental results show that learning based subgraph isomorphism counting can speed up the traditional algorithm, VF2, 10-1,000 times with acceptable errors. Domain adaptation based on fine-tuning also shows the usefulness of our approach in real-world applications.

Subgraph isomorphism problem

Graph isomorphism

Isomorphism (crystallography)

Cograph

Modular decomposition

10.1145/3394486.3403247

Cite

Citations (54)