Qi Zhang

Fudan University

Author Statistics

Papers

Citation

H-Index

i-10 index

Research Field

Discourse Relations Detection via a Mixed Generative-Discriminative Framework

Proceedings of the AAAI Conference on Artificial Intelligence (2016)

Jifan Chen Qi Zhang Pengfei Liu Xuanjing Huang

Word embeddings, which can better capture the fine-grained semantics of words, have proven to be useful for a variety of natural language processing tasks. However, because discourse structures describe the relationships between segments of discourse, word embeddings cannot be directly integrated to perform the task. In this paper, we introduce a mixed generative-discriminative framework, in which we use vector offsets between embeddings of words to represent the semantic relations between text segments and Fisher kernel framework to convert a variable number of vector offsets into a fixed length vector. In order to incorporate the weights of these offsets into the vector, we also propose the Weighted Fisher Vector. Experimental results on two different datasets show that the proposed method without using manually designed features can achieve better performance on recognizing the discourse level relations in most cases.

Discriminative model

Fisher kernel

Kernel (algebra)

Distributional semantics

Generative model

10.1609/aaai.v30i1.10366

Cite

Citations (13)

Distantly Supervised Named Entity Recognition using Positive-Unlabeled Learning

arXiv (Cornell University) (2019)

Minlong Peng Xiaoyu Xing Qi Zhang Jinlan Fu Xuanjing Huang

In this work, we explore the way to perform named entity recognition (NER) using only unlabeled data and named entity dictionaries. To this end, we formulate the task as a positive-unlabeled (PU) learning problem and accordingly propose a novel PU learning algorithm to perform the task. We prove that the proposed algorithm can unbiasedly and consistently estimate the task loss as if there is fully labeled data. A key feature of the proposed method is that it does not require the dictionaries to label every entity within a sentence, and it even does not require the dictionaries to label all of the words constituting an entity. This greatly reduces the requirement on the quality of the dictionaries and makes our method generalize well with quite simple dictionaries. Empirical studies on four public NER datasets demonstrate the effectiveness of our proposed method. We have published the source code at \url{https://github.com/v-mipeng/LexiconNER}.

Named Entity Recognition

Labeled data

Feature (linguistics)

Code (set theory)

Entity linking

10.48550/arxiv.1906.01378

Cite

Citations (7)

Hashtag recommendation using attention-based convolutional neural network

International Joint Conference on Artificial Intelligence (2016)

Yuyun Gong Qi Zhang

Along with the increasing requirements, the hashtag recommendation task for microblogs has been receiving considerable attention in recent years. Various researchers have studied the problem from different aspects. However, most of these methods usually need handcrafted features. Motivated by the successful use of convolutional neural networks (CNNs) for many natural language processing tasks, in this paper, we adopt CNNs to perform the hashtag recommendation problem. To incorporate the trigger words whose effectiveness have been experimentally evaluated in several previous works, we propose a novel architecture with an attention mechanism. The results of experiments on the data collected from a real world microblogging service demonstrated that the proposed model outperforms state-of-the-art methods. By incorporating trigger words into the consideration, the relative improvement of the proposed method over the state-of-the-art method is around 9.4% in the F1-score.

Microblogging

Task Analysis

Source

Cite

Citations (178)

Natural Language Processing Of Disfluent Sentences

The Journal of the Acoustical Society of America (2011)

Fuliang Weng Qi Zhang

10.1121/1.3650357

Cite

Citations (1)

Learning "O" Helps for Learning More: Handling the Concealed Entity Problem for Class-incremental NER

arXiv (Cornell University) (2022)

Ruotian Ma Xuanting Chen Lin Zhang Tao Gui Qi Zhang

As the categories of named entities rapidly increase, the deployed NER models are required to keep updating toward recognizing more entity types, creating a demand for class-incremental learning for NER. Considering the privacy concerns and storage constraints, the standard paradigm for class-incremental NER updates the models with training data only annotated with the new classes, yet the entities from other entity classes are unlabeled, regarded as "Non-entity" (or "O"). In this work, we conduct an empirical study on the "Unlabeled Entity Problem" and find that it leads to severe confusion between "O" and entities, decreasing class discrimination of old classes and declining the model's ability to learn new classes. To solve the Unlabeled Entity Problem, we propose a novel representation learning method to learn discriminative representations for the entity classes and "O". Specifically, we propose an entity-aware contrastive learning method that adaptively detects entity clusters in "O". Furthermore, we propose two effective distance-based relabeling strategies for better learning the old classes. We introduce a more realistic and challenging benchmark for class-incremental NER, and the proposed method achieves up to 10.62\% improvement over the baseline methods.

Named Entity Recognition

Benchmark (surveying)

Discriminative model

Entity linking

Confusion

Representation

Baseline (sea)

10.48550/arxiv.2210.04676

Cite

Citations (0)

MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective

arXiv (Cornell University) (2022)

Xiao Wang Shihan Dou Limao Xiong Yicheng Zou Qi Zhang

NER model has achieved promising performance on standard NER benchmarks. However, recent studies show that previous approaches may over-rely on entity mention information, resulting in poor performance on out-of-vocabulary (OOV) entity recognition. In this work, we propose MINER, a novel NER learning framework, to remedy this issue from an information-theoretic perspective. The proposed approach contains two mutual information-based training objectives: i) generalizing information maximization, which enhances representation via deep understanding of context and entity surface forms; ii) superfluous information minimization, which discourages representation from rote memorizing entity names or exploiting biased cues in data. Experiments on various settings and datasets demonstrate that it achieves better performance in predicting OOV entities.

Named Entity Recognition

Representation

Maximization

Memorization

10.48550/arxiv.2204.04391

Cite

Citations (1)

Navigating the OverKill in Large Language Models

Chenyu Shi Xiao Wang Qiming Ge Songyang Gao Xianjun Yang

10.18653/v1/2024.acl-long.253

Cite

Citations (0)

Weighed Domain-Invariant Representation Learning for Cross-domain Sentiment Analysis

Proceedings of the 17th international conference on Computational linguistics - (2020)

Minlong Peng Qi Zhang

Cross-domain sentiment analysis is currently a hot topic in both the research and industrial areas. One of the most popular framework for the task is domain-invariant representation learning (DIRL), which aims to learn a distribution-invariant feature representation across domains. However, in this work, we find out that applying DIRL may degrade domain adaptation performance when the label distribution P(Y) changes across domains. To address this problem, we propose a modification to DIRL, obtaining a novel weighted domain-invariant representation learning (WDIRL) framework. We show that it is easy to transfer existing models of the DIRL framework to the WDIRL framework. Empirical studies on extensive cross-domain sentiment analysis tasks verified our statements and showed the effectiveness of our proposed solution.

Domain Adaptation

Representation

Transfer of learning

Sentiment Analysis

Feature Learning

10.18653/v1/2020.coling-main.22

Cite

Citations (9)

Policy choice and economic growth under factional politics: Evidence from a Chinese Province

China Economic Review (2017)

Daniel Z. Li Qi Zhang

Grassroots

Autocracy

Robustness

10.1016/j.chieco.2017.11.003

Cite

Citations (32)

Evaluation of an artificial intelligence-based clinical trial matching system in Chinese patients with hepatocellular carcinoma: A retrospective study

Research Square (Research Square) (2023)

Kunyuan Wang Hao Cui Yun Zhu Xiao Hu Chang Hong

Abstract Background: Artificial intelligence (AI)-assisted clinical trial screening is a promising prospect, although previous matching systems were developed in English, and relevant studies have only been conducted in Western countries. Therefore, we evaluated an AI-based clinical trial matching system (CTMS) that extracts medical data from the electronic health record system and matches them to clinical trials automatically. Methods: This study included 1,053 consecutive inpatients primarily diagnosed with hepatocellular carcinoma who were referred to the liver tumor center of an academic medical center in China between January and December 2019. The eligibility criteria extracted from two clinical trials, patient attributes, and gold standard were decided manually. We evaluated the performance of the CTMS against the established gold standard by measuring the accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and run time required. Results: The manual reviewers demonstrated acceptable interrater reliability (Cohen’s kappa 0.65–0.88). The performance results for the CTMS were as follows: accuracy, 92.9%–98.0%; sensitivity, 51.9%–83.5%; specificity, 99.0%–99.1%; PPV, 75.7%–85.1%; and NPV, 97.4%–98.9%. The time required for eligibility determination by the CTMS and manual reviewers was 2 hours and 150 hours, respectively. Conclusions: We found that the CTMS is particularly reliable in excluding ineligible patients in a significantly reduced amount of time. The CTMS excluded ineligible patients for clinical trials with good performance, reducing 98.7% of the work time. Thus, such AI-based systems with natural language processing and machine learning have potential utility in Chinese clinical trials.

Gold standard (test)

Medical record

Inter-Rater Reliability

10.21203/rs.3.rs-3311980/v1

Cite

Citations (1)