Natural-Language-Inference

From September 2018, I started working as a research intern in the Alibaba Search Group. The main research field is very related to NLI. This project is to reproduce some models of current NLI and introduces these related NLI models.

Introduction

The Natural Language Inference models can be used in many problems in different fields. such as question-answer, text entailment inference and so on. this repository mainly focuses on text entailment inference.

There are usually three types of data (Premise, Hypothesis and label) to complete this task. The relationship of different component can be found as follows:

In the next section, I will mainly introduce the feature-based model, encoder-based model, and attention-based model.

Feature-based model

Feature-based model was traditional method and had a limit performace. However, the feature of this methods is also very interesting. I will list the features in this paper.

Bowman S R, Angeli G, Potts C, et al. A large annotated corpus for learning natural language inference[J]. arXiv preprint arXiv:1508.05326, 2015.

I wrote the code to build these following features:

The BLEU score of the hypothesis with respect to the premise, using an n-gram length between 1 and 4.

The length difference between the hypothesis and the premise, as a real-valued feature.

The overlap between words in the premise and hypothesis, both as an absolute count and a percentage of possible overlap, and both over all words and over just nouns, verbs, adjectives, and adverbs.

An indicator for every unigram and bigram in the hypothesis.

Cross-unigrams: for every pair of words across the premise and hypothesis which share a POS tag, an indicator feature over the two words.

Cross-bigrams: for every pair of bigrams across the premise and hypothesis which share a POS tag on the second word, an indicator feature over the two bigrams.

Encoder-based model

The encoder-based models were mainly built based on LSTM, CNN, Tree-LSTM and so on. In this section, i will introduce three papers as follows. These three models are representative.

LSTM Bowman S R, Angeli G, Potts C, et al. A large annotated corpus for learning natural language inference[J]. arXiv preprint arXiv:1508.05326, 2015.

This method is very simple. The encode layer used LSTM network. The detail information model can be found in this paper. I build this model and improve it. (The LSTM component becomes BiLSTM component). tensorflow code link

TBCNN Mou L, Men R, Li G, et al. Natural language inference by tree-based convolution and heuristic matching[J]. arXiv preprint arXiv:1512.08422, 2015.

Tree-LSTM todo..

Match-LSTM todo..

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
imgs		imgs
models		models
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Natural-Language-Inference

Introduction

Feature-based model

Encoder-based model

Attention-based model

About

Uh oh!

Releases

Packages

Languages

AbnerYang/Natural-Language-Inference

Folders and files

Latest commit

History

Repository files navigation

Natural-Language-Inference

Introduction

Feature-based model

Encoder-based model

Attention-based model

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages