Scalable Micro-planned Generation of Discourse from Structured Data | Zendy

Anirban Laha | Zendy; Parag Jain | Zendy; Abhijit Mishra | Zendy; Karthik Sankaranarayanan | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Scalable Micro-planned Generation of Discourse from Structured Data

Author(s) -

Anirban Laha,

Parag Jain,

Abhijit Mishra,

Karthik Sankaranarayanan

Publication year - 2019

Publication title -

computational linguistics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.314

H-Index - 98

eISSN - 1530-9312

pISSN - 0891-2017

DOI - 10.1162/coli_a_00363

Subject(s) - computer science , natural language processing , interpretability , scalability , artificial intelligence , natural language generation , sentence , pipeline (software) , paragraph , fluency , natural language understanding , data manipulation language , natural language , information retrieval , programming language , database , world wide web , linguistics , philosophy

We present a framework for generating natural language description from structured data such as tables. Motivated by the need to approach this problem in a manner that is scalable and easily adaptable to newer domains, unlike existing related systems, our system does not require parallel data; it rather relies on monolingual corpora and basic NLP tools which are easily accessible. The system employs a 3-staged pipeline that: (i) converts entries in the structured data to canonical form, (ii) generates simple sentences for each atomic entry in the canonicalized representation, and (iii) combines the sentences to produce a coherent, fluent and adequate paragraph description through sentence compounding and co-reference replacement modules. Experiments on a benchmark mixed-domain dataset curated for paragraph description from tables reveals the superiority of our system over existing data-to-text approaches. We also demonstrate the robustness of our system in accepting other data types such as Knowledge-Graphs and Key-Value dictionaries.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research