Alibaba Group Holding Limited
TRAINED SEQUENCE-TO-SEQUENCE CONVERSION OF DATABASE QUERIES
Last updated:
Abstract:
Methods and systems are provided for sequence-to-sequence conversion from unstructured search queries to structured database queries, so that lay persons may retrieve information from relational databases without specialized knowledge of database query languages. An encoder module and a decoder module of a learning model are trained to convert an unstructured search query to an intermediate feature vector by computing co-attention and self-attention based on a context string and a database schema, encoding the database schema in the context string by application of self-attention between the context string containing tokens of the database schema with learned structural attention heads which relate the token to logic of the database. Training is performed using labeled training datasets which include structured database queries which are normalized by parsing into a semantic representation thereof, followed by linearization.
Utility
6 Mar 2020
9 Sep 2021