Palantir Technologies Inc.
Inferring a dataset schema from input files
Last updated:
Abstract:
Techniques for generating a schema for a data input file are described herein. In an embodiment, a server computer receives a data input file. The server computer system selects a sample excerpt from the data input which comprises a subset of the data input file. The server computer system analyzes the sample excerpt to determine a row delimiter for the data input file, a column delimiter for the data input file, and a plurality of data format types. Using the column delimiter, row delimiter, and plurality of data format types, the server computer system generates a candidate schema for the data input file.
Status:
Grant
Type:
Utility
Filling date:
5 Dec 2018
Issue date:
21 Jan 2020