Palantir Technologies Inc.
INFERRING A DATASET SCHEMA FROM INPUT FILES
Last updated:
Abstract:
Techniques for generating a schema for a data input file are described herein. In an embodiment, a server computer receives a data input file. The server computer system selects a sample excerpt from the data input which comprises a subset of the data input file. The server computer system analyzes the sample excerpt to determine a row delimiter for the data input file, a column delimiter for the data input file, and a plurality of data format types. Using the column delimiter, row delimiter, and plurality of data format types, the server computer system generates a candidate schema for the data input file.
Status:
Application
Type:
Utility
Filling date:
21 Jan 2020
Issue date:
21 May 2020