Palantir Technologies Inc.
INFERRING A DATASET SCHEMA FROM INPUT FILES

Last updated:

Abstract:

Techniques for generating a schema for a data input file are described herein. In an embodiment, a server computer receives a data input file. The server computer system selects a sample excerpt from the data input which comprises a subset of the data input file. The server computer system analyzes the sample excerpt to determine a row delimiter for the data input file, a column delimiter for the data input file, and a plurality of data format types. Using the column delimiter, row delimiter, and plurality of data format types, the server computer system generates a candidate schema for the data input file.

Status:
Application
Type:

Utility

Filling date:

21 Jan 2020

Issue date:

21 May 2020