Palantir Technologies Inc.
Propagated deletion of database records and derived data
Last updated:
Abstract:
Using a distributed database system that manages a plurality of different raw datasets and a plurality of derived datasets that have been derived from the raw datasets based on a plurality of derivation relationships that link the raw datasets to the derived datasets, a subset of records that are candidates for propagated deletion of specified data values is determined. One or more particular raw datasets that contain the subset of records is determined. The specified data values from the particular raw datasets is deleted. Based on the plurality of derivation relationships and the particular raw datasets, one or more particular derived datasets that have been derived from the particular raw datasets is identified. A build of one or more particular derived datasets to result in creating and storing one or more particular derived datasets without the specified data values deleted from the particular raw datasets is generated and executed.
Utility
25 May 2018
23 Mar 2021